llvm-project

Commit Graph

Author	SHA1	Message	Date
Craig Topper	cfe3b0005f	[RISCV] Reorder masked builtin operands. Use clang_builtin_alias for all overloaded vector builtins. This patch makes the builtin operand order match the C operand order for all intrinsics. With this we can use clang_builtin_alias for all overloaded intrinsics. This should further reduce the test time for vector intrinsics. Differential Revision: https://reviews.llvm.org/D101700	2021-05-02 10:57:25 -07:00
Yaxun (Sam) Liu	c58a6a6fb4	[HIP] Fix device lib selection Choose optimized device lib bitcode by fp options for performance. Reviewed by: Artem Belevich, Fangrui Song Differential Revision: https://reviews.llvm.org/D101654	2021-05-01 20:31:11 -04:00
Craig Topper	f36e6e16a8	[RISCV] Add missing frontend tests for vcompress intrinsics.	2021-05-01 13:07:28 -07:00
Pratyush Das	8518742104	Fix type printing of array template args The code example: ``` constexpr const char kEta[] = "Eta"; template <const char*, typename T> class Column {}; using quick = Column<kEta,double>; void lookup() { quick c1; c1.ls(); } ``` emits error: no member named 'ls' in 'Column<&kEta, double>'. The patch fixes the printed type name by not printing the ampersand for array types. Differential Revision: https://reviews.llvm.org/D36368	2021-05-01 18:50:09 +00:00
Yaxun (Sam) Liu	0175999805	[AMDGPU] Add options -mamdgpu-ieee -mno-amdgpu-ieee AMDGPU backend need to know whether floating point opcodes that support exception flag gathering quiet and propagate signaling NaN inputs per IEEE754-2008, which is conveyed by a function attribute "amdgpu-ieee". "amdgpu-ieee"="false" turns this off. Without this function attribute backend assumes it is on for compute functions. -mamdgpu-ieee and -mno-amdgpu-ieee are added to Clang to control this function attribute. By default it is on. -mno-amdgpu-ieee requires -fno-honor-nans or equivalent. Reviewed by: Matt Arsenault Differential Revision: https://reviews.llvm.org/D77013	2021-05-01 09:02:55 -04:00
Alex Lorenz	8fc5f07fc0	[clang][driver][darwin] use the deployment target version as the SDK version when passing -platform_version to the linker The use of a valid SDK version is preferred over an empty SDK version (0.0.0) as the system's runtime might expect the linked binary to contain a valid SDK version in order for the binary to work correctly rdar://66795188	2021-04-30 18:54:02 -07:00
Nemanja Ivanovic	bfd60b36f8	[PowerPC] Add floating point overloads for vec_sldw These are added for compatibility with XLC.	2021-04-30 20:29:03 -05:00
Nemanja Ivanovic	c3da07d216	[PowerPC] Provide fastmath sqrt and div functions in altivec.h This adds the long overdue implementations of these functions that have been part of the ABI document and are now part of the "Power Vector Intrinsic Programming Reference" (PVIPR). The approach is to add new builtins and to emit code with the fast flag regardless of whether fastmath was specified on the command line. Differential revision: https://reviews.llvm.org/D101209	2021-04-30 19:17:48 -05:00
Joel E. Denny	82e99f5035	[OpenMP] Fix second debug name from map clause This patch fixes a bug from D89802. For example, without it, Clang generates x as the debug map name for both x and y in the following example: ``` #pragma omp target map(to: x, y) x = y = 1; ``` Reviewed By: jhuber6 Differential Revision: https://reviews.llvm.org/D101564	2021-04-30 16:26:59 -04:00
Vince Bridgers	a27af1d816	[analyzer] Fix assertion in SVals.h Fix assertion in SVals.h apparently caused by https://reviews.llvm.org/D89055. clang:clang/include/clang/StaticAnalyzer/Core/PathSensitive/SVals.h:596: clang::ento::loc::MemRegionVal::MemRegionVal(const clang::ento::MemRegion *): Assertion `r' failed. Backtrace: ... clang/include/clang/StaticAnalyzer/Core/PathSensitive/SVals.h:597:3 clang::QualType, clang::QualType) clang/lib/StaticAnalyzer/Core/SValBuilder.cpp:773:18 clang::QualType, clang::QualType) clang/lib/StaticAnalyzer/Core/SValBuilder.cpp:612:12 clang::QualType) clang/lib/StaticAnalyzer/Core/SValBuilder.cpp:587:12 namespace)::RegionBindingsRef const&, clang::ento::Loc, clang::QualType) clang/lib/StaticAnalyzer/Core/RegionStore.cpp:1510:24 ... Reviewed By: ASDenysPetrov Differential Revision: https://reviews.llvm.org/D101635	2021-04-30 11:00:43 -05:00
Adam Czachorowski	fbfcfdbf68	[clang] Fix assert() crash when checking undeduced arg alignment There already was a check for undeduced and incomplete types, but it failed to trigger when outer type (SubstTemplateTypeParm in test) looked fine, but inner type was not. Differential Revision: https://reviews.llvm.org/D100667	2021-04-30 16:24:33 +02:00
Anastasia Stulova	3ec82e5195	[OpenCL] Prevent adding vendor extensions for all targets Removed extension begin/end pragma as it has no effect and it is added unconditionally for all targets. Differential Revision: https://reviews.llvm.org/D92244	2021-04-30 14:42:51 +01:00
Florian Hahn	6c31295493	[clang] Refactor mustprogress handling, add it to all loops in c++11+. Currently Clang does not add mustprogress to inifinite loops with a known constant condition, matching C11 behavior. The forward progress guarantee in C++11 and later should allow us to add mustprogress to any loop (http://eel.is/c++draft/intro.progress#1). This allows us to simplify the code dealing with adding mustprogress a bit. Reviewed By: aaron.ballman, lebedev.ri Differential Revision: https://reviews.llvm.org/D96418	2021-04-30 14:13:47 +01:00
Tomas Matheson	b14a6f06cc	[ARM][MVE] vcreateq lane ordering for big endian Use of bitcast resulted in lanes being swapped for vcreateq with big endian. Fix this by using vreinterpret. No code change for little endian. Adds IR lit test. Differential Revision: https://reviews.llvm.org/D101606	2021-04-30 13:48:05 +01:00
Keith Walker	109bf25e2c	[AArch64] Change __ARM_FEATURE_FP16FML macro name to __ARM_FEATURE_FP16_FML The "Arm C Language extensions" document (the current version can be found at https://developer.arm.com/documentation/101028/0012/?lang=en) states that the name of the feature test macro for the FP16 FML extension is __ARM_FEATURE_FP16_FML. Differential Revision: https://reviews.llvm.org/D101532	2021-04-30 11:03:15 +01:00
Evgeny Leviant	c81ec19fba	Fix -fdebug-pass-structure test case Pass structure can change when -O0 is given and extensions are used.	2021-04-30 10:18:23 +03:00
Wang, Pengfei	e0c7db7d8c	[MS] Preserve base register %rbx around cpuid This patch copies implementation from cpuid.h, which preserve base register %rbx around cpuid. It fixes PR50133. Reviewed By: rnk Differential Revision: https://reviews.llvm.org/D101338	2021-04-30 10:16:25 +08:00
Akira Hatanaka	809435e390	[Sema] Don't set BlockDecl's DoesNotEscape bit if the parameter type of the function the block is passed to isn't a block pointer type This patch fixes a bug where a block passed to a function taking a parameter that doesn't have a block pointer type (e.g., id or reference to a block pointer) was marked as noescape. This partially fixes PR50043. rdar://77030453 Differential Revision: https://reviews.llvm.org/D101097	2021-04-29 16:19:48 -07:00
Akira Hatanaka	2e1d9ebd46	[ObjC][ARC] Don't enter the cleanup scope if the initializer expression isn't an ExprWithCleanups This patch fixes a bug where a temporary ObjC pointer is released before the end of the full expression. This fixes PR50043. rdar://77030453 Differential Revision: https://reviews.llvm.org/D101502	2021-04-29 16:04:30 -07:00
Alex Lorenz	6b938d2ead	Recommit "[clang][driver] Use the provided arch name for a Darwin target triple This ensures that the Darwin driver uses a consistent target triple representation when the triple is printed out to the user. This reverts the revert commit `ab0df6c034`. Differential Revision: https://reviews.llvm.org/D100807	2021-04-29 15:00:40 -07:00
Dan Liew	2d42b2ee7b	[ASan] Rename `-fsanitize-address-destructor-kind=` to drop the `-kind` suffix. Renaming the option is based on discussions in https://reviews.llvm.org/D101122. It is normally not a good idea to rename driver flags but this flag is new enough and obscure enough that it is very unlikely to have adopters. While we're here also drop the `<kind>` metavar. It's not necessary and is actually inconsistent with the documentation in `clang/docs/ClangCommandLineReference.rst`. Differential Revision: https://reviews.llvm.org/D101491	2021-04-29 11:55:42 -07:00
Chirag Khandelwal	c204106188	[Clang][OpenMP] Frontend work for sections - D89671 This patch is child of D89671, contains the clang implementation to use the OpenMP IRBuilder's section construct. Co-author: @anchu-rajendran Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D91054	2021-04-29 19:52:27 +05:30
David Zarzycki	3eb2be67b9	Unbreak no-asserts testing	2021-04-29 10:01:37 -04:00
Evgeny Leviant	6a0283d0d2	[NewPM] Add an option to dump pass structure Patch adds -debug-pass-structure option to dump pass structure when new pass manager is used. Differential revision: https://reviews.llvm.org/D99599	2021-04-29 10:29:42 +03:00
Denys Petrov	b30521c28a	[analyzer] Wrong type cast occurs during pointer dereferencing after type punning Summary: During pointer dereferencing CastRetrievedVal uses wrong type from the Store after type punning. Namely, the pointer casts to another type and then assigns with a value of one more another type. It produces NonLoc value when Loc is expected. Differential Revision: https://reviews.llvm.org/D89055 Fixes: https://bugs.llvm.org/show_bug.cgi?id=37503 https://bugs.llvm.org/show_bug.cgi?id=49007	2021-04-29 01:03:38 +03:00
Petr Hosek	ea12d779bc	[libc++] Support per-target __config_site in per-target runtime build When using the per-target runtime build, it may be desirable to have different __config_site headers for each target where all targets cannot share a single configuration. The layout used for libc++ headers after this change is: ``` include/ c++/ v1/ <libc++ headers except for __config_site> <target1>/ c++/ v1/ __config_site <target2>/ c++/ v1/ __config_site <other targets> ``` This is the most optimal layout since it avoids duplication, the only headers that's per-target is __config_site, all other headers are shared across targets. This also means that we no need two -isystem flags: one for the target-agnostic headers and one for the target specific headers. Differential Revision: https://reviews.llvm.org/D89013	2021-04-28 14:27:16 -07:00
Arthur Eubanks	6d8d133862	Revert "[Clang] -Wunused-but-set-parameter and -Wunused-but-set-variable" This reverts commit `9b0501abc7`. False positives reported in D100581.	2021-04-28 12:47:18 -07:00
Ryan Santhirarajan	0395f9e70b	[ARM] Neon Polynomial vadd Intrinsic fix The Neon vadd intrinsics were added to the ARMSIMD intrinsic map, however due to being defined under an AArch64 guard in arm_neon.td, were not previously useable on ARM. This change rectifies that. It is important to note that poly128 is not valid on ARM, thus it was extracted out of the original arm_neon.td definition and separated for the sake of AArch64. Reviewed By: DavidSpickett Differential Revision: https://reviews.llvm.org/D100772	2021-04-28 11:59:40 -07:00
Valeriy Savchenko	602c8b4db5	[analyzer][NFC] Fix tests failing after a rebase	2021-04-28 18:55:20 +03:00
Valeriy Savchenko	ab5823867c	[analyzer] Find better description for tracked symbolic values When searching for stores and creating corresponding notes, the analyzer is more specific about the target region of the store as opposed to the stored value. While this description was tweaked for constant and undefined values, it lacked in the most general case of symbolic values. This patch tries to find a memory region, where this value is stored, to use it as a better alias for the value. rdar://76645710 Differential Revision: https://reviews.llvm.org/D101041	2021-04-28 18:37:38 +03:00
Valeriy Savchenko	e273918038	[analyzer] Track leaking object through stores Since we can report memory leaks on one variable, while the originally allocated object was stored into another one, we should explain how did it get there. rdar://76645710 Differential Revision: https://reviews.llvm.org/D100852	2021-04-28 18:37:38 +03:00
Valeriy Savchenko	61ae2db2d7	[analyzer] Adjust the reported variable name in retain count checker When reporting leaks, we try to attach the leaking object to some variable, so it's easier to understand. Before the patch, we always tried to use the first variable that stored the object in question. This can get very confusing for the user, if that variable doesn't contain that object at the moment of the actual leak. In many cases, the warning is dismissed as false positive and it is effectively a false positive when we fail to properly explain the warning to the user. This patch addresses the bigest issue in cases like this. Now we check if the variable still contains the leaking symbolic object. If not, we look for the last variable to actually hold it and use that variable instead. rdar://76645710 Differential Revision: https://reviews.llvm.org/D100839	2021-04-28 18:37:37 +03:00
David Candler	b8baa2a913	[ARM][AArch64] Require appropriate features for crypto algorithms This patch changes the AArch32 crypto instructions (sha2 and aes) to require the specific sha2 or aes features. These features have already been implemented and can be controlled through the command line, but do not have the expected result (i.e. `+noaes` will not disable aes instructions). The crypto feature retains its existing meaning of both sha2 and aes. Several small changes are included due to the knock-on effect this has: - The AArch32 driver has been modified to ensure sha2/aes is correctly set based on arch/cpu/fpu selection and feature ordering. - Crypto extensions are permitted for AArch32 v8-R profile, but not enabled by default. - ACLE feature macros have been updated with the fine grained crypto algorithms. These are also used by AArch64. - Various tests updated due to the change in feature lists and macros. Reviewed By: lenary Differential Revision: https://reviews.llvm.org/D99079	2021-04-28 16:26:18 +01:00
Anton Zabaznov	f0efc00751	[OpenCL] Introduce new method for validating OpenCL target Language options are not available when a target is being created, thus, a new method is introduced. Also, some refactoring is done, such as removing OpenCL feature macros setting from TargetInfo. Reviewed By: Anastasia Differential Revision: https://reviews.llvm.org/D101087	2021-04-28 16:00:02 +03:00
Alexey Bataev	c835630c25	[OPENMP]Fix PR49098: respect firstprivate of declare target variable. Need to respect mapping/privatization of declare target variables in the target regions if explicitly specified by the user. Differential Revision: https://reviews.llvm.org/D99530	2021-04-28 05:39:10 -07:00
Hans Wennborg	2d37f2115d	Try to fix clang/test/Driver/cl-options.c on non-x86 hosts The /QIntel-jcc-erratum flag only works when targeting x86, so pass --target to the driver to do that also on non-x86 hosts.	2021-04-28 11:57:09 +02:00
Hans Wennborg	789549bea4	[clang-cl] Map /QIntel-jcc-erratum to -mbranches-within-32B-boundaries	2021-04-28 11:10:08 +02:00
Hsiangkai Wang	b358a2be52	[RISCV] Remove riscv32 test cases for vector intrinsics.	2021-04-28 15:54:25 +08:00
Petr Hosek	36430d44ed	[Driver] Use normalized triples for per-target runtimes This is a partial revert of `b4537c3f51` based on the discussion in https://reviews.llvm.org/D101194. Rather than using the getMultiarchTriple, we use the getTripleString.	2021-04-27 22:31:36 -07:00
Petr Hosek	a921d2d2fb	[Driver] Add -print-multiarch This is useful in runtimes build for example which currently try to guess the correct triple where to place libraries in the multiarch layout. Using this flag, the build system can get the correct triple directly by querying Clang. Differential Revision: https://reviews.llvm.org/D101400	2021-04-27 16:04:54 -07:00
Samuel Thibault	b13e913b31	hurd: Clean up test - Unsupported Windows to drop backslashes code - Upgrade to current gcc 10 version Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D101347	2021-04-27 13:19:17 -07:00
Samuel Thibault	932e8c3241	hurd: Detect libstdc++ include paths on Debian Hurd i386 This is a follow-up of `e92d2b80c6` ("[Driver] Detect libstdc++ include paths for native gcc (-m32 and -m64) on Debian i386") for the Debian Hurd case, which has the same multiarch name reduction from i686 to i386. i386-linux-gnu is actually Linux-only, so this moves the code of that commit to Linux.cpp, and adds the same to Hurd.cpp Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D101331	2021-04-27 13:04:41 -07:00
Samuel Thibault	9c552d27ee	hurd: Fix i386 research path `f263418402` ("[Driver] Gnu.cpp: remove obsoleted i386 triple detection from end-of-life distribution versions") dropped the i686-gnu gcc path, but GNU/Hurd's gcc is actually using it, and not i386. This fixes the gcc path and update the tests to reflect it. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D101317	2021-04-27 12:41:18 -07:00
Petr Hosek	887bdff8ba	[Driver] Fix tests failing in per-target multiarch layout These failures were revealed by `b4537c3f51`. Differential Revision: https://reviews.llvm.org/D101348	2021-04-27 10:19:00 -07:00
Gabor Marton	4b99f9c7db	[analyzer][StdLibraryFunctionsChecker] Track dependent arguments When we report an argument constraint violation, we should track those other arguments that participate in the evaluation of the violation. By default, we depend only on the argument that is constrained, however, there are some special cases like the buffer size constraint that might be encoded in another argument(s). Differential Revision: https://reviews.llvm.org/D101358	2021-04-27 15:35:58 +02:00
Anastasia Stulova	6a92c19f3b	[C++4OpenCL] Add diagnostics for OpenCL types in templates. Refactored diagnostics for OpenCL types to allow their reuse for templates. Patch by olestrohm (Ole Strohm)! Differential Revision: https://reviews.llvm.org/D100860	2021-04-27 13:04:25 +01:00
Pushpinder Singh	59ad4e0f01	Reapply "[AMDGPU][OpenMP] Add amdgpu-arch tool to list AMD GPUs installed" This reverts commit `93604305bb`.	2021-04-27 10:47:05 +00:00
Pushpinder Singh	93604305bb	Revert "Reapply "[AMDGPU][OpenMP] Add amdgpu-arch tool to list AMD GPUs installed"" This reverts commit `15be0c41d2`.	2021-04-27 02:23:44 +00:00
Alex Lorenz	2509f9fbad	[clang] Don't crash when loading invalid VFS for the module dep collector The VFS is null when it's invalid so return early in collectVFSFromYAML.	2021-04-26 17:05:22 -07:00
Yonghong Song	a2a3ca8d97	BPF: emit debuginfo for Function of DeclRefExpr if requested Commit `e3d8ee35e4` ("reland "[DebugInfo] Support to emit debugInfo for extern variables"") added support to emit debugInfo for extern variables if requested by the target. Currently, only BPF target enables this feature by default. As BPF ecosystem grows, callback function started to get support, e.g., recently bpf_for_each_map_elem() is introduced (https://lwn.net/Articles/846504/) with a callback function as an argument. In the future we may have something like below as a demonstration of use case : extern int do_work(int); long bpf_helper(void callback_fn, void callback_ctx, ...); long prog_main() { struct { ... } ctx = { ... }; return bpf_helper(&do_work, &ctx, ...); } Basically bpf helper may have a callback function and the callback function is defined in another file or in the kernel. In this case, we would like to know the debuginfo types for do_work(), so the verifier can proper verify the safety of bpf_helper() call. For the following example, extern int do_work(int); long bpf_helper(void callback_fn); long prog() { return bpf_helper(&do_work); } Currently, there is no debuginfo generated for extern function do_work(). In the IR, we have, ... define dso_local i64 @prog() local_unnamed_addr #0 !dbg !7 { entry: %call = tail call i64 @bpf_helper(i8 bitcast (i32 (i32)* @do_work to i8*)) #2, !dbg !11 ret i64 %call, !dbg !12 } ... declare dso_local i32 @do_work(i32) #1 ... This patch added support for the above callback function use case, and the generated IR looks like below: ... declare !dbg !17 dso_local i32 @do_work(i32) #1 ... !17 = !DISubprogram(name: "do_work", scope: !1, file: !1, line: 1, type: !18, flags: DIFlagPrototyped, spFlags: DISPFlagOptimized, retainedNodes: !2) !18 = !DISubroutineType(types: !19) !19 = !{!20, !20} !20 = !DIBasicType(name: "int", size: 32, encoding: DW_ATE_signed) The TargetInfo.allowDebugInfoForExternalVar is renamed to TargetInfo.allowDebugInfoForExternalRef as now it guards both extern variable and extern function debuginfo generation. Differential Revision: https://reviews.llvm.org/D100567	2021-04-26 16:53:25 -07:00
Michael Benfield	9b0501abc7	[Clang] -Wunused-but-set-parameter and -Wunused-but-set-variable These are intended to mimic warnings available in gcc. -Wunused-but-set-variable is triggered in the case of a variable which appears on the LHS of an assignment but not otherwise used. For instance: void f() { int x; x = 0; } -Wunused-but-set-parameter works similarly, but for function parameters instead of variables. In C++, they are triggered only for scalar types; otherwise, they are triggered for all types. This is gcc's behavior. -Wunused-but-set-parameter is controlled by -Wextra, while -Wunused-but-set-variable is controlled by -Wunused. This is slightly different from gcc's behavior, but seems most consistent with clang's behavior for -Wunused-parameter and -Wunused-variable. Reviewed By: aeubanks Differential Revision: https://reviews.llvm.org/D100581	2021-04-26 15:09:03 -07:00
Alex Lorenz	ab0df6c034	Revert "[clang][driver] Use the provided arch name for a Darwin target triple" This reverts commit `6cc62043c8`. This caused a test failure on a M1 mac CI job (https://reviews.llvm.org/D100807#2718006), I will recommit this with a fix.	2021-04-26 14:57:00 -07:00
Martin Storsjö	f8de9aaef2	[llvm-rc] Add a GNU windres-like frontend to llvm-rc This primarily parses a different set of options and invokes the same resource compiler as llvm-rc normally. Additionally, it can convert directly to an object file (which in MSVC style setups is done with the separate cvtres tool, or by the linker). (GNU windres also supports other conversions; from coff object file back to .res, and from .res or object file back to .rc form; that's not yet implemented.) The other bigger complication lies in being able to imply or pass the intended target triple, to let clang find the corresponding mingw sysroot for finding include files, and for specifying the default output object machine format. It can be implied from the tool triple prefix, like `<triple>-[llvm-]windres` or picked up from the windres option e.g. `-F pe-x86-64`. In GNU windres, that option takes BFD style format names such as pe-i386 or pe-x86-64. As libbfd in binutils doesn't support Windows on ARM, there's no such canonical name for the ARM targets. Therefore, as an LLVM specific extension, this option is extended to allow passing full triples, too. Differential Revision: https://reviews.llvm.org/D100756	2021-04-26 22:04:29 +03:00
Alex Lorenz	6cc62043c8	[clang][driver] Use the provided arch name for a Darwin target triple This ensures that the Darwin driver uses a consistent target triple representation when the triple is printed out to the user. Differential Revision: https://reviews.llvm.org/D100807	2021-04-26 11:31:50 -07:00
Wael Yehia	91db11c98d	XFAIL thinlto testcases on AIX In D100350, we switched off thinLTO on the AIX OS. So, XFAIL tests that invoke clang with -flto=thin on AIX.	2021-04-26 17:02:06 +00:00
Nigel Perks	e7b6c0f398	[clang][XCore] Define __xcore__ for XCore target. The headers shipped with the XMOS XCore compiler expect __xcore__ to be defined. The __XS1B__ macro, already defined, is for the default subtarget. No other targets affected.	2021-04-26 15:06:04 +01:00
Alexey Bader	7818906ca1	[SYCL] Implement SYCL address space attributes handling Default address space (applies when no explicit address space was specified) maps to generic (4) address space. Added SYCL named address spaces `sycl_global`, `sycl_local` and `sycl_private` defined as sub-sets of the default address space. Static variables without address space now reside in global address space when compile for SPIR target, unless they have an explicit address space qualifier in source code. Differential Revision: https://reviews.llvm.org/D89909	2021-04-26 13:44:10 +03:00
Jeremy Morse	3c9bcf0e35	[Clang][Coroutine][DebugInfo] Relax test ordering requirement The test added in D97533 (and modified by this patch) has some overly strict printed metadata ordering requirements, specifically the interleaving of DILocalVariable nodes and DILocation nodes. Slight changes in metadata emission can easily break this unfortunately. This patch stops after clang codegen rather than allowing the coro splitter to run, and reduces the need for ordering: it picks out the DILocalVariable nodes being sought, in any order (CHECK-DAG), and doesn't examine any DILocations. The implicit CHECK-NOT is what's important: the test seeks to ensure a duplicate set of DILocalVariables aren't emitted in the same scope. Differential Revision: https://reviews.llvm.org/D100298	2021-04-26 10:07:22 +01:00
Hsiangkai Wang	16dc9afa09	[RISCV] Add REQUIRES: riscv-registered-target for RISC-V test cases.	2021-04-26 17:02:27 +08:00
Jan Svoboda	0f7d4105c6	[clang][deps] Only generate absolute paths when asked to Add option to `clang-scan-deps` to enable/disable generation of command-line arguments with absolute paths. This is essentially a revert of D100533, but with improved naming and added test. Reviewed By: dexonsmith Differential Revision: https://reviews.llvm.org/D101051	2021-04-26 10:53:41 +02:00
Hsiangkai Wang	bd32c2d648	[RISCV] Implement the vwcvt{u}.x.x.v/vncvt.x.x.w builtin. Differential Revision: https://reviews.llvm.org/D100824	2021-04-26 15:38:31 +08:00
Hsiangkai Wang	645c5f27a8	[RISCV] Implement the pseudo compare builtin. Differential Revision: https://reviews.llvm.org/D100823	2021-04-26 15:38:31 +08:00
Hsiangkai Wang	bfb3fca7e1	[RISCV] Implement the vfabs.v/vfneg.v builtin. Differential Revision: https://reviews.llvm.org/D100822	2021-04-26 15:38:31 +08:00
Hsiangkai Wang	4b2434172c	[RISCV] Implement the vmmv.m/vmnot.m builtin. Differential Revision: https://reviews.llvm.org/D100821	2021-04-26 15:38:31 +08:00
Levy Hsu	8cf54c7ff5	[RISCV] [1/2] Add IR intrinsic for Zbe extension RV32/64: bcompress bdecompress RV64 ONLY: bcompressw bdecompressw Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D101143	2021-04-25 19:14:34 -07:00
Hsiangkai Wang	d8fa5ef6a2	[RISCV] Apply clang_builtin_alias to overloaded builtins. We only apply `clang_builtin_alias` to non-masked builtins. Masked builtins could not use `clang_builtin_alias` because the operand order is different between overloaded intrinsics and builtins. A bunch of test cases need to be updated. Differential Revision: https://reviews.llvm.org/D100658	2021-04-25 18:12:19 +08:00
Hsiangkai Wang	108864397d	[Clang] Add clang attribute `clang_builtin_alias`. In some cases, we want to provide the alias name for the clang builtins. For example, the arguments must be constant integers for some RISC-V builtins. If we use wrapper functions, we could not constrain the arguments be constant integer. This attribute is used to achieve the purpose. Besides this, use `clang_builtin_alias` is more efficient than using wrapper functions. We use this attribute to deal with test time issue reported in https://bugs.llvm.org/show_bug.cgi?id=49962. In our downstream testing, it could decrease the testing time from 6.3 seconds to 3.7 seconds for vloxei.c test. Differential Revision: https://reviews.llvm.org/D100611	2021-04-25 08:49:19 +08:00
David Tolnay	967ebad125	Fix null ptr crash dumping TemplateTemplateParmDecl The following program winds up with D->getDefaultArgStorage().getInheritedFrom() == nullptr during dumping the TemplateTemplateParmDecl corresponding to the template parameter of i. template <typename> struct R; template <template <typename> class = R> void i(); This patch fixes the null pointer dereference.	2021-04-24 12:28:10 -04:00
Jon Chesterfield	fc88d927e3	[clang][amdgpu] Use implicit code object version [clang][amdgpu] Use implicit code object version At present, clang always passes amdhsa-code-object-version on to -cc1. That is great for certainty over what object version is being used when debugging. Unfortunately, the command line argument is in AMDGPUBaseInfo.cpp in the amdgpu target. If clang is used with an llvm compiled with DLLVM_TARGETS_TO_BUILD that excludes amdgpu, this will be diagnosed (as discovered via D98658): - Unknown command line argument '--amdhsa-code-object-version=4' This means that clang, built only for X86, can be used to compile the nvptx devicertl for openmp but not the amdgpu one. That would shortly spawn fragile logic in the devicertl cmake to try to guess whether the clang used will work. This change omits the amdhsa-code-object-version parameter when it matches the default that AMDGPUBaseInfo.cpp specifies, with a comment to indicate why. As this is the only part of clang's codegen for amdgpu that depends on the target in the back end it suffices to build the openmp runtime on most (all?) systems. It is a non-functional change, though observable in the updated tests and when compiling with -###. It may cause minor disruption to the amd-stg-open branch. Revision of D98746, builds on refactor in D101077 Reviewed By: yaxunl Differential Revision: https://reviews.llvm.org/D101095	2021-04-23 23:52:50 +01:00
Thomas Lively	502f54049d	[WebAssembly] Finalize wasm_simd128.h intrinsics Adds new intrinsics for instructions that are in the final SIMD spec but did not previously have intrinsics. Also updates the names of existing intrinsics to reflect the final names of the underlying instructions in the spec. Keeps the old names as deprecated functions to ease the transition to the new names. Differential Revision: https://reviews.llvm.org/D101112	2021-04-23 13:37:27 -07:00
Nemanja Ivanovic	19b29b1ed1	[PowerPC] Provide XL-compatible builtins in altivec.h There are some interfaces in altivec.h that are not compatible between Clang and XL (although Clang is compatible with GCC). Currently, we have found 3 but there may be others. Clang/GCC signatures: vector double vec_ctf(vector signed long long) vector double vec_ctf(vector unsigned long long) vector signed long long vec_cts(vector double) vector unsigned long long vec_ctu(vector double) XL signatures: vector float vec_ctf(vector signed long long) vector float vec_ctf(vector unsigned long long) vector signed int vec_cts(vector double) vector unsigned int vec_ctu(vector double) This patch provides the XL behaviour under the __XL_COMPAT_ALTIVEC__ macro for users that rely on XL behaviour. Differential revision: https://reviews.llvm.org/D101130	2021-04-23 15:13:46 -05:00
Johannes Doerfert	cbe8b57a67	[Clang] Allow the combination of loader_uninitialized and address spaces When an object is allocated in a non-default address space we do not need to check for a constructor if it is not initialized and has a trivial constructor (which we won't call then). Reviewed By: JonChesterfield Differential Revision: https://reviews.llvm.org/D100929	2021-04-23 11:21:52 -05:00
Nemanja Ivanovic	6725b90a02	[PowerPC] Add vec_ctsl and vec_ctul to altivec.h These are added for compatibility with XLC. They are similar to vec_cts and vec_ctu except that the result is a doubleword vector regardless of the parameter type.	2021-04-23 11:03:38 -05:00
Gabor Marton	a7cb951fa4	[Analyzer][StdLibraryFunctionsChecker] Describe arg constraints In this patch, I provide a detailed explanation for each argument constraint. This explanation is added in an extra 'note' tag, which is displayed alongside the warning. Since these new notes describe clearly the constraint, there is no need to provide the number of the argument (e.g. 'Arg3') within the warning. However, I decided to keep the name of the constraint in the warning (but this could be a subject of discussion) in order to be able to identify the different kind of constraint violations easily in a bug database (e.g. CodeChecker). Differential Revision: https://reviews.llvm.org/D101060	2021-04-23 17:27:54 +02:00
Anastasia Stulova	fcb45b544d	[OpenCL] Fix typo in the test.	2021-04-23 14:36:36 +01:00
Dávid Bolvanský	2cae7025c1	Reland "[Clang] Propagate guaranteed alignment for malloc and others" This relands commit `6914a0ed2b`. Crash in InstCombine was fixed.	2021-04-23 14:05:57 +02:00
Adam Czachorowski	ddfbdbfefa	[clang] Do not crash on template specialization following a fatal error There was a missing isInvalid() check leading to an attempt to instantiate template with an empty instantiation stack. Differential Revision: https://reviews.llvm.org/D100675	2021-04-23 13:34:05 +02:00
Dávid Bolvanský	6914a0ed2b	Revert "[Clang] Propagate guaranteed alignment for malloc and others" This reverts commit `c2297544c0`. Some buildbots are broken.	2021-04-23 11:33:33 +02:00
Dávid Bolvanský	c2297544c0	[Clang] Propagate guaranteed alignment for malloc and others LLVM should be smarter about known malloc's alignment and this knowledge may enable other optimizations. Originally started as LLVM patch - https://reviews.llvm.org/D100862 but this logic should be really in Clang. Reviewed By: rjmccall Differential Revision: https://reviews.llvm.org/D100879	2021-04-23 11:07:14 +02:00
Hsiangkai Wang	43cd588c2d	[RISCV] Implement the vnot.v builtin. Differential Revision: https://reviews.llvm.org/D100820	2021-04-23 13:16:23 +08:00
Hsiangkai Wang	14cc1cb222	[RISCV] Implement the vneg.v builtin. Differential Revision: https://reviews.llvm.org/D100819	2021-04-23 11:40:18 +08:00
Fangrui Song	2786e673c7	[IR][sanitizer] Add module flag "frame-pointer" and set it for cc1 -mframe-pointer={non-leaf,all} The Linux kernel objtool diagnostic `call without frame pointer save/setup` arise in multiple instrumentation passes (asan/tsan/gcov). With the mechanism introduced in D100251, it's trivial to respect the command line -m[no-]omit-leaf-frame-pointer/-f[no-]omit-frame-pointer, so let's do it. Fix: https://github.com/ClangBuiltLinux/linux/issues/1236 (tsan) Fix: https://github.com/ClangBuiltLinux/linux/issues/1238 (asan) Also document the function attribute "frame-pointer" which is long overdue. Differential Revision: https://reviews.llvm.org/D101016	2021-04-22 18:07:30 -07:00
Jon Chesterfield	15be0c41d2	Reapply "[AMDGPU][OpenMP] Add amdgpu-arch tool to list AMD GPUs installed" This reverts commit `24c1ed3b34`.	2021-04-23 01:07:16 +01:00
Richard Smith	40beb1f84a	CWG2076: Permit implicit conversions within a single level of braces during overload resolution, even when calling a copy constructor.	2021-04-22 16:49:39 -07:00
Levy Hsu	b49337bbb9	[RISCV] [1/2] Add IR intrinsic for Zbp extension RV32/64: grev grevi gorc gorci shfl shfli unshfl unshfli RV64 ONLY: grevw greviw gorcw gorciw shflw shfli (For non-existing shfliw) unshfli (For non-existing unshfliw) Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D100830	2021-04-22 16:34:51 -07:00
Yaxun (Sam) Liu	8baba6890d	[HIP] Support overloaded math functions for hipRTC Remove the dependence on standard C++ header for overloaded math functions in HIP header since standard C++ header is not available for hipRTC. Reviewed by: Artem Belevich, Justin Lebar Differential Revision: https://reviews.llvm.org/D100794	2021-04-22 19:06:51 -04:00
Petr Hosek	d5f433d330	Revert "Re-land "[Driver] Support default libc++ library location on Darwin"" This reverts commit `6331680ad2` because this breaks the compiler-rt build.	2021-04-22 14:04:24 -07:00
Jon Chesterfield	24c1ed3b34	Revert "[AMDGPU][OpenMP] Add amdgpu-arch tool to list AMD GPUs installed" This reverts commit `722d4d8e75`. Unclear where hsa.h should be included from, see report in D99949	2021-04-22 19:39:37 +01:00
Dávid Bolvanský	abf3ca61e3	[Diagnostics] Restore -Wdeprecated warning when user-declared copy assignment operator is defined as deleted (PR45634) Solves https://bugs.llvm.org/show_bug.cgi?id=45634 Be more agressive than GCC with -Wdeprecated-copy. Also provide -W(no-)deprecated-copy-user-provided-copy/dtor options to on/off this behaviour. Reviewed By: Quuxplusone Differential Revision: https://reviews.llvm.org/D79714	2021-04-22 20:35:41 +02:00
Petr Hosek	45340efb4c	[Driver] Specify -ccc-install-dir for linux-cross test This avoids test failures where extra files exist in the tree, such as the standard library built using the runtimes build. Differential Revision: https://reviews.llvm.org/D101023	2021-04-22 10:58:26 -07:00
Nemanja Ivanovic	e43963db24	[PowerPC] Add vec_load_splats to altivec.h Add these overloads for compatibility with XLC. This is a word load-and-splat.	2021-04-22 10:31:00 -05:00
Nemanja Ivanovic	a0e6189712	[PowerPC] Add vec_xlds to altivec.h Add these overloads for compatibility with XLC. This is a doubleword load-and-splat.	2021-04-22 10:31:00 -05:00
Nemanja Ivanovic	a1d325af67	[PowerPC] Add vec_roundz as alias for vec_trunc in altivec.h Add the overloads for compatibility with XLC.	2021-04-22 10:31:00 -05:00
Nemanja Ivanovic	1550c47c18	[PowerPC] Add vec_roundp as alias for vec_ceil Add the overloads for compatibility with XLC.	2021-04-22 10:30:59 -05:00
Nemanja Ivanovic	3a46667059	[PowerPC] Add vec_roundm as alias for vec_floor in altivec.h Add the overloads for compatibility with XLC.	2021-04-22 10:30:59 -05:00
Anastasia Stulova	362958ac73	[C++4OpenCL] Add extra diagnostics for kernel argument types Add restrictions on type layout (PR48099): - Types passed by pointer or reference must be standard layout types. - Types passed by value must be POD types. Patch by olestrohm (Ole Strohm)! Differential Revision: https://reviews.llvm.org/D100471	2021-04-22 15:28:04 +01:00
Sven van Haastregt	4138e7bd76	[OpenCL] Add missing C++ legacy atomics with generic https://reviews.llvm.org/D62335 added some C++ for OpenCL specific builtins to opencl-c.h, but these were not mirrored to the TableGen builtin functions yet. The TableGen builtins machinery does not have dedicated version handling for C++ for OpenCL at the moment: all builtin versioning is tied to `LangOpts.OpenCLVersion` (i.e., the OpenCL C version). As a workaround, to add builtins that are only available in C++ for OpenCL, we define a function extension guarded by the __cplusplus macro. Differential Revision: https://reviews.llvm.org/D100935 Fixes PR50041.	2021-04-22 15:08:36 +01:00
Nathan Sidwell	6ad7e87806	clang: libstdc++ LWM is 4.8.3 Document oldest libstdc++ as 4.8.3, remove a hack for a 4.6 issue. Differential Revision: https://reviews.llvm.org/D100465	2021-04-22 05:26:07 -07:00
Sander de Smalen	e951b045bf	[AArch64][SVE] Regression test all ACLE tests with C++ We found issues with a number of intrinsics when building them with C++, so it makes sense to guard these tests with some extra RUN lines to build the tests in C++ mode.	2021-04-22 13:24:04 +01:00
Valeriy Savchenko	5780dbeee6	[-Wcalled-once] Do not run analysis on Obj-C++ Objective-C++ is not yet suppoerted. rdar://76729552 Differential Revision: https://reviews.llvm.org/D100955	2021-04-22 15:20:52 +03:00
Jan Svoboda	850e01a34d	[clang][deps] Check extra args in tests These flags are being generated by `clang-scan-deps` and it makes sense to ensure it keeps doing so.	2021-04-22 14:10:08 +02:00
Hamza Mahfooz	be2277fbf2	[Matrix] Support #pragma clang fp From https://bugs.llvm.org/show_bug.cgi?id=49739: Currently, `#pragma clang fp` are ignored for matrix types. For the code below, the `contract` fast-math flag should be added to the generated call to `llvm.matrix.multiply` and `fadd` ``` typedef float fx2x2_t __attribute__((matrix_type(2, 2))); void foo(fx2x2_t &A, fx2x2_t &C, fx2x2_t &B) { #pragma clang fp contract(fast) C = A*B + C; } ``` Reviewed By: fhahn, mibintc Differential Revision: https://reviews.llvm.org/D100834	2021-04-22 11:45:34 +01:00
Nemanja Ivanovic	3bcd0ece43	[PowerPC] Add vec_roundc as alias for vec_rint in altivec.h For compatibility with XLC, add these overloads.	2021-04-22 05:31:38 -05:00
Jan Svoboda	398f253400	[clang][deps] Include "-cc1" in the arguments To simplify tools consuming dependency scanning results, prepend the "-cc1" argument by default. Reviewed By: Bigcheese Differential Revision: https://reviews.llvm.org/D100942	2021-04-22 10:57:56 +02:00
Pushpinder Singh	722d4d8e75	[AMDGPU][OpenMP] Add amdgpu-arch tool to list AMD GPUs installed This patch adds new clang tool named amdgpu-arch which uses HSA to detect installed AMDGPU and report back latter's march. This tool is built only if system has HSA installed. The value printed by amdgpu-arch is used to fill -march when latter is not explicitly provided in -Xopenmp-target. Reviewed By: JonChesterfield, gregrodgers Differential Revision: https://reviews.llvm.org/D99949	2021-04-22 05:20:28 +00:00
Chen Zheng	26f138eed4	[Debug-Info] implement -gstrict-dwarf This patch implements -gstrict-dwarf option in clang FE. Reviewed By: dblaikie, probinson, aprantl Differential Revision: https://reviews.llvm.org/D100809	2021-04-22 00:41:25 -04:00
Giorgis Georgakoudis	a2dbfb6b72	[OpenMP] Simplify offloading parallel call codegen This revision simplifies Clang codegen for parallel regions in OpenMP GPU target offloading and corresponding changes in libomptarget: SPMD/non-SPMD parallel calls are unified under a single `kmpc_parallel_51` runtime entry point for parallel regions (which will be commonized between target, host-side parallel regions), data sharing is internalized to the runtime. Tests have been auto-generated using `update_cc_test_checks.py`. Also, the revision contains changes to OpenMPOpt for remark creation on target offloading regions. Reviewed By: jdoerfert, Meinersbur Differential Revision: https://reviews.llvm.org/D95976	2021-04-21 18:46:07 -07:00
Fangrui Song	77ac823fd2	Delete le32/le64 targets They are unused now. Note: NaCl is still used and is currently expected to be needed until 2022-06 (https://blog.chromium.org/2020/08/changes-to-chrome-app-support-timeline.html). Differential Revision: https://reviews.llvm.org/D100981	2021-04-21 18:44:12 -07:00
Fangrui Song	775a9483e5	[IR][sanitizer] Set nounwind on module ctor/dtor, additionally set uwtable if -fasynchronous-unwind-tables On ELF targets, if a function has uwtable or personality, or does not have nounwind (`needsUnwindTableEntry`), it marks that `.eh_frame` is needed in the module. Then, a function gets `.eh_frame` if `needsUnwindTableEntry` or `-g[123]` is specified. (i.e. If -g[123], every function gets `.eh_frame`. This behavior is strange but that is the status quo on GCC and Clang.) Let's take asan as an example. Other sanitizers are similar. `asan.module_[cd]tor` has no attribute. `needsUnwindTableEntry` returns true, so every function gets `.eh_frame` if `-g[123]` is specified. This is the root cause that `-fno-exceptions -fno-asynchronous-unwind-tables -g` produces .debug_frame while `-fno-exceptions -fno-asynchronous-unwind-tables -g -fsanitize=address` produces .eh_frame. This patch * sets the nounwind attribute on sanitizer module ctor/dtor. * let Clang emit a module flag metadata "uwtable" for -fasynchronous-unwind-tables. If "uwtable" is set, sanitizer module ctor/dtor additionally get the uwtable attribute. The "uwtable" mechanism is generic: synthesized functions not cloned/specialized from existing ones should consider `Function::createWithDefaultAttr` instead of `Function::create` if they want to get some default attributes which have more of module semantics. Other candidates: "frame-pointer" (https://github.com/ClangBuiltLinux/linux/issues/955 https://github.com/ClangBuiltLinux/linux/issues/1238), dso_local, etc. Differential Revision: https://reviews.llvm.org/D100251	2021-04-21 15:58:20 -07:00
Petr Hosek	f749550cfe	[libcxx] Stop using use c++ subdirectory for libc++ library The new layout more closely matches the layout used by other compilers. This is only used when LLVM_ENABLE_PER_TARGET_RUNTIME_DIR is enabled. Differential Revision: https://reviews.llvm.org/D100869	2021-04-21 15:39:03 -07:00
Jonas Devlieghere	6331680ad2	Re-land "[Driver] Support default libc++ library location on Darwin" This reverts commit `05eeed9691` and after fixing the impacted lldb tests in `5d1c43f333`. [Driver] Support default libc++ library location on Darwin Darwin driver currently uses libc++ headers that are part of Clang toolchain when available (by default ../include/c++/v1 relative to executable), but it completely ignores the libc++ library itself because it doesn't pass the location of libc++ library that's part of Clang (by default ../lib relative to the exceutable) to the linker always using the system copy of libc++. This may lead to subtle issues when the compilation fails because the headers that are part of Clang toolchain are incompatible with the system library. Either the driver should ignore both headers as well as the library, or it should always try to use both when available. This patch changes the driver behavior to do the latter which seems more reasonable, it makes it easy to test and use custom libc++ build on Darwin while still allowing the use of system version. This also matches the Clang driver behavior on other systems. Differential Revision: https://reviews.llvm.org/D45639	2021-04-21 14:22:13 -07:00
Yaxun (Sam) Liu	5a2d78b163	[HIP] Add option -fgpu-inline-threshold Add option -fgpu-inline-threshold for inline threshold for device compilation only. Reviewed by: Artem Belevich Differential Revision: https://reviews.llvm.org/D99233	2021-04-21 17:18:18 -04:00
Alexey Bataev	079884225a	[OPENMP]Fix PR49698: OpenMP declare mapper causes segmentation fault. The implicitly generated mappings for allocation/deallocation in mappers runtime should be mapped as implicit, also no need to clear member_of flag to avoid ref counter increment. Also, the ref counter should not be incremented for the very first element that comes from the mapper function. Differential Revision: https://reviews.llvm.org/D100673	2021-04-21 10:38:31 -07:00
Victor Campos	ee3e01627f	[Clang][ARM] Define __VFP_FP__ macro unconditionally Clang only defines __VFP_FP__ when the FPU is enabled. However, gcc defines it unconditionally. This patch aligns Clang with gcc. Reviewed By: peter.smith, rengolin Differential Revision: https://reviews.llvm.org/D100372	2021-04-21 15:03:59 +01:00
LemonBoy	8e336511f1	[clang] Recognize ppc32 as valid mcpu value The `ppc32` cpu model was introduced a while ago in `a9321059b9` as an independent copy of the `ppc` one but was never wired into clang. Reviewed By: #powerpc, nemanjai Differential Revision: https://reviews.llvm.org/D100933	2021-04-21 15:11:01 +02:00
John Brawn	5f02c308e6	Fix clang/test/Frontend/plugin-annotate-functions.c FileCheck now gives an error when there's a check for an undefined variable, which this test does in one of its NOT checks. Fix this by being a bit looser in what the test checks.	2021-04-21 14:00:38 +01:00
Sylvain Audi	8c16c8b7ef	Reland "[clang-scan-deps] Add support for clang-cl" This reverts commit `199c397482`. This time, clang-scan-deps's search for output argument in clang-cl command line will now ignore arguments preceded by "-Xclang". That way, it won't detect a /o argument in "-Xclang -ivfsoverlay -Xclang /opt/subpath" Initial patch description: clang-scan-deps contains some command line parsing and modifications. This patch adds support for clang-cl command options. Differential Revision: https://reviews.llvm.org/D92191	2021-04-21 07:56:39 -04:00
Sven van Haastregt	e2b3b89bf1	[OpenCL] Do not add builtins with unavailable types Add functionality to assign extensions to types in OpenCLBuiltins.td and use that information to filter candidates that should not be exposed if a type is not available. Differential Revision: https://reviews.llvm.org/D100209	2021-04-21 11:59:29 +01:00
Martin Storsjö	174e796c7d	[llvm-rc] Fix a new test to disambiguate macOS paths like /Users/... from options starting with a slash This should fix test failures on macOS.	2021-04-21 13:34:33 +03:00
Simon Tatham	77e170db86	[ARM][Driver][Windows] Allow command-line upgrade to Armv8. If you gave clang the options `--target=arm-pc-windows-msvc` and `-march=armv8-a+crypto` together, the crypto extension would not be enabled in the compilation, and you'd see the following warning message suggesting that the 'armv8-a' had been ignored: clang: warning: ignoring extension 'crypto' because the 'armv7-a' architecture does not support it [-Winvalid-command-line-argument] This happens because Triple::getARMCPUForArch(), for the Win32 OS, unconditionally returns "cortex-a9" (an Armv7 CPU) regardless of MArch, which overrides the architecture setting on the command line. I don't think that the combination of Windows and AArch32 _should_ unconditionally outlaw the use of the crypto extension. MSVC itself doesn't think so: you can perfectly well compile Thumb crypto code using its AArch32-targeted compiler. All the other default CPUs in the same switch statement are conditional on a particular MArch setting; this is the only one that returns a particular CPU _regardless_ of MArch. So I've fixed this one by adding a condition, so that if you ask for an architecture above v7, the default of Cortex-A9 no longer overrides it. Reviewed By: mstorsjo Differential Revision: https://reviews.llvm.org/D100937	2021-04-21 11:20:05 +01:00
Martin Storsjö	066b8f2fc6	[llvm-rc] Try to fix the Preprocessor/llvm-rc.rc test on non arm/x86 architectures When llvm-rc invokes clang for preprocessing, it uses a target triple derived from the default target. The test verifies that e.g. _WIN32 is defined when preprocessing. If running clang with e.g. -target ppc64le-windows-msvc, that particular arch/OS combination isn't hooked up, so _WIN32 doesn't get defined in that configuration. Therefore, the preprocessing test fails. Instead make llvm-rc inspect the architecture of the default target. If it's one of the known supported architectures, use it as such, otherwise set a default one (x86_64). (Clang can run preprocessing with an x86_64 target triple, even if the x86 backend isn't enabled.) Also remove superfluous llvm:: specifications on enums in llvm-rc.cpp.	2021-04-21 12:47:33 +03:00
Martin Storsjö	64bc44f5dd	[llvm-rc] Run clang to preprocess input files Allow opting out from preprocessing with a command line argument. Update tests to pass -no-preprocess to make it not try to use clang (which isn't a build level dependency of llvm-rc), but add a test that does preprocessing under clang/test/Preprocessor. Update a few options to allow them both joined (as -DFOO) and separate (-D BR), as rc.exe allows both forms of them. With the verbose flag set, this prints the preprocessing command used (which differs from what rc.exe does). Tests under llvm/test/tools/llvm-rc only test constructing the preprocessor commands, while tests under clang/test/Preprocessor test actually running the preprocessor. Differential Revision: https://reviews.llvm.org/D100755	2021-04-21 11:50:10 +03:00
ShihPo Hung	11072a0bdb	[RISCV][Clang] Add RVV AMO builtins Add vamo[swap/add/xor/and/or/min/max/minu/maxu] builtins. Reviewed By: khchen Differential Revision: https://reviews.llvm.org/D100448	2021-04-21 01:48:02 -07:00
Pushpinder Singh	0ad50bf27f	Revert "[AMDGPU][OpenMP] Add amdgpu-arch tool to list AMD GPUs installed" This reverts commit `3194761d27`.	2021-04-21 08:05:38 +00:00
Pushpinder Singh	3194761d27	[AMDGPU][OpenMP] Add amdgpu-arch tool to list AMD GPUs installed This patch adds new clang tool named amdgpu-arch which uses HSA to detect installed AMDGPU and report back latter's march. This tool is built only if system has HSA installed. The value printed by amdgpu-arch is used to fill -march when latter is not explicitly provided in -Xopenmp-target. Reviewed By: JonChesterfield, gregrodgers Differential Revision: https://reviews.llvm.org/D99949	2021-04-21 05:05:49 +00:00
Jonas Devlieghere	05eeed9691	Revert "[Driver] Support default libc++ library location on Darwin" This reverts the following commits because it breaks TestAppleSimulatorOSType.py on GreenDragon [1]. `caff17e503` `f5efe0aa04` `ae8b2cab67` [1] http://green.lab.llvm.org/green/view/LLDB/job/lldb-cmake/31346/	2021-04-20 20:42:50 -07:00
Liu, Chen3	72e4bf12ee	[X86] Support some missing intrinsics Support for _mm512_i32logather_pd, _mm512_mask_i32logather_pd, _mm512_i32logather_epi64, _mm512_mask_i32logather_epi64, _mm512_i32loscatter_pd, _mm512_mask_i32loscatter_pd, _mm512_i32loscatter_epi64, _mm512_mask_i32loscatter_epi64. Differential Revision: https://reviews.llvm.org/D100368	2021-04-21 10:50:37 +08:00
Dávid Bolvanský	9f1e2ee462	[Clang, builtins] Added aligned_alloc, memalign support	2021-04-21 00:11:54 +02:00
Petr Hosek	caff17e503	[Driver] Don't use capture for InstalledDir This is another attempt to address the issue introduced in `ae8b2cab67`. We cannot capture InstalledDir because FileCheck doesn't handle the backslashes correctly, so instead we just consume the entire path prefix which is what other tests are doing.	2021-04-20 13:43:56 -07:00
Petr Hosek	f5efe0aa04	[Driver] Support both slashes This addresses Windows breakage introduced by `ae8b2cab67`.	2021-04-20 13:25:38 -07:00
Petr Hosek	ae8b2cab67	[Driver] Support default libc++ library location on Darwin Darwin driver currently uses libc++ headers that are part of Clang toolchain when available (by default ../include/c++/v1 relative to executable), but it completely ignores the libc++ library itself because it doesn't pass the location of libc++ library that's part of Clang (by default ../lib relative to the exceutable) to the linker always using the system copy of libc++. This may lead to subtle issues when the compilation fails because the headers that are part of Clang toolchain are incompatible with the system library. Either the driver should ignore both headers as well as the library, or it should always try to use both when available. This patch changes the driver behavior to do the latter which seems more reasonable, it makes it easy to test and use custom libc++ build on Darwin while still allowing the use of system version. This also matches the Clang driver behavior on other systems. Differential Revision: https://reviews.llvm.org/D45639	2021-04-20 12:30:35 -07:00
Ahmed Bougacha	cedb5b06df	[AArch64] Don't always override CPU for arm64e. This demotes the apple-a12 CPU selection for arm64e to just be the last-resort default. Concretely, this means: - an explicitly-specified -mcpu will override the arm64e default; a user could potentially pick an invalid CPU that doesn't have v8.3a support, but that's not a major problem anymore - arm64e-apple-macos (and variants) will pick apple-m1 instead of being forced to apple-a12.	2021-04-20 08:41:04 -07:00
Ahmed Bougacha	a8a3a43792	[AArch64] Add apple-m1 CPU, and default to it for macOS. apple-m1 has the same level of ISA support as apple-a14, so this is a straightforward mechanical change. However, that also means this inherits apple-a14's v8.5a+nobti quirkiness. rdar://68287159	2021-04-20 08:41:04 -07:00
Erich Keane	0ed613612c	Ensure target-multiversioning emits deferred declarations As reported in PR50025, sometimes we would end up not emitting functions needed by inline multiversioned variants. This is because we typically use the 'deferred decl' mechanism to emit these. However, the variants are emitted after that typically happens. This fixes that by ensuring we re-run deferred decls after this happens. Also, the multiversion emission is done recursively to ensure that MV functions that require other MV functions to be emitted get emitted.	2021-04-20 08:10:26 -07:00
Thomas Preud'homme	8cee150e9a	[C++, test] Fix typo in NSS* vars The NSS FileCheck variables at the end of the CodeGenCXX/split-stacks.cpp clang testcase are off by 1, resulting in the use of an undefined variable (NSS3). One of the CHECK-NOT is also redundant because _Z8tnosplitIiEiv uses the same attribute as _Z3foov without split stack. This commit fixes that. Reviewed By: ChuanqiXu Differential Revision: https://reviews.llvm.org/D99839	2021-04-20 12:07:41 +01:00
Serguei Katkov	70193bdfc0	Re-land [GreedyRA ORE] Add Cost of spill locations into remark Re-land the patch with a fix of clang test. Cost of spill location is computed basing on relative branch frequency where corresponding spill/reload/copy are located. While the number itself is highly depends on incoming IR, the total cost can be used when do some changes in RA. Revert "Revert "[GreedyRA ORE] Add Cost of spill locations into remark"" This reverts commit `680f3d6de7`.	2021-04-20 16:21:07 +07:00
Alexandre Ganea	199c397482	Revert "[clang-scan-deps] Add support for clang-cl" This reverts commit `bb26fa8c28`.	2021-04-19 17:45:18 -04:00
Yaxun (Sam) Liu	d8805574c1	[CUDA][HIP] Allow non-ODR use of host var in device Reviewed by: Artem Belevich, Richard Smith Differential Revision: https://reviews.llvm.org/D98193	2021-04-19 14:45:24 -04:00
Wael Yehia	369c0e0f48	[AIX] Diagnose thinLTO usage in clang on AIX. Reviewed By: Xiangling Liao Differential Revision: https://reviews.llvm.org/D100350	2021-04-19 16:39:48 +00:00
Hans Wennborg	aa1e391402	Fix test/Frontend/print-header-includes.c I managed to run the wrong tests before committing D100759.	2021-04-19 15:39:09 +02:00
Jan Svoboda	0a92e09c07	[clang][deps] Generate the full command-line for modules This patch uses the new `CompilerInvocation::generateCC1CommandLine` to generate the full canonical command line for modular dependencies, instead of only appending additional arguments. Reviewed By: dexonsmith Differential Revision: https://reviews.llvm.org/D100534	2021-04-19 14:32:49 +02:00
Timm Bäder	320311a01b	[clang][parser] Unify rejecting (non) decl stmts with gnu attributes The comment here was introduced in `a3e01cf822` and suggests that we should handle declaration statements and non-declaration statements the same, but don't because ProhibitAttributes() can't handle GNU attributes. That has recently changed, so remove the comment and handle all statements the same. Differential Revision: https://reviews.llvm.org/D99936	2021-04-19 12:43:55 +02:00
Jan Svoboda	2b73565210	[clang][deps] Remove the -full-command-line flag This patch removes the `-full-command-line` option from `clang-scan-deps`. It's only used with `-format=experimental-full`, where omitting the command lines doesn't make much sense. There are no tests without `-full-command-line`. Depends on D100531. Reviewed By: Bigcheese Differential Revision: https://reviews.llvm.org/D100533	2021-04-19 12:28:02 +02:00
Jan Svoboda	32219c8c44	[clang][deps] Simplify function discovering .pcm and .modulemap files This patch simplifies (and renames) the `appendCommonModuleArguments` function. It no longer tries to construct the command line for explicitly building modules. Instead, it only performs the DFS traversal of modular dependencies and queries the callbacks to collect paths to `.pcm` and `.modulemap` files. This makes it more flexible and usable in two contexts: * Generating additional command line arguments for the main TU in modular build. The `std::vector<std::string>` output parameters can be used to manually generate appropriate command line flags. * Generate full command line for a module. The output parameters can be the corresponding parts of `CompilerInvocation`. (In a follow-up patch.) Reviewed By: dexonsmith Differential Revision: https://reviews.llvm.org/D100531	2021-04-19 12:11:39 +02:00
ShihPo Hung	27edaee84e	[RISCV][Driver] Make the ordering of CmdArgs consistent between RISCV::Linker and baremetal::Linker In baremetal::Linker::ConstructJob, LinkerInput is handled prior to T_Group options, but on the other side in RISCV::Linker::ConstructJob, it is opposite. We want it to be consistent whether users are using RISCV::Linker or baremetal::Linker. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D100615	2021-04-18 19:05:20 -07:00
Xun Li	5faba87938	Revert "[Coroutines] Set presplit attribute in Clang instead of CoroEarly pass" This reverts commit `fa6b54c44a`. The commited patch broke mlir tests. It seems that mlir tests depend on coroutine function properties set in CoroEarly pass.	2021-04-18 17:22:28 -07:00
Xun Li	fa6b54c44a	[Coroutines] Set presplit attribute in Clang instead of CoroEarly pass Presplit coroutines cannot be inlined. During AlwaysInliner we check if a function is a presplit coroutine, if so we skip inlining. The presplit coroutine attributes are set in CoroEarly pass. However in O0 pipeline, AlwaysInliner runs before CoroEarly, so the attribute isn't set yet and will still inline the coroutine. This causes Clang to crash: https://bugs.llvm.org/show_bug.cgi?id=49920 To fix this, we set the attributes in the Clang front-end instead of in CoroEarly pass. Reviewed By: rjmccall, ChuanqiXu Differential Revision: https://reviews.llvm.org/D100282	2021-04-18 15:41:09 -07:00
Xun Li	c0211e8d7d	Revert "[Coroutines] Move CoroEarly pass to before AlwaysInliner" This reverts commit `2b50f5a434`. Forgot to update the description of the commit to sync with phabricator. Going to redo the commit.	2021-04-18 15:38:19 -07:00
Xun Li	2b50f5a434	[Coroutines] Move CoroEarly pass to before AlwaysInliner Presplit coroutines cannot be inlined. During AlwaysInliner we check if a function is a presplit coroutine, if so we skip inlining. The presplit coroutine attributes are set in CoroEarly pass. However in O0 pipeline, AlwaysInliner runs before CoroEarly, so the attribute isn't set yet and will still inline the coroutine. This causes Clang to crash: https://bugs.llvm.org/show_bug.cgi?id=49920 Differential Revision: https://reviews.llvm.org/D100282	2021-04-18 14:54:04 -07:00
Sylvain Audi	bb26fa8c28	[clang-scan-deps] Add support for clang-cl clang-scan-deps contains some command line parsing and modifications. This patch adds support for clang-cl command options. Differential Revision: https://reviews.llvm.org/D92191	2021-04-17 14:22:51 -04:00
Sylvain Audi	488a19d00c	[clang-scan-deps] Support double-dashes in clang command lines This fixes argument injection in clang command lines, by adding them before "--". Previously, the arguments were injected at the end of the command line and could be added after "--", which would be wrongly interpreted as input file paths. This fix is needed for a subsequent patch, see D92191. Differential Revision: https://reviews.llvm.org/D95099	2021-04-17 14:22:51 -04:00
Yaxun (Sam) Liu	6823af0ca8	[HIP] Support hipRTC in header hipRTC compiles HIP device code at run time. Since the system may not have development tools installed, when a HIP program is compiled through hipRTC, there is no standard C or C++ header available. As such, the HIP headers should not depend on standard C or C++ headers when used with hipRTC. Basically when hipRTC is used, HIP headers only provides definitions of HIP device API functions. This is in line with what nvRTC does. This patch adds support of hipRTC to HIP headers in clang. Basically hipRTC defines a macro __HIPCC_RTC__ when compile HIP code at run time. When this macro is defined, HIP headers do not include standard C/C++ headers. Reviewed by: Artem Belevich Differential Revision: https://reviews.llvm.org/D100652	2021-04-17 11:34:52 -04:00
Dávid Bolvanský	12a1f1d9d7	[Pragma] Added support for GCC unroll/nounroll GCC 8 introduced these new pragmas to control loop unrolling. We should support them for compatibility reasons and the implementation itself requires few lines of code, since everything needed is already implemented for #pragma unroll/nounroll.	2021-04-17 17:29:55 +02:00
Yaxun (Sam) Liu	d5c0f00e21	[CUDA][HIP] Mark device var used by host only Add device variables to llvm.compiler.used if they are ODR-used by either host or device functions. This is necessary to prevent them from being eliminated by whole-program optimization where the compiler has no way to know a device variable is used by some host code. Reviewed by: Artem Belevich Differential Revision: https://reviews.llvm.org/D98814	2021-04-17 11:25:25 -04:00
Yaxun (Sam) Liu	3597f02fd5	[AMDGPU] Add GlobalDCE before internalization pass The internalization pass only internalizes global variables with no users. If the global variable has some dead user, the internalization pass will not internalize it. To be able to internalize global variables with dead users, a global dce pass is needed before the internalization pass. This patch adds that. Reviewed by: Artem Belevich, Matt Arsenault Differential Revision: https://reviews.llvm.org/D98783	2021-04-17 11:25:25 -04:00
Ben Barham	1206b95e07	[ASTReader] Only mark module out of date if not already compiled If a module contains errors (ie. it was built with -fallow-pcm-with-compiler-errors and had errors) and was from the module cache, it is marked as out of date - see `a2c1054c30`. When a module is imported multiple times in the one compile, this caused it to be recompiled each time - removing the existing buffer from the module cache and replacing it. This results in various errors further down the line. Instead, only mark the module as out of date if it isn't already finalized in the module cache. Reviewed By: akyrtzi Differential Revision: https://reviews.llvm.org/D100619	2021-04-16 17:57:03 -07:00
Philip Reames	f549176ad9	[funcattrs] Add the maximal set of implied attributes to definitions Have funcattrs expand all implied attributes into the IR. This expands the infrastructure from D100400, but for definitions not declarations this time. Somewhat subtly, this mostly isn't semantic. Because the accessors did the inference, any client which used the accessor was already getting the stronger result. Clients that directly checked presence of attributes (there are some), will see a stronger result now. The old behavior can end up quite confusing for two reasons: * Without this change, we have situations where function-attrs appears to fail when inferring an attribute (as seen by a human reading IR), but that consuming code will see that it should have been implied. As a human trying to sanity check test results and study IR for optimization possibilities, this is exceeding error prone and confusing. (I'll note that I wasted several hours recently because of this.) * We can have transforms which trigger without the IR appearing (on inspection) to meet the preconditions. This change doesn't prevent this from happening (as the accessors still involve multiple checks), but it should make it less frequent. I'd argue in favor of deleting the extra checks out of the accessors after this lands, but I want that in it's own review as a) it's purely stylistic, and b) I already know there's some disagreement. Once this lands, I'm also going to do a cleanup change which will delete some now redundant duplicate predicates in the inference code, but again, that deserves to be a change of it's own. Differential Revision: https://reviews.llvm.org/D100226	2021-04-16 14:22:19 -07:00
Thomas Lively	5c729750a6	[WebAssembly] Remove saturating fp-to-int target intrinsics Use the target-independent @llvm.fptosi and @llvm.fptoui intrinsics instead. This includes removing the instrinsics for i32x4.trunc_sat_zero_f64x2_{s,u}, which are now represented in IR as a saturating truncation to a v2i32 followed by a concatenation with a zero vector. Differential Revision: https://reviews.llvm.org/D100596	2021-04-16 12:11:20 -07:00
Dávid Bolvanský	0daf273025	[Builtins] Add memory allocation builtins (PR12543)	2021-04-16 20:36:46 +02:00
Artem Belevich	eaa9ef075d	[CUDA, FDO] Filter out profiling options from GPU-side compilations. Differential Revision: https://reviews.llvm.org/D100598	2021-04-16 11:35:28 -07:00
Zakk Chen	8f683366af	[RISCV][Clang] Add RVV miscellaneous intrinsic functions. 1. vreinterpret 2. vundefined 3. LMUL truncation and extension. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D100391	2021-04-16 09:41:19 -07:00
Zakk Chen	ca9e52f67c	[RISCV][Clang] Drop the assembly tests for RVV intrinsics. We had verified the correctness of all intrinsics in downstream, so dropping the assembly tests to decrease the check-clang time. It would remove 1/3 of the RUN lines. https://reviews.llvm.org/D99151#2654154 mentions why we need to have the ASM tests before. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D100617	2021-04-16 09:30:12 -07:00
Troy Johnson	8628ed0310	[Driver] Allow both lib64 and lib in rocm-detect test. Differential Revision: https://reviews.llvm.org/D100502	2021-04-16 09:55:57 -05:00
Alexey Bataev	10c7b9f64f	[OPENMP]Fix PR49115: Incorrect results for scan directive. For combined worksharing directives need to emit the temp arrays outside of the parallel region and update them in the master thread only. Differential Revision: https://reviews.llvm.org/D100187	2021-04-16 06:25:35 -07:00
Pushpinder Singh	efc013ec4d	Revert "[AMDGPU][OpenMP] Add amdgpu-arch tool to list AMD GPUs installed" This reverts commit `7029cffc4e`.	2021-04-16 09:16:58 +00:00
Pushpinder Singh	7029cffc4e	[AMDGPU][OpenMP] Add amdgpu-arch tool to list AMD GPUs installed This patch adds new clang tool named amdgpu-arch which uses HSA to detect installed AMDGPU and report back latter's march. This tool is built only if system has HSA installed. The value printed by amdgpu-arch is used to fill -march when latter is not explicitly provided in -Xopenmp-target. Reviewed By: JonChesterfield, gregrodgers Differential Revision: https://reviews.llvm.org/D99949	2021-04-16 05:26:20 +00:00
Richard Smith	f7c9de0de5	Add triple to fix test failure. This test uses `__regcall`, support for which is target-specific.	2021-04-15 18:08:35 -07:00
Joshua Haberman	8344675908	Implemented [[clang::musttail]] attribute for guaranteed tail calls. This is a Clang-only change and depends on the existing "musttail" support already implemented in LLVM. The [[clang::musttail]] attribute goes on a return statement, not a function definition. There are several constraints that the user must follow when using [[clang::musttail]], and these constraints are verified by Sema. Tail calls are supported on regular function calls, calls through a function pointer, member function calls, and even pointer to member. Future work would be to throw a warning if a users tries to pass a pointer or reference to a local variable through a musttail call. Reviewed By: rsmith Differential Revision: https://reviews.llvm.org/D99517	2021-04-15 17:12:21 -07:00
Momchil Velikov	f9d932e673	[clang][AArch64] Correctly align HFA arguments when passed on the stack When we pass a AArch64 Homogeneous Floating-Point Aggregate (HFA) argument with increased alignment requirements, for example struct S { __attribute__ ((__aligned__(16))) double v[4]; }; Clang uses `[4 x double]` for the parameter, which is passed on the stack at alignment 8, whereas it should be at alignment 16, following Rule C.4 in AAPCS (https://github.com/ARM-software/abi-aa/blob/master/aapcs64/aapcs64.rst#642parameter-passing-rules) Currently we don't have a way to express in LLVM IR the alignment requirements of the function arguments. The align attribute is applicable to pointers only, and only for some special ways of passing arguments (e..g byval). When implementing AAPCS32/AAPCS64, clang resorts to dubious hacks of coercing to types, which naturally have the needed alignment. We don't have enough types to cover all the cases, though. This patch introduces a new use of the stackalign attribute to control stack slot alignment, when and if an argument is passed in memory. The attribute align is left as an optimizer hint - it still applies to pointer types only and pertains to the content of the pointer, whereas the alignment of the pointer itself is determined by the stackalign attribute. For byval arguments, the stackalign attribute assumes the role, previously perfomed by align, falling back to align if stackalign` is absent. On the clang side, when passing arguments using the "direct" style (cf. `ABIArgInfo::Kind`), now we can optionally specify an alignment, which is emitted as the new `stackalign` attribute. Patch by Momchil Velikov and Lucas Prates. Differential Revision: https://reviews.llvm.org/D98794	2021-04-15 22:58:14 +01:00
Martin Storsjö	8e0f2e89ff	[clang] [AArch64] Fix handling of HFAs passed to Windows variadic functions The documentation says that for variadic functions, all composites are treated similarly, no special handling of HFAs/HVAs, not even for the fixed arguments of a variadic function. Differential Revision: https://reviews.llvm.org/D100467	2021-04-15 22:21:27 +03:00
cchen	e0c2125d1d	[OpenMP] Added codegen for masked directive Reviewed By: ABataev Differential Revision: https://reviews.llvm.org/D100514	2021-04-15 12:55:07 -05:00
Melanie Blower	938b863bb5	[clang][patch] Modify diagnostic level from err to warn: anyx86_interrupt_regsave Reviewed By: Aaron Ballman Differential Revision: https://reviews.llvm.org/D100511	2021-04-15 13:11:33 -04:00
Arthur Eubanks	c8f0a7c215	[NewPM] Cleanup IR printing instrumentation Being lazy with printing the banner seems hard to reason with, we should print it unconditionally first (it could also lead to duplicate banners if we have multiple functions in -filter-print-funcs). The printIR() functions were doing too many things. I separated out the call from PrintPassInstrumentation since we were essentially doing two completely separate things in printIR() from different callers. There were multiple ways to generate the name of some IR. That's all been moved to getIRName(). The printing of the IR name was also inconsistent, now it's always "IR Dump on $foo" where "$foo" is the name. For a function, it's the function name. For a loop, it's what's printed by Loop::print(), which is more detailed. For an SCC, it's the list of functions in parentheses. For a module it's "[module]", to differentiate between a possible SCC with a function called "module". To preserve D74814, we have to check if we're going to print anything at all first. This is unfortunate, but I would consider this a special case that shouldn't be handled in the core logic. Reviewed By: jamieschmeiser Differential Revision: https://reviews.llvm.org/D100231	2021-04-15 09:50:55 -07:00
Mark Johnston	99eca1bd9c	[Driver] Enable kernel address and memory sanitizers on FreeBSD Test Plan: using kernel ASAN and MSAN implementations in FreeBSD Reviewed By: emaste, dim, arichardson Differential Revision: https://reviews.llvm.org/D98286	2021-04-15 17:49:00 +01:00
Aaron Ballman	ad2d6bbb14	Fix potential infinite loop with malformed attribute syntax Double square bracket attribute arguments can be arbitrarily complex, and the attribute argument parsing logic recovers by skipping tokens. As a fallback recovery mechanism, parse recovery stops before reading a semicolon. This could lead to an infinite loop in the attribute list parsing logic.	2021-04-15 10:47:32 -04:00
Matthias Klose	56cb214b38	add test case for ignoring -flto=auto and -flto=jobserver as requested in https://reviews.llvm.org/D99501, test that the two new options are ignored. Reviewed By: tejohnson, fhahn Differential Revision: https://reviews.llvm.org/D100484	2021-04-15 12:19:14 +02:00
Martin Storsjö	ee570e2153	[clang] [test] Share patterns in CodeGen/ms_abi_aarch64.c between cases. NFC. Differential Revision: https://reviews.llvm.org/D100468	2021-04-15 11:02:14 +03:00
Zakk Chen	ea5d33dbc1	[RISCV][Clang] Add vmv and vfmv series intrinsic functions. Authored-by: Roger Ferrer Ibanez <rofirrim@gmail.com> Co-Authored-by: Zakk Chen <zakk.chen@sifive.com> Reviewed By: craig.topper, Jim Differential Revision: https://reviews.llvm.org/D100266	2021-04-14 22:22:39 -07:00
Eli Friedman	dc1ab590a0	[Sema] Fold VLA types in compound literals to constant arrays. Similar to variables with an initializer, this is never valid in standard C, so we can safely constant-fold as an extension. I ran into this construct in a couple proprietary codebases. While I'm here, drive-by fix for 090dd647: we should only fold variables with VLA types, not arbitrary variably modified types. Differential Revision: https://reviews.llvm.org/D98363	2021-04-14 17:09:59 -07:00
Philip Reames	dd985551c2	Reapply "[InferAttributes] Materialize all infered attributes for declaration"" and follow on patches. This reverts commit `ab98f2c712` and `98eea392cd`. It includes a fix for the clang test which triggered the revert. I failed to notice this one because there was another AMDGPU llvm test with a similiar name and the exact same text in the error message. Odd. Since only one build bot reported the clang test, I didn't notice that one.	2021-04-14 16:38:07 -07:00
Thomas Lively	6a18cc23ef	[WebAssembly] Codegen for i64x2.extend_{low,high}_i32x4_{s,u} Removes the builtins and intrinsics used to opt in to using these instructions and replaces them with normal ISel patterns now that they are no longer prototypes. Differential Revision: https://reviews.llvm.org/D100402	2021-04-14 13:43:09 -07:00
Sterling Augustine	d2bb3cbbf8	Make test runnable on read-only file systems.	2021-04-14 13:29:51 -07:00
Alex Lorenz	c1554f32e3	[clang][FileManager] Support empty file name in getVirtualFileRef for serialized diagnostics After https://reviews.llvm.org/D90484 libclang is unable to read a serialized diagnostic file which contains a diagnostic which came from a file with an empty filename. The reason being is that the serialized diagnostic reader is creating a virtual file for the "" filename, which now fails after the changes in https://reviews.llvm.org/D90484. This patch restores the previous behavior in getVirtualFileRef by allowing it to construct a file entry ref with an empty name by pretending its name is "." so that the directory entry can be created. Differential Revision: https://reviews.llvm.org/D100428	2021-04-14 11:29:25 -07:00
Thomas Lively	af7925b4dd	[WebAssembly] Codegen for f64x2.convert_low_i32x4_{s,u} Add a custom DAG combine and ISD opcode for detecting patterns like (uint_to_fp (extract_subvector ...)) before the extract_subvector is expanded to ensure that they will ultimately lower to f64x2.convert_low_i32x4_{s,u} instructions. Since these instructions are no longer prototypes and can now be produced via standard IR, this commit also removes the target intrinsics and builtins that had been used to prototype the instructions. Differential Revision: https://reviews.llvm.org/D100425	2021-04-14 10:42:45 -07:00
Thomas Lively	af7ab81ce3	[WebAssembly] Use standard intrinsics for f32x4 and f64x2 ops Now that these instructions are no longer prototypes, we do not need to be careful about keeping them opt-in and can use the standard LLVM infrastructure for them. This commit removes the bespoke intrinsics we were using to represent these operations in favor of the corresponding target-independent intrinsics. The clang builtins are preserved because there is no standard way to easily represent these operations in C/C++. For consistency with the scalar codegen in the Wasm backend, the intrinsic used to represent {f32x4,f64x2}.nearest is @llvm.nearbyint even though @llvm.roundeven better captures the semantics of the underlying Wasm instruction. Replacing our use of @llvm.nearbyint with use of @llvm.roundeven is left to a potential future patch. Differential Revision: https://reviews.llvm.org/D100411	2021-04-14 09:19:27 -07:00
Hans Wennborg	f29dcbdde1	Add flag for showing skipped headers in -H / --show-includes output Consider the following set of files: a.cc: #include "a.h" a.h: #ifndef A_H #define A_H #include "b.h" #include "c.h" // This gets "skipped". #endif b.h: #ifndef B_H #define B_H #include "c.h" #endif c.h: #ifndef C_H #define C_H void c(); #endif And the output of the -H option: $ clang -c -H a.cc . ./a.h .. ./b.h ... ./c.h Note that the include of c.h in a.h is not shown in the output (GCC does the same). This is because of the include guard optimization: clang knows c.h is covered by an include guard which is already defined, so when it sees the include in a.h, it skips it. The same would have happened if #pragma once were used instead of include guards. However, a.h does include c.h, and it may be useful to show that in the -H output. This patch adds a flag for doing that. Differential revision: https://reviews.llvm.org/D100480	2021-04-14 17:01:51 +02:00
Erich Keane	92aba5ae49	CPUDispatch- allow out of line member definitions ICC permits this, and after some extensive testing it looks like we can support this with very little trouble. We intentionally don't choose to do this with attribute-target (despite it likely working as well!) because GCC does not support that, and introducing said incompatibility doesn't seem worth it.	2021-04-14 06:19:49 -07:00
Martin Storsjö	3637c5c8ec	[clang] [AArch64] Fix Windows va_arg handling for larger structs Aggregate types over 16 bytes are passed by reference. Contrary to the x86_64 ABI, smaller structs with an odd (non power of two) are padded and passed in registers. Differential Revision: https://reviews.llvm.org/D100374	2021-04-14 14:51:53 +03:00
Liu, Chen3	1c4108ab66	[i386] Modify the alignment of __m128/__m256/__m512 vector type according i386 abi. According to i386 System V ABI: 1. when __m256 are required to be passed on the stack, the stack pointer must be aligned on a 0 mod 32 byte boundary at the time of the call. 2. when __m512 are required to be passed on the stack, the stack pointer must be aligned on a 0 mod 64 byte boundary at the time of the call. The current method of clang passing __m512 parameter are as follow: 1. when target supports avx512, passing it with 64 byte alignment; 2. when target supports avx, passing it with 32 byte alignment; 3. Otherwise, passing it with 16 byte alignment. Passing __m256 parameter are as follow: 1. when target supports avx or avx512, passing it with 32 byte alignment; 2. Otherwise, passing it with 16 byte alignment. This pach will passing __m128/__m256/__m512 following i386 System V ABI and apply it to Linux only since other System V OS (e.g Darwin, PS4 and FreeBSD) don't want to spend any effort dealing with the ramifications of ABI breaks at present. Differential Revision: https://reviews.llvm.org/D78564	2021-04-14 16:44:54 +08:00
Anton Bikineev	69545154cc	[Sema] Move 'char-expression-as-unsigned < 0' into a separate diagnostic This change splits '-Wtautological-unsigned-zero-compare' by reporting char-expressions-interpreted-as-unsigned under a separate diagnostic '-Wtautological-unsigned-char-zero-compare'. This is beneficial for projects that want to enable '-Wtautological-unsigned-zero-compare' but at the same time want to keep code portable for platforms with char being signed or unsigned, such as Chromium. Differential Revision: https://reviews.llvm.org/D99808	2021-04-14 01:01:40 +02:00
Sander de Smalen	204aaf8795	[AArch64][SVE] Always use overloaded methods instead of preprocessor macro. This fixes a subtle issue where: svprf(pg, ptr, SV_ALL /is sv_pattern instead of sv_prfop/) would be quietly accepted. With this change, the function declaration guards that the third parameter is a `enum sv_prfop`. Previously `svprf` would map directly to `__builtin_sve_svprfb`, which accepts the enum operand as a signed integer and only checks that the incoming range is valid, meaning that SV_ALL would be discarded as being outside the valid immediate range, but would have allowed SV_VL1 without issuing a warning (C) or error (C++). Reviewed By: c-rhodes Differential Revision: https://reviews.llvm.org/D100297	2021-04-13 21:12:53 +01:00
Hana Dusíková	64c24f493e	Remove warning "suggest braces" for aggregate initialization of an empty class with an aggregate base class. I recently ran into issues with aggregates and inheritance, I'm using it for creating a type-safe library where most of the types are build over "tagged" std::array. After bit of cleaning and enabling -Wall -Wextra -pedantic I noticed clang only in my pipeline gives me warning. After a bit of focusing on it I found it's not helpful, and contemplate disabling the warning all together. After a discussion with other library authors I found it's bothering more people and decided to fix it. Removes this warning: template<typename T, int N> struct StdArray { T contents[N]; }; template<typename T, int N> struct AggregateAndEmpty : StdArray<T,N> { }; AggregateAndEmpty<int, 3> p = {1, 2, 3}; // <-- warning here about omitted braces	2021-04-13 15:45:09 -04:00
Aaron Ballman	c058a71227	Correct the tablegen for checking mutually exclusive stmt attrs The previous implementation was insufficient for checking statement attribute mutual exclusion because attributed statements do not collect their attributes one-at-a-time in the same way that declarations do. So the design that was attempting to check for mutual exclusion as each attribute was processed would not ever catch a mutual exclusion in a statement. This was missed due to insufficient test coverage, which has now been added for the [[likely]] and [[unlikely]] attributes. The new approach is to check all of attributes that are to be applied to the attributed statement in a group. This required generating another DiagnoseMutualExclusions() function into AttrParsedAttrImpl.inc.	2021-04-13 15:20:30 -04:00
ThePhD	701d70d4c2	String Literal and Wide String Literal Encoding from the Preprocessor Adds the __clang_literal_encoding__ and __clang_wide_literal_encoding__ predefined macros to expose the encoding used for string literals to the preprocessor.	2021-04-13 14:18:07 -04:00
Aaron Ballman	62328f2f29	Implement WG21 P2156R1/WG14 N2557 on duplicate attributes These proposals make the same changes to both C++ and C and remove a restriction on standard attributes appearing multiple times in the same attribute list. We could warn on the duplicate attributes, but do not. This is for consistency as we do not warn on attributes duplicated within the attribute specifier sequence. If we want to warn on duplicated standard attributes, we should do so both for both situations: [[foo, foo]] and [[foo]][[foo]].	2021-04-13 12:30:04 -04:00
Aaron Ballman	5ad15f4d1c	Require commas between double square bracket attributes. Clang currently has a bug where it allows you to write [[foo bar]] and both attributes are silently accepted. This patch corrects the comma parsing rules for such attributes and handles the test case fallout, as a few tests were accidentally doing this.	2021-04-13 06:43:01 -04:00
Ben Dunbobbin	eae2d4b852	[Windows Itanium][PS4] handle dllimport/export w.r.t vtables/rtti The existing Windows Itanium patches for dllimport/export behaviour w.r.t vtables/rtti can't be adopted for PS4 due to backwards compatibility reasons (see comments on https://reviews.llvm.org/D90299). This commit adds our PS4 scheme for this to Clang. Differential Revision: https://reviews.llvm.org/D93203	2021-04-13 11:41:10 +01:00
Sander de Smalen	fa936b610f	[AArch64][SVE] Fix dup/dupq intrinsics for C++. This patch changes the builtin prototype to use 'b' (boolean) instead of the default integer element type. That fixes the dup/dupq intrinsics when compiling with C++. This patch also fixes one of the defines for __ARM_FEATURE_SVE2_BITPERM. Reviewed By: kmclaughlin Differential Revision: https://reviews.llvm.org/D100294	2021-04-13 10:55:20 +01:00
Alexey Bader	95c614afcd	[NFC][SYCL] Drop idle triple component from regression tests.	2021-04-13 08:00:21 +03:00
Freddy Ye	3fc1fe8db8	[X86] Support -march=rocketlake Reviewed By: skan, craig.topper, MaskRay Differential Revision: https://reviews.llvm.org/D100085	2021-04-13 09:48:13 +08:00
Sanjay Patel	661cc71a1c	[PassManager][PhaseOrdering] lower expects before running simplifyCFG Retry of `330619a3a6` that includes a clang test update. Original commit message: If we run passes before lowering llvm.expect intrinsics to metadata, then those passes have no way to act on the hints provided by llvm.expect. SimplifyCFG is the known offender, and we made it smarter about profile metadata in D98898 <https://reviews.llvm.org/D98898>. In the motivating example from https://llvm.org/PR49336 , this means we were ignoring the recommended method for a programmer to tell the compiler that a compare+branch is expensive. This change appears to solve that case - the metadata survives to the backend, the compare order is as expected in IR, and the backend does not do anything to reverse it. We make the same change to the old pass manager to keep things synchronized. Differential Revision: https://reviews.llvm.org/D100213	2021-04-12 15:07:53 -04:00
Sean Perry	06c8b29d23	Enable creation of large response file on z/OS Most text processing commands (eg. grep, awk) have a maximum line length limit on z/OS. The current method of using cc -E & grep fails on z/OS because of this limit. I'm changing the command to create the long line in the response file to use python. This avoids the possibility of any tools blocking the generation of the large response file. This also eliminates the need for the extra file. Reviewed By: abhina.sreeskantharajan Differential Revision: https://reviews.llvm.org/D100197	2021-04-12 15:06:05 -04:00
Artem Belevich	38cf112a6b	Allow applying attributes to subset of allowed subjects. Differential Revision: https://reviews.llvm.org/D100136	2021-04-12 09:33:33 -07:00
Esme-Yi	dff922f39b	Reland [DebugInfo] Fix the mismatching between C++ language tags and Dwarf versions."" This reverts commit `c965e14a12`.	2021-04-12 11:05:55 +00:00
Esme-Yi	c965e14a12	Revert "[DebugInfo] Fix the mismatching between C++ language tags and Dwarf versions." This reverts commit `62fa9b9388`.	2021-04-12 10:36:46 +00:00
Sander de Smalen	6bf806b3e2	[AArch64] ACLE: Fix issue for mismatching enum types with builtins. This patch fixes an issue with the SVE prefetch and qinc/qdec intrinsics that take an `enum` argument, but where the builtin prototype encodes these as `int`. Some code in SemaDecl found the mismatch and chose to forget about the builtin altogether, which meant that any future code using that builtin would fail. The code that forgets about the builtin was actually obsolete after D77491 and should have been removed. This patch now removes that code. This patch also fixes another issue with the SVE prefetch intrinsic when built with C++, where the builtin didn't accept the correct pointer type, which should be `const void *`. Reviewed By: tambre Differential Revision: https://reviews.llvm.org/D100046	2021-04-12 11:16:28 +01:00
Sven van Haastregt	731bf28a60	[OpenCL] Accept .rgba in OpenCL 3.0 The .rgba vector component accessors are supported in OpenCL C 3.0. Previously, the diagnostic would check `OpenCLVersion` for version 2.2 (value 220) and report those accessors are an OpenCL 2.2 feature. However, there is no "OpenCL C version 2.2", so change the check and diagnostic text to 3.0 only. A spurious `OpenCLVersion` argument was passed into the diagnostic; remove that. Differential Revision: https://reviews.llvm.org/D99969	2021-04-12 09:30:06 +01:00
Esme-Yi	62fa9b9388	[DebugInfo] Fix the mismatching between C++ language tags and Dwarf versions. Summary: The tags DW_LANG_C_plus_plus_14 and DW_LANG_C_plus_plus_11, introduced in Dwarf-5, are unexpected in previous versions. Fixing the mismathing doesn't have any drawbacks for any other debuggers, but helps dbx. Reviewed By: aprantl, shchenz Differential Revision: https://reviews.llvm.org/D99250	2021-04-12 07:42:54 +00:00
Freddy Ye	5cb47be410	[X86] Remove FeatureCLWB from FeaturesICLClient Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D100279	2021-04-12 12:08:59 +08:00
yifeng.dongyifeng	3a6a80b641	[Clang][Coroutine][DebugInfo] In c++ coroutine, clang will emit different debug info variables for parameters and move-parameters. The first one is the real parameters of the coroutine function, the other one just for copying parameters to the coroutine frame. Considering the following c++ code: ``` struct coro { ... }; coro foo(struct test & t) { ... co_await suspend_always(); ... co_await suspend_always(); ... co_await suspend_always(); } int main(int argc, char *argv[]) { auto c = foo(...); c.handle.resume(); ... } ``` Function foo is the standard coroutine function, and it has only one parameter named t (ignoring this at first), when we use the llvm code to compile this function, we can get the following ir: ``` !2921 = distinct !DISubprogram(name: "foo", linkageName: "_ZN6Object3fooE4test", scope: !2211, file: !45, li\ ne: 48, type: !2329, scopeLine: 48, flags: DIFlagPrototyped \| DIFlagAllCallsDescribed, spFlags: DISPFlagDefi\ nition \| DISPFlagOptimized, unit: !44, declaration: !2328, retainedNodes: !2922) !2924 = !DILocalVariable(name: "t", arg: 2, scope: !2921, file: !45, line: 48, type: !838) ... !2926 = !DILocalVariable(name: "t", scope: !2921, type: !838, flags: DIFlagArtificial) ``` We can find there are two `the same` DIVariable named t in the same dwarf scope for foo.resume. And when we try to use llvm-dwarfdump to dump the dwarf info of this elf, we get the following output: ``` 0x00006684: DW_TAG_subprogram DW_AT_low_pc (0x00000000004013a0) DW_AT_high_pc (0x00000000004013a8) DW_AT_frame_base (DW_OP_reg7 RSP) DW_AT_object_pointer (0x0000669c) DW_AT_GNU_all_call_sites (true) DW_AT_specification (0x00005b5c "_ZN6Object3fooE4test") 0x000066a5: DW_TAG_formal_parameter DW_AT_name ("t") DW_AT_decl_file ("/disk1/yifeng.dongyifeng/my_code/llvm/build/bin/coro-debug-1.cpp") DW_AT_decl_line (48) DW_AT_type (0x00004146 "test") 0x000066ba: DW_TAG_variable DW_AT_name ("t") DW_AT_type (0x00004146 "test") DW_AT_artificial (true) ``` The elf also has two 't' in the same scope. But unluckily, it might let the debugger confused. And failed to print parameters for O0 or above. This patch will make coroutine parameters and move parameters use the same DIVar and try to fix the problems that I mentioned before. Test Plan: check-clang Reviewed By: aprantl, jmorse Differential Revision: https://reviews.llvm.org/D97533	2021-04-12 11:10:47 +08:00
Zakk Chen	59d5b8c27b	[RISCV][Clang] Add some RVV Permutation intrinsic functions. Support the following instructions. 1. Vector Slide Instructions 2. Vector Register Gather Instructions 3. Vector Compress Instruction Authored-by: Roger Ferrer Ibanez <rofirrim@gmail.com> Co-Authored-by: Zakk Chen <zakk.chen@sifive.com> Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D100127	2021-04-11 19:19:02 -07:00
Zakk Chen	a8fc0e445c	[RISCV][Clang] Add all RVV Mask intrinsic functions. 1. Redefine vpopc and vfirst IR intrinsic so it could adapt on clang tablegen generator which always appends a type for vl in IntrinsicType of clang codegen. 2. Remove `c` type transformer and add `u` and `l` for unsigned long and long type. Authored-by: Roger Ferrer Ibanez <rofirrim@gmail.com> Co-Authored-by: Zakk Chen <zakk.chen@sifive.com> Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D100120	2021-04-11 19:19:02 -07:00
Zakk Chen	e5a8219264	[RISCV][Clang] Add more RVV load/store intrinsic functions. Support the following instructions. 1. Mask load and store 2. Vector Strided Instructions 3. Vector Indexed Store Instructions Authored-by: Roger Ferrer Ibanez <rofirrim@gmail.com> Co-Authored-by: Zakk Chen <zakk.chen@sifive.com> Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D99965	2021-04-11 19:19:02 -07:00
Zakk Chen	c680b0dabf	[RISCV][Clang] Add all RVV Reduction intrinsic functions. Authored-by: Roger Ferrer Ibanez <rofirrim@gmail.com> Co-Authored-by: Zakk Chen <zakk.chen@sifive.com> Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D99964	2021-04-11 19:19:01 -07:00
Zakk Chen	07c3854a75	[RISCV][Clang] Add RVV merge intrinsic functions. Authored-by: Roger Ferrer Ibanez <rofirrim@gmail.com> Co-Authored-by: Zakk Chen <zakk.chen@sifive.com> Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D99963	2021-04-11 19:19:01 -07:00
Zakk Chen	01fa222b6d	[RISCV][Clang] Add RVV Type-Convert intrinsic functions. Fix extension macro condition. Support below instructions: 1. Single-Width Floating-Point/Integer Type-Convert Instructions 2. Widening Floating-Point/Integer Type-Convert Instructions 3. Narrowing Floating-Point/Integer Type-Convert Instructions Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D99742	2021-04-11 19:19:01 -07:00
Zakk Chen	5f7739b60e	[RISCV][Clang] Add some RVV Floating-Point intrinsic functions. Support vfclass, vfmerge, vfrec7, vfrsqrt7, vfsqrt instructions. Reviewed By: craig.topper Authored-by: Roger Ferrer Ibanez <rofirrim@gmail.com> Co-Authored-by: Zakk Chen <zakk.chen@sifive.com> Differential Revision: https://reviews.llvm.org/D99741	2021-04-11 19:19:01 -07:00
Zakk Chen	98a3ff9d05	[RISCV][Clang] Add more RVV Floating-Point intrinsic functions. Support below instructions. 1. Vector Widening Floating-Point Add/Subtract Instructions 2. Vector Widening Floating-Point Multiply 3. Vector Single-Width Floating-Point Fused Multiply-Add Instructions 4. Vector Widening Floating-Point Fused Multiply-Add Instructions 5. Vector Floating-Point Compare Instructions Reviewed By: craig.topper, HsiangKai Authored-by: Roger Ferrer Ibanez <rofirrim@gmail.com> Co-Authored-by: Zakk Chen <zakk.chen@sifive.com> Differential Revision: https://reviews.llvm.org/D99669	2021-04-11 19:19:01 -07:00
Zakk Chen	007ea0e736	[RISCV][Clang] Add some RVV Floating-Point intrinsic functions. Support the following instructions which have the same class. 1. Vector Single-Width Floating-Point Subtract Instructions 2. Vector Single-Width Floating-Point Multiply/Divide Instructions 3. Vector Floating-Point MIN/MAX Instructions 4. Vector Floating-Point Sign-Injection Instructions Reviewed By: craig.topper Authored-by: Roger Ferrer Ibanez <rofirrim@gmail.com> Co-Authored-by: Zakk Chen <zakk.chen@sifive.com> Differential Revision: https://reviews.llvm.org/D99668	2021-04-11 19:19:01 -07:00
Zakk Chen	ccc624bfd4	[RISCV][Clang] Add RVV Widening Integer Add/Subtract intrinsic functions. Reviewed By: craig.topper Authored-by: Roger Ferrer Ibanez <rofirrim@gmail.com> Co-Authored-by: Zakk Chen <zakk.chen@sifive.com> Differential Revision: https://reviews.llvm.org/D99526	2021-04-11 19:19:01 -07:00
Saurabh Jha	71ab6c98a0	[Matrix] Implement C-style explicit type conversions for matrix types. This implements C-style type conversions for matrix types, as specified in clang/docs/MatrixTypes.rst. Fixes PR47141. Reviewed By: fhahn Differential Revision: https://reviews.llvm.org/D99037	2021-04-10 11:48:41 +01:00
Hsiangkai Wang	471ae42c04	[RISCV][Clang] Add RVV vleff intrinsic functions. Reviewed By: craig.topper, liaolucy, jrtc27, khchen Differential Revision: https://reviews.llvm.org/D99151	2021-04-10 17:10:19 +08:00
Roman Lebedev	6270b3a1ea	Temporairly revert "[CGCall] Annotate `this` argument with alignment" As per @jyknight, "It seems like there's a bug with vtable thunks getting the wrong information." See https://reviews.llvm.org/D99790#2680857, https://godbolt.org/z/MxhYMe1q7 This reverts commit `0aa0458f14`.	2021-04-10 10:43:16 +03:00
Ben Shi	4f173c0c42	[clang][AVR] Support variable decorator '__flash' Reviewed By: Anastasia Differential Revision: https://reviews.llvm.org/D96853	2021-04-10 11:23:55 +08:00
cchen	1a43fd2769	[OpenMP51] Initial support for masked directive and filter clause Adds basic parsing/sema/serialization support for the #pragma omp masked directive. Reviewed By: ABataev Differential Revision: https://reviews.llvm.org/D99995	2021-04-09 14:00:36 -05:00
Alex Richardson	dc4abca766	Handle alloc_size attribute on function pointers I have been trying to statically find and analyze all calls to heap allocation functions to determine how many of them use sizes known at compile time vs only at runtime. While doing so I saw that quite a few projects use replaceable function pointers for heap allocation and noticed that clang was not able to annotate functions pointers with alloc_size. I have changed the Sema checks to allow alloc_size on all function pointers and typedefs for function pointers now and added checks that these attributes are propagated to the LLVM IR correctly. With this patch we can also compute __builtin_object_size() for calls to allocation function pointers with the alloc_size attribute. Reviewed By: aaron.ballman, erik.pilkington Differential Revision: https://reviews.llvm.org/D55212	2021-04-09 18:49:38 +01:00
Matheus Izvekov	1819222860	[clang] tests: cleanup, update and add some new ones This reworks a small set of tests, as preparatory work for implementing P2266. * Run for more standard versions, including c++2b. * Normalize file names and run commands. * Adds some extra tests. New Coroutine tests taken from Aaron Puchert's D68845. Signed-off-by: Matheus Izvekov <mizvekov@gmail.com> Reviewed By: thakis Differential Revision: https://reviews.llvm.org/D99225	2021-04-09 17:24:08 +02:00
Yaxun (Sam) Liu	25942d7c49	[AMDGPU] Allow relaxed/consume memory order for atomic inc/dec Reviewed by: Jon Chesterfield Differential Revision: https://reviews.llvm.org/D100144	2021-04-09 09:23:41 -04:00
David Blaikie	eb8a28e2cf	DebugInfo: Include inline namespaces in template specialization parameter names This ensures these types have distinct names if they are distinct types (eg: if one is an instantiation with a type in one inline namespace, and another from a type with the same simple name, but in a different inline namespace).	2021-04-08 17:37:55 -07:00
Xiangling Liao	d508561798	[AIX] Support init priority attribute Differential Revision: https://reviews.llvm.org/D99291	2021-04-08 15:40:09 -04:00
Craig Topper	02ef9963e1	[RISCV] Prevent __builtin_riscv_orc_b_64 from being compiled RV32 target. The backend can't handle this and will throw a fatal error from type legalization. It's easy enough to fix that for this intrinsic by just splitting the IR intrinsic since it works on individual bytes. There will be other intrinsics in the future that would be harder to support through splitting, for example grev, gorc, and shfl. Those would require a compare and a select be inserted to check the MSB of their control input. This patch adds support for preventing this in the frontend with a nice diagnostic. Reviewed By: frasercrmck Differential Revision: https://reviews.llvm.org/D99984	2021-04-08 11:34:56 -07:00
Valeriy Savchenko	663ac91ed1	[analyzer] Fix false positives in inner pointer checker (PR49628) This patch supports std::data and std::addressof functions. rdar://73463300 Differential Revision: https://reviews.llvm.org/D99260	2021-04-08 20:30:12 +03:00
Valeriy Savchenko	4b958dd6bc	[analyzer] Fix crash on spaceship operator (PR47511) rdar://68954187 Differential Revision: https://reviews.llvm.org/D99181	2021-04-08 20:28:05 +03:00
Dávid Bolvanský	2cb8c10342	Revert "Reduce the number of attributes attached to each function" This reverts commit `053dc95839`. It causes perf regressions - see discussion in D97116.	2021-04-08 17:28:57 +02:00
Valeriy Savchenko	9f0d8bac14	[analyzer] Fix dead store checker false positive It is common to zero-initialize not only scalar variables, but also structs. This is also defensive programming and we shouldn't complain about that. rdar://34122265 Differential Revision: https://reviews.llvm.org/D99262	2021-04-08 16:12:42 +03:00
Fangrui Song	8ac5e44061	[Driver] Drop $DEFAULT_TRIPLE-$name as a fallback program name D13340 introduced this behavior which is not needed even for mips. This was raised on https://lists.llvm.org/pipermail/cfe-dev/2020-May/065437.html but no action was taken. This was raised again in https://lists.llvm.org/pipermail/cfe-dev/2021-April/067974.html "The LLVM host/target TRIPLE padding drama on Debian" as it caused confusion. This patch drops the behavior. Differential Revision: https://reviews.llvm.org/D99996	2021-04-07 21:01:10 -07:00
Jinsong Ji	a723310b41	[Driver][test] Test intended target only `6fe7de90b9` changed GNU toolchain, and added new RUN line to test expected behavior. The change is for GNU toolchain only, so this will fail other toolchain, eg: AIX. Update the test with `-target` to test GNU tool chain only. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D99901	2021-04-07 20:08:26 +00:00
Jennifer Yu	ebf2dc3328	Fix missing generate capture expression for novariants condition.	2021-04-07 12:35:49 -07:00
Aaron En Ye Shi	df59850038	[HIP] Fix rocm-detect.hip test path The ROCm installation directory may be another directory, llvm/ inside the build directory. Reviewed By: yaxunl Differential Revision: https://reviews.llvm.org/D100045	2021-04-07 17:20:59 +00:00
Sander de Smalen	672f673004	[SVE] Remove checks for warnings in scalable-vector tests. After D98856 these tests will by default break (fatal_error) if any of the wrong interfaces are used, so there's no longer a need to have a RUN line that checks for a warning message emitted by the compiler.	2021-04-07 15:59:32 +01:00
Florian Hahn	7ca4dd8217	[Clang] Extend test coverage for -f[no-]finite-loops options. Extend test coverage by checking various standard versions with -f[no-]finite-loops. Suggested as part of D96418.	2021-04-07 13:15:49 +01:00
Balazs Benics	f0e102c1a3	[analyzer][NFC] Add tests for extents If we allocate memory, the extent of the MemRegion will be the symbolic value of the size parameter. This way, if that symbol gets constrained, the extent will be also constrained. This test demonstrates that the extent is indeed the same symbol. Reviewed By: NoQ Differential Revision: https://reviews.llvm.org/D99959	2021-04-07 13:43:19 +02:00
Valeriy Savchenko	77f1e096e8	[-Wcompletion-handler] Don't recognize init methods as conventional rdar://75704162 Differential Revision: https://reviews.llvm.org/D99601	2021-04-07 13:50:01 +03:00
Valeriy Savchenko	4821c15691	[analyzer] Fix body farm for Obj-C++ properties When property is declared in a superclass (or in a protocol), it still can be of CXXRecord type and Sema could've already generated a body for us. This patch joins two branches and two ways of acquiring IVar in order to reuse the existing code. And prevent us from generating l-value to r-value casts for C++ types. rdar://67416721 Differential Revision: https://reviews.llvm.org/D99194	2021-04-07 13:44:43 +03:00
Sven van Haastregt	35bc7569f8	[OpenCL] Add as_size/ptrdiff/intptr/uintptr_t operators size_t and friends are built-in scalar data types and s6.4.4.2 of the OpenCL C Specification says the as_type() operator must be available for these data types. Differential Revision: https://reviews.llvm.org/D98959	2021-04-07 10:16:41 +01:00
Roman Lebedev	2829094a8e	Reland [InstCombine] Fold `((X - Y) - Z)` to `X - (Y + Z)` (PR49858) This reverts commit `a547b4e26b`, relanding commit `31d219d299`, which was reverted because there was a conflicting inverse transform, which was causing an endless combine loop, which has now been adjusted. Original commit message: https://alive2.llvm.org/ce/z/67w-wQ We prefer `add`s over `sub`, and this particular xform allows further folds to happen: Fixes https://bugs.llvm.org/show_bug.cgi?id=49858	2021-04-07 12:06:25 +03:00
Thomas Preud'homme	e018698bec	[clang, test] Fix use of undef FileCheck var Clang test CodeGen/libcalls.c contains CHECK-NOT directives using a variable defined in a CHECK directive with a different prefix never enabled together, therefore causing the variable to be undefined in that CHECK-NOT. The intent of the test is to check that some declaration do not have the same attribute as when compiling the test without -fmath-errno. This commits instead changes all CHECK-NOT to CHECK directive, checking that they all use the same attribute. It also adds an extra CHECK for that prefix to check the expected attributes these functions should have when compiling with -fmath-errno. Reviewed By: rjmccall Differential Revision: https://reviews.llvm.org/D99898	2021-04-07 09:43:58 +01:00
Roman Lebedev	0aa0458f14	[CGCall] Annotate `this` argument with alignment As it is being noted in D99249, lack of alignment information on `this` has been preventing LICM from happening. For some time now, lack of alignment attribute does not imply natural alignment, but an alignment of `1`. Also, we used to treat dereferenceable as implying alignment, but we no longer do, so it's a bugfix. Differential Revision: https://reviews.llvm.org/D99790	2021-04-07 11:02:01 +03:00
Petr Hosek	000cf84cf1	Revert "[NFC][Clang] Speculative fix for builtins-ppc-quadword-noi128.c" This reverts commit `849d372943` which depends on `31d219d299` that was reverted.	2021-04-06 23:22:08 -07:00
Weverything	401826800e	Add missing CHECK lines in test	2021-04-06 18:00:31 -07:00
Yaxun (Sam) Liu	86175d5fed	Minor fix for test hip-code-object-version.hip Changed the order of checking of v2 and v3. Change-Id: Ifea8197b398afdfb0aa1bd40140cda30f00f0c17	2021-04-06 20:32:16 -04:00
Yaxun (Sam) Liu	4fd05e0ad7	[HIP] Change to code object v4 Change to code object v4 by default to match ROCm 4.1. Reviewed by: Artem Belevich Differential Revision: https://reviews.llvm.org/D99235	2021-04-06 20:22:58 -04:00
Hansang Bae	3da61ddae7	[OpenMP] Define omp_is_initial_device() variants in omp.h omp_is_initial_device() is marked as a built-in function in the current compiler, and user code guarded by this call may be optimized away, resulting in undesired behavior in some cases. This patch provides a possible fix for such cases by defining the routine as a variant function and removing it from builtin list. Differential Revision: https://reviews.llvm.org/D99447	2021-04-06 16:58:01 -05:00
Aaron Puchert	dfec26b186	Thread safety analysis: Don't warn about managed locks on join points We already did so for scoped locks acquired in the constructor, this change extends the treatment to deferred locks and scoped unlocking, so locks acquired outside of the constructor. Obviously this makes things more consistent. Originally I thought this was a bad idea, because obviously it introduces false negatives when it comes to double locking, but these are typically easily found in tests, and the primary goal of the Thread safety analysis is not to find double locks but race conditions. Since the scoped lock will release the mutex anyway when the scope ends, the inconsistent state is just temporary and probably fine. Reviewed By: delesley Differential Revision: https://reviews.llvm.org/D98747	2021-04-06 22:29:48 +02:00
Yaxun (Sam) Liu	61d065e21f	Let clang atomic builtins fetch add/sub support floating point types Recently atomicrmw started to support fadd/fsub: https://reviews.llvm.org/D53965 However clang atomic builtins fetch add/sub still does not support emitting atomicrmw fadd/fsub. This patch adds that. Reviewed by: John McCall, Artem Belevich, Matt Arsenault, JF Bastien, James Y Knight, Louis Dionne, Olivier Giroux Differential Revision: https://reviews.llvm.org/D71726	2021-04-06 15:44:00 -04:00
Alexandre Ganea	8fbc05acd5	[Windows] Add test coverage for line endings when rewriting includes Validate that we're properly generating a single line ending on Windows when using -frewrite-includes. Otherwise we're breaking split-line macros. The test fails before `23929af383`. See discussion in https://reviews.llvm.org/D96363#2650460 and D99426 Differential Revision: https://reviews.llvm.org/D99973	2021-04-06 15:38:19 -04:00
Paul Robinson	04b3c8c52c	Pass -fcrash-diagnostics-dir along to LLVM This allows frontend and backend diagnostic files to all go into the same place. Have it control the Windows (mini-)dump location. Differential Revision: https://reviews.llvm.org/D99199	2021-04-06 09:30:52 -07:00
Ben Langmuir	93c87fc06e	[index] Improve macro indexing support The major change here is to index macro occurrences in more places than before, specifically * In non-expansion references such as `#if`, `#ifdef`, etc. * When the macro is a reference to a builtin macro such as __LINE__. * When using the preprocessor state instead of callbacks, we now include all definition locations and undefinitions instead of just the latest one (which may also have had the wrong location previously). * When indexing an existing module file (.pcm), we now include module macros, and we no longer report unrelated preprocessor macros during indexing the module, which could have caused duplication. Additionally, we now correctly obey the system symbol filter for macros, so by default in system headers only definition/undefinition occurrences are reported, but it can be configured to report references as well if desired. Extends FileIndexRecord to support occurrences of macros. Since the design of this type is to keep a single list of entities organized by source location, we incorporate macros into the existing DeclOccurrence struct. Differential Revision: https://reviews.llvm.org/D99758	2021-04-06 09:12:14 -07:00
Erik Pilkington	b660abc80d	[ObjC] Add a command line flag that disables recognition of objc_direct for testability Programmers would like to be able to test direct methods by calling them from a different linkage unit or mocking them, both of which are impossible. This patch adds a flag that effectively disables the attribute, which will fix this when enabled in testable builds. rdar://71190891 Differential revision: https://reviews.llvm.org/D95845	2021-04-06 11:17:01 -04:00
Roman Lebedev	849d372943	[NFC][Clang] Speculative fix for builtins-ppc-quadword-noi128.c	2021-04-06 16:15:23 +03:00
Zakk Chen	f2a3601aa5	[RISCV][Clang] Add all RVV Fixed-Point Arithmetic intrinsic functions. Reviewed By: HsiangKai Authored-by: Roger Ferrer Ibanez <rofirrim@gmail.com> Co-Authored-by: Zakk Chen <zakk.chen@sifive.com> Differential Revision: https://reviews.llvm.org/D99610	2021-04-06 03:12:45 -07:00
Zakk Chen	fe252b509e	[RISCV][Clang] Add more RVV Integer intrinsic functions. Support below instructions. 1. Vector Integer Add-with-Carry / Subtract-with-Borrow Instructions 2. Vector Integer Comparison Instructions 3. Vector Widening Integer Multiply-Add Instructions Reviewed By: HsiangKai Authored-by: Roger Ferrer Ibanez <rofirrim@gmail.com> Co-Authored-by: Zakk Chen <zakk.chen@sifive.com> Differential Revision: https://reviews.llvm.org/D99528	2021-04-06 03:11:28 -07:00
Zakk Chen	f720c22e77	[RISCV][Clang] Add RVV Widening Integer Extension intrinsic functions. Reviewed By: HsiangKai Authored-by: Roger Ferrer Ibanez <rofirrim@gmail.com> Co-Authored-by: Zakk Chen <zakk.chen@sifive.com> Differential Revision: https://reviews.llvm.org/D99527	2021-04-06 03:10:14 -07:00
Zakk Chen	0a18ea01f1	[RISCV][Clang] Add RVV vnsra, vnsrl and vwmul intrinsic functions. Reviewed By: craig.topper Authored-by: Roger Ferrer Ibanez <rofirrim@gmail.com> Co-Authored-by: Zakk Chen <zakk.chen@sifive.com> Differential Revision: https://reviews.llvm.org/D99525	2021-04-06 03:07:36 -07:00
Zakk Chen	66c05609e0	[RISCV][Clang] Add some RVV Integer intrinsic functions. 1. Rename RVVBinBuiltin to RVVOutputOp1Builtin because it is not related to the number of operand. 2. Add RVV Integer instuctions which use RVVOutputOp1Builtin. Reviewed By: craig.topper Authored-by: Roger Ferrer Ibanez <rofirrim@gmail.com> Co-Authored-by: Zakk Chen <zakk.chen@sifive.com> Differential Revision: https://reviews.llvm.org/D99524	2021-04-06 03:07:36 -07:00
Balázs Kéri	bee4813789	[clang][Checkers] Fix PthreadLockChecker state cleanup at dead symbol. It is possible that an entry in 'DestroyRetVal' lives longer than an entry in 'LockMap' if not removed at checkDeadSymbols. The added test case demonstrates this. Reviewed By: NoQ Differential Revision: https://reviews.llvm.org/D98504	2021-04-06 11:15:29 +02:00
Thomas Preud'homme	828ec9e9e5	[OpenCL, test] Fix use of undef FileCheck var Clang test CodeGenOpenCL/fpmath.cl uses a variable defined in an earlier CHECK-NOT directive. However, by definition the pattern in that directive is not supposed to occur so no variable will be defined. This commit solves the issue by using a regex match with the same regex as in the definition. It also changes the definition into a regex match since no variable is going to be defined. Reviewed By: yaxunl Differential Revision: https://reviews.llvm.org/D99857	2021-04-05 21:11:39 +01:00
Jennifer Yu	7078ef4722	[OPENMP51]Initial support for nocontext clause. Added basic parsing/sema/serialization support for the 'nocontext' clause. Differential Revision: https://reviews.llvm.org/D99848	2021-04-05 11:45:49 -07:00
Charusso	89d210fe1a	[analyzer] DynamicSize: Debug facility This patch adds two debug functions to ExprInspectionChecker to dump out the dynamic extent and element count of symbolic values: dumpExtent(), dumpElementCount().	2021-04-05 19:17:52 +02:00
Charusso	df64f471d1	[analyzer] DynamicSize: Store the dynamic size This patch introduces a way to store the size. Reviewed By: NoQ Differential Revision: https://reviews.llvm.org/D69726	2021-04-05 19:04:53 +02:00
Erik Pilkington	803b79221e	[SemaObjC] Fix a -Wbridge-cast false-positive Clang used to emit a bad -Wbridge-cast diagnostic on the cast in the attached test. This was because, after `09abecef7`, struct __CFString was not added to lookup, so the objc_bridge attribute wasn't getting duplicated onto the most recent declaration, causing us to fail to find it in getObjCBridgeAttr. This patch fixes this by instead walking through the redeclarations to find an appropriate bridge attribute. rdar://72823399 Differential revision: https://reviews.llvm.org/D99661	2021-04-05 11:41:40 -04:00
Thomas Preud'homme	4dd3e0feca	[DebugInfo, CallSites, test] Fix use of undef FileCheck var Clang test CodeGen/debug-info-extern-call.c tries to check for the absence of a sequence of instructions with several CHECK-NOT with one of those directives using a variable defined in another. However CHECK-NOT are checked independently so that is using a variable defined in a pattern that should not occur in the input. This commit removes the CHECK-NOT for the retained line attribute definition since the CHECK-NOT on the compile unit will already check that there is no retained lines. Reviewed By: djtodoro Differential Revision: https://reviews.llvm.org/D99830	2021-04-05 11:39:24 +01:00
Yaxun (Sam) Liu	907af84396	[CUDA][HIP] rename -fcuda-flush-denormals-to-zero Rename it to -fgpu-flush-denormals-to-zero. Reviewed by: Artem Belevich Differential Revision: https://reviews.llvm.org/D99688	2021-04-05 00:13:51 -04:00
Thomas Preud'homme	292726b644	[HIP, test] Fix use of undef FileCheck var Clang test CodeGenCUDA/kernel-stub-name.cu uses never defined DKERN variable in a CHECK-NOT directive. This commit replace the variable by a regex, thereby avoiding the issue. Reviewed By: yaxunl Differential Revision: https://reviews.llvm.org/D99832	2021-04-04 19:30:49 +01:00
Thomas Preud'homme	a41b5100e4	[HIP-Clang, test] Fix use of undef FileCheck var Commit `8129521318` changed a line defining PREFIX in clang test CodeGenCUDA/device-stub.cu into a CHECK-NOT directive. All following lines using PREFIX are therefore using an undefined variable since the pattern defining PREFIX is not supposed to occur and CHECK-NOT are checked independently. This commit replaces all uses of PREFIX by the regex used to define it, thereby avoiding the problem. Reviewed By: yaxunl Differential Revision: https://reviews.llvm.org/D99831	2021-04-04 19:30:27 +01:00
Fangrui Song	e92d2b80c6	[Driver] Detect libstdc++ include paths for native gcc (-m32 and -m64) on Debian i386 Take gcc-8 on Debian i386 as an example. The target-specific libstdc++ search path (`GPLUSPLUS_TOOL_INCLUDE_DIR`) uses the multiarch name `i386-linux-gnu`, instead of the triple of the GCC installation `i686-linux-gnu` (the directory under `usr/lib/gcc/`): ``` /usr/include/c++/8 /usr/include/i386-linux-gnu/c++/8 /usr/include/c++/8/backward ``` Clang currently detects `/usr/lib/gcc/i686-linux-gnu/8/../../../include/i686-linux-gnu/c++/8`. This patch changes the second i686-linux-gnu to i386-linux-gnu so that `/usr/include/i386-linux-gnu/c++/8` can be found. Fix PR49827 - this was somehow regressed by my previous libstdc++ include path cleanups and fixes for gcc-cross, but it seems that the paths were never properly tested before. Differential Revision: https://reviews.llvm.org/D99852	2021-04-04 10:15:12 -07:00
Aaron Ballman	241d42c382	Speculative fix for failing build bot. This attempts to resolve an issue found by http://45.33.8.238/macm1/6821/step_6.txt	2021-04-04 10:58:56 -04:00
Timm Bäder	1b4800c262	[clang][parser] Set source ranges for GNU-style attributes Set the source ranges for parsed GNU-style attributes in ParseGNUAttributes(), the same way that ParseCXX11Attributes() does it. Differential Revision: https://reviews.llvm.org/D75844	2021-04-04 07:59:22 +02:00
Thomas Preud'homme	1cc9d949a1	[C++20, test] Fix use of undef FileCheck variable Commit `f495de43bd` forgot two lines when removing checks for strong and weak equality, resulting in the use of an undefined FileCheck variable. Reviewed By: Quuxplusone Differential Revision: https://reviews.llvm.org/D99838	2021-04-04 00:05:48 +01:00
Thomas Preud'homme	95f448aa86	[PGO, test] Fix typo in FileCheck var Reviewed By: xur Differential Revision: https://reviews.llvm.org/D99821	2021-04-03 08:44:46 +01:00
Matheus Izvekov	bac74a50e9	[clang] NFC: remove trailing white spaces from some tests Differential Revision: https://reviews.llvm.org/D99826	2021-04-03 03:18:22 +02:00
Aaron Ballman	4be8a26951	Use tablegen to diagnose mutually exclusive attributes Currently, when one or more attributes are mutually exclusive, the developer adding the attribute has to manually emit diagnostics. In practice, this is highly error prone, especially for declaration attributes, because such checking is not trivial. Redeclarations require you to write a "merge" function to diagnose mutually exclusive attributes and most attributes get this wrong. This patch introduces a table-generated way to specify that a group of two or more attributes are mutually exclusive: def : MutualExclusions<[Attr1, Attr2, Attr3]>; This works for both statement and declaration attributes (but not type attributes) and the checking is done either from the common attribute diagnostic checking code or from within mergeDeclAttribute() when merging redeclarations.	2021-04-02 16:34:42 -04:00
Jennifer Yu	cb424fee3d	[OPENMP5.1]Initial support for novariants clause. Added basic parsing/sema/serialization support for the 'novariants' clause.	2021-04-02 13:19:01 -07:00
Levy Hsu	f78d932cf2	[RISCV] Add IR intrinsics for Zbc extension Head files are included in a separate patch in case the name needs to be changed. RV32 / 64: clmul clmulh clmulr Differential Revision: https://reviews.llvm.org/D99711	2021-04-02 12:09:13 -07:00
Levy Hsu	944adbf285	Recommit "[RISCV] Add IR intrinsic for Zbb extension" Forgot to amend the Author. Original commit message: Header files are included in a separate patch in case the name needs to be changed. RV32 / 64: orc.b Differential Revision: https://reviews.llvm.org/D99320	2021-04-02 11:50:19 -07:00
Craig Topper	1f0b309f24	Revert "[RISCV] Add IR intrinsic for Zbb extension" This reverts commit `1808194590`. I forgot to change the author.	2021-04-02 11:47:02 -07:00
Craig Topper	1808194590	[RISCV] Add IR intrinsic for Zbb extension Header files are included in a separate patch in case the name needs to be changed. RV32 / 64: orc.b	2021-04-02 11:23:57 -07:00
Levy Hsu	b001d574d7	[RISCV] Add IR intrinsic for Zbr extension Implementation for RISC-V Zbr extension intrinsic. Header files are included in separate patch in case the name needs to be changed RV32 / 64: crc32b crc32h crc32w crc32cb crc32ch crc32cw RV64 Only: crc32d crc32cd Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D99009	2021-04-02 10:58:45 -07:00
Marek Kurdej	2ec7f639c4	[clang-cl] [Sema] Do not prefer integral conversion over floating-to-integral for MS compatibility 19.28 and higher. As of MSVC 19.28 (2019 Update 8), integral conversion is no longer preferred over floating-to-integral, and so MSVC is more standard conformant and will generate a compiler error on ambiguous call. Cf. https://godbolt.org/z/E8xsdqKsb. Initially found during the review of D99641. Reviewed By: rnk Differential Revision: https://reviews.llvm.org/D99663	2021-04-02 08:58:22 +02:00
Chen Zheng	f026e1f520	[debug-info][XCOFF] set `-gno-column-info` by default for DBX For DBX, it does not handle column info well. Set -gno-column-info by default for DBX. Reviewed By: dblaikie Differential Revision: https://reviews.llvm.org/D99703	2021-04-01 21:29:11 -04:00
Thomas Preud'homme	2c3db73341	[OpenMP, test] Fix use of undef VAR_PRIV FileCheck var Remove the CHECK-NOT directive referring to as-of-yet undefined VAR_PRIV variable since the pattern of the following CHECK-NOT in the same CHECK-NOT block covers a superset of the case caught by the first CHECK-NOT. Reviewed By: ABataev Differential Revision: https://reviews.llvm.org/D99775	2021-04-02 00:39:21 +01:00
Thomas Preud'homme	58e458935b	[OpenMP, test] Fix use of undef DECL FileCheck var OpenMP test target_data_use_device_ptr_if_codegen contains a CHECK-NOT directive using an undefined DECL FileCheck variable. It seems copied from target_data_use_device_ptr_codegen where there's a CHECK for a load that defined the variable. Since there is no corresponding load in this testcase, the simplest is to simply forbid any store and get rid of the variable altogether. Reviewed By: ABataev Differential Revision: https://reviews.llvm.org/D99771	2021-04-02 00:36:56 +01:00
Thomas Preud'homme	d222a07d30	[OpenMP, test] Fix uses of undef SVAR FileCheck var Fix the many cases of use of undefined SIVAR/SVAR/SFVAR in OpenMP private_codegen tests, due to a missing BLOCK directive to capture the IR variable when it is declared. It also fixes a few typo in its use. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D99770	2021-04-02 00:36:14 +01:00
cchen	cba422264c	[OpenMP51] Accept `primary` as proc bind affinity policy in Clang Reviewed By: ABataev Differential Revision: https://reviews.llvm.org/D99622	2021-04-01 18:07:12 -05:00
Fangrui Song	6fe7de90b9	[Driver] -nostdinc -nostdinc++: don't warn for -Wunused-command-line-argument	2021-04-01 14:37:34 -07:00
Jian Cai	76d9bc7278	Reland "Add support to -Wa,--version in clang"" This relands commit `3cc3c0f835` with fixed test cases, which was reverted by commit `bf2479c347`.	2021-04-01 13:47:56 -07:00
Joseph Huber	69ca50bd7d	[OpenMP] Pass mapping names to add components in a user defined mapper Summary: Currently the mapping names are not passed to the mapper components that set up the array region. This means array mappings will not have their names availible in the runtime. This patch fixes this by passing the argument name to the region correctly. This means that the mapped variable's name will be the declared mapper that placed it on the device. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D99681	2021-04-01 15:51:03 -04:00
Timm Bäder	908a267b5a	Revert "[clang][parser] Set source ranges for GNU-style attributes" This reverts commit `1ea9fa8c50`.	2021-04-01 17:32:40 +02:00
Timm Bäder	1ea9fa8c50	[clang][parser] Set source ranges for GNU-style attributes Set the source ranges for parsed GNU-style attributes in ParseGNUAttributes(), the same way that ParseCXX11Attributes() does it. Differential Revision: https://reviews.llvm.org/D75844	2021-04-01 17:25:23 +02:00
Balázs Kéri	df4fa53fdd	[clang][Checkers] Extend PthreadLockChecker state dump (NFC). Add printing of map 'DestroyRetVal'. Reviewed By: steakhal Differential Revision: https://reviews.llvm.org/D98502	2021-04-01 11:59:00 +02:00
Harald van Dijk	1d463c2a38	[Driver] Fix architecture triplets and search paths for Linux x32 Currently, support for the x32 ABI is handled as a multilib to the x86_64 target only. However, full self-hosting x32 systems treating it as a separate architecture with its own architecture triplets as well as search paths exist as well, in Debian's x32 port and elsewhere. This adds the missing architecture triplets and search paths so that clang can work as a native compiler on x32, and updates the tests so that they pass when using an x32 libdir suffix. Additionally, we would previously also assume that objects from any x86_64-linux-gnu GCC installation could be used to target x32. This changes the logic so that only GCC installations that include x32 support are used when targetting x32, meaning x86_64-linux-gnux32 GCC installations, and x86_64-linux-gnu and i686-linux-gnu GCC installations that include x32 multilib support. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D52050	2021-04-01 09:47:56 +01:00
Chen Zheng	bfcd21876a	[debug-info] support new tuning debugger type DBX for XCOFF DWARF Based on this debugger type, for now, we plan to: 1: use inline string by default for XCOFF DWARF 2: generate no column info for debug line table. Reviewed By: aprantl Differential Revision: https://reviews.llvm.org/D99400	2021-04-01 00:11:30 -04:00
Nick Desaulniers	bf2479c347	Revert "Add support to -Wa,--version in clang" This reverts commit `3cc3c0f835`. Breaks non-linux platforms. https://reviews.llvm.org/D99556#2662706 Signed-off-by: Nick Desaulniers <ndesaulniers@google.com>	2021-03-31 17:02:13 -07:00
Jian Cai	3cc3c0f835	Add support to -Wa,--version in clang Clang currently only supports -Wa,--version when -no-integrated-as is used. This adds support to -Wa,--version with -integrated-as. Link: https://github.com/ClangBuiltLinux/linux/issues/1320 Reviewed By: nickdesaulniers, MaskRay Differential Revision: https://reviews.llvm.org/D99556	2021-03-31 16:29:02 -07:00
Alexey Bataev	a28e835e94	[OPENMP]Fix PR48885: Crash in passing firstprivate args to tasks on Apple M1. Need to bitcast the function pointer passed as a parameter to the real type to avoid possible problem with calling conventions. Differential Revision: https://reviews.llvm.org/D99521	2021-03-31 13:00:58 -07:00
Alexey Bataev	66da4f6fc9	[OPENMP]Fix PR48658: [OpenMP 5.0] Compiler crash when OpenMP atomic sync hints used. No need to consider hint clause kind as the main atomic clause kind at the codegen. Differential Revision: https://reviews.llvm.org/D99611	2021-03-31 12:58:24 -07:00
Petr Hosek	fcf6800506	[Driver] Move detectLibcxxIncludePath to ToolChain This helper method is useful even outside of Gnu toolchains, so move it to ToolChain so it can be reused in other toolchains such as Fuchsia. Differential Revision: https://reviews.llvm.org/D88452	2021-03-31 10:50:44 -07:00
Thomas Lively	45783d0e8a	[WebAssembly] Implement i64x2 comparisons Removes the prototype builtin and intrinsic for i64x2.eq and implements that instruction as well as the other i64x2 comparison instructions in the final SIMD spec. Unsigned comparisons were not included in the final spec, so they still need to be scalarized via a custom lowering. Differential Revision: https://reviews.llvm.org/D99623	2021-03-31 10:46:17 -07:00
Timm Bäder	5018e15fdf	[clang][parser] Allow GNU-style attributes in explicit template... ... instantiations They are currently not being diagnosed because ProhibitAttributes() does not handle attribute lists with an invalid source range. But once it does, we need to allow GNU attributes in this place. Additionally, start optionally diagnosing empty attr lists in ProhibitCXX11Attributes(), since ProhibitAttribute() does it. Differential Revision: https://reviews.llvm.org/D97362	2021-03-31 16:44:19 +02:00
Luís Marques	a8cf32baf5	[RISCV] Add XFAIL riscv32 for known issue with the old pass manager See D80668, rG7b4832648a63 and https://bugs.llvm.org/show_bug.cgi?id=46117 for details of the issue. Differential Revision: https://reviews.llvm.org/D99108	2021-03-31 15:18:32 +01:00
Anton Bikineev	dc7ebd2cb0	[C++2b] Support size_t literals This adds support for C++2b's z/uz suffixes for size_t literals (P0330).	2021-03-31 13:36:23 +00:00
Balázs Kéri	ffcb4b43b7	Revert "[clang][Checkers] Extend PthreadLockChecker state dump (NFC)." This reverts commit `49c0ab6d76`. Test failures showed up because non-deterministic output.	2021-03-31 15:28:53 +02:00
Balázs Kéri	49c0ab6d76	[clang][Checkers] Extend PthreadLockChecker state dump (NFC). Add printing of map 'DestroyRetVal'. Reviewed By: steakhal Differential Revision: https://reviews.llvm.org/D98502	2021-03-31 11:19:42 +02:00
Jim Lin	32ca5a037a	[RISCV] Refine pre-define macro tests 1. Undefined macro test for rv32i and rv64i. a. Reorder it with canonical order. b. Add missing undefined macro check. c. Append defined value to `__riscv_a`, `__riscv_f` and `__riscv_c` to distinguish with `__riscv_arch_test`, `__riscv_cmodel_medlow` and `__riscv_float_abi_soft`. They have the same prefix. 2. Move abi macro test below f and d. 3. Unify coding style for newline. Reviewed By: HsiangKai Differential Revision: https://reviews.llvm.org/D99631	2021-03-31 14:06:20 +08:00
Ta-Wei Tu	99fd066227	[clang][Sema] Don't try to initialize implicit variable of invalid anonymous union/struct This fixes https://bugs.llvm.org/show_bug.cgi?id=49534, where the call to the constructor of the anonymous union is checked and triggers assertion failure when trying to retrieve the alignment of the `this` argument (which is a union with virtual function). The extra check for alignment was introduced in D97187. Reviewed By: tmatheson Differential Revision: https://reviews.llvm.org/D98548	2021-03-31 09:05:45 +08:00
Richard Smith	9eef0fae2b	Fix test expectations for %diff documentation.	2021-03-30 17:48:08 -07:00
Richard Smith	1705136590	Fix pluralization error in diagnostic, and move C++ testcase to proper directory.	2021-03-30 16:18:55 -07:00
Wei Mi	d535a05ca1	[ThinLTO] During module importing, close one source module before open another one for distributed mode. Currently during module importing, ThinLTO opens all the source modules, collect functions to be imported and append them to the destination module, then leave all the modules open through out the lto backend pipeline. This patch refactors it in the way that one source module will be closed before another source module is opened. All the source modules will be closed after importing phase is done. It will save some amount of memory when there are many source modules to be imported. Note that this patch only changes the distributed thinlto mode. For in process thinlto mode, one source module is shared acorss different thinlto backend threads so it is not changed in this patch. Differential Revision: https://reviews.llvm.org/D99554	2021-03-30 14:37:29 -07:00
Mike Rice	b7899ba0e8	[OPENMP51]Initial support for the dispatch directive. Added basic parsing/sema/serialization support for dispatch directive. Differential Revision: https://reviews.llvm.org/D99537	2021-03-30 14:12:53 -07:00
Alexey Bataev	e2c7bf08cc	[OPENMP]Fix PR48607: Crash during clang openmp codegen for firstprivate() of `float _Complex`. Need to cast the argument for the debug wrapper function call to the corresponding parameter type to avoid crash. Differential Revision: https://reviews.llvm.org/D99617	2021-03-30 13:39:45 -07:00
Matheus Izvekov	3ad6dd5d8f	[clang] Use decltype((E)) for compound requirement type constraint See PR45088. Compound requirement type constraints were using decltype(E) instead of decltype((E)), as per `[expr.prim.req]p1.3.3`. Since neither instantiation nor type dependence should matter for the constraints, this uses an approach where a `decltype` type is not built, and just the canonical type of the expression after template instantiation is used on the requirement. Signed-off-by: Matheus Izvekov <mizvekov@gmail.com> Reviewed By: rsmith Differential Revision: https://reviews.llvm.org/D98160	2021-03-30 22:00:33 +02:00
Kevin Petit	9d25ce743a	[OpenCL] Fix parsing of opencl-c.h in CL 3.0 Ensure that the cl_khr_3d_image_writes pragma is enabled by making cl_khr_3d_image_writes an optional core feature in CL 3.0 in addition to being an available extension in 1.0 onwards and a core feature in CL 2.0. https://reviews.llvm.org/D99425 Signed-off-by: Kevin Petit <kevin.petit@arm.com>	2021-03-30 16:17:46 +01:00
Alexey Bataev	bd334c790f	[OPENMP]Fix test checks for 32bit targets, NFC.	2021-03-30 07:45:12 -07:00
Valeriy Savchenko	af7e1f07ac	[analyzer] Fix crash when reasoning about C11 atomics (PR49422) rdar://75020762 Differential Revision: https://reviews.llvm.org/D99274	2021-03-30 16:04:19 +03:00
Valeriy Savchenko	90377308de	[analyzer] Support allocClassWithName in OSObjectCStyleCast checker `allocClassWithName` allocates an object with the given type. The type is actually provided as a string argument (type's name). This creates a possibility for not particularly useful warnings from the analyzer. In order to combat with those, this patch checks for casts of the `allocClassWithName` results to types mentioned directly as its argument. All other uses of this method should be reasoned about as before. rdar://72165694 Differential Revision: https://reviews.llvm.org/D99500	2021-03-30 15:58:06 +03:00
Gabor Marton	efa7df1682	[Analyzer] Track RValue expressions It makes sense to track rvalue expressions in the case of special concrete integer values. The most notable special value is zero (later we may find other values). By tracking the origin of 0, we can provide a better explanation for users e.g. in case of division by 0 warnings. When the divisor is a product of a multiplication then now we can show which operand (or both) was (were) zero and why. Differential Revision: https://reviews.llvm.org/D99344	2021-03-30 14:48:38 +02:00
Alexey Bataev	1696b8ae96	[OPENMP]Fix PR48740: OpenMP declare reduction in C does not require an initializer If no initializer-clause is specified, the private variables will be initialized following the rules for initialization of objects with static storage duration. Need to adjust the implementation to the current version of the standard. Differential Revision: https://reviews.llvm.org/D99539	2021-03-30 05:38:20 -07:00
Marek Kurdej	a99b8ae390	[clang] [PR49736] [C++2b] Correctly reject lambdas with requires clause and no parameter list This fixes http://llvm.org/PR49736 caused by implementing http://wg21.link/P1102 (https://reviews.llvm.org/rG0620e6f4b76a9725dbd82454d58c5a68a7e47074), by correctly allowing requires-clause only: 1) directly after template-parameter-list 2) after lambda-specifiers iff parameter-declaration-clause is present (2nd kind of lambda-declarator) Reviewed By: aaron.ballman Differential Revision: https://reviews.llvm.org/D99489	2021-03-30 13:53:55 +02:00
Raphael Isemann	1cbba533ec	[ObjC][CodeGen] Fix missing debug info in situations where an instance and class property have the same identifier Since the introduction of class properties in Objective-C it is possible to declare a class and an instance property with the same identifier in an interface/protocol. Right now Clang just generates debug information for whatever property comes first in the source file. The second property is ignored as it's filtered out by the set of already emitted properties (which is just using the identifier of the property to check for equivalence). I don't think generating debug info in this case was never supported as the identifier filter is in place since `7123bca7fb` (which precedes the introduction of class properties). This patch expands the filter to take in account identifier + whether the property is class/instance. This ensures that both properties are emitted in this special situation. Reviewed By: aprantl Differential Revision: https://reviews.llvm.org/D99512	2021-03-30 11:07:16 +02:00
Johannes Doerfert	03cc8a1ba0	[OpenMP][NFC] Move the `noinline` to the parallel entry point The `noinline` for non-SPMD parallel functions is probably not necessary but as long as we use it we should put it on the outermost parallel function, which is the wrapper, not the actual outlined function. Resolves PR49752 Reviewed By: JonChesterfield Differential Revision: https://reviews.llvm.org/D99506	2021-03-30 01:12:45 -05:00
Hsiangkai Wang	5821a58d8e	[RISCV] Add inline asm constraint 'vr' and 'vm' in Clang for RISC-V 'V'. Add asm constraint 'vr' for vector registers. Add asm constraint 'vm' for vector mask registers. Differential Revision: https://reviews.llvm.org/D98616	2021-03-30 09:47:27 +08:00
Fanbo Meng	bd8dd580ff	[NFC] clang-formatting zos-alignment.c Reviewed By: abhina.sreeskantharajan Differential Revision: https://reviews.llvm.org/D99514	2021-03-29 16:48:10 -04:00
Florian Hahn	d3ff65dc11	[Clang] Fix line numbers in CHECK lines.	2021-03-29 17:37:48 +01:00
Florian Hahn	9320ac9b49	[Clang] Only run test when X86 backend is built. After `c773d0f973` the remark is only emitted if the loop is profitable to vectorize, but cannot be vectorized. Hence, it depends on X86-specific cost-modeling.	2021-03-29 17:27:01 +01:00
Fanbo Meng	f1e0c7fdd7	[SystemZ][z/OS] Add test of leading zero length bitfield in const/volatile struct Reviewed By: abhina.sreeskantharajan Differential Revision: https://reviews.llvm.org/D99508	2021-03-29 12:06:30 -04:00
Alexey Bataev	0411b23319	[OPENMP]Map data field with l-value reference types. Added initial support dfor the mapping of the data members with l-value reference types. Differential Revision: https://reviews.llvm.org/D98812	2021-03-29 07:07:09 -07:00
Alexey Bataev	f6f21dcd6c	[OPENMP]Fix PR49636: Assertion `(!Entry.getAddress() \|\| Entry.getAddress() == Addr) && "Resetting with the new address."' failed. The original issue is caused by the fact that the variable is allocated with incorrect type i1 instead of i8. This causes the bitcasting of the declaration to i8 type and the bitcast expression does not match the original variable. To fix the problem, the UndefValue initializer and the original variable should be emitted with type i8, not i1. Differential Revision: https://reviews.llvm.org/D99297	2021-03-29 06:55:57 -07:00
Fanbo Meng	0858f0e09e	[SystemZ][z/OS] Set maximum value to truncate attribute aligned to for static variables on z/OS target On z/OS there is a hard limitation on on the maximum requestable alignment in aligned attribute for static variables. We need to truncate values greater than that. Reviewed By: abhina.sreeskantharajan Differential Revision: https://reviews.llvm.org/D98864	2021-03-29 09:44:33 -04:00
Alexey Bataev	dcf96178cb	[OPENMP]Fix PR49052: Clang crashed when compiling target code with assert(0). Need to insert a basic block during generation of the target region to avoid crash for the GPU to be able always calling a cleanup action. This cleanup action is required for the correct emission of the target region for the GPU. Differential Revision: https://reviews.llvm.org/D99445	2021-03-29 06:36:06 -07:00
Matt Arsenault	9a0c9402fa	Reapply "OpaquePtr: Turn inalloca into a type attribute" This reverts commit `07e46367ba`.	2021-03-29 08:55:30 -04:00
Oliver Stannard	07e46367ba	Revert "Reapply "OpaquePtr: Turn inalloca into a type attribute"" Reverting because test 'Bindings/Go/go.test' is failing on most buildbots. This reverts commit `fc9df30991`.	2021-03-29 11:32:22 +01:00
Fangrui Song	2a28d1d3b7	[Driver] Linux.cpp: move resource directory before /usr/local/include for non-musl This follows GCC and simplifies code. /usr/local/include and TOOL_INCLUDE_DIR should not conflict with the resource directory include so users should not observe any difference.	2021-03-28 12:44:21 -07:00
Fangrui Song	53c98d85a8	[Driver] Suppress libstdc++/libc++ path with -nostdinc This follows GCC. Having libstdc++/libc++ include paths is not useful anyway because libstdc++/libc++ header files cannot find features.h. While here, suppress -stdlib++-isystem with -nostdlibinc.	2021-03-28 11:30:27 -07:00
Matt Arsenault	fc9df30991	Reapply "OpaquePtr: Turn inalloca into a type attribute" This reverts commit `20d5c42e0e`.	2021-03-28 13:35:21 -04:00
Nico Weber	20d5c42e0e	Revert "OpaquePtr: Turn inalloca into a type attribute" This reverts commit `4fefed6563`. Broke check-clang everywhere.	2021-03-28 13:02:52 -04:00
Zakk Chen	821547cabb	[RISCV][Clang] Update new overloading rules for RVV intrinsics. RVV intrinsics has new overloading rule, please see `82aac7dad4` Changed: 1. Rename `generic` to `overloaded` because the new rule is not using C11 generic. 2. Change HasGeneric to HasNoMaskedOverloaded because all masked operations support overloading api. 3. Add more overloaded tests due to overloading rule changed. Differential Revision: https://reviews.llvm.org/D99189	2021-03-28 09:04:35 -07:00
Matt Arsenault	4fefed6563	OpaquePtr: Turn inalloca into a type attribute I think byval/sret and the others are close to being able to rip out the code to support the missing type case. A lot of this code is shared with inalloca, so catch this up to the others so that can happen.	2021-03-28 11:12:23 -04:00
Fangrui Song	dcaa0293c1	[test] Add UNSUPPORTED: system-windows to linux-ld.c We should have a test verifying / \ for Windows but have such a long test specifically for Linux cross compilation suffer from Windows \ is too troublesome.	2021-03-27 16:46:30 -07:00
Fangrui Song	87a9f42fc1	[Driver] Remove an incorrect library path for multilib This is incorrect (adding a path with unrelated libraries) but benign in practice because previous paths take precedence.	2021-03-27 16:36:21 -07:00
Fangrui Song	19e45696f5	[Driver] Remove an unneeded multiarch library path which ends with ../../.. Neither vanilla nor Debian GCC has the patch, which usually duplicates $sysroot/usr/lib.	2021-03-27 15:46:06 -07:00
Giorgis Georgakoudis	8bc2c662d9	[Utils] Add prefix parameter in update test checks to avoid FileCheck conflicts IR values convert to check prefix FileCheck variables for IR checks. For example, nameless values, e.g., %0, convert to check prefix TMP FileCheck variables, e.g., [[TMP0:%.*]]. This check prefix may clash with named values that have the same name and that causes auto-generated tests to fail. Currently a warning is emitted to change the names of the IR values but this is not always possible, if for example they are generated by clang. Manual intervention to fix the FileCheck variable names is too tedious. This patch add a parameter to prefix conflicting FileCheck variable names with a user-provided string to automate the process. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D99415	2021-03-26 11:49:42 -07:00
Anastasia Stulova	6e46f0b628	[OpenCL] Fix AST check in address-space-templates test Differential Revision: https://reviews.llvm.org/D99258	2021-03-26 14:24:30 +00:00
Fanbo Meng	6f91cf75d7	[SystemZ][z/OS] Ignore leading zero width bitfield alignment on z/OS target Zero length bitfield alignment is not respected if they are leading members on z/OS target. Reviewed By: abhina.sreeskantharajan Differential Revision: https://reviews.llvm.org/D98890	2021-03-26 10:10:33 -04:00
Richard Smith	4f3ea27dac	Stop this test from dropping a .s file in the current directory.	2021-03-25 18:22:18 -07:00
Richard Smith	11bf268864	Add a target triple to fix test failure on targets that don't support __int128.	2021-03-25 17:05:36 -07:00
Fangrui Song	ed956554f9	[Triple][Driver] Add muslx32 environment and use /lib/ld-musl-x32.so.1 for -dynamic-linker Differential Revision: https://reviews.llvm.org/D99308	2021-03-25 16:25:47 -07:00
David Stone	4b5baa5b82	Handle 128-bits IntegerLiterals in StmtPrinter This fixes PR35677: "int128_t or uint128_t as non-type template parameter causes crash when considering invalid constructor".	2021-03-25 17:27:13 -04:00
Xun Li	f490a5969b	[OpenMP][InstrProfiling] Fix a missing instr profiling counter When emitting a function body there needs to be a instr profiling counter emitted. Otherwise instr profiling won't work for this function. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D98135	2021-03-25 13:52:36 -07:00
Richard Smith	622f8de4f2	PR49724: Fix deduction of null member pointers. Previously we created an implicit cast of the wrong kind, which we'd later fail to constant-evaluate, resulting in deduction failure.	2021-03-25 13:47:22 -07:00
Xun Li	c7a39c833a	[Coroutine][Clang] Force emit lifetime intrinsics for Coroutines tl;dr Correct implementation of Corouintes requires having lifetime intrinsics available. Coroutine functions are functions that can be suspended and resumed latter. To do so, data that need to stay alive after suspension must be put on the heap (i.e. the coroutine frame). The optimizer is responsible for analyzing each AllocaInst and figure out whether it should be put on the stack or the frame. In most cases, for data that we are unable to accurately analyze lifetime, we can just conservatively put them on the heap. Unfortunately, there exists a few cases where certain data MUST be put on the stack, not on the heap. Without lifetime intrinsics, we are unable to correctly analyze those data's lifetime. To dig into more details, there exists cases where at certain code points, the current coroutine frame may have already been destroyed. Hence no frame access would be allowed beyond that point. The following is a common code pattern called "Symmetric Transfer" in coroutine: ``` auto tmp = await_suspend(); __builtin_coro_resume(tmp.address()); return; ``` In the above code example, `await_suspend()` returns a new coroutine handle, which we will obtain the address and then resume that coroutine. This essentially "transfered" from the current coroutine to a different coroutine. During the call to `await_suspend()`, the current coroutine may be destroyed, which should be fine because we are not accessing any data afterwards. However when LLVM is emitting IR for the above code, it needs to emit an AllocaInst for `tmp`. It will then call the `address` function on tmp. `address` function is a member function of coroutine, and there is no way for the LLVM optimizer to know that it does not capture the `tmp` pointer. So when the optimizer looks at it, it has to conservatively assume that `tmp` may escape and hence put it on the heap. Furthermore, in some cases `address` call would be inlined, which will generate a bunch of store/load instructions that move the `tmp` pointer around. Those stores will also make the compiler to think that `tmp` might escape. To summarize, it's really difficult for the mid-end to figure out that the `tmp` data is short-lived. I made some attempt in D98638, but it appears to be way too complex and is basically doing the same thing as inserting lifetime intrinsics in coroutines. Also, for reference, we already force emitting lifetime intrinsics in O0 for AlwaysInliner: https://github.com/llvm/llvm-project/blob/main/llvm/lib/Passes/PassBuilder.cpp#L1893 Differential Revision: https://reviews.llvm.org/D99227	2021-03-25 13:46:20 -07:00
Leonard Chan	1abaadb30d	[clang][driver] Support HWASan in the Fuchsia toolchain These contain clang driver changes for supporting HWASan on Fuchsia. This includes hwasan multilibs and the dylib path change. Differential Revision: https://reviews.llvm.org/D99361	2021-03-25 13:36:23 -07:00
Yaxun (Sam) Liu	cc9477166a	[CUDA][HIP] add __builtin_get_device_side_mangled_name Add builtin function __builtin_get_device_side_mangled_name to get device side manged name for functions and global variables, which can be used to get symbol address of kernels or variables by mangled name in dynamically loaded bundled code objects at run time. Reviewed by: Artem Belevich Differential Revision: https://reviews.llvm.org/D99301	2021-03-25 15:25:29 -04:00
Gabor Marton	015c39882e	[Analyzer] Infer 0 value when the divisible is 0 (bug fix) Currently, we infer 0 if the divisible of the modulo op is 0: int a = x < 0; // a can be 0 int b = a % y; // b is either 1 % sym or 0 However, we don't when the op is / : int a = x < 0; // a can be 0 int b = a / y; // b is either 1 / sym or 0 / sym This commit fixes the discrepancy. Differential Revision: https://reviews.llvm.org/D99343	2021-03-25 18:25:06 +01:00
Djordje Todorovic	8420a53324	[Debugify] Expose original debug info preservation check as CC1 option In order to test the preservation of the original Debug Info metadata in your projects, a front end option could be very useful, since users usually report that a concrete entity (e.g. variable x, or function fn2()) is missing debug info. The [0] is an example of running the utility on GDB Project. This depends on: D82546 and D82545. Differential Revision: https://reviews.llvm.org/D82547	2021-03-25 05:29:42 -07:00
Chuanqi Xu	20b4f484d1	[Driver] Add -fno-split-stack Summary: Add -fno-split-stack and rename CC1 option from `-split-stacks` to `-fsplit-stack`. Test Plan: check-all Differential Revision: https://reviews.llvm.org/D99245	2021-03-25 14:18:28 +08:00
Fangrui Song	cdd993fab3	[Driver] Use -dynamic-linker /lib/ld-musl-i386.so.1 for i?86-linux-musl Noticed by Khem Raj	2021-03-24 19:44:53 -07:00
Nathan Chancellor	ef58ae86ba	[RISCV] Fix mcount name GCC's name for this symbol is _mcount, which the Linux kernel expects in a few different place: $ echo 'int main(void) { return 0; }' \| riscv32-linux-gcc -c -pg -o tmp.o -x c - $ llvm-objdump -dr tmp.o \| grep mcount 0000000c: R_RISCV_CALL _mcount $ echo 'int main(void) { return 0; }' \| riscv64-linux-gcc -c -pg -o tmp.o -x c - $ llvm-objdump -dr tmp.o \| grep mcount 000000000000000c: R_RISCV_CALL _mcount $ echo 'int main(void) { return 0; }' \| clang -c -pg -o tmp.o --target=riscv32-linux-gnu -x c - $ llvm-objdump -dr tmp.o \| grep mcount 0000000a: R_RISCV_CALL_PLT mcount $ echo 'int main(void) { return 0; }' \| clang -c -pg -o tmp.o --target=riscv64-linux-gnu -x c - $ llvm-objdump -dr tmp.o \| grep mcount 000000000000000a: R_RISCV_CALL_PLT mcount Set MCountName to "_mcount" in RISCVTargetInfo then prevent it from getting overridden in certain OSTargetInfo constructors. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D98881 Signed-off-by: Nathan Chancellor <nathan@kernel.org>	2021-03-24 18:11:37 -07:00
Yuanfang Chen	217f0f735a	[Clang][Sema] Implement GCC -Wcast-function-type ``` Warn when a function pointer is cast to an incompatible function pointer. In a cast involving function types with a variable argument list only the types of initial arguments that are provided are considered. Any parameter of pointer-type matches any other pointer-type. Any benign differences in integral types are ignored, like int vs. long on ILP32 targets. Likewise type qualifiers are ignored. The function type void (*) (void) is special and matches everything, which can be used to suppress this warning. In a cast involving pointer to member types this warning warns whenever the type cast is changing the pointer to member type. This warning is enabled by -Wextra. ``` Reviewed By: rsmith Differential Revision: https://reviews.llvm.org/D97831	2021-03-24 16:04:18 -07:00
Fangrui Song	bfbfd83f14	[Driver] Linux.cpp: delete unneeded D.getVFS().exists checks Not only can this save unneeded filesystem stats, it can make `clang --sysroot=/path/to/debian-sysroot -c a.cc` work (get `-internal-isystem $sysroot/usr/include/x86_64-linux-gnu`) even without `lib/x86_64-linux-gnu/`. This should make thakis happy.	2021-03-24 15:25:36 -07:00
Alexey Bataev	9e9f6eba84	[OPENMP]Fix PR49468: Declare target should allow empty sequences and namespaces. The emty declare target/end declare target region should not cause an error emission. Differential Revision: https://reviews.llvm.org/D99288	2021-03-24 12:53:33 -07:00
Heejin Ahn	a6aae5f7fc	[WebAssembly] Don't inline -emscripten-cxx-exceptions-allowed functions Functions specified in `-emscripten-cxx-exceptions-allowed`, which is set by Emscripten's `EXCEPTION_CATCHING_ALLOWED` setting, can be inlined in LLVM middle ends before we reach WebAssemblyLowerEmscriptenEHSjLj pass in the wasm backend and thus don't get transformed for exception catching. This fixes the issue by adding `--force-attribute=FUNC_NAME:noinline` for each function name in `-emscripten-cxx-exceptions-allowed`, which adds `noinline` attribute to the specified function and thus excludes the function from inlining candidates in optimization passes. Fixes the remaining half of https://github.com/emscripten-core/emscripten/issues/10721. Reviewed By: sbc100 Differential Revision: https://reviews.llvm.org/D99259	2021-03-24 12:27:49 -07:00
Nathan James	279ea930fa	[clang] Add fixit for Wreorder-ctor Create fix-it hints to fix the order of constructors. To make this a lot simpler, I've grouped all the warnings for each out of order initializer into 1. This is necessary as fixing one initializer would often interfere with other initializers. Reviewed By: aaron.ballman Differential Revision: https://reviews.llvm.org/D98745	2021-03-24 19:22:53 +00:00
Alexey Bataev	7654bb6303	[OPENMP]Fix PR48571: critical/master in outlined contexts cause crash. If emit inlined region for master/critical directives, no need to clear lambda/block context data, otherwise the variables cannot be found and it causes a crash at compile time. Differential Revision: https://reviews.llvm.org/D99280	2021-03-24 10:15:24 -07:00
Aaron Puchert	a6a1c3051d	Fix false negative in -Wthread-safety-attributes The original implementation didn't fire on non-template classes when a base class was an instantiation of a template with a dependent base. In that case the base of the base is dependent as seen from the base, but not from the class we're interested in, which isn't a template. Also it simplifies the code a lot. Reviewed By: aaron.ballman Differential Revision: https://reviews.llvm.org/D98724	2021-03-24 17:45:25 +01:00
Marek Kurdej	0620e6f4b7	[clang] [C++2b] [P1102] Accept lambdas without parameter list (). As an extension, accept such lambdas in previous standards with a warning. * http://eel.is/c++draft/expr.prim.lambda * http://wg21.link/P1102 Reviewed By: aaron.ballman Differential Revision: https://reviews.llvm.org/D98433	2021-03-24 14:42:27 +01:00
Stefan Pintilie	91f4c11133	[PowerPC] Add mprivileged option Add an option to tell the compiler that it can use privileged instructions. This patch only adds the option. Backend implementation will be added in a future patch. Reviewed By: lei, amyk Differential Revision: https://reviews.llvm.org/D99193	2021-03-24 08:33:22 -05:00
Haojian Wu	cfc36bf017	[clang] Treat variable-length array of incomplete element type as incomplete type. Differential Revision: https://reviews.llvm.org/D99165	2021-03-24 14:22:15 +01:00
Anastasia Stulova	d1c8a151df	[OpenCL] Added distinct file extension for C++ for OpenCL. Files compiled with C++ for OpenCL mode can now have a distinct file extension - clcpp, then clang driver picks the compilation mode automatically (-x clcpp) without the use of -cl-std=clc++. Differential Revision: https://reviews.llvm.org/D96771	2021-03-24 13:07:04 +00:00
Stefan Pintilie	0e4f5f3ea6	[PowerPC] Change option to mrop-protect In order to have the same option on power PC LLVM and power PC gcc the option will be changed from -mrop-protection to -mrop-protect. The feature will be off by default and turned on when the option is used. Reviewed By: lei, amyk Differential Revision: https://reviews.llvm.org/D99185	2021-03-24 05:51:35 -05:00
Ella Ma	1d8fc086ae	[clang][lit] Allow test cases to use the compiler that are used to compile Clang Required by D83660. Test cases may want to use the host compiler to compile some mocks for the test case. This patch adds two substitutions `%host_cc` and `%host_cxx` to use the host compilers set via variable `CMAKE_C_COMPILER` and `CMAKE_CXX_COMPILER`. Patch by Ella Ma! Reviewed By: steakhal Differential Revision: https://reviews.llvm.org/D98918	2021-03-24 11:32:57 +01:00
Nemanja Ivanovic	4020932706	[PowerPC] Make altivec.h work with AIX which has no __int128 There are a number of functions in altivec.h that use vector __int128 which isn't supported on AIX. Those functions need to be guarded for targets that don't support the type. Furthermore, the functions that produce quadword instructions without using the type need a builtin. This patch adds the macro guards to altivec.h using the __SIZEOF_INT128__ which is only defined on targets that support the __int128 type.	2021-03-24 00:35:51 -05:00
Zakk Chen	88c2d4c8eb	[RISCV][Clang] Add RVV Vector Indexed Load intrinsic functions. Support Complex type transformer to define more complexity legal type. Overall our downstream implementation there are only four instructions need to use complex type transformer, it's not a common case. I still feel using a string for prototypes is simple and clear. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D98848	2021-03-23 19:18:50 -07:00
Richard Smith	4259301aaf	Support #__private_macro and #__public_macro in local submodule visibility mode.	2021-03-23 16:54:28 -07:00
Bruno Cardoso Lopes	431e3138a1	[CGAtomic] Lift stronger requirements on cmpxch and support acquire failure mode - Fix `emitAtomicCmpXchgFailureSet` to support release/acquire (succ/fail) memory order. - Remove stronger checks for cmpxch. Effectively, this addresses http://wg21.link/p0418 Differential Revision: https://reviews.llvm.org/D98995	2021-03-23 16:45:37 -07:00
Fangrui Song	c4f65ef78f	[test] Add --sysroot= to make gcc-toolchain.cpp stable	2021-03-23 13:32:30 -07:00
Arthur O'Dwyer	5f1de9cab1	[C++20] [P1825] Fix bugs with implicit-move from variables of reference type. Review D88220 turns out to have some pretty severe bugs, but I think this patch fixes them. Paper P1825 is supposed to enable implicit move from "non-volatile objects and rvalue references to non-volatile object types." Instead, what was committed seems to have enabled implicit move from "non-volatile things of all kinds, except that if they're rvalue references then they must also refer to non-volatile things." In other words, D88220 accidentally enabled implicit move from lvalue object references (super yikes!) and also from non-object references (such as references to functions). These two cases are now fixed and regression-tested. Differential Revision: https://reviews.llvm.org/D98971	2021-03-23 14:12:06 -04:00
Nancy Wang	f46c41febb	[SystemZ][z/OS] fix lit test related to alignment This patch is to fix lit test case failure relate to alignment, on z/OS, maximum alignment value for 64 bit mode is 16 and also fixed clang/test/Layout/itanium-union-bitfield.cpp, attribute ((aligned(4))) is needed for bit-field member in Union for z/OS because single bit-field has one byte alignment, this will make sure size and alignment will be correct value on z/OS. Differential Revision: https://reviews.llvm.org/D98793	2021-03-23 13:15:19 -04:00
Timm Bäder	bc6b139392	[clang][parser] Don't prohibit attributes on objc @try/@throw This line has a TODO comment, but the answer to it seems to be "no" given that clang itself uses attributes on @try statements in its tests. This ProhibitAttributes() statement is also dead code since ProhibitAttributs() does not handle GNU attributes at the moment but those are the only attributes valid in objc. Differential Revision: https://reviews.llvm.org/D97371	2021-03-23 15:26:25 +01:00
Zakk Chen	0bc1959f51	[RISCV][NFC] Fix RVV intrinsic tests. 1. Skip the temporary file 2. Test cc1 with -S to verify codegen work well. Add '-target-feature +m' because the backend requires it to calculate the vscaled size/offset. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D99082	2021-03-23 06:06:05 -07:00
Kadir Cetinkaya	8f80c66bd2	[clang] Fix a crash when CTAD fails Differential Revision: https://reviews.llvm.org/D99145	2021-03-23 13:03:30 +01:00
Nemanja Ivanovic	2f782a796a	[PowerPC] Add more missing overloads to altivec.h Add overloads that perform subtraction on v1i128 that take and produce vector unsigned char to avoid needing to use __int128. The overloads are suffixed with _u128 and are needed for targets where __int128 isn't supported (AIX).	2021-03-23 05:52:36 -05:00
Nemanja Ivanovic	54e4654f04	[PowerPC] Add more missing overloads to altivec.h Add overloads that perform addition on v1i128 that take and produce vector unsigned char to avoid needing to use __int128. The overloads are suffixed with _u128 and are needed for targets where __int128 isn't supported (AIX).	2021-03-23 05:09:19 -05:00
Nemanja Ivanovic	10cc5bcd86	[PowerPC] Add more missing overloads to altivec.h Add vec_permi as a synonym for vec_xxpermdi (but only for doubleword vectors).	2021-03-22 23:09:41 -05:00
Nemanja Ivanovic	b5e96e0ad6	[PowerPC] Add more missing overloads to altivec.h Add vec_gbb as a synonym for vec_vgbbd but for doubleword vectors.	2021-03-22 22:25:28 -05:00
Nemanja Ivanovic	d8e574c8e6	[PowerPC] Add more missing overloads to altivec.h Add vec_cvf as a synonym for vec_doublee/vec_floate.	2021-03-22 22:08:43 -05:00
Zakk Chen	1ea07ee453	Revert "[RISCV][NFC] Fix RVV intrinsic tests." This reverts commit `ab082b582d`.	2021-03-22 18:51:48 -07:00
Zakk Chen	ab082b582d	[RISCV][NFC] Fix RVV intrinsic tests. 1. Skip the temporary file 2. Test cc1 with -S to verify codegen work well. Add '-target-feature +m' because the backend requires it to calculate the vscaled size/offset. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D99082	2021-03-22 18:24:03 -07:00
Nemanja Ivanovic	bef2cb9062	[PowerPC] Add more missing overloads to altivec.h Add vec_ctd which is similar to vec_ctf except the return type is vector double rather than vector float.	2021-03-22 20:23:07 -05:00
Amara Emerson	66af90b46e	[darwin][driver] Pass through -global-isel LLVM flags to ld. GlobalISel is currently not enabled when using -flto since the front-end -mvllm flags don't get passed through. This change fixes this for Darwin platforms. We have to do this in the driver because the code generator choice isn't embedded into the bitcode file. Differential Revision: https://reviews.llvm.org/D99126	2021-03-22 17:23:06 -07:00
Yaxun (Sam) Liu	282bf9eaf7	[HIP] Fix ROCm detection ROCm has changed installation path to /opt/rocm-{release}. Add detection for that. Also support ROCM_PATH environment variable. Reviewed by: Artem Belevich Differential Revision: https://reviews.llvm.org/D98867	2021-03-22 16:10:02 -04:00
Fangrui Song	3e32e8c588	[test] Bring back the improved arm and $sysroot/usr/include/i386-linux-gnu tests `21b211a8f2` was reverted temporarily to give Fuchsia some time for migrating to a better sysroot, but the tests can be restored separately.	2021-03-22 12:08:46 -07:00
Petr Hosek	21b211a8f2	Revert "[Driver] Clean up Debian multiarch /usr/include/<triplet> madness" This reverts commit `874bdc8e61` which broke the use of older Debian sysroots.	2021-03-22 11:58:28 -07:00
Petr Hosek	933d146f38	Revert "[Driver] -m32: Add /usr/include/i386-linux-gnu for Debian" This reverts commit `82f6e0dde2` which hasn't addressed the `874bdc8e61` issue.	2021-03-22 11:58:28 -07:00
Bradley Smith	48f5a392cb	[IR] Add vscale_range IR function attribute This attribute represents the minimum and maximum values vscale can take. For now this attribute is not hooked up to anything during codegen, this will be added in the future when such codegen is considered stable. Additionally hook up the -msve-vector-bits=<x> clang option to emit this attribute. Differential Revision: https://reviews.llvm.org/D98030	2021-03-22 12:05:06 +00:00
Sven van Haastregt	2bbc9bccf0	[OpenCL] Support template parameters for as_type Implement the TreeTransform for AsTypeExpr. Split `BuildAsTypeExpr` out of `ActOnAsTypeExpr`, such that we can call the Build method from the TreeTransform. Fixes PR47979. Differential Revision: https://reviews.llvm.org/D98855	2021-03-22 11:59:05 +00:00
Sven van Haastregt	20d93267e1	[OpenCL] Use -fdeclare-opencl-builtins for some tests This speeds up the test running times, as the large `opencl-c.h` header no longer needs to be parsed.	2021-03-22 09:46:28 +00:00
Fangrui Song	82f6e0dde2	[Driver] -m32: Add /usr/include/i386-linux-gnu for Debian	2021-03-22 01:27:06 -07:00
Valeriy Savchenko	3085bda2b3	[analyzer][solver] Fix infeasible constraints (PR49642) Additionally, this patch puts an assertion checking for feasible constraints in every place where constraints are assigned to states. Differential Revision: https://reviews.llvm.org/D98948	2021-03-22 11:02:02 +03:00
Fangrui Song	874bdc8e61	[Driver] Clean up Debian multiarch /usr/include/<triplet> madness Debian multiarch additionally adds /usr/include/<triplet> and somehow Android borrowed the idea. (Note /usr/<triplet>/include is already an include dir...). On Debian, we should just assume a GCC installation is available and use its triple.	2021-03-21 22:40:38 -07:00
Fangrui Song	6a4fbf14ef	[test] Add test for cross compiling on Linux	2021-03-21 15:37:35 -07:00
Fangrui Song	72ac988dc7	[test] Delete obsoleted debian_multiarch_tree and ubuntu_13.04_multiarch_tree They are quite outdated. Delete them to avoid unnecessary test churn.	2021-03-21 15:37:34 -07:00
Roman Lebedev	e3a4701627	[clang][CodeGen] Lower Likelihood attributes to @llvm.expect intrin instead of branch weights `08196e0b2e` exposed LowerExpectIntrinsic's internal implementation detail in the form of LikelyBranchWeight/UnlikelyBranchWeight options to the outside. While this isn't incorrect from the results viewpoint, this is suboptimal from the layering viewpoint, and causes confusion - should transforms also use those weights, or should they use something else, D98898? So go back to status quo by making LikelyBranchWeight/UnlikelyBranchWeight internal again, and fixing all the code that used it directly, which currently is only clang codegen, thankfully, to emit proper @llvm.expect intrinsics instead.	2021-03-21 22:50:21 +03:00
Fangrui Song	2288a75d9e	[Driver] Linux.cpp: add -internal-isystem lib/../$triple/include With this change, for `#include <ar.h>`, `clang --target=aarch64-linux-gnu` will read `/usr/lib/gcc/aarch64-linux-gnu/10/../../../../aarch64-linux-gnu/include/ar.h` (on Debian gcc->gcc-cross) instead of `/usr/include/ar.h`. Some glibc headers (e.g. gnu/stubs.h) are different across architectures.	2021-03-21 00:56:03 -07:00
Fangrui Song	0ad0c476ef	[Driver] Gnu.cpp: remove unneeded -L detection hack for -mx32 Removing the hack actually improves our compatibility with gcc -mx32.	2021-03-20 20:12:45 -07:00
Fangrui Song	06d6b1471e	[Driver] Gnu.cpp: remove unneeded -L lib/gcc/$triple/$version/../../../$triple After path resolution, it duplicates a subsequent -L entry. The entry below (lib/gcc/$triple/$version/../../../../$OSLibDir) usually does not exist (e.g. Arch Linux; Debian cross gcc). When it exists, it typically just has ld.so (e.g. Debian native gcc) which cannot cause collision. Removing the -L (similar to reordering it) is therefore justified.	2021-03-20 18:50:14 -07:00
Fangrui Song	1fe1e996e9	[test] Delete "-internal-isystem" "/usr/local/include"	2021-03-20 15:24:02 -07:00
Fangrui Song	f628ba0b55	[test] Fix Driver/gcc-toolchain.cpp if CLANG_DEFAULT_RTLIB is compiler-rt	2021-03-20 13:24:49 -07:00
Fangrui Song	e92faa77b4	[test] Fix Driver/gcc-toolchain.cpp if CLANG_DEFAULT_CXX_STDLIB is libc++	2021-03-20 11:06:44 -07:00
Fangrui Song	dc3b438c8f	Revert "Revert "[Driver] Drop obsoleted Ubuntu 11.04 gcc detection"" This reverts commit `243333ef3e`.	2021-03-20 09:57:05 -07:00
David Zarzycki	243333ef3e	Revert "[Driver] Drop obsoleted Ubuntu 11.04 gcc detection" This reverts commit `bdf39e6b0e`. The change is failing on Fedora 33 (x86-64).	2021-03-20 07:29:01 -04:00
Fangrui Song	bed9933a46	[Driver][test] Fix gcc-toolchain.cpp on non-x86_64	2021-03-19 23:50:22 -07:00
Fangrui Song	bdf39e6b0e	[Driver] Drop obsoleted Ubuntu 11.04 gcc detection It has a very broken gcc installation path (usr/lib/i386-linux-gnu/gcc/i686-linux-gnu).	2021-03-19 23:23:28 -07:00
Fangrui Song	28d58d8fe2	[Driver] Stop searching other prefixes once a GCC installation is found in one prefix so that when --sysroot is specified, the detected GCC installation will not be overridden by another from /usr which happens to have a larger version. This behavior is particularly inconvenient when the system has a larger version GCC while the user wants to try out an older sysroot. Delete some tests from linux-ld.c which overlap with cross-linux.c	2021-03-19 20:35:59 -07:00
Fangrui Song	f9cac39930	[Driver] Delete compatibility aliases -mpie-copy-relocations and -mno-pie-copy-relocations They should be unused everywhere.	2021-03-19 17:47:30 -07:00
Fangrui Song	4c2da86410	[Driver] Suppress GCC detection under -B In GCC, if `-B $prefix` is specified, `$prefix` is used to find executable files and startup files. `$prefix/include` is added as an include search directory. Clang overloads -B with GCC installation detection semantics which make the behavior less predictable (due to the "largest GCC version wins" rule) and interact poorly with --gcc-toolchain (--gcc-toolchain can be overridden by -B). * `clang++ foo.cpp` detects GCC installation under `/usr`. * `clang++ --gcc-toolchain=Inputs foo.cpp` detects GCC installation under `Inputs`. * `clang++ -BA --gcc-toolchain=B foo.cpp` detects GCC installation under A and B and the larger version wins. With this patch, only B is used for detection. * `clang++ -BA foo.cpp` detects GCC installation under `A` and `/usr`, and the larger GCC version wins. With this patch `A` is not used for detection. This patch changes -B to drop the GCC detection semantics. Its executable searching semantics are preserved. --gcc-toolchain is the recommended option to specify the GCC installation detection directory. ( Note: Clang detects GCC installation in various target dependent directories. `$sysroot/usr` (sysroot defaults to "") is a common directory used by most targets. Such a directory is expected to contain something like `lib{,32,64}/gcc{,-cross}/$triple`. Clang will then construct library/include paths from the directory. ) Differential Revision: https://reviews.llvm.org/D97993	2021-03-19 15:42:18 -07:00
Benjamin Kramer	19d2c65ddd	[CodeGen] Don't crash on for loops with cond variables and no increment This looks like an oversight from `a875721d8a`, creating IR that refers to `for.inc` even if it doesn't exist. Differential Revision: https://reviews.llvm.org/D98980	2021-03-19 20:43:52 +01:00
Markus Böck	aafc3f7be8	[Driver] Add -print-runtime-dir This patch adds a new command line option to clang which outputs the directory containing clangs runtime libraries to stdout. The primary use case for this command line flag is for build systems using clang-cl. Build systems when using clang-cl invoke the linker, that is either link or lld-link in this case, directly instead of invoking the compiler for the linking process as is common with the other drivers. This leads to issues when runtime libraries of clang, such as sanitizers or profiling, have to be linked in as the compiler cannot communicate the link directory to the linker. Using this flag, build systems would be capable of getting the directory containing all of clang's runtime libraries and add it to the linker path. Differential Revision: https://reviews.llvm.org/D98868	2021-03-19 17:48:03 +01:00
Maxim Kuvyrkov	2049fe5890	[WoA][MSVC] Use default linker setting in MSVC-compatible driver [take 2] At the moment "link.exe" is hard-coded as default linker in MSVC.cpp, so there's no way to use LLD as default linker for MSVC driver. This patch adds checking of CLANG_DEFAULT_LINKER to MSVC.cpp and updates unit-tests that expect link.exe linker to explicitly select it via -fuse-ld=link, so that buildbots and other builds that set -DCLANG_DEFAULT_LINKER=foobar don't fail these tests. This is a squash of - https://reviews.llvm.org/D98493 (MSVC.cpp change) and - https://reviews.llvm.org/D98862 (unit-tests change) Reviewed By: maxim-kuvyrkov Differential Revision: https://reviews.llvm.org/D98935	2021-03-19 13:38:03 +00:00
Aaron Ballman	fa4e72971e	Automate common diagnostic checking for statement attributes Clang currently automates a fair amount of diagnostic checking for declaration attributes based on the declarations in Attr.td. It checks for things like subject appertainment, number of arguments, language options, etc. This patch uses the same machinery to perform diagnostic checking on statement attributes.	2021-03-19 08:35:38 -04:00
Hongtao Yu	fc1812a0ad	[UniqueLinkageName] Use consistent checks when mangling symbo linkage name and debug linkage name. C functions may be declared and defined in different prototypes like below. This patch unifies the checks for mangling names in symbol linkage name emission and debug linkage name emission so that the two names are consistent. static int go(int); static int go(a) int a; { return a; } Test Plan: Differential Revision: https://reviews.llvm.org/D98799	2021-03-18 22:11:16 -07:00
Zequan Wu	1c740b29fa	[clang-cl] make -ffile-compilation-dir a CoreOption. Let clang-cl accepts `-ffile-compilation-dir` flag. Differential Revision: https://reviews.llvm.org/D98887	2021-03-18 13:20:47 -07:00
Thomas Lively	f5764a8654	[WebAssembly] Finalize SIMD names and opcodes Updates the names (e.g. widen => extend, saturate => sat) and opcodes of all SIMD instructions to match the finalized SIMD spec. Deliberately does not change the public interface in wasm_simd128.h yet; that will require more care. Depends on D98466. Differential Revision: https://reviews.llvm.org/D98676	2021-03-18 11:21:25 -07:00
Thomas Lively	2f2ae08da9	[WebAssembly] Remove experimental SIMD instructions Removes the instruction definitions, intrinsics, and builtins for qfma/qfms, signselect, and prefetch instructions, which were not included in the final WebAssembly SIMD spec. Depends on D98457. Differential Revision: https://reviews.llvm.org/D98466	2021-03-18 11:21:24 -07:00
Thomas Lively	8638c897f4	[WebAssembly] Remove unimplemented-simd target feature Now that the WebAssembly SIMD specification is finalized and engines are generally up-to-date, there is no need for a separate target feature for gating SIMD instructions that engines have not implemented. With this change, v128.const is now enabled by default with the simd128 target feature. Differential Revision: https://reviews.llvm.org/D98457	2021-03-18 10:23:12 -07:00
Mircea Trofin	92ccc6cb17	Reapply "[NPM][CGSCC] FunctionAnalysisManagerCGSCCProxy: do not clear immutable function passes" This reverts commit `11b70b9e3a`. The bot failure was due to ArgumentPromotion deleting functions without deleting their analyses. This was separately fixed in `4b1c807`.	2021-03-18 09:44:34 -07:00
Mike Rice	c2f8e158f5	[OPENMP51]Support for the 'destroy' clause with interop variable. Added basic parsing/sema/serialization support to extend the existing 'destroy' clause for use with the 'interop' directive. Differential Revision: https://reviews.llvm.org/D98834	2021-03-18 09:12:56 -07:00
Sven van Haastregt	c5c4a88a84	[OpenCL] Remove spurious atomic_fetch tablegen builtins The `int` and `long` versions of these builtins already provide the necessary overloads for `intptr_t` and `uintptr_t` arguments, as `ASTContext` defines `atomic_(u)intptr_t` in terms of the `int` or `long` types. Prior to this patch, calls to those builtins with particular argument types resulted in call-is-ambiguous errors. Differential Revision: https://reviews.llvm.org/D98520	2021-03-18 12:17:12 +00:00
Thomas Preud'homme	e5cd5b352f	[test] Fix variable definition in acle_sve_ld1.sh Clang test acle_sve_ld1.sh is missing the colon in one of the string variable definition separating the variable name from the regex. This leads the substitution block to be parsed as a numeric variable use. Reviewed By: sdesmalen Differential Revision: https://reviews.llvm.org/D98852	2021-03-18 12:15:45 +00:00
Elizabeth Andrews	d8b8f544d9	[Reland] "Do not apply calling conventions to MSVC entry points" This patch is a second attempt at fixing a link error for MSVC entry points when calling conventions are specified using a flag. Calling conventions specified using flags should not be applied to MSVC entry points. The default calling convention is set in this case. The default calling convention for MSVC entry points main and wmain is cdecl. For WinMain, wWinMain and DllMain, the default calling convention is stdcall on 32 bit Windows. Explicitly specified calling conventions are applied to MSVC entry points. For MinGW, the default calling convention for all MSVC entry points is cdecl. First attempt: `4cff1b40da` Revert of first attempt: `bebfc3b92d` Differential Revision: https://reviews.llvm.org/D97941	2021-03-18 04:26:47 -07:00
Valeriy Savchenko	4a7afc9a88	[-Wcalled-once-parameter] Fix false positives for cleanup attr Cleanup attribute allows users to attach a destructor-like functions to variable declarations to be called whenever they leave the scope. The logic of such functions is not supported by the Clang's CFG and is too hard to be reasoned about. In order to avoid false positives in this situation, we assume that we didn't see ALL of the executtion paths of the function and, thus, can warn only about multiple call violation. rdar://74441906 Differential Revision: https://reviews.llvm.org/D98694	2021-03-18 12:32:16 +03:00
Valeriy Savchenko	f1a7d5a7b0	[-Wcalled-once-parameter] Harden analysis in terms of block use This patch introduces a very simple inter-procedural analysis between blocks and enclosing functions. We always analyze blocks first (analysis is done as part of semantic analysis that goes side-by-side with the parsing process), and at the moment of reporting we don't know how that block will be actually used. This patch introduces new logic delaying reports of the "never called" warnings on blocks. If we are not sure that the block will be called exactly once, we shouldn't warn our users about that. Double calls, however, don't require such delays. While analyzing the enclosing function, we can actually decide what we should do with those warnings. Additionally, as a side effect, we can be more confident about blocks in such context and can treat them not as escapes, but as direct calls. rdar://74090107 Differential Revision: https://reviews.llvm.org/D98688	2021-03-18 12:12:18 +03:00
Artem Dergachev	c75b2261a0	[analyzer] Introduce common bug category "Unused code". This category is generic enough to hold a variety of checkers. Currently it contains the Dead Stores checker and an alpha unreachable code checker. Differential Revision: https://reviews.llvm.org/D98741	2021-03-17 20:58:27 -07:00
Zakk Chen	be947aded0	[RISCV][Clang] Add RVV vle/vse intrinsic functions. Add new field PermuteOperands to mapping different operand order between C/C++ API and clang builtin. Reviewed By: craig.topper, rogfer01 Authored-by: Roger Ferrer Ibanez <rofirrim@gmail.com> Co-Authored-by: Hsiangkai Wang <kai.wang@sifive.com> Co-Authored-by: Zakk Chen <zakk.chen@sifive.com> Differential Revision: https://reviews.llvm.org/D98388	2021-03-17 20:31:25 -07:00
Zakk Chen	95c0125f2b	[Clang][RISCV] Add rvv vsetvl and vsetvlmax intrinsic functions. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D96843	2021-03-17 20:26:06 -07:00
Alex Lorenz	d672d5219a	Revert "[CodeGenModule] Set dso_local for Mach-O GlobalValue" This reverts commit `809a1e0ffd`. Mach-O doesn't support dso_local and this change broke XNU because of the use of dso_local. Differential Revision: https://reviews.llvm.org/D98458	2021-03-17 17:27:41 -07:00
Richard Smith	3315bd0beb	PR49619: Remove delayed call to noteFailed. This would assert if we hit the evaluation step limit between starting to delay the call and finishing. In any case, delaying the call was largely pointless as it doesn't really matter when we mark the evaluation as having had side effects.	2021-03-17 17:25:18 -07:00
Richard Smith	a875721d8a	PR49585: Emit the jump destination for a for loop 'continue' from within the scope of the condition variable. The condition variable is in scope in the loop increment, so we need to emit the jump destination from wthin the scope of the condition variable. For GCC compatibility (and compatibility with real-world 'FOR_EACH' macros), 'continue' is permitted in a statement expression within the condition of a for loop, though, so there are two cases here: * If the for loop has no condition variable, we can emit the jump destination before emitting the condition. * If the for loop has a condition variable, we must defer emitting the jump destination until after emitting the variable. We diagnose a 'continue' appearing in the initializer of the condition variable, because it would jump past the initializer into the scope of that variable. Reviewed By: rjmccall Differential Revision: https://reviews.llvm.org/D98816	2021-03-17 16:24:04 -07:00
Mike Rice	c615927c8e	[OPENMP51]Initial support for the use clause. Added basic parsing/sema/serialization support for the 'use' clause. Differential Revision: https://reviews.llvm.org/D98815	2021-03-17 15:46:14 -07:00
Thomas Preud'homme	2426b1fa66	[Test] Fix undef var in attr-speculative-load-hardening.c Fix use of undefined variable in CHECK-NOT directive in clang test CodeGen/attr-speculative-load-hardening.c. Reviewed By: kristof.beyls Differential Revision: https://reviews.llvm.org/D93347	2021-03-17 19:12:25 +00:00
Mike Rice	410f09af09	[OPENMP51]Initial support for the interop directive. Added basic parsing/sema/serialization support for interop directive. Support for the 'init' clause. Differential Revision: https://reviews.llvm.org/D98558	2021-03-17 09:42:07 -07:00
Aaron Ballman	7bafe336a1	Fixing a test case that was missed in `c165a99a1b`	2021-03-17 08:46:04 -04:00
Aaron Ballman	c165a99a1b	[SYCL] Rework the SYCL driver options SYCL compilations initiated by the driver will spawn off one or more frontend compilation jobs (one for device and one for host). This patch reworks the driver options to make upstreaming this from the downstream SYCL fork easier. This patch introduces a language option to identify host executions (SYCLIsHost) and a -cc1 frontend option to enable this mode. -fsycl and -fno-sycl become driver-only options that are rejected when passed to -cc1. This is because the frontend and beyond should be looking at whether the user is doing a device or host compilation specifically. Because the frontend should only ever be in one mode or the other, -fsycl-is-device and -fsycl-is-host are mutually exclusive options.	2021-03-17 08:27:19 -04:00
Aaron Ballman	ecfa874531	Update diagnostic groups for pre-compat warnings As a follow-up to D95691, add new diagnostic groups named pre-c++N-compat to replace the old diagnostic groups with the standards listed out explicitly. The old group names are retained for backwards compatibility.	2021-03-17 07:52:34 -04:00
Bradley Smith	cf0da91ba5	[AArch64][SVE/NEON] Add support for FROUNDEVEN for both NEON and fixed length SVE Previously NEON used a target specific intrinsic for frintn, given that the FROUNDEVEN ISD node now exists, move over to that instead and add codegen support for that node for both NEON and fixed length SVE. Differential Revision: https://reviews.llvm.org/D98487	2021-03-17 11:41:22 +00:00
Jay Foad	967b64beb4	[AMDGPU] Split dot2-insts feature Split out some of the instructions predicated on the dot2-insts target feature into a new dot7-insts, in preparation for subtargets that have some but not all of these instructions. NFCI. Differential Revision: https://reviews.llvm.org/D98717	2021-03-17 09:42:21 +00:00
Vassil Vassilev	0cb7e7ca0c	Make iteration over the DeclContext::lookup_result safe. The idiom: ``` DeclContext::lookup_result R = DeclContext::lookup(Name); for (auto D : R) {...} ``` is not safe when in the loop body we trigger deserialization from an AST file. The deserialization can insert new declarations in the StoredDeclsList whose underlying type is a vector. When the vector decides to reallocate its storage the pointer we hold becomes invalid. This patch replaces a SmallVector with an singly-linked list. The current approach stores a SmallVector<NamedDecl, 4> which is around 8 pointers. The linked list is 3, 5, or 7. We do better in terms of memory usage for small cases (and worse in terms of locality -- the linked list entries won't be near each other, but will be near their corresponding declarations, and we were going to fetch those memory pages anyway). For larger cases: the vector uses a doubling strategy for reallocation, so will generally be between half-full and full. Let's say it's 75% full on average, so there's N * 4/3 + 4 pointers' worth of space allocated currently and will be 2N pointers with the linked list. So we break even when there are N=6 entries and slightly lose in terms of memory usage after that. We suspect that's still a win on average. Thanks to @rsmith! Differential revision: https://reviews.llvm.org/D91524	2021-03-17 08:59:04 +00:00
Valeriy Savchenko	c86dacd1a4	[-Wcalled-once-parameter] Let escapes overwrite MaybeCalled states This commit makes escapes symmetrical, meaning that having escape before and after the branching, where parameter is not called on one of the paths, will have the same effect. Differential Revision: https://reviews.llvm.org/D98622	2021-03-17 11:12:55 +03:00
Bing1 Yu	320b72e9cd	[X86][AMX] Rename amx-bf16 intrinsic according to correct naming convention __tile_tdpbf16ps should be renamed with __tile_dpbf16ps Reviewed By: pengfei Differential Revision: https://reviews.llvm.org/D98685	2021-03-17 11:22:52 +08:00
Giorgis Georgakoudis	a80a33e8b5	[Utils] Support lit-like substitutions in update_cc_test_checks Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D98712	2021-03-16 10:36:22 -07:00
Fangrui Song	6ab8927931	[RISCV] Support clang -fpatchable-function-entry && GNU function attribute 'patchable_function_entry' Similar to D72215 (AArch64) and D72220 (x86). ``` % clang -target riscv32 -march=rv64g -c -fpatchable-function-entry=2 a.c && llvm-objdump -dr a.o ... 0000000000000000 <main>: 0: 13 00 00 00 nop 4: 13 00 00 00 nop % clang -target riscv32 -march=rv64gc -c -fpatchable-function-entry=2 a.c && llvm-objdump -dr a.o ... 00000002 <main>: 2: 01 00 nop 4: 01 00 nop ``` Recently the mainline kernel started to use -fpatchable-function-entry=8 for riscv (https://git.kernel.org/linus/afc76b8b80112189b6f11e67e19cf58301944814). Differential Revision: https://reviews.llvm.org/D98610	2021-03-16 10:02:35 -07:00
Sam McCall	128ce70eef	[CodeCompletion] Avoid spurious signature help for init-list args Somewhat surprisingly, signature help is emitted as a side-effect of computing the expected type of a function argument. The reason is that both actions require enumerating the possible function signatures and running partial overload resolution, and doing this twice would be wasteful and complicated. Change #1: document this, it's subtle :-) However, sometimes we need to compute the expected type without having reached the code completion cursor yet - in particular to allow completion of designators. `eb4ab3358c` did this but introduced a regression - it emits signature help in the wrong location as a side-effect. Change #2: only emit signature help if the code completion cursor was reached. Currently there is PP.isCodeCompletionReached(), but we can't use it because it's set after running code completion. It'd be nice to set this implicitly when the completion token is lexed, but ConsumeCodeCompletionToken() makes this complicated. Change #3: call cutOffParsing() first when seeing a completion token. After this, the fact that the Sema::Produce*SignatureHelp() functions are even more confusing, as they only sometimes do that. I don't want to rename them in this patch as it's another large mechanical change, but we should soon. Change #4: prepare to rename ProduceSignatureHelp() to GuessArgumentType() etc. Differential Revision: https://reviews.llvm.org/D98488	2021-03-16 12:46:40 +01:00
Pushpinder Singh	fc12a64ecc	[OpenMP][AMDGPU] Skip backend and assemble phases for amdgcn Remove emit-llvm-bc from addClangTargetOptions as it conflicts with -E for save-temps. AMDGCN does not yet support linking object files so backend and assemble actions are skipped, leaving LLVM IR as the output format. Reviewed By: JonChesterfield, ronlieb Differential Revision: https://reviews.llvm.org/D96769	2021-03-16 04:58:14 +00:00
Amy Huang	f5352dd9da	Emit inline implementation of __builtin__wmemchr on MSVCRT platforms. The MSVC runtime library doesn't have a definition for wmemchr, so provide an inline implementation. Differential Revision: https://reviews.llvm.org/D98472	2021-03-15 15:30:55 -07:00
diggerlin	d1f1bff81b	[AIX][XCOFF] Fixed the test case which failed at aix OS because enable -mignore-xcoff-visibility by default. Summary: because we enable -mignore-xcoff-visibility by default when there is no -fvisibility option in the clang in AIX OS it will cause some test case fail at aix os. in order to let the -mignore-xcoff-visibility to be disable, we need to add the -fvisibility=default for those test case. Reviewers: hubert.reinterpretcast daltenty Differential Revision: https://reviews.llvm.org/D98660	2021-03-15 17:33:02 -04:00
Jonas Paulsson	9cfd301ec8	[SystemZ] Test for isinf and isfinite in testFPKind(). Recognize BI__builtin_isinf and BI__builtin_isfinite (and a few other opcodes for finite) in testFPKind() and handle with TDC. Review: Ulrich Weigand. Differential Revision: https://reviews.llvm.org/D97901	2021-03-15 15:02:39 -06:00
Stefan Pintilie	86f2a3d178	[PowerPC] Add __PCREL__ when PC Relative is enabled. This patch adds the `__PCREL__` define when PC Relative addressing is enabled. Reviewed By: nemanjai, #powerpc Differential Revision: https://reviews.llvm.org/D98546	2021-03-15 15:13:02 -05:00
Markus Böck	af2796c76d	[test] Add ability to get error messages from CMake for errc substitution Visual Studios implementation of the C++ Standard Library does not use strerror to produce a message for std::error_code unlike other standard libraries such as libstdc++ or libc++ that might be used. This patch adds a cmake script that through running a C++ program gets the error messages for the POSIX error codes and passes them onto lit through an optional config parameter. If the config parameter is not set, or getting the messages failed, due to say a cross compiling configuration without an emulator, it will fall back to using pythons strerror functions. Differential Revision: https://reviews.llvm.org/D98278	2021-03-15 20:56:08 +01:00
Stelios Ioannou	ab86edbc88	[AArch64] Implement __rndr, __rndrrs intrinsics This patch implements the __rndr and __rndrrs intrinsics to provide access to the random number instructions introduced in Armv8.5-A. They are only defined for the AArch64 execution state and are available when __ARM_FEATURE_RNG is defined. These intrinsics store the random number in their pointer argument and return a status code if the generation succeeded. The difference between __rndr __rndrrs, is that the latter intrinsic reseeds the random number generator. The instructions write the NZCV flags indicating the success of the operation that we can then read with a CSET. [1] https://developer.arm.com/docs/101028/latest/data-processing-intrinsics [2] https://bugs.llvm.org/show_bug.cgi?id=47838 Differential Revision: https://reviews.llvm.org/D98264 Change-Id: I8f92e7bf5b450e5da3e59943b53482edf0df6efc	2021-03-15 17:51:48 +00:00
serge-sans-paille	4aa510be78	Allow __ieee128 as an alias to __float128 on ppc This matches gcc behavior. Differential Revision: https://reviews.llvm.org/D97846	2021-03-15 18:28:26 +01:00
Luke Drummond	fcfd3fda71	[OpenCL] Respect calling convention for builtin `__translate_sampler_initializer` has a calling convention of `spir_func`, but clang generated calls to it using the default CC. Instruction Combining was lowering these mismatching calling conventions to `store i1* undef` which itself was subsequently lowered to a trap instruction by simplifyCFG resulting in runtime `SIGILL` There are arguably two bugs here: but whether there's any wisdom in converting an obviously invalid call into a runtime crash over aborting with a sensible error message will require further discussion. So for now it's enough to set the right calling convention on the runtime helper. Reviewed By: svenh, bader Differential Revision: https://reviews.llvm.org/D98411	2021-03-15 17:26:51 +00:00
Melanie Blower	33b1f3f42c	[clang][patch] Solve PR49479, File scope fp pragma should propagate to functions nested in struct, and initialization expressions Previously, the CurFPFeatures state was set to command line settings before semantic analysis of the nested member functions and initialization expressions, that's not correct, it should use the pragma state which is in effect at the lexical position. Reviewed By: Erich Keane, Aaron Ballman Differential Revision: https://reviews.llvm.org/D98211	2021-03-15 12:15:20 -04:00
Thomas Preud'homme	f60b35340f	Stop traping on sNaN in __builtin_isinf __builtin_isinf currently generates a floating-point compare operation which triggers a trap when faced with a signaling NaN in StrictFP mode. This commit uses integer operations instead to not generate any trap in such a case. Reviewed By: mibintc Differential Revision: https://reviews.llvm.org/D97125	2021-03-15 15:38:08 +00:00
David Green	2b3c813143	[Clang][ARM] Reenable arm_acle.c test. This test was apparently disabled in `6fcd4e080f`, without any sign of how it was going to be reenabled. This patch rewrites the test to use update_cc_test_checks, with midend optimizations other that mem2reg disabled. The first attempt of this patch in `5ae949a927` failed on bots even though it worked locally. I've attempted to adjust the RUN lines and made the test AArch64/ARM specific. Differential Revision: https://reviews.llvm.org/D98510	2021-03-14 10:59:24 +00:00
Giorgis Georgakoudis	1ce846be04	Replace func name with regex for update test scripts The patch adds an argument to update test scripts, such as update_cc_test_checks, for replacing a function name matching a regex. This functionality is needed to match generated function signatures that include file hashes. Example: The function signature for the following function: `__omp_offloading_50_b84c41e__Z9ftemplateIiET_i_l30_worker` with `--replace-function-regex "__omp_offloading_[0-9]+_[a-z0-9]+_(.*)"` will become: `CHECK-LABEL: @{{__omp_offloading_[0-9]+_[a-z0-9]+__Z9ftemplateIiET_i_l30_worker}}(` Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D97107	2021-03-12 17:37:09 -08:00
Giorgis Georgakoudis	9f9a4dfda7	Revert "Replace func name with regex for update test scripts" This reverts commit `5eaf70afb5`.	2021-03-12 17:20:00 -08:00
Giorgis Georgakoudis	5eaf70afb5	Replace func name with regex for update test scripts The patch adds an argument to update test scripts, such as update_cc_test_checks, for replacing a function name matching a regex. This functionality is needed to match generated function signatures that include file hashes. Example: The function signature for the following function: `__omp_offloading_50_b84c41e__Z9ftemplateIiET_i_l30_worker` with `--replace-function-regex "__omp_offloading_[0-9]+_[a-z0-9]+_(.*)"` will become: `CHECK-LABEL: @{{__omp_offloading_[0-9]+_[a-z0-9]+__Z9ftemplateIiET_i_l30_worker}}(` Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D97107	2021-03-12 17:00:42 -08:00
Matheus Izvekov	d4a8c7359b	[clang] Fix ICE on invalid type parameters for concepts See PR48593. Constraints with invalid type parameters were causing a null pointer dereference. Signed-off-by: Matheus Izvekov <mizvekov@gmail.com> Reviewed By: rsmith Differential Revision: https://reviews.llvm.org/D98095	2021-03-13 01:23:02 +01:00
Matheus Izvekov	c9fd92d573	[clang] Improve diagnostics on implicitly deleted defaulted comparisons This patch just makes the error message clearer by reinforcing the cause was a lack of viable three-way comparison function for the complete object. Signed-off-by: Matheus Izvekov <mizvekov@gmail.com> Reviewed By: rsmith Differential Revision: https://reviews.llvm.org/D97990	2021-03-13 01:13:52 +01:00
Amy Huang	d7cd208f08	[DebugInfo] Add an attribute to force type info to be emitted for types that are required to be complete. This was motivated by the fact that constructor type homing (debug info optimization that we want to turn on by default) drops some libc++ types, so an attribute would allow us to override constructor homing and emit them anyway. I'm currently looking into the particular libc++ issue, but even if we do fix that, this issue might come up elsewhere and it might be nice to have this. As I've implemented it now, the attribute isn't specific to the constructor homing optimization and overrides all of the debug info optimizations. Open to discussion about naming, specifics on what the attribute should do, etc. Differential Revision: https://reviews.llvm.org/D97411	2021-03-12 12:30:01 -08:00
Anastasia Stulova	eed88e91f3	[OpenCL] Use spir target for CIndex tests for OpenCL. This fixes failing bots. Patch by azabaznov (Anton Zabaznov)! Differential Revision: https://reviews.llvm.org/D98539	2021-03-12 20:11:26 +00:00
Nico Weber	d7b7e2026b	Revert "[Clang][ARM] Reenable arm_acle.c test." This reverts commit `5ae949a927`. Test fails everywhere.	2021-03-12 14:37:37 -05:00
David Green	5ae949a927	[Clang][ARM] Reenable arm_acle.c test. This test was apparently disabled in `6fcd4e080f`, without any sign of how it was going to be reenabled. This patch rewrites the test to use update_cc_test_checks, with midend optimizations other that mem2reg disabled.	2021-03-12 19:21:21 +00:00
Nemanja Ivanovic	b5fae4b9b2	[PowerPC] Add more missing overloads to altivec.h We are missing more predicate forms for 'vector double' and some tests. This adds the missing overloads and completes the set of test cases for them.	2021-03-12 10:51:57 -06:00
Valeriy Savchenko	6dc1523508	[analyzer][solver] Prevent infeasible states (PR49490) This patch fixes the situation when our knowledge of disequalities can help us figuring out that some assumption is infeasible, but the solver still produces a state with inconsistent constraints. Additionally, this patch adds a couple of assertions to catch this type of problems easier. Differential Revision: https://reviews.llvm.org/D98341	2021-03-12 15:56:48 +03:00
Hans Wennborg	f50aef745c	Revert "[InstrProfiling] Don't generate __llvm_profile_runtime_user" This broke the check-profile tests on Mac, see comment on the code review. > This is no longer needed, we can add __llvm_profile_runtime directly > to llvm.compiler.used or llvm.used to achieve the same effect. > > Differential Revision: https://reviews.llvm.org/D98325 This reverts commit `c7712087cb`. Also reverting the dependent follow-up commit: Revert "[InstrProfiling] Generate runtime hook for ELF platforms" > When using -fprofile-list to selectively apply instrumentation only > to certain files or functions, we may end up with a binary that doesn't > have any counters in the case where no files were selected. However, > because on Linux and Fuchsia, we pass -u__llvm_profile_runtime, the > runtime would still be pulled in and incur some non-trivial overhead, > especially in the case when the continuous or runtime counter relocation > mode is being used. A better way would be to pull in the profile runtime > only when needed by declaring the __llvm_profile_runtime symbol in the > translation unit only when needed. > > This approach was already used prior to `9a041a7522`, but we changed it > to always generate the __llvm_profile_runtime due to a TAPI limitation. > Since TAPI is only used on Mach-O platforms, we could use the early > emission of __llvm_profile_runtime there, and on other platforms we > could change back to the earlier approach where the symbol is generated > later only when needed. We can stop passing -u__llvm_profile_runtime to > the linker on Linux and Fuchsia since the generated undefined symbol in > each translation unit that needed it serves the same purpose. > > Differential Revision: https://reviews.llvm.org/D98061 This reverts commit `87fd09b25f`.	2021-03-12 13:53:46 +01:00
Aaron Ballman	e448310059	Add support for digit separators in C2x. WG14 adopted N2626 at the meetings this week. This commit adds support for using ' as a digit separator in a numeric literal which is compatible with the C++ feature.	2021-03-12 07:21:03 -05:00
Anton Zabaznov	840643bbe1	[OpenCL] Refactor diagnostic for OpenCL extension/feature There is no need to check for enabled pragma for core or optional core features, thus this check is removed Reviewed By: Anastasia Differential Revision: https://reviews.llvm.org/D97058	2021-03-12 11:43:53 +03:00
Johannes Doerfert	49ed3032ff	Revert "[OpenMP] Do not propagate match extensions to nested contexts" Two tests failed for some reason, need to investigate: https://lab.llvm.org/buildbot/#/builders/109/builds/10399 This reverts commit `ad9e98b8ef`.	2021-03-11 23:48:36 -06:00
Johannes Doerfert	0fe0d114e4	Revert "[OpenMP] Introduce the `disable_selector_propagation` variant selector trait" Need to revert `ad9e98b8ef` which this commit depends on. This reverts commit f771ef7b5f0ed260d00931cd50e6fe462edbacaf.	2021-03-11 23:48:35 -06:00
Johannes Doerfert	b2642456ab	[OpenMP] Introduce the `disable_selector_propagation` variant selector trait Nested `omp [begin\|end] declare variant` inherit the selectors from surrounding `omp (begin\|end) declare variant` constructs. To stop such propagation the user can add the `disable_selector_propagation` to the `extension` set in the `implementation` selector. Reviewed By: tianshilei1992 Differential Revision: https://reviews.llvm.org/D95765	2021-03-11 23:31:25 -06:00
Johannes Doerfert	ad9e98b8ef	[OpenMP] Do not propagate match extensions to nested contexts If we have nested declare variant context, it doesn't make sense to inherit the match extension from the parent. Instead, just skip it. Reviewed By: JonChesterfield Differential Revision: https://reviews.llvm.org/D95764	2021-03-11 23:31:21 -06:00
Johannes Doerfert	cd1bd6e587	[Utils] Check for more global information in update_test_checks This allows to check for various globals (metadata/attributes/...) and also resolves problems with globals (metadata/attributes/...) being reused across different prefixes. Reviewed By: sstefan1 Differential Revision: https://reviews.llvm.org/D94741	2021-03-11 23:31:16 -06:00
Sriraman Tallam	cdb42a4cc4	Disable unique linkage suffixes ifor global vars until demanglers can be fixed. D96109 added support for unique internal linkage names for both internal linkage functions and global variables. There was a lot of discussion on how to get the demangling right for functions but I completely missed the point that demanglers do not support suffixes for global vars. For example: $ c++filt _ZL3foo foo $ c++filt _ZL3foo.uniq.123 _ZL3foo.uniq.123 The demangling for functions works as expected. I am not sure of the impact of this. I don't understand how debuggers and other tools depend on the correctness of global variable demangling so I am pre-emptively disabling it until we can get the demangling support added. Importantly, uniquefying global variables is not needed right now as we do not do profile attribution to global vars based on sampling. It was added for completeness and so this feature is not exactly missed. Differential Revision: https://reviews.llvm.org/D98392	2021-03-11 20:59:30 -08:00
Mircea Trofin	11b70b9e3a	Revert "[NPM][CGSCC] FunctionAnalysisManagerCGSCCProxy: do not clear immutable function passes" This reverts commit `5eaeb0fa67`. It appears there are analyses that assume clearing - example: https://lab.llvm.org/buildbot#builders/36/builds/5964	2021-03-11 18:31:19 -08:00
Mircea Trofin	5eaeb0fa67	[NPM][CGSCC] FunctionAnalysisManagerCGSCCProxy: do not clear immutable function passes Check with the analysis result by calling invalidate instead of clear on the analysis manager. Differential Revision: https://reviews.llvm.org/D98440	2021-03-11 18:15:28 -08:00
Florian Hahn	c92ec0dd92	[Matrix] Add support for matrix-by-scalar division. This patch extends the matrix spec to allow matrix-by-scalar division. Originally support for `/` was left out to avoid ambiguity for the matrix-matrix version of `/`, which could either be elementwise or specified as matrix multiplication M1 * (1/M2). For the matrix-scalar version, no ambiguity exists; `*` is also an elementwise operation in that case. Matrix-by-scalar division is commonly supported by systems including Matlab, Mathematica or NumPy. Reviewed By: rjmccall Differential Revision: https://reviews.llvm.org/D97857	2021-03-11 22:21:23 +00:00
Petr Hosek	87fd09b25f	[InstrProfiling] Generate runtime hook for ELF platforms When using -fprofile-list to selectively apply instrumentation only to certain files or functions, we may end up with a binary that doesn't have any counters in the case where no files were selected. However, because on Linux and Fuchsia, we pass -u__llvm_profile_runtime, the runtime would still be pulled in and incur some non-trivial overhead, especially in the case when the continuous or runtime counter relocation mode is being used. A better way would be to pull in the profile runtime only when needed by declaring the __llvm_profile_runtime symbol in the translation unit only when needed. This approach was already used prior to `9a041a7522`, but we changed it to always generate the __llvm_profile_runtime due to a TAPI limitation. Since TAPI is only used on Mach-O platforms, we could use the early emission of __llvm_profile_runtime there, and on other platforms we could change back to the earlier approach where the symbol is generated later only when needed. We can stop passing -u__llvm_profile_runtime to the linker on Linux and Fuchsia since the generated undefined symbol in each translation unit that needed it serves the same purpose. Differential Revision: https://reviews.llvm.org/D98061	2021-03-11 12:29:01 -08:00
Joseph Huber	807466ef28	[OpenMP] Restore backwards compatibility for libomptarget Summary: The changes introduced in D87946 changed the API for libomptarget functions. `__kmpc_push_target_tripcount` was a function in Clang 11.x but was not given a backward-compatible interface. This change will require people using Clang 13.x or 12.x to recompile their offloading programs. Reviewed By: jdoerfert cchen Differential Revision: https://reviews.llvm.org/D98358	2021-03-11 09:52:11 -05:00
Nathan James	cb559c8d5e	[Sema] Add some basic lambda capture fix-its Adds fix-its when users forget to explicitly capture variables or this in lambdas Addresses https://github.com/clangd/clangd/issues/697 Reviewed By: kbobyrev Differential Revision: https://reviews.llvm.org/D96975	2021-03-11 13:46:25 +00:00
Olivier Goffart	5baea05601	[SEH] Fix capture of this in lambda functions Commit `1b04bdc2f3` added support for capturing the 'this' pointer in a SEH context (__finally or __except), But the case in which the 'this' pointer is part of a lambda capture was not handled properly Differential Revision: https://reviews.llvm.org/D97687	2021-03-11 09:12:42 +01:00
Zakk Chen	d6a0560bf2	[Clang][RISCV] Add custom TableGen backend for riscv-vector intrinsics. Demonstrate how to generate vadd/vfadd intrinsic functions 1. add -gen-riscv-vector-builtins for clang builtins. 2. add -gen-riscv-vector-builtin-codegen for clang codegen. 3. add -gen-riscv-vector-header for riscv_vector.h. It also generates ifdef directives with extension checking, base on D94403. 4. add -gen-riscv-vector-generic-header for riscv_vector_generic.h. Generate overloading version Header for generic api. https://github.com/riscv/rvv-intrinsic-doc/blob/master/rvv-intrinsic-rfc.md#c11-generic-interface 5. update tblgen doc for riscv related options. riscv_vector.td also defines some unused type transformers for vadd, because I think it could demonstrate how tranfer type work and we need them for the whole intrinsic functions implementation in the future. Authored-by: Roger Ferrer Ibanez <rofirrim@gmail.com> Co-Authored-by: Zakk Chen <zakk.chen@sifive.com> Reviewed By: jrtc27, craig.topper, HsiangKai, Jim, Paul-C-Anagnostopoulos Differential Revision: https://reviews.llvm.org/D95016	2021-03-10 18:43:43 -08:00
Leonard Chan	70af0bf6fe	[clang][Driver] Expose -fexperimental-relative-c++-abi-vtables flag Initially, this flag was meant to only be used through cc1 and not directly through the clang driver. However, we accidentally ended up using this flag as a driver flag already for selecting multilibs within the fuchsia toolchain. We're currently in an awkward state where it's only accepted as a driver flag when targeting Fuchsia, and all other instances it can only be added via -Xclang. Since we're ready to use this in Fuchsia, we can just expose this to the driver for simplicity. Differential Revision: https://reviews.llvm.org/D98375	2021-03-10 16:28:40 -08:00
Giorgis Georgakoudis	ecf68972fd	Revert "Replace func name with regex in update_cc_test_checks" This reverts commit `bf58d6a1f9`. Breaks tests, fix	2021-03-10 15:05:35 -08:00
zoecarver	a89ac0dd18	Update __is_unsigned builtin to match the Standard. Updates __is_unsigned to have the same behavior as the standard specifies. This is in line with `511dbd8`, which applied the same change to __is_signed. Refs D67897. Differential Revision: https://reviews.llvm.org/D98104	2021-03-10 15:00:26 -08:00
Giorgis Georgakoudis	bf58d6a1f9	Replace func name with regex in update_cc_test_checks The patch adds an argument to update_cc_test_checks for replacing a function name matching a regex. This functionality is needed to match generated function signatures that include file hashes. Example: The function signature for the following function: `__omp_offloading_50_b84c41e__Z9ftemplateIiET_i_l30_worker` with `--replace-function-regex "__omp_offloading_[0-9]+_[a-z0-9]+_(.*)"` will become: `CHECK-LABEL: @{{__omp_offloading_[0-9]+_[a-z0-9]+__Z9ftemplateIiET_i_l30_worker}}(` Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D97107	2021-03-10 12:57:35 -08:00
Giorgis Georgakoudis	a2abe2259c	Run non-filechecked commands in update_cc_test_checks.py Some tests in clang require running non-filechecked commands to generate the actual filecheck input. For example, tests for openmp offloading require generating the host bc without any checking, before running the clang command to actually generate the filechecked IR of the target device. This patch enables `update_cc_test_checks.py` to run non-filechecked run lines in-place. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D97068	2021-03-10 12:25:35 -08:00
Arthur Eubanks	c8227f06b3	[clang] Don't assert in EmitAggregateCopy on trivial_abi types Fixes PR42961. Reviewed By: rnk Differential Revision: https://reviews.llvm.org/D97872	2021-03-10 10:16:06 -08:00
Jingu Kang	25951c5ab8	[AArch64] Add missing intrinsics for scalar FP rounding Differential Revision: https://reviews.llvm.org/D98269	2021-03-10 13:22:29 +00:00
Balazs Benics	a94ac467c2	[analyzer][CTU][NFC] Fix "Add an extra regression test" As thakis reported, I will replace `rm -r` by `rm -rf`. I hope it fixes the build bot.	2021-03-10 13:07:49 +01:00
Adam Balogh	bcc662484a	[analyzer] Crash fix for alpha.cplusplus.IteratorRange If the non-iterator side of an iterator operation `+`, `+=`, `-` or `-=` is `UndefinedVal` an assertions happens. This small fix prevents this. Patch by Adam Balogh. Reviewed By: NoQ Differential Revision: https://reviews.llvm.org/D85424	2021-03-10 12:42:24 +01:00
Balazs Benics	0e0ea9ffb8	[analyzer][CTU][NFC] Add an extra regression test Before `bc713f6a004723d1325bc16e1efc32d0ac82f939` landed, the analyzer crashed on this reduced example. It seems important to have bot `ctu` and `-analyzer-opt-analyze-headers` enabled in the example. This test file ensures that no regression happens in the future in this regard. Reviewed By: martong, NoQ Differential Revision: https://reviews.llvm.org/D96586	2021-03-10 12:42:24 +01:00
Balazs Benics	0dc0e2a9ab	[analyzer][NFC] Add more tests for ArrayBoundCheckerV2 According to a Bugzilla ticket (https://bugs.llvm.org/show_bug.cgi?id=45148), ArrayBoundCheckerV2 produces a false-positive report. This patch adds a test demonstrating the current //flawed// behavior. Also adds several similar test cases just to be on the safe side. Reviewed By: martong Differential Revision: https://reviews.llvm.org/D86870	2021-03-10 12:42:23 +01:00
Sven van Haastregt	6f912a2cd4	[OpenCL] Set calling convention for -fdeclare-opencl-builtins IR produced using TableGen builtin function declarations (`fdeclare-opencl-builtins.cl`) did not have the target's calling convention applied to builtin calls. Fix this, and update the codegen test to check that IR produced using opencl-c.h and `-fdeclare-opencl-builtins` is identical with respect to the builtin calls. Differential Revision: https://reviews.llvm.org/D98039	2021-03-10 10:03:57 +00:00
Valeriy Savchenko	59112eacb9	[-Wcompletion-handler] Extend list of detected conventions Update convention detection to accomodate changes from: https://github.com/DougGregor/swift-evolution/blob/concurrency-objc/proposals/NNNN-concurrency-objc.md#asynchronous-completion-handler-methods Differential Revision: https://reviews.llvm.org/D98251	2021-03-10 10:43:19 +03:00
Fangrui Song	9d117e7b2a	Define __GCC_HAVE_DWARF2_CFI_ASM if applicable In -fno-exceptions -fno-asynchronous-unwind-tables -g0 mode, GCC does not emit `.cfi_` directives. ``` % diff <(gcc -fno-asynchronous-unwind-tables -dM -E a.c) <(gcc -dM -E a.c) 130a131 > #define __GCC_HAVE_DWARF2_CFI_ASM 1 ``` This macro is useful because code can decide whether inline asm should include `.cfi_` directives. `.cfi_*` directives without `.cfi_startproc` can cause assembler errors (integrated assembler: `this directive must appear between .cfi_startproc and .cfi_endproc directives`). Differential Revision: https://reviews.llvm.org/D97743	2021-03-09 22:21:36 -08:00
Ryan Prichard	a478b0a199	[Android] Default to --rtlib=compiler-rt By default, the driver uses the compiler-rt builtins and links with -l:libunwind.a. Restore the previous behavior by passing --rtlib=libgcc. Reviewed By: danalbert Differential Revision: https://reviews.llvm.org/D96404	2021-03-09 18:09:53 -08:00
Richard Smith	a892b0015e	PR49465: Disallow constant evaluation of a call to operator delete(nullptr). The only time we would consider allowing this is inside a call to std::allocator<T>::deallocate, whose contract does not permit deletion of null pointers.	2021-03-09 15:06:06 -08:00
Alex Lorenz	234f3211a3	[clang][driver] Support Darwin SDK names with an optional prefix in their name rdar://74017977	2021-03-09 14:57:58 -08:00
Alex Lorenz	2de0a18a89	[clang][ObjC] allow the use of NSAttributedString * return type with format_arg attribute This is useful for APIs that want to produce an attributed NSString as a result of some formatting API call.	2021-03-09 13:36:57 -08:00
Fangrui Song	b4948c27d2	Revert D97743 "Define __GCC_HAVE_DWARF2_CFI_ASM if applicable" This reverts commit `c11ff4bbad` & `df67d35269`. Trying to make the change to the driver to avoid round-trip issues.	2021-03-09 12:14:12 -08:00
Fangrui Song	df67d35269	[test] Fix debug-info-macro.c	2021-03-09 12:04:51 -08:00
Fangrui Song	c11ff4bbad	Define __GCC_HAVE_DWARF2_CFI_ASM if applicable In -fno-exceptions -fno-asynchronous-unwind-tables -g0 mode, GCC does not emit `.cfi_` directives. ``` % diff <(gcc -fno-asynchronous-unwind-tables -dM -E a.c) <(gcc -dM -E a.c) 130a131 > #define __GCC_HAVE_DWARF2_CFI_ASM 1 ``` This macro is useful because code can decide whether inline asm should include `.cfi_` directives. `.cfi_*` directives without `.cfi_startproc` can cause assembler errors (integrated assembler: `this directive must appear between .cfi_startproc and .cfi_endproc directives`). Differential Revision: https://reviews.llvm.org/D97743	2021-03-09 10:52:26 -08:00
Adam Czachorowski	4e1c487004	[clang] Fix crash when creating deduction guide. We used to trigger assertion when transforming c-tor with unparsed default argument. Now we ignore such constructors for this purpose. Differential Revision: https://reviews.llvm.org/D97965	2021-03-09 16:57:56 +01:00
Anton Bikineev	4f8e299785	[Sema] Fix diagnostics for one-byte length modifier In case a char-literal of type int (C/ObjectiveC) corresponds to a format specifier with the %hh length modifier, don't treat the literal as of type char for issuing diagnostics, as otherwise this results in: printf("%hhd", 'e'); warning: format specifies type 'char' but the argument has type 'char'. Differential revision: https://reviews.llvm.org/D97951	2021-03-09 16:56:20 +01:00
diggerlin	46d4d1fea4	[AIX] do not emit visibility attribute into IR when there is -mignore-xcoff-visibility SUMMARY: n the patch https://reviews.llvm.org/D87451 "add new option -mignore-xcoff-visibility" we did as "The option -mignore-xcoff-visibility has no effect on visibility attribute when compile with -emit-llvm option to generated LLVM IR." in these patch we let -mignore-xcoff-visibility effect on generating IR too. the new feature only work on AIX OS Reviewer: Jason Liu, Differential Revision: https://reviews.llvm.org/D89986	2021-03-09 10:38:00 -05:00
Florian Hahn	fc8d3766d7	[ExtVectorType] Support conditional select operator for C++. This patch implements the conditional select operator for ext_vector_types in C++. It does so by using the same semantics as for C. D71463 added support for the conditional select operator for VectorType in C++. Unfortunately the semantics between ext_vector_type in C are different to VectorType in C++. Select for ext_vector_type is based on the MSB of the condition vector, whereas for VectorType it is `!= 0`. This unfortunately means that the behavior is inconsistent between ExtVectorType and VectorType, but I think using the C semantics for ExtVectorType in C++ as well should be less surprising for users. Reviewed By: erichkeane, aaron.ballman Differential Revision: https://reviews.llvm.org/D98055	2021-03-09 13:08:52 +00:00
Sven van Haastregt	13c77f2046	[OpenCL] Fix builtins that require multiple extensions Builtins that require multiple extensions, such as certain `write_imagef` forms, were not exposed because of the Sema check not splitting the extension string. Differential Revision: https://reviews.llvm.org/D97930	2021-03-09 11:37:26 +00:00
Tomas Matheson	7e5cea5b50	[Clang][Sema] Warn when function argument is less aligned than parameter See https://bugs.llvm.org/show_bug.cgi?id=42154. GCC's __attribute__((align)) can reduce the alignment of a type when applied to a typedef. However, functions which take a pointer or reference to the original type are compiled assuming the original alignment. Therefore when any such function is passed an object of the new, less-aligned type, an alignment fault can occur. In particular, this applies to the constructor, which is defined for the original type and called for the less-aligned object. This change adds a warning whenever an pointer or reference to an object is passed to a function that was defined for a more-aligned type. The calls to ASTContext::getTypeAlignInChars seem change the order in which record layouts are evaluated, which caused changes to the output of -fdump-record-layouts. As such some tests needed to be updated: * Use CHECK-LABEL rather than counting the number of "Dumping AST Record Layout" headers. * Check for end of line in labels, so that struct B1 doesn't match struct B etc. * Add --strict-whitespace, since the whitespace shows meaningful structure. * The order in which record layouts are printed has changed in some cases. * clang-format for regions changed Differential Revision: https://reviews.llvm.org/D97187	2021-03-09 10:37:32 +00:00
Jon Roelofs	a24644bb1c	Revert "Run non-filechecked commands in update_cc_test_checks.py" This reverts commit `60d4c73b30`. The new test is broken on macos hosts. Discussion here: https://reviews.llvm.org/D97068#2611269 https://reviews.llvm.org/D97068#2612675 ... revert to green.	2021-03-08 17:26:24 -08:00
Min-Yih Hsu	5509748f2c	[cfe][driver][M68k](8/8) Clang driver support Add M68k-specific toolchain and driver configurations / options. Authors: myhsu, m4yers, glaubitz Differential Revision: https://reviews.llvm.org/D88394	2021-03-08 12:30:57 -08:00
Shilei Tian	c41ae246ac	[OpenMP][Clang][NVPTX] Only build one bitcode library for each SM In D97003, CUDA 9.2 is the minimum requirement for OpenMP offloading on NVPTX target. We don't need to have macros in source code to select right functions based on CUDA version. we don't need to compile multiple bitcode libraries of different CUDA versions for each SM. We don't need to worry about future compatibility with newer CUDA version. `-target-feature +ptx61` is used in this patch, which corresponds to the highest PTX version that CUDA 9.2 can support. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D97198	2021-03-08 12:03:04 -05:00
Tim Northover	c4542005da	AArch64/MacOS: switch default CPU to apple-a13. The DevKits had A12 processors, but they're all gone now and real hardware has an A13.	2021-03-08 15:47:05 +00:00
Giorgis Georgakoudis	60d4c73b30	Run non-filechecked commands in update_cc_test_checks.py Some tests in clang require running non-filechecked commands to generate the actual filecheck input. For example, tests for openmp offloading require generating the host bc without any checking, before running the clang command to actually generate the filechecked IR of the target device. This patch enables `update_cc_test_checks.py` to run non-filechecked run lines in-place. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D97068	2021-03-08 07:18:01 -08:00
Ahsan Saghir	acce401068	[PowerPC] Change target data layout for 16-byte stack alignment This changes the target data layout to make stack align to 16 bytes on Power10. Before this change, stack was being aligned to 32 bytes. Reviewed By: #powerpc, nemanjai Differential Revision: https://reviews.llvm.org/D96265	2021-03-08 08:13:08 -06:00
Saurabh Jha	63851a701e	[Matrix] Implement += and -= for MatrixType. Make sure CompLHSTy is set correctly for += and -= and matrix type operands. Bugzilla ticket is here https://bugs.llvm.org/show_bug.cgi?id=46164 Patch by Saurabh Jha <saurabh.jhaa@gmail.com> Reviewed By: fhahn Differential Revision: https://reviews.llvm.org/D98075	2021-03-08 09:32:11 +00:00
Freddy Ye	5f9489b754	[X86] Refine "Support -march=alderlake" Refine "Support -march=alderlake" Compare with tremont, it includes 25 more new features. They are adx, aes, avx, avx2, avxvnni, bmi, bmi2, cldemote, f16c, fma, hreset, invpcid, kl, lzcnt, movdir64b, movdiri, pclmulqdq, pconfig, pku, serialize, shstk, vaes, vpclmulqdq, waitpkg, widekl. Reviewed By: pengfei Differential Revision: https://reviews.llvm.org/D97832	2021-03-08 13:17:18 +08:00
Petr Hosek	7514f1a312	[Driver] Pass --unwindlib=platform to tests that check unwinder There are two additional cases that were missed in D98131. Differential Revision: https://reviews.llvm.org/D98158	2021-03-07 17:28:34 -08:00
Petr Hosek	41476d89b8	[Driver] Pass --unwindlib=platform to tests that check unwinder This addresses an issue which was revealed by D98022. Differential Revision: https://reviews.llvm.org/D98131	2021-03-06 21:44:26 -08:00
Yaxun (Sam) Liu	34d1a5c7b1	[HIP] Support Spack packages Spack is a package management tool extensively used by HPC community. As ROCm packages are built by Spack by HPC community, we need to teach clang driver to detect ROCm installation built by Spack. Reviewed by: Artem Belevich Differential Revision: https://reviews.llvm.org/D97340	2021-03-06 08:41:37 -05:00
Jay Foad	99682bc039	Revert "Revert "[AMDGPU] Restore the s_memtime instruction in gfx1030"" This reverts commit `e58d68fcd0`. This reinstates commit `fc28f600e5` with a fix to initialize HasShaderCyclesRegister. See https://reviews.llvm.org/D97928.	2021-03-06 09:00:01 +00:00
Martin Storsjö	ebe6d3be0f	[clang] Don't default to a specifically shared libunwind on mingw with a g++ driver For MinGW targets, we distinguish between an explicitly shared unwinder library (requested via -shared-libgcc), an explicitly static one (requested via -static-libgcc or -static) and the default case (which just passes -lunwind to the linker, which will pick either shared or static depending on what's available, with the normal linker logic). This makes the implicit default case (as added in D79995) actually work as it was intended, when using the g++ driver (which is the main usecase for libunwind as far as I know). Differential Revision: https://reviews.llvm.org/D98023	2021-03-06 08:50:46 +02:00
Mitch Phillips	e58d68fcd0	Revert "[AMDGPU] Restore the s_memtime instruction in gfx1030" Broke the ASan/MSan buildbots. See more comments in the original patch, https://reviews.llvm.org/D97928. Build failure at http://lab.llvm.org:8011/#/builders/5/builds/5327 This reverts commit `fc28f600e5`.	2021-03-05 18:24:59 -08:00
Matheus Izvekov	71e6e82746	[clang] Fix constrained decltype(auto) deduction Prior to this fix, constrained decltype(auto) behaves exactly the same as constrained regular auto. This fixes it so it deduces like decltype(auto). Signed-off-by: Matheus Izvekov <mizvekov@gmail.com> Reviewed By: rsmith Differential Revision: https://reviews.llvm.org/D98087	2021-03-05 18:20:09 -08:00
Nemanja Ivanovic	f4ad7a1a15	[PowerPC] Add missing double precision vec_all overloads to altivec.h We somehow missed vec_all_nlt, vec_all_nle and vec_all_numeric overloads for double precision vectors when VSX is enabled.	2021-03-05 18:42:12 -06:00
Richard Smith	abbe42d8b5	PR49260: Improve diagnostics for no matching 'operator new'. Fix duplicate diagnostic for an over-aligned allocation with no matching function, and add custom diagnostic for the case where the non-allocating placement new was intended but <new> was not included.	2021-03-05 15:53:10 -08:00
Sriraman Tallam	78d0e91865	Refactor -funique-internal-linakge-names implementation. The option -funique-internal-linkage-names was added in D73307 and D78243 as a LLVM early pass to insert a unique suffix to internal linkage functions and vars. The unique suffix was the hash of the module path. However, we found that this can be done more cleanly in clang early and the fixes that need to be done later can be completely avoided. The fixes in particular are trying to modify the DW_AT_linkage_name and finding the right place to insert the pass. This patch ressurects the original implementation proposed in D73307 which was reviewed and then ditched in favor of the pass based approach. Differential Revision: https://reviews.llvm.org/D96109	2021-03-05 13:32:17 -08:00
PremAnand Rao	c2de5aff1a	[OpenMP] Handle non-function context before checking for diagnostic emission Ensure that we are in a function declaration context before checking the diagnostic emission status, to avoid dereferencing a NULL function declaration. Differential Revision: https://reviews.llvm.org/D97573	2021-03-05 12:37:49 -08:00
Jay Foad	fc28f600e5	[AMDGPU] Restore the s_memtime instruction in gfx1030 gfx1030 added a new way to implement readcyclecounter using the SHADER_CYCLES hardware register, but the s_memtime instruction still exists, so the MC layer should still accept it and the llvm.amdgcn.s.memtime intrinsic should still work. Differential Revision: https://reviews.llvm.org/D97928	2021-03-05 20:19:11 +00:00
Chen Zheng	afa76fe67a	[XCOFF][DWARF] set default DWARF version to 3. Reviewed By: jsji Differential Revision: https://reviews.llvm.org/D98010	2021-03-05 09:21:57 -05:00
Yaxun (Sam) Liu	5b3fc7180c	[HIP] do not use -munsafe-fp-atomics by default A bug was introduced when adding -munsafe-fp-atomics. By default it should be off. Reviewed by: Artem Belevich Differential Revision: https://reviews.llvm.org/D97967	2021-03-05 08:46:58 -05:00
Yaxun (Sam) Liu	258ecf5f33	[HIP] do not use -mconstructor-aliases for device Like nvptx and some other targets, -mconstructor-aliases does not work well with amdgpu, therefore we disable it in the same approach. Reviewed by: Artem Belevich Differential Revision: https://reviews.llvm.org/D97959	2021-03-05 08:46:58 -05:00
Sven van Haastregt	f0686569cc	[OpenCL] Fix `mix` builtin overloads `mix` is subtly different from `clamp`: in the overloads where the last argument is a scalar, the second argument should be a gentype for `mix`. As scalars can be implicitly converted to vectors, this cannot be caught in the Sema test. Hence adding a CodeGen test, where we can verify the types using the mangled name.	2021-03-05 13:43:30 +00:00
Jingu Kang	9b302513f6	[AArch64] Add missing intrinsics for vrnd	2021-03-05 11:26:12 +00:00
Michael Kruse	b119120673	[clang][OpenMP] Use OpenMPIRBuilder for workshare loops. Initial support for using the OpenMPIRBuilder by clang to generate loops using the OpenMPIRBuilder. This initial support is intentionally limited to: * Only the worksharing-loop directive. * Recognizes only the nowait clause. * No loop nests with more than one loop. * Untested with templates, exceptions. * Semantic checking left to the existing infrastructure. This patch introduces a new AST node, OMPCanonicalLoop, which becomes parent of any loop that has to adheres to the restrictions as specified by the OpenMP standard. These restrictions allow OMPCanonicalLoop to provide the following additional information that depends on base language semantics: * The distance function: How many loop iterations there will be before entering the loop nest. * The loop variable function: Conversion from a logical iteration number to the loop variable. These allow the OpenMPIRBuilder to act solely using logical iteration numbers without needing to be concerned with iterator semantics between calling the distance function and determining what the value of the loop variable ought to be. Any OpenMP logical should be done by the OpenMPIRBuilder such that it can be reused MLIR OpenMP dialect and thus by flang. The distance and loop variable function are implemented using lambdas (or more exactly: CapturedStmt because lambda implementation is more interviewed with the parser). It is up to the OpenMPIRBuilder how they are called which depends on what is done with the loop. By default, these are emitted as outlined functions but we might think about emitting them inline as the OpenMPRuntime does. For compatibility with the current OpenMP implementation, even though not necessary for the OpenMPIRBuilder, OMPCanonicalLoop can still be nested within OMPLoopDirectives' CapturedStmt. Although OMPCanonicalLoop's are not currently generated when the OpenMPIRBuilder is not enabled, these can just be skipped when not using the OpenMPIRBuilder in case we don't want to make the AST dependent on the EnableOMPBuilder setting. Loop nests with more than one loop require support by the OpenMPIRBuilder (D93268). A simple implementation of non-rectangular loop nests would add another lambda function that returns whether a loop iteration of the rectangular overapproximation is also within its non-rectangular subset. Reviewed By: jdenny Differential Revision: https://reviews.llvm.org/D94973	2021-03-04 22:52:59 -06:00
Heejin Ahn	561abd83ff	[WebAssembly] Disable uses of __clang_call_terminate Background: Wasm EH, while using Windows EH (catchpad/cleanuppad based) IR, uses Itanium-based libraries and ABIs with some modifications. `__clang_call_terminate` is a wrapper generated in Clang's Itanium C++ ABI implementation. It contains this code, in C-style pseudocode: ``` void __clang_call_terminate(void *exn) { __cxa_begin_catch(exn); std::terminate(); } ``` So this function is a wrapper to call `__cxa_begin_catch` on the exception pointer before termination. In Itanium ABI, this function is called when another exception is thrown while processing an exception. The pointer for this second, violating exception is passed as the argument of this `__clang_call_terminate`, which calls `__cxa_begin_catch` with that pointer and calls `std::terminate` to terminate the program. The spec (https://libcxxabi.llvm.org/spec.html) for `__cxa_begin_catch` says, ``` When the personality routine encounters a termination condition, it will call __cxa_begin_catch() to mark the exception as handled and then call terminate(), which shall not return to its caller. ``` In wasm EH's Clang implementation, this function is called from cleanuppads that terminates the program, which we also call terminate pads. Cleanuppads normally don't access the thrown exception and the wasm backend converts them to `catch_all` blocks. But because we need the exception pointer in this cleanuppad, we generate `wasm.get.exception` intrinsic (which will eventually be lowered to `catch` instruction) as we do in the catchpads. But because terminate pads are cleanup pads and should run even when a foreign exception is thrown, so what we have been doing is: 1. In `WebAssemblyLateEHPrepare::ensureSingleBBTermPads()`, we make sure terminate pads are in this simple shape: ``` %exn = catch call @__clang_call_terminate(%exn) unreachable ``` 2. In `WebAssemblyHandleEHTerminatePads` pass at the end of the pipeline, we attach a `catch_all` to terminate pads, so they will be in this form: ``` %exn = catch call @__clang_call_terminate(%exn) unreachable catch_all call @std::terminate() unreachable ``` In `catch_all` part, we don't have the exception pointer, so we call `std::terminate()` directly. The reason we ran HandleEHTerminatePads at the end of the pipeline, separate from LateEHPrepare, was it was convenient to assume there was only a single `catch` part per `try` during CFGSort and CFGStackify. --- Problem: While it thinks terminate pads could have been possibly split or calls to `__clang_call_terminate` could have been duplicated, `WebAssemblyLateEHPrepare::ensureSingleBBTermPads()` assumes terminate pads contain no more than calls to `__clang_call_terminate` and `unreachable` instruction. I assumed that because in LLVM very limited forms of transformations are done to catchpads and cleanuppads to maintain the scoping structure. But it turned out to be incorrect; passes can merge cleanuppads into one, including terminate pads, as long as the new code has a correct scoping structure. One pass that does this I observed was `SimplifyCFG`, but there can be more. After this transformation, a single cleanuppad can contain any number of other instructions with the call to `__clang_call_terminate` and can span many BBs. It wouldn't be practical to duplicate all these BBs within the cleanuppad to generate the equivalent `catch_all` blocks, only with calls to `__clang_call_terminate` replaced by calls to `std::terminate`. Unless we do more complicated transformation to split those calls to `__clang_call_terminate` into a separate cleanuppad, it is tricky to solve. --- Solution (?): This CL just disables the generation and use of `__clang_call_terminate` and calls `std::terminate()` directly in its place. The possible downside of this approach can be, because the Itanium ABI intended to "mark" the violating exception handled, we don't do that anymore. What `__cxa_begin_catch` actually does is increment the exception's handler count and decrement the uncaught exception count, which in my opinion do not matter much given that we are about to terminate the program anyway. Also it does not affect info like stack traces that can be possibly shown to developers. And while we use a variant of Itanium EH ABI, we can make some deviations if we choose to; we are already different in that in the current version of the EH spec we don't support two-phase unwinding. We can possibly consider a more complicated transformation later to reenable this, but I don't think that has high priority. Changes in this CL contains: - In Clang, we don't generate a call to `wasm.get.exception()` intrinsic and `__clang_call_terminate` function in terminate pads anymore; we simply generate calls to `std::terminate()`, which is the default implementation of `CGCXXABI::emitTerminateForUnexpectedException`. - Remove `WebAssembly::ensureSingleBBTermPads() function and `WebAssemblyHandleEHTerminatePads` pass, because terminate pads are already `catch_all` now (because they don't need the exception pointer) and we don't need these transformations anymore. - Change tests to use `std::terminate` directly. Also removes tests that tested `LateEHPrepare::ensureSingleBBTermPads` and `HandleEHTerminatePads` pass. - Drive-by fix: Add some function attributes to EH intrinsic declarations Fixes https://github.com/emscripten-core/emscripten/issues/13582. Reviewed By: dschuff, tlively Differential Revision: https://reviews.llvm.org/D97834	2021-03-04 14:26:35 -08:00
Reid Kleckner	1c2e7d200d	[MS] Fix crash involving gnu stmt exprs and inalloca Use a WeakTrackingVH to cope with the stmt emission logic that cleans up unreachable blocks. This invalidates the reference to the deferred replacement placeholder. Cope with it. Fixes PR25102 (from 2015!)	2021-03-04 13:57:46 -08:00
Gui Andrade	10264a1b21	Introduce noundef attribute at call sites for stricter poison analysis This change adds a new IR noundef attribute, which denotes when a function call argument or return val may never contain uninitialized bits. In MemorySanitizer, this attribute enables optimizations which decrease instrumented code size by up to 17% (measured with an instrumented build of clang) . I'll introduce the change allowing msan to take advantage of this information in a separate patch. Differential Revision: https://reviews.llvm.org/D81678	2021-03-04 12:15:12 -08:00
Zequan Wu	9783e20988	Revert "Revert "[Coverage] Emit gap region between statements if first statements contains terminate statements."" Reland with update on test case ContinuousSyncmode/basic.c. This reverts commit `fe5c2c3ca6`.	2021-03-04 11:52:43 -08:00
Akira Hatanaka	1900503595	[ObjC][ARC] Use operand bundle 'clang.arc.attachedcall' instead of explicitly emitting retainRV or claimRV calls in the IR This reapplies `ed4718eccb`, which was reverted because it was causing a miscompile. The bug that was causing the miscompile has been fixed in `75805dce5f`. Original commit message: Background: This fixes a longstanding problem where llvm breaks ARC's autorelease optimization (see the link below) by separating calls from the marker instructions or retainRV/claimRV calls. The backend changes are in https://reviews.llvm.org/D92569. https://clang.llvm.org/docs/AutomaticReferenceCounting.html#arc-runtime-objc-autoreleasereturnvalue What this patch does to fix the problem: - The front-end adds operand bundle "clang.arc.attachedcall" to calls, which indicates the call is implicitly followed by a marker instruction and an implicit retainRV/claimRV call that consumes the call result. In addition, it emits a call to @llvm.objc.clang.arc.noop.use, which consumes the call result, to prevent the middle-end passes from changing the return type of the called function. This is currently done only when the target is arm64 and the optimization level is higher than -O0. - ARC optimizer temporarily emits retainRV/claimRV calls after the calls with the operand bundle in the IR and removes the inserted calls after processing the function. - ARC contract pass emits retainRV/claimRV calls after the call with the operand bundle. It doesn't remove the operand bundle on the call since the backend needs it to emit the marker instruction. The retainRV and claimRV calls are emitted late in the pipeline to prevent optimization passes from transforming the IR in a way that makes it harder for the ARC middle-end passes to figure out the def-use relationship between the call and the retainRV/claimRV calls (which is the cause of PR31925). - The function inliner removes an autoreleaseRV call in the callee if nothing in the callee prevents it from being paired up with the retainRV/claimRV call in the caller. It then inserts a release call if claimRV is attached to the call since autoreleaseRV+claimRV is equivalent to a release. If it cannot find an autoreleaseRV call, it tries to transfer the operand bundle to a function call in the callee. This is important since the ARC optimizer can remove the autoreleaseRV returning the callee result, which makes it impossible to pair it up with the retainRV/claimRV call in the caller. If that fails, it simply emits a retain call in the IR if retainRV is attached to the call and does nothing if claimRV is attached to it. - SCCP refrains from replacing the return value of a call with a constant value if the call has the operand bundle. This ensures the call always has at least one user (the call to @llvm.objc.clang.arc.noop.use). - This patch also fixes a bug in replaceUsesOfNonProtoConstant where multiple operand bundles of the same kind were being added to a call. Future work: - Use the operand bundle on x86-64. - Fix the auto upgrader to convert call+retainRV/claimRV pairs into calls with the operand bundles. rdar://71443534 Differential Revision: https://reviews.llvm.org/D92808	2021-03-04 11:22:30 -08:00
Christopher Di Bella	9830901b34	[clang] removes check against integral-to-pointer conversion... ... unless it's a literal D94640 was a bit too aggressive in its analysis, considering integers representing valid addresses as invalid. This change rolls back some of the check, so that only the most obvious case is still flagged. Before: ```cpp free((void)1000); // literal converted to `void`: warning good free((void)an_int); // `int` object converted to `void`: warning might // be a false positive ``` After ```cpp free((void)1000); // literal converted to `void`: warning good free((void*)an_int); // doesn't warn ``` Differential Revision: https://reviews.llvm.org/D97512	2021-03-04 17:00:54 +00:00
Alexey Bataev	711179b581	[OPENMP]Fix PR48759: "fatal error" when compile with preprocessed file. If the file in line directive does not exist on the system we need, to use the original file to get its file id. Differential Revision: https://reviews.llvm.org/D97945	2021-03-04 07:26:57 -08:00
Gabor Marton	2e90fc2c40	[AST][PCH][ASTImporter] Fix UB caused by uninited SwitchStmt member The SwitchStmt::FirstCase member is not initialized when the AST is built by the ASTStmtReader. See the below code of ASTStmtReader::VisitSwitchStmt in the case where the for loop does not have any iterations: ``` // ... more code ... SwitchCase PrevSC = nullptr; for (auto E = Record.size(); Record.getIdx() != E; ) { SwitchCase SC = Record.getSwitchCaseWithID(Record.readInt()); if (PrevSC) PrevSC->setNextSwitchCase(SC); else S->setSwitchCaseList(SC); // Sets FirstCase !!! PrevSC = SC; } } // return ``` Later, in ASTNodeImporter::VisitSwitchStmt, we have a condition that depends on this uninited value: ``` for (SwitchCase SC = S->getSwitchCaseList(); SC != nullptr; SC = SC->getNextSwitchCase()) { // ... more code ... } ``` This is clearly an UB. This causes non-deterministic crashes when ClangSA analyzes some code with CTU. See the below report by valgrind (the whole valgrind output is attached): ``` ==31019== Conditional jump or move depends on uninitialised value(s) ==31019== at 0x12ED1983: clang::ASTNodeImporter::VisitSwitchStmt(clang::SwitchStmt) (ASTImporter.cpp:6195) ==31019== by 0x12F1D509: clang::StmtVisitorBase<std::add_pointer, clang::ASTNodeImporter, llvm::Expected<clang::Stmt>>::Visit(clang::Stmt) (StmtNodes.inc:591) ==31019== by 0x12EE4FDF: clang::ASTImporter::Import(clang::Stmt) (ASTImporter.cpp:8484) ==31019== by 0x12F09498: llvm::Expected<clang::Stmt> clang::ASTNodeImporter::import<clang::Stmt>(clang::Stmt) (ASTImporter.cpp:164) ==31019== by 0x12F3A1F5: llvm::Error clang::ASTNodeImporter::ImportArrayChecked<clang::Stmt, clang::Stmt>(clang::Stmt, clang::Stmt, clang::Stmt) (ASTImporter.cpp:653) ==31019== by 0x12F13152: llvm::Error clang::ASTNodeImporter::ImportContainerChecked<llvm::iterator_range<clang::Stmt>, llvm::SmallVector<clang::Stmt, 8u> >(llvm::iterator_range<clang::Stmt*> const&, llvm::SmallVector<clang::Stmt, 8u>&) (ASTImporter.cpp:669) ==31019== by 0x12ED099F: clang::ASTNodeImporter::VisitCompoundStmt(clang::CompoundStmt) (ASTImporter.cpp:6077) ==31019== by 0x12F1CC2D: clang::StmtVisitorBase<std::add_pointer, clang::ASTNodeImporter, llvm::Expected<clang::Stmt>>::Visit(clang::Stmt) (StmtNodes.inc:73) ==31019== by 0x12EE4FDF: clang::ASTImporter::Import(clang::Stmt) (ASTImporter.cpp:8484) ==31019== by 0x12F09498: llvm::Expected<clang::Stmt> clang::ASTNodeImporter::import<clang::Stmt>(clang::Stmt) (ASTImporter.cpp:164) ==31019== by 0x12F13275: clang::Stmt* clang::ASTNodeImporter::importChecked<clang::Stmt>(llvm::Error&, clang::Stmt const&) (ASTImporter.cpp:197) ==31019== by 0x12ED0CE6: clang::ASTNodeImporter::VisitCaseStmt(clang::CaseStmt*) (ASTImporter.cpp:6098) ``` Differential Revision: https://reviews.llvm.org/D97849	2021-03-04 15:10:04 +01:00
Nico Weber	fe5c2c3ca6	Revert "[Coverage] Emit gap region between statements if first statements contains terminate statements." This reverts commit `2d7374a0c6`. Breaks ContinuousSyncMode/basic.c in check-profile on macOS.	2021-03-04 08:53:30 -05:00
Thomas Preud'homme	52bfe6605a	Add __builtin_isnan(__fp16) testcase Reviewed By: rjmccall Differential Revision: https://reviews.llvm.org/D97777	2021-03-04 13:03:48 +00:00
Thomas Preud'homme	6d6e7132f9	Revert "Add __builtin_isnan(__fp16) testcase" This reverts commit `e77b5c40d5` because it fails without `1b6eb56aa0`.	2021-03-04 12:18:03 +00:00
Thomas Preud'homme	b7aeece47c	Revert "Stop traping on sNaN in __builtin_isinf" This reverts commit `1b6eb56aa0` because the invert logic for isfinite is incorrect.	2021-03-04 12:07:35 +00:00
Wang, Pengfei	e7e67c930a	Add Windows ehcont section support (/guard:ehcont). Add option /guard:ehcont Reviewed By: rnk Differential Revision: https://reviews.llvm.org/D96709	2021-03-04 11:47:29 +08:00
Fangrui Song	584cb67d2d	[IRSymTab] Set FB_used on llvm.compiler.used symbols IR symbol table does not parse inline asm. A symbol only referenced by inline asm is not in the IR symbol table, so LTO does not know that the definition (in another translation unit) is referenced and may internalize it, even if that definition has `__attribute__((used))` (which lowers to `llvm.compiler.used` on ELF targets since D97446). ``` // cabac.c __attribute__((used)) const uint8_t ff_h264_cabac_tables[...] = {...}; // h264_cabac.c asm("lea ff_h264_cabac_tables(%rip), %0" : ...); ``` `__attribute__((used))` is the recommended way to tell the compiler there may be inline asm references, so the usage is perfectly fine. This patch conservatively sets the `FB_used` bit on `llvm.compiler.used` symbols to work around the IR symbol table limitation. Note: before D97446, Clang never emitted symbols in the `llvm.compiler.used` list, so this change does not punish any Clang emitted global object. Without the patch, `ff_h264_cabac_tables` may be assigned to a non-external partition and get internalized. Then we will get a linker error because the `cabac.c` definition is not exposed. Differential Revision: https://reviews.llvm.org/D97755	2021-03-03 16:22:30 -08:00
Steven Wan	0b274ed499	[AIX] Update default arch on AIX On AIX, the default arch level should match the minimum supported arch level of the OS version. Differential Revision: https://reviews.llvm.org/D97823	2021-03-03 19:07:43 -05:00
Zequan Wu	2d7374a0c6	[Coverage] Emit gap region between statements if first statements contains terminate statements. Differential Revision: https://reviews.llvm.org/D97101	2021-03-03 11:25:49 -08:00
David Tenty	66799bf0e2	[AIX][clang][driver] Restrict /usr/lib to internal library search paths Adding it to the general filepaths results in it being added to the linker arguments. The AIX linker always looks in this path anyway and adds it as a default library path component. Adding this duplicate explicitly results in duplicate entries in path in the loader section of executables and messes up tools like CMake that parse the default library flags. Reviewed By: ZarkoCA Differential Revision: https://reviews.llvm.org/D97574	2021-03-03 10:48:35 -05:00
Daniel McIntosh	9403b59a7d	[test] Fix apparent typo in clang/test/Driver/std.c Currently the test on line 3 is identical to the test on line 1. Looking at the rest of the file (particularily the use of FOVERRIDE as the check-prefix), I think it's pretty clear that this line was supposed to use `-ftrigraphs` instead of `-trigraphs`. Reviewed By: thakis Differential Revision: https://reviews.llvm.org/D97796	2021-03-03 10:31:47 -05:00
Melanie Blower	cc3d25be01	[clang][patch] To solve PR26413, x86 interrupt routines may only call routines with no_saved_reg Reviewed By: Aaron Ballman Differential Revision: https://reviews.llvm.org/D97764	2021-03-03 10:11:13 -05:00
Aaron Ballman	b2bc0a3254	Implement P2173 for attributes on lambdas https://wg21.link/P2173 is making its way through WG21 currently and has not been formally adopted yet. This feature provides very useful functionality in that you can specify attributes on the various function declarations generated by a lambda expression, where the current C++ grammar only allows attributes which apply to the various function types so generated. This patch implements P2173 on the assumption that it will be adopted by WG21 with this syntax for C++23.	2021-03-03 10:05:39 -05:00
Anastasia Stulova	25ad188bfc	[OpenCL] Prevent adding extension pragma by default. This commit refactors extension support to allow specifying whether pragma is needed or not explicitly. For backward compatibility pragmas are set to required for all extensions that were added prior to this but not for OpenCL 3.0 features. Differential Revision: https://reviews.llvm.org/D97052	2021-03-03 15:02:21 +00:00
Hans Wennborg	0a5dd06718	Revert "[ObjC][ARC] Use operand bundle 'clang.arc.attachedcall' instead of explicitly emitting retainRV or claimRV calls in the IR" This caused miscompiles of Chromium tests for iOS due clobbering of live registers. See discussion on the code review for details. > Background: > > This fixes a longstanding problem where llvm breaks ARC's autorelease > optimization (see the link below) by separating calls from the marker > instructions or retainRV/claimRV calls. The backend changes are in > https://reviews.llvm.org/D92569. > > https://clang.llvm.org/docs/AutomaticReferenceCounting.html#arc-runtime-objc-autoreleasereturnvalue > > What this patch does to fix the problem: > > - The front-end adds operand bundle "clang.arc.attachedcall" to calls, > which indicates the call is implicitly followed by a marker > instruction and an implicit retainRV/claimRV call that consumes the > call result. In addition, it emits a call to > @llvm.objc.clang.arc.noop.use, which consumes the call result, to > prevent the middle-end passes from changing the return type of the > called function. This is currently done only when the target is arm64 > and the optimization level is higher than -O0. > > - ARC optimizer temporarily emits retainRV/claimRV calls after the calls > with the operand bundle in the IR and removes the inserted calls after > processing the function. > > - ARC contract pass emits retainRV/claimRV calls after the call with the > operand bundle. It doesn't remove the operand bundle on the call since > the backend needs it to emit the marker instruction. The retainRV and > claimRV calls are emitted late in the pipeline to prevent optimization > passes from transforming the IR in a way that makes it harder for the > ARC middle-end passes to figure out the def-use relationship between > the call and the retainRV/claimRV calls (which is the cause of > PR31925). > > - The function inliner removes an autoreleaseRV call in the callee if > nothing in the callee prevents it from being paired up with the > retainRV/claimRV call in the caller. It then inserts a release call if > claimRV is attached to the call since autoreleaseRV+claimRV is > equivalent to a release. If it cannot find an autoreleaseRV call, it > tries to transfer the operand bundle to a function call in the callee. > This is important since the ARC optimizer can remove the autoreleaseRV > returning the callee result, which makes it impossible to pair it up > with the retainRV/claimRV call in the caller. If that fails, it simply > emits a retain call in the IR if retainRV is attached to the call and > does nothing if claimRV is attached to it. > > - SCCP refrains from replacing the return value of a call with a > constant value if the call has the operand bundle. This ensures the > call always has at least one user (the call to > @llvm.objc.clang.arc.noop.use). > > - This patch also fixes a bug in replaceUsesOfNonProtoConstant where > multiple operand bundles of the same kind were being added to a call. > > Future work: > > - Use the operand bundle on x86-64. > > - Fix the auto upgrader to convert call+retainRV/claimRV pairs into > calls with the operand bundles. > > rdar://71443534 > > Differential Revision: https://reviews.llvm.org/D92808 This reverts commit `ed4718eccb`.	2021-03-03 15:51:40 +01:00
Aaron Ballman	8da090381d	Improve static_assert/_Static_assert diagnostics Our diagnostics relating to static assertions were a bit confused. For instance, when in MS compatibility mode in C (where we accept static_assert even without including <assert.h>), we would fail to warn the user that they were using the wrong spelling (even in pedantic mode), we were missing a compatibility warning about using _Static_assert in earlier standards modes, diagnostics for the optional message were not reflected in C as they were in C++, etc.	2021-03-03 08:48:27 -05:00
JinGu Kang	394a4d0433	[AArch64] Add missing intrinsics for vcls Differential Revision: https://reviews.llvm.org/D97775	2021-03-03 10:17:56 +00:00
Thomas Preud'homme	e77b5c40d5	Add __builtin_isnan(__fp16) testcase Reviewed By: rjmccall Differential Revision: https://reviews.llvm.org/D97777	2021-03-02 21:01:51 +00:00
Jez Ng	18fa1d380d	[clang+lld] Pass -platform_version args to ld64.lld Fix regression where we aren't passing `-platform_version` to new ld64.lld after {D95204}. Most of the changes were originally in D95204, but I backed them out due to test failures on builds which have `CLANG_DEFAULT_LINKER=lld`. The tests are properly updated in this diff. Reviewed By: #lld-macho, thakis Differential Revision: https://reviews.llvm.org/D97741	2021-03-02 12:52:54 -05:00
Thomas Preud'homme	1b6eb56aa0	Stop traping on sNaN in __builtin_isinf __builtin_isinf currently generates a floating-point compare operation which triggers a trap when faced with a signaling NaN in StrictFP mode. This commit uses integer operations instead to not generate any trap in such a case. Reviewed By: mibintc Differential Revision: https://reviews.llvm.org/D97125	2021-03-02 15:54:56 +00:00
Alexey Bataev	0caf736d7e	[OPENMP50]Mapping of the subcomponents with the 'default' mappers. If the mapped structure has data members, which have 'default' mappers, need to map these members individually using their 'default' mappers. Differential Revision: https://reviews.llvm.org/D92195	2021-03-02 07:11:06 -08:00
Tim Northover	888c5c24ca	AArch64: report fp16 arithmetic is present for apple-a11 CPU. AArch64.td got it right, but the target-parser dropped it, leading to missing feature flags in Clang.	2021-03-02 15:07:18 +00:00
Ed Maste	462cf39a5c	[Driver] Fix -gz=zlib options for linker also on FreeBSD `ccb4124a41` fixed translating -gz=zlib to --compress-debug-sections for linker invocation for several ToolChains, but omitted FreeBSD. Differential Revision: https://reviews.llvm.org/D97752	2021-03-02 08:44:24 -05:00
Richard Smith	9e2579dbf4	Fix infinite recursion during IR emission if a constant-initialized lifetime-extended temporary object's initializer refers back to the same object. `GetAddrOfGlobalTemporary` previously tried to emit the initializer of a global temporary before updating the global temporary map. Emitting the initializer could recurse back into `GetAddrOfGlobalTemporary` for the same temporary, resulting in an infinite recursion. Reviewed By: rjmccall Differential Revision: https://reviews.llvm.org/D97733	2021-03-01 22:19:21 -08:00
Nemanja Ivanovic	1ff93618e5	[PowerPC] Add missing overloads of vec_promote to altivec.h The VSX-only overloads (for 8-byte element vectors) are missing. Add the missing overloads and convert element numbering to modulo arithmetic to match GCC and XLC.	2021-03-01 21:40:30 -06:00
Yaxun (Sam) Liu	9ecbb34e1d	Fix test cxx-call-kernel.cpp Only test it with x86 since other target may have an ABI making it difficult to test. Change-Id: I85423c8bbbbbb8f24cb3ea4cb64a408069b4d61c	2021-03-01 17:10:53 -05:00
Yaxun (Sam) Liu	5cf2a37f12	[HIP] Emit kernel symbol Currently clang uses stub function to launch kernel. This is inconvenient to interop with C++ programs since the stub function has different name as kernel, which is required by ROCm debugger. This patch emits a variable symbol which has the same name as the kernel and uses it to register and launch the kernel. This allows C++ program to launch a kernel by using the original kernel name. Reviewed by: Artem Belevich Differential Revision: https://reviews.llvm.org/D86376	2021-03-01 16:31:40 -05:00
Richard Smith	564f5b0734	Revert "[c++20] Mark class type NTTPs as done and start defining the feature test macro." Some of the parts of this work were reverted; stop defining the feature test macro for now. This reverts commit `b4c63ef6dd`.	2021-03-01 12:53:35 -08:00
Jez Ng	922de2574c	[lld-macho] Partial revert of D95204 Trying to unbreak https://lab.llvm.org/buildbot/#/builders/57/builds/4753 I'm not able to repro the failures locally so... here's hoping	2021-03-01 11:29:42 -08:00
Fangrui Song	d942a82a07	Make -f[no-]split-dwarf-inlining CC1 default align with driver default (no inlining) This makes CC1 and driver defaults consistent. In addition, for more common cases (-g is specified without -gsplit-dwarf), users will not see -fno-split-dwarf-inlining in CC1 options. Verified that the below is still true: * `clang -g` => `splitDebugInlining: false` in DICompileUnit * `clang -g -gsplit-dwarf` => `splitDebugInlining: false` in DICompileUnit * `clang -g -gsplit-dwarf -fsplit-dwarf-inlining` => no `splitDebugInlining: false` Reviewed By: dblaikie Differential Revision: https://reviews.llvm.org/D97706	2021-03-01 10:55:19 -08:00
Yonghong Song	283db5f083	BPF: fix enum value 0 issue for __builtin_preserve_enum_value() Lorenz Bauer reported that the following code will have compilation error for bpf target: enum e { TWO }; bpf_core_enum_value_exists(enum e, TWO); The clang emitted the following error message: __builtin_preserve_enum_value argument 1 invalid In SemaChecking, an expression like "(enum NAME)1" will have cast kind CK_IntegralToPointer, but "(enum NAME)0" will have cast kind CK_NullToPointer. Current implementation only permits CK_IntegralToPointer, missing enum value 0 case. This patch permits CK_NullToPointer cast kind and the above test case can pass now. Differential Revision: https://reviews.llvm.org/D97659	2021-03-01 10:23:24 -08:00
Sean Fertile	3f40dbbbc7	[PowerPC][AIX] Enable passing vectors in variadic functions. Differential Revision: https://reviews.llvm.org/D97474	2021-03-01 13:08:28 -05:00
Arthur Eubanks	040c1b49d7	Move EntryExitInstrumentation pass location This seems to be more of a Clang thing rather than a generic LLVM thing, so this moves it out of LLVM pipelines and as Clang extension hooks into LLVM pipelines. Move the post-inline EEInstrumentation out of the backend pipeline and into a late pass, similar to other sanitizer passes. It doesn't fit into the codegen pipeline. Also fix up EntryExitInstrumentation not running at -O0 under the new PM. PR49143 Reviewed By: hans Differential Revision: https://reviews.llvm.org/D97608	2021-03-01 10:08:10 -08:00
Jez Ng	415c0cd698	[lld-macho] Switch default to new Darwin backend The new Darwin backend for LLD is now able to link reasonably large real-world programs on x86_64. For instance, we have achieved self-hosting for the X86_64 target, where all LLD tests pass when building lld with itself on macOS. As such, we would like to make it the default back-end. The new port is now named `ld64.lld`, and the old port remains accessible as `ld64.lld.darwinold` This [annoucement email][1] has some context. (But note that, unlike what the email says, we are no longer doing this as part of the LLVM 12 branch cut -- instead we will go into LLVM 13.) Numerous mechanical test changes were required to make this change; in the interest of creating something that's reviewable on Phabricator, I've split out the boring changes into a separate diff (D95905). I plan to merge its contents with those in this diff before landing. (@gkm made the original draft of this diff, and he has agreed to let me take over.) [1]: https://lists.llvm.org/pipermail/llvm-dev/2021-January/147665.html Reviewed By: #lld-macho, thakis Differential Revision: https://reviews.llvm.org/D95204	2021-03-01 12:30:10 -05:00
Nico Weber	83feaa36ad	[clang-cl] make -f(no-)ident a CoreOption On clang emits the compiler version string into debug information by default for both dwarf and codeview. That makes compiler output needlessly compiler-version-dependent which makes e.g. comparing object file outputs during a bisect hard. So it's nice if there's an easy way to turn this off. (On ELF, this flag also controls the .comment section, but that part is ELF-only. The debug-info bit isn't.) Differential Revision: https://reviews.llvm.org/D97695	2021-03-01 11:53:51 -05:00
Olivier Goffart	1b04bdc2f3	[SEH] capture 'this' Simply make sure that the CodeGenFunction::CXXThisValue and CXXABIThisValue are correctly initialized to the recovered value. For lambda capture, we also need to make sure to fill the LambdaCaptureFields Differential Revision: https://reviews.llvm.org/D97534	2021-03-01 11:57:35 +01:00
Benjamin Kramer	965f24d4db	[Driver] Don't litter the source directory in test	2021-03-01 11:20:13 +01:00
Gabor Horvath	dd6738d93d	[clang][Lifetimes] Fix false positive warning from BUG 49342 Differential Revision: https://reviews.llvm.org/D97605	2021-02-27 08:09:57 -08:00
Fangrui Song	2e2ee4300d	[test] Add -triple x86_64 to attr-retain.cpp	2021-02-26 19:35:53 -08:00
Fangrui Song	a0c1cd642d	[test] Add -triple x86_64 to attr-retain.c	2021-02-26 17:26:26 -08:00
Fangrui Song	8afdacba9d	Add GNU attribute 'retain' For ELF targets, GCC 11 will set SHF_GNU_RETAIN on the section of a `__attribute__((retain))` function/variable to prevent linker garbage collection. (See AttrDocs.td for the linker support). This patch adds `retain` functions/variables to the `llvm.used` list, which has the desired linker GC semantics. Note: `retain` does not imply `used`, so an unused function/variable can be dropped by Sema. Before 'retain' was introduced, previous ELF solutions require inline asm or linker tricks, e.g. `asm volatile(".reloc 0, R_X86_64_NONE, target");` (architecture dependent) or define a non-local symbol in the section and use `ld -u`. There was no elegant source-level solution. With D97448, `__attribute__((retain))` will set `SHF_GNU_RETAIN` on ELF targets. Differential Revision: https://reviews.llvm.org/D97447	2021-02-26 16:37:50 -08:00
Vladimir Vereschaka	155c49e087	[Driver] Print process statistics report on CC_PRINT_PROC_STAT env variable. Added supporting CC_PRINT_PROC_STAT and CC_PRINT_PROC_STAT_FILE environment variables to trigger clang driver reporting the process statistics into specified file (alternate for -fproc-stat-report option). Differential Revision: https://reviews.llvm.org/D97094	2021-02-26 16:16:00 -08:00
Matheus Izvekov	4a8530fc30	[clang] implicitly delete space ship operator with function pointers See bug #48856 Definitions of classes with member function pointers and default spaceship operator were getting accepted with no diagnostic on release build, and triggering assert on builds with runtime checks enabled. Diagnostics were only produced when actually comparing instances of such classes. This patch makes it so Spaceship and Less operators are not considered as builtin operator candidates for function pointers, producing equivalent diagnostics for the cases where pointers to member function and pointers to data members are used instead. Reviewed By: rsmith Differential Revision: https://reviews.llvm.org/D95409	2021-02-26 16:03:01 -08:00
Fangrui Song	28cb620321	Change some addUsedGlobal to addUsedOrCompilerUsedGlobal An global value in the `llvm.used` list does not have GC root semantics on ELF targets. This will be changed in a subsequent backend patch. Change some `llvm.used` in the ELF code path to use `llvm.compiler.used` to prevent undesired GC root semantics. Change one extern "C" alias (due to `__attribute__((used))` in extern "C") to use `llvm.compiler.used` on all targets. GNU ld has a rule "`__start_/__stop_` references from a live input section retain the associated C identifier name sections", which LLD may drop entirely (currently refined to exclude SHF_LINK_ORDER/SHF_GROUP) in a future release (the rule makes it clumsy to GC metadata sections; D96914 added a way to try the potential future behavior). For `llvm.used` global values defined in a C identifier name section, keep using `llvm.used` so that the future LLD change will not affect them. rnk kindly categorized the changes: ``` ObjC/blocks: this wants GC root semantics, since ObjC mainly runs on Mac. MS C++ ABI stuff: wants GC root semantics, no change OpenMP: unsure, but GC root semantics probably don't hurt CodeGenModule: affected in this patch to not use GC root semantics so that __attribute__((used)) behavior remains the same on ELF, plus two other minor use cases that don't want GC semantics Coverage: Probably want GC root semantics CGExpr.cpp: refers to LTO, wants GC root CGDeclCXX.cpp: one is MS ABI specific, so yes GC root, one is some other C++ init functionality, which should form GC roots (C++ initializers can have side effects and must run) CGDecl.cpp: Changed in this patch for __attribute__((used)) ``` Differential Revision: https://reviews.llvm.org/D97446	2021-02-26 10:42:07 -08:00
Petr Hosek	bf6380c096	[Driver] Don't pass -ffile-compilation-dir through to cc1 This is a driver only flag so it has to be expanded when invoking cc1. Differential Revision: https://reviews.llvm.org/D97528	2021-02-25 23:03:54 -08:00
Petr Hosek	8459b8ef39	[Driver] Rename -fprofile-{prefix-map,compilation-dir} to -fcoverage-{prefix-map,compilation-dir} These flags affect coverage mapping (-fcoverage-mapping), not -fprofile-[instr-]generate so it makes more sense to use the -fcoverage-* prefix. Differential Revision: https://reviews.llvm.org/D97434	2021-02-25 21:40:12 -08:00
Petr Hosek	9e56a093ee	[Driver] Create -ffile-compilation-dir alias We introduce -ffile-compilation-dir shorthand to avoid having to set -fdebug-compilation-dir and -fprofile-compilation-dir separately. This is similar to -ffile-prefix-map. Differential Revision: https://reviews.llvm.org/D97433	2021-02-25 21:20:10 -08:00
Justin Lebar	c90dac27e9	[clang] Print 32 candidates on the first failure, with -fshow-overloads=best. Previously, -fshow-overloads=best always showed 4 candidates. The problem is, when this isn't enough, you're kind of up a creek; the only option available is to recompile with different flags. This can be quite expensive! With this change, we try to strike a compromise. The first error with more than 4 candidates will show up to 32 candidates. All further errors continue to show only 4 candidates. The hope is that this way, users will have some chance of making forward progress, without facing unbounded amounts of error spam. Differential Revision: https://reviews.llvm.org/D95754	2021-02-25 17:45:19 -08:00
Zequan Wu	4500f0a732	[Clang][Attributes] Allow not_tail_called attribute to be applied to virtual function. It would be beneficial to allow not_tail_called attribute to be applied to virtual functions. I don't see any drawback of allowing this. Differential Revision: https://reviews.llvm.org/D96832	2021-02-25 14:58:18 -08:00
Nicolas Guillemot	3573a90b8a	[PM] Show the pass argument in pre/post-pass IR dumps This patch adds each pass' pass argument in the header for IR dumps. For example: Before: ``` * IR Dump Before InstructionSelect * ``` After: ``` * IR Dump Before InstructionSelect (instruction-select) * ``` The goal is to make it easier to know what argument to pass to command line options like `debug-only` or `run-pass` to further investigate a given pass.	2021-02-25 14:02:00 -08:00
Dan Liew	7b1d2a2891	[NFC] Switch to auto marshalling infrastructure for `-fsanitize-address-destructor-kind=` flag. This change simplifies `clang/lib/Frontend/CompilerInvocation.cpp` because we no longer need to manually parse the flag and set codegen options in the frontend. However, we still need to manually parse the flag in the driver because: * The marshalling infrastructure doesn't operate there. * We need to do some platform specific checks in the driver that will likely never be supported by any kind of marshalling infrastructure. rdar://71609176 Differential Revision: https://reviews.llvm.org/D97327	2021-02-25 13:24:50 -08:00
Akira Hatanaka	ec4408ad69	[CodeGen] Call ConvertTypeForMem instead of ConvertType This fixes a crash that occurs when the type passed to the method is `_Bool`. rdar://74493389	2021-02-25 12:11:18 -08:00
Dan Liew	fdce098b49	[Clang][ASan] Teach Clang to not emit ASan module destructors when compiling with `-mkernel` or `-fapple-kext`. rdar://71609176 Differential Revision: https://reviews.llvm.org/D96573	2021-02-25 12:02:21 -08:00
Dan Liew	5d64dd8e3c	[Clang][ASan] Introduce `-fsanitize-address-destructor-kind=` driver & frontend option. The new `-fsanitize-address-destructor-kind=` option allows control over how module destructors are emitted by ASan. The new option is consumed by both the driver and the frontend and is propagated into codegen options by the frontend. Both the legacy and new pass manager code have been updated to consume the new option from the codegen options. It would be nice if the new utility functions (`AsanDtorKindToString` and `AsanDtorKindFromString`) could live in LLVM instead of Clang so they could be consumed by other language frontends. Unfortunately that doesn't work because the clang driver doesn't link against the LLVM instrumentation library. rdar://71609176 Differential Revision: https://reviews.llvm.org/D96572	2021-02-25 12:02:21 -08:00
Christopher Di Bella	4f395db86b	adds more checks to -Wfree-nonheap-object This commit adds checks for the following: * labels * block expressions * random integers cast to `void` function pointers cast to `void*` Differential Revision: https://reviews.llvm.org/D94640	2021-02-25 19:25:00 +00:00
Jon Roelofs	7f6e331645	Support `#pragma clang section` directives on MachO targets rdar://59560986 Differential Revision: https://reviews.llvm.org/D97233	2021-02-25 09:30:10 -08:00
Stanislav Mekhanoshin	502b3bfc6a	[AMDGPU] require s-memtime-inst for __builtin_amdgcn_s_memtime Differential Revision: https://reviews.llvm.org/D97420	2021-02-25 08:31:59 -08:00
Albion Fung	3b7104a2f2	Fix a test case that should check whether or not it is passed into lld This test case was causing a PowerPC buildbot to fail as it happened to be named lld-multistage, which matches with the original regex and therefore fails the check-not. This should better represent the desired check. Differential Revision: https://reviews.llvm.org/D97423	2021-02-25 10:32:32 -05:00
Timm Bäder	2cc58463ca	[clang][sema] Ignore xor-used-as-pow if both sides are macros This happens in codebases a lot, which use xor where both sides are macros. Using xor in that case is not the common error-prone 2^6 code that the warning was introduced for. Don't diagnose such a use of xor. Differential Revision: https://reviews.llvm.org/D97445	2021-02-25 16:31:07 +01:00
Harmen Stoppels	a54f160b3a	Prefer /usr/bin/env xxx over /usr/bin/xxx where xxx = perl, python, awk Allow users to use a non-system version of perl, python and awk, which is useful in certain package managers. Reviewed By: JDevlieghere, MaskRay Differential Revision: https://reviews.llvm.org/D95119	2021-02-25 11:32:27 +01:00
Jan Svoboda	d748908fa0	[clang][cli] Round-trip the whole CompilerInvocation Finally, this patch moves from round-tripping one `CompilerInvocation` at a time to round-tripping the invocation as a whole. This patch includes only the code required to make round-tripping the whole invocation work. More cleanups will be done in a follow-up patch. Depends on D96847, D97041 & D97042. Reviewed By: dexonsmith Differential Revision: https://reviews.llvm.org/D96280	2021-02-25 11:02:49 +01:00
Pushpinder Singh	99951aa68d	OpenMP: Fix object clobbering issue when using save-temps There are two preconditions to reproduce the issue, 1. Use -save-temps option 2. Provide the -o option with name equal to the input file name without the file extension. For e.g. clang a.c -o a With the -o specified, the AssembleJobAction after OffloadWrapperJobAction will produce the object file with same name as host code object file. Due to this clash, the OffloadWrapperAction overwrites the initial host object file, which results in lld error. This also fixes the `multiple definition of __dummy.omp_offloading.entry'` issue in D96769 . Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D97273	2021-02-25 00:50:51 -05:00
Liu, Chen3	4bc7c8631a	[X86] Support amx-bf16 intrinsic. Adding support for intrinsics of AMX-BF16. This patch alse fix a bug that AMX-INT8 instructions will be selected with wrong predicate. Differential Revision: https://reviews.llvm.org/D97358	2021-02-25 09:06:48 +08:00
Yaxun (Sam) Liu	47acdec1dd	[CUDA][HIP] Support accessing static device variable in host code for -fgpu-rdc For -fgpu-rdc mode, static device vars in different TU's may have the same name. To support accessing file-scope static device variables in host code, we need to give them a distinct name and external linkage. This can be done by postfixing each static device variable with a distinct CUID (Compilation Unit ID) hash. Since the static device variables have different name across compilation units, now we let them have external linkage so that they can be looked up by the runtime. Reviewed by: Artem Belevich, and Jon Chesterfield Differential Revision: https://reviews.llvm.org/D85223	2021-02-24 18:23:45 -05:00
Markus Böck	9f1b832331	Reland "[Driver][Windows] Support per-target runtimes dir layout for profile instr generate" This relands commit rG7f9d5d6e444c which was reverted in rGab5b00ada9e7 Differential Revision: https://reviews.llvm.org/D96638	2021-02-24 23:40:20 +01:00
Anastasia Stulova	abbdb5639c	[OpenCL] Allow taking address of functions as an extension. When '__cl_clang_function_pointers' extension is enabled the parser should allow obtaining the function address. This fixes PR49264! Differential Revision: https://reviews.llvm.org/D97203	2021-02-24 12:32:02 +00:00
Sven van Haastregt	0344aea6ea	[OpenCL] Add ndrange builtin functions to TableGen Also ensure all kernel enqueue functions have CL 2.0 as minimum version. Differential Revision: https://reviews.llvm.org/D97060	2021-02-24 09:27:36 +00:00
Sven van Haastregt	85eb12eefd	[OpenCL] Add declarations with enum/typedef args Add the remaining missing builtin function declarations that have enum or typedef argument or return types. Differential Revision: https://reviews.llvm.org/D96860	2021-02-24 09:27:35 +00:00
Vitaly Buka	8560c2d426	[ThinLTO, NewPM] Run OptimizerLastEPCallbacks from buildThinLTOPreLinkDefaultPipeline -O1 and above do dont call real optimizer pipeline in ThinLTO PreLink. Also clang can't add PostLink OptimizerLastEPCallbacks for in-process ThinLTO. This results in missing sanitizer passes with ThinLTO. Simple working solution is just call OptimizerLastEPCallbacks at the end of buildThinLTOPreLinkDefaultPipeline. Differential Revision: https://reviews.llvm.org/D96320	2021-02-23 22:14:41 -08:00
Dávid Bolvanský	053dc95839	Reduce the number of attributes attached to each function Patch takes advantage of the implicit default behavior to reduce the number of attributes, which in turns reduces compilation time. Reviewed By: serge-sans-paille Differential Revision: https://reviews.llvm.org/D97116	2021-02-24 07:08:44 +01:00
Yaxun (Sam) Liu	a3ce7f5cd2	[HIP] Fix managed variable linkage Currently managed variables are emitted as undefined symbols, which causes difficulty for diagnosing undefined symbols for non-managed variables. This patch transforms managed variables in device compilation so that they can be emitted as normal variables. Reviewed by: Artem Belevich Differential Revision: https://reviews.llvm.org/D96195	2021-02-23 22:34:45 -05:00
Nico Weber	ab5b00ada9	Revert "[Driver][Windows] Support per-target runtimes dir layout for profile instr generate" This reverts commit `7f9d5d6e44`. Breaks check-clang everywhere, see https://reviews.llvm.org/D96638#2583608	2021-02-23 20:38:39 -05:00
Hsiangkai Wang	1a35a1b074	[RISCV] Add vadd with mask and without mask builtin. Demonstrate how to add RISC-V V builtins and lower them to IR intrinsics for V extension. Authored-by: Roger Ferrer Ibanez <rofirrim@gmail.com> Co-Authored-by: Hsiangkai Wang <kai.wang@sifive.com> Differential Revision: https://reviews.llvm.org/D93446	2021-02-24 07:57:31 +08:00
David Crook	039f79c78c	[SEMA] Added warn_decl_shadow support for structured bindings https://bugs.llvm.org/show_bug.cgi?id=40858 CheckShadow is now called for each binding in the structured binding to make sure it does not shadow any other variable in scope. This does use a custom implementation of getShadowedDeclaration though because a BindingDecl is not a VarDecl Added a few unit tests for this. In theory though all the other shadow unit tests should be duplicated for the structured binding variables too but whether it is probably not worth it as they use common code. The MyTuple and std interface code has been copied from live-bindings-test.cpp Reviewed By: rsmith Differential Revision: https://reviews.llvm.org/D96147	2021-02-23 13:37:05 -08:00
zero9178	7f9d5d6e44	[Driver][Windows] Support per-target runtimes dir layout for profile instr generate When targeting a MSVC triple, --dependant-libs with the name of the clang runtime library for profiling is added to the command line args. In it's current implementations clang_rt.profile-<ARCH> is chosen as the name. When building a distribution using LLVM_ENABLE_PER_TARGET_RUNTIME_DIR this fails, due to the runtime file names not having an architecture suffix in the filename. This patch refactors getCompilerRT and getCompilerRTBasename to always consider per-target runtime directories. getCompilerRTBasename now simply returns the filename component of the path found by getCompilerRT Differential Revision: https://reviews.llvm.org/D96638	2021-02-23 22:35:19 +01:00
Joe Ellis	1b1b30cf0f	[clang][SVE] Don't warn on vector to sizeless builtin implicit conversion This commit prevents warnings from -Wconversion when a clang vector type is implicitly converted to a sizeless builtin type -- for example, when implicitly converting a fixed-predicate to a scalable predicate. The code below: 1 #include <arm_sve.h> 2 3 #define N __ARM_FEATURE_SVE_BITS 4 #define FIXED_ATTR __attribute__((arm_sve_vector_bits (N))) 5 typedef svbool_t fixed_svbool_t FIXED_ATTR; 6 7 inline fixed_svbool_t foo(fixed_svbool_t p) { 8 return svnot_z(svptrue_b64(), p); 9 } would previously raise this warning: warning: implicit conversion turns vector to scalar: \ 'fixed_svbool_t' (vector of 8 'unsigned char' values) to 'svbool_t' \ (aka '__SVBool_t') [-Wconversion] Note that many cases of these implicit conversions were already permitted because many functions inside arm_sve.h are spawned via preprocessor macros, and the call to isInSystemMacro would cover us in this case. This commit fixes the remaining cases. Differential Revision: https://reviews.llvm.org/D97053	2021-02-23 13:40:58 +00:00
Liu, Chen3	f8b9035aae	[X86] Support amx-int8 intrinsic. Adding support for intrinsics of TDPBSUD/TDPBUSD/TDPBUUD. Differential Revision: https://reviews.llvm.org/D97259	2021-02-23 17:08:05 +08:00
James Y Knight	e8617f2f18	DebugInfo: Emit "LocalToUnit" flag on local member function decls. Follow-up to `fe2dcd89ac`. Update test per review comments, restoring the "D" type to its original state, and adding new "L" type. (Sorry, this was intended to be included in the prior commit) Differential Revision: https://reviews.llvm.org/D96044	2021-02-22 18:47:15 -05:00
James Y Knight	fe2dcd89ac	DebugInfo: Emit "LocalToUnit" flag on local member function decls. Previously, the definition was so-marked, but the declaration was not. This resulted in LLVM's dwarf emission treating the function as being external, and incorrectly emitting DW_AT_external. Differential Revision: https://reviews.llvm.org/D96044	2021-02-22 17:55:25 -05:00
Shafik Yaghmour	50542d504d	Modify TypePrinter to differentiate between anonymous struct and unnamed struct Currently TypePrinter lumps anonymous classes and unnamed classes in one group "anonymous" this is not correct and can be confusing in some contexts. Differential Revision: https://reviews.llvm.org/D96807	2021-02-22 14:16:43 -08:00
Nathan James	5616c5b866	[clang] Tweaked fixit for static assert with no message If a static assert has a message as the right side of an and condition, suggest a fix it of replacing the '&&' to ','. `static_assert(cond && "Failed Cond")` -> `static_assert(cond, "Failed cond")` This use case comes up when lazily replacing asserts with static asserts. Reviewed By: aaron.ballman Differential Revision: https://reviews.llvm.org/D89065	2021-02-22 17:43:53 +00:00
Fangrui Song	bccdf6b232	Improve diagnostic for ignored GNU 'used' attribute Differential Revision: https://reviews.llvm.org/D97161	2021-02-22 09:18:13 -08:00
Shilei Tian	76151acf89	[Clang][OpenMP] Require CUDA 9.2+ for OpenMP offloading on NVPTX target In current implementation of `deviceRTLs`, we're using some functions that are CUDA version dependent (if CUDA_VERSION < 9, it is one; otheriwse, it is another one). As a result, we have to compile one bitcode library for each CUDA version supported. A worse problem is forward compatibility. If a new CUDA version is released, we have to update CMake file as well. CUDA 9.2 has been released for three years. Instead of using various weird tricks to make `deviceRTLs` work with different CUDA versions and still have forward compatibility, we can simply drop support for CUDA 9.1 or lower version. It has at least two benifits: - We don't need to generate bitcode libraries for each CUDA version; - Clang driver doesn't need to search for the bitcode lib based on CUDA version. We can claim that starting from LLVM 12, OpenMP offloading on NVPTX target requires CUDA 9.2+. Reviewed By: jdoerfert, JonChesterfield Differential Revision: https://reviews.llvm.org/D97003	2021-02-22 11:00:33 -05:00
Anastasia Stulova	cf3ef15a6e	[OpenCL] Add builtin declarations by default. This change enables the builtin function declarations in clang driver by default using the Tablegen solution along with the implicit include of 'opencl-c-base.h' header. A new flag '-cl-no-stdinc' disabling all default declarations and header includes is added. If any other mechanisms were used to include the declarations (e.g. with -Xclang -finclude-default-header) and the new default approach is not sufficient the, `-cl-no-stdinc` flag has to be used with clang to activate the old behavior. Tags: #clang Differential Revision: https://reviews.llvm.org/D96515	2021-02-22 12:24:16 +00:00
Ryan Santhiraraja	2c25efcbd3	[AArch64] Adding SHA3 Intrinsics support This patch adds the following SHA3 Intrinsics: vsha512hq_u64, vsha512h2q_u64, vsha512su0q_u64, vsha512su1q_u64 veor3q_u8 veor3q_u16 veor3q_u32 veor3q_u64 veor3q_s8 veor3q_s16 veor3q_s32 veor3q_s64 vrax1q_u64 vxarq_u64 vbcaxq_u8 vbcaxq_u16 vbcaxq_u32 vbcaxq_u64 vbcaxq_s8 vbcaxq_s16 vbcaxq_s32 vbcaxq_s64 Note need to include +sha3 and +crypto when building from the front-end Reviewed By: DavidSpickett Differential Revision: https://reviews.llvm.org/D96381	2021-02-22 12:09:20 +00:00
Balazs Benics	38b185832e	[analyzer][CTU] API for CTU macro expansions Removes `CrossTranslationUnitContext::getImportedFromSourceLocation` Removes the corresponding unit-test segment. Introduces the `CrossTranslationUnitContext::getMacroExpansionContextForSourceLocation` which will return the macro expansion context for an imported TU. Also adds a few implementation FIXME notes where applicable, since this feature is not implemented yet. This fact is also noted as Doxygen comments. Uplifts a few CTU LIT test to match the current incomplete behavior. It is a regression to some extent since now we don't expand any macros in imported TUs. At least we don't crash anymore. Note that the introduced function is already covered by LIT tests. Eg.: Analysis/plist-macros-with-expansion-ctu.c Reviewed By: balazske, Szelethus Differential Revision: https://reviews.llvm.org/D94673	2021-02-22 11:12:22 +01:00
Balazs Benics	170c67d5b8	[analyzer] Use the MacroExpansionContext for macro expansions in plists Removes the obsolete ad-hoc macro expansions during bugreport constructions. It will skip the macro expansion if the expansion happened in an imported TU. Also removes the expected plist file, while expanding matching context for the tests. Adds a previously crashing `plist-macros-with-expansion.c` testfile. Temporarily marks `plist-macros-with-expansion-ctu.c ` to `XFAIL`. Reviewed By: xazax.hun, Szelethus Differential Revision: https://reviews.llvm.org/D93224	2021-02-22 11:12:18 +01:00
Jan Svoboda	820e0c49fc	[clang][cli] Pass '-Wspir-compat' to cc1 from driver This patch moves the creation of the '-Wspir-compat' argument from cc1 to the driver. Without this change, generating command line arguments from `CompilerInvocation` cannot be done reliably: there's no way to distinguish whether '-Wspir-compat' was passed to cc1 on the command line (should be generated), or if it was created within `CompilerInvocation::CreateFromArgs` (should not be generated). This is also in line with how other '-W' flags are handled. (This was introduced in D21567.) Reviewed By: Anastasia Differential Revision: https://reviews.llvm.org/D97041	2021-02-22 09:54:44 +01:00
Brad Smith	b42d57a100	[clang][Driver][OpenBSD] libcxx also requires pthread	2021-02-20 20:53:25 -05:00
Shilei Tian	33d660939d	[Clang][OpenMP] Update driver test case for OpenMP offload to use sm_35 `sm_35` is the minimum requirement for OpenMP offloading on NVPTX device. Current driver test case is using `sm_20`. D97003 is going to switch the minimum CUDA version to 9.2, which only supports `sm_30+`. This patch makes step for the change. Reviewed By: JonChesterfield Differential Revision: https://reviews.llvm.org/D97120	2021-02-20 15:14:13 -05:00
Daan De Meyer	7dd42ecfa2	clang: Exclude efi_main from -Wmissing-prototypes When compiling UEFI applications, the main function is named efi_main() instead of main(). Let's exclude efi_main() from -Wmissing-prototypes as well to avoid warnings when working on UEFI applications. Differential Revision: https://reviews.llvm.org/D95746	2021-02-20 20:00:50 +00:00
Dávid Bolvanský	501b4fe4ed	Fixed failing test	2021-02-20 07:11:42 +01:00
Dávid Bolvanský	ee51c42e00	Reduce the number of attributes attached to each function This takes advantage of the implicit default behavior to reduce the number of attributes.	2021-02-20 06:57:47 +01:00
Dávid Bolvanský	cd54c57919	Reland "[Libcalls, Attrs] Annotate libcalls with noundef" Fixed Clang tests.	2021-02-20 06:18:48 +01:00
Petr Hosek	3275b18f89	[Coverage] Normalize compilation dir as well This matches debug info behavior. Differential Revision: https://reviews.llvm.org/D97001	2021-02-19 15:29:03 -08:00
Christopher Tetreault	55448ab540	[AArch64] Adding Neon Polynomial vadd Intrinsics This patch adds the following intrinsics: vadd_p8 vadd_p16 vadd_p64 vaddq_p8 vaddq_p16 vaddq_p64 vaddq_p128 Reviewed By: t.p.northover, DavidSpickett, ctetreau Differential Revision: https://reviews.llvm.org/D96825	2021-02-19 14:48:12 -08:00
Teresa Johnson	0923a60ea7	[clang] Emit type metadata on available_externally vtables for WPD When WPD is enabled, via WholeProgramVTables, emit type metadata for available_externally vtables. Additionally, add the vtables to the llvm.compiler.used global so that they are not prematurely eliminated (before *LTO analysis). This is needed to avoid devirtualizing calls to a function overriding a class defined in a header file but with a strong definition in a shared library. Without type metadata on the available_externally vtables from the header, the WPD analysis never sees what a derived class is overriding. Even if the available_externally base class functions are pure virtual, because shared library definitions are already treated conservatively (committed patches D91583, D96721, and D96722) we will not devirtualize, which would be unsafe since the library might contain overrides that aren't visible to the LTO unit. An example is std::error_category, which is overridden in LLVM and causing failures after a self build with WPD enabled, because libstdc++ contains hidden overrides of the virtual base class methods. Differential Revision: https://reviews.llvm.org/D96919	2021-02-19 12:42:34 -08:00
Artem Belevich	1a368ae3b7	[CUDA] fix builtin constraints for PTX 7.2 This fixes build issues w/ CUDA-11 introduced by https://reviews.llvm.org/D95974 Reviewed By: yaxunl Differential Revision: https://reviews.llvm.org/D97009	2021-02-19 09:57:21 -08:00
Nikita Popov	71a8e4e7d6	[MemCopyOpt] Enable MemorySSA by default This enables use of MemorySSA instead of MemDep in MemCpyOpt. To allow this without significant compile-time impact, the MemCpyOpt pass is moved directly before DSE (in the cases where this was not already the case), which allows us to reuse the existing MemorySSA analysis. Unlike the MemDep-based implementation, the MemorySSA-based MemCpyOpt can also perform simple optimizations across basic blocks. Differential Revision: https://reviews.llvm.org/D94376	2021-02-19 18:06:25 +01:00
Sjoerd Meijer	260f90bb3d	[AArch64] Add some missing Neoverse features This enables AES fusion and the post RA scheduler for the Neoverse cores. And while we are it also for the A55 that we had missed earlier. Differential Revision: https://reviews.llvm.org/D96866	2021-02-19 09:18:35 +00:00
Yaxun (Sam) Liu	51ade31e67	[HIP] Support device sanitizer Add option -fgpu-sanitize to enable sanitizer for AMDGPU target. Since it is experimental, it is off by default. Reviewed by: Artem Belevich Differential Revision: https://reviews.llvm.org/D96835	2021-02-18 23:30:25 -05:00
Richard Smith	bdf6fbc939	PR49239: Don't take shortcuts when constant evaluating in 'warn on UB' mode. We use that mode when evaluating ICEs in C, and those shortcuts could result in ICE evaluation producing the wrong answer, specifically if we evaluate a statement-expression as part of evaluating the ICE.	2021-02-18 18:31:08 -08:00
Shafik Yaghmour	9068dab1fd	Revert "Modify TypePrinter to differentiate between anonymous struct and unnamed struct" I missed clangd test suite and may need some time to get those working, so reverting for now. This reverts commit `ecb90b5545`.	2021-02-18 18:17:24 -08:00
Shafik Yaghmour	ecb90b5545	Modify TypePrinter to differentiate between anonymous struct and unnamed struct Currently TypePrinter lumps anonymous classes and unnamed classes in one group "anonymous" this is not correct and can be confusing in some contexts. Differential Revision: https://reviews.llvm.org/D96807	2021-02-18 17:44:45 -08:00
Richard Smith	3cd70fc59d	Detect diagnostic groups that are defined in multiple 'def's. Remove the three such groups that we've accumulated. These were causing duplicated output to appear in generated the diagnostic reference.	2021-02-18 17:19:01 -08:00
Petr Hosek	5fbd1a333a	[Coverage] Store compilation dir separately in coverage mapping We currently always store absolute filenames in coverage mapping. This is problematic for several reasons. It poses a problem for distributed compilation as source location might vary across machines. We are also duplicating the path prefix potentially wasting space. This change modifies how we store filenames in coverage mapping. Rather than absolute paths, it stores the compilation directory and file paths as given to the compiler, either relative or absolute. Later when reading the coverage mapping information, we recombine relative paths with the working directory. This approach is similar to handling ofDW_AT_comp_dir in DWARF. Finally, we also provide a new option, -fprofile-compilation-dir akin to -fdebug-compilation-dir which can be used to manually override the compilation directory which is useful in distributed compilation cases. Differential Revision: https://reviews.llvm.org/D95753	2021-02-18 14:34:39 -08:00
Petr Hosek	fbf8b957fd	Revert "[Coverage] Store compilation dir separately in coverage mapping" This reverts commit `97ec8fa5bb` since the test is failing on some bots.	2021-02-18 12:50:24 -08:00
Pengxuan Zheng	0ec32f1326	Revert "[AArch64] Adding Neon Polynomial vadd Intrinsics" Revert the patch due to buildbot failures. This reverts commit `d9645059c5`.	2021-02-18 12:38:16 -08:00
Petr Hosek	97ec8fa5bb	[Coverage] Store compilation dir separately in coverage mapping We currently always store absolute filenames in coverage mapping. This is problematic for several reasons. It poses a problem for distributed compilation as source location might vary across machines. We are also duplicating the path prefix potentially wasting space. This change modifies how we store filenames in coverage mapping. Rather than absolute paths, it stores the compilation directory and file paths as given to the compiler, either relative or absolute. Later when reading the coverage mapping information, we recombine relative paths with the working directory. This approach is similar to handling ofDW_AT_comp_dir in DWARF. Finally, we also provide a new option, -fprofile-compilation-dir akin to -fdebug-compilation-dir which can be used to manually override the compilation directory which is useful in distributed compilation cases. Differential Revision: https://reviews.llvm.org/D95753	2021-02-18 12:27:42 -08:00
Zequan Wu	d83511dd26	[Coverage] Emit gap region after conditions when macro is present.	2021-02-18 11:41:04 -08:00
Pengxuan Zheng	d9645059c5	[AArch64] Adding Neon Polynomial vadd Intrinsics This patch adds the following intrinsics: vadd_p8 vadd_p16 vadd_p64 vaddq_p8 vaddq_p16 vaddq_p64 vaddq_p128 Reviewed By: t.p.northover, DavidSpickett Differential Revision: https://reviews.llvm.org/D96825	2021-02-18 11:33:24 -08:00
Jonas Paulsson	e57bd1ff4f	[CFE, SystemZ] New target hook testFPKind() for checks of FP values. The recent commit `00a6254` "Stop traping on sNaN in builtin_isnan" changed the lowering in constrained FP mode of builtin_isnan from an FP comparison to integer operations to avoid trapping. SystemZ has a special instruction "Test Data Class" which is the preferred way to do this check. This patch adds a new target hook "testFPKind()" that lets SystemZ emit the s390_tdc intrinsic instead. testFPKind() takes the BuiltinID as an argument and is expected to soon handle more opcodes than just 'builtin_isnan'. Review: Thomas Preud'homme, Ulrich Weigand Differential Revision: https://reviews.llvm.org/D96568	2021-02-18 12:36:46 -06:00
Akira Hatanaka	b87a120820	[ObjC] Encode pointers to C++ classes as "^v" if the encoded string would otherwise include template specialization types This helps reduce the size of the encoded C++ type strings in the binary. This is enabled by default only on Darwin, but can be enabled/disabled via command line options. rdar://63288571 Differential Revision: https://reviews.llvm.org/D96816	2021-02-18 09:38:26 -08:00
Jeroen Dobbelaere	46757ccb49	[clang] functions with the 'const' or 'pure' attribute must always return. As described in * https://gcc.gnu.org/onlinedocs/gcc/Common-Function-Attributes.html#index-pure-function-attribute * https://gcc.gnu.org/onlinedocs/gcc/Common-Function-Attributes.html#index-const-function-attribute An `__attribute__((pure))` function must always return, as well as an `__attribute__((const))` function. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D96960	2021-02-18 17:29:46 +01:00
Ties Stuij	5f7715d878	Pass the cmdline aapcs bitfield options to cc1 The following commits added commandline arguments to control following the Arm Procedure Call Standard for certain volatile bitfield operations: - https://reviews.llvm.org/D67399 - https://reviews.llvm.org/D72932 This commit fixes the oversight that these args weren't passed from the driver to cc1 if appropriate. Where appropriate means: - `-faapcs-bitfield-width`: is the default, so won't be passed - `-fno-aapcs-bitfield-width`: should be passed - `-faapcs-bitfield-load`: should be passed Differential Revision: https://reviews.llvm.org/D96784	2021-02-18 15:41:20 +00:00
Stefan Pintilie	b80357d46e	[PowerPC] Add option for ROP Protection Added -mrop-protection for Power PC to turn on codegen that provides some protection from ROP attacks. The option is off by default and can be turned on for Power 8, Power 9 and Power 10. This patch is for the option only. The feature will be implemented by a later patch. Reviewed By: amyk Differential Revision: https://reviews.llvm.org/D96512	2021-02-18 12:15:50 +00:00
Vitaly Buka	3afc8161b0	[NFC] Simplify msan test	2021-02-17 22:10:42 -08:00
Igor Kudrin	a0c9ec1f5e	[Driver] Honor "-gdwarf-N" at any position for assembler sources This fixes an issue when "-gdwarf-N" switch was ignored if it was given before another debug option. Differential Revision: https://reviews.llvm.org/D96865	2021-02-18 10:36:42 +07:00
Hsiangkai Wang	766ee1096f	[Clang][RISCV] Define RISC-V V builtin types Add the types for the RISC-V V extension builtins. These types will be used by the RISC-V V intrinsics which require types of the form <vscale x 1 x i64>(LMUL=1 element size=64) or <vscale x 4 x i32>(LMUL=2 element size=32), etc. The vector_size attribute does not work for us as it doesn't create a scalable vector type. We want these types to be opaque and have no operators defined for them. We want them to be sizeless. This makes them similar to the ARM SVE builtin types. But we will have quite a bit more types. This patch adds around 60. Later patches will add another 230 or so types representing tuples of these types similar to the x2/x3/x4 types in ARM SVE. But with extra complexity that these types are combined with the LMUL concept that is unique to RISCV. For more background see this RFC http://lists.llvm.org/pipermail/llvm-dev/2020-October/145850.html Authored-by: Roger Ferrer Ibanez <roger.ferrer@bsc.es> Co-Authored-by: Hsiangkai Wang <kai.wang@sifive.com> Differential Revision: https://reviews.llvm.org/D92715	2021-02-18 10:17:31 +08:00
Joerg Sonnenberger	2628e91461	[NetBSD] Use cortex-a8 as default CPU for ARMv7 This matches the platform default for GCC. It primarily matters when the integrated assembler is not used as there is no default CPU defined for ARMv7-A and GNU as is upset with -mcpu=generic.	2021-02-18 01:53:04 +01:00
Heejin Ahn	0b5d2b0efd	[WebAssembly] Remove dependency of reference types from EH The new spec does not have `exnref` so EH does not have dependency of the reference types proposal anymore. Reviewed By: dschuff Differential Revision: https://reviews.llvm.org/D96903	2021-02-17 16:10:59 -08:00
Stanislav Mekhanoshin	a8d9d50762	[AMDGPU] gfx90a support Differential Revision: https://reviews.llvm.org/D96906	2021-02-17 16:01:32 -08:00
Fangrui Song	0c2bb6b446	[Driver] Clean up some Separate form options Drop the `Separate` form of `-fmodule-name X`, `-fprofile-remapping-file X`, and `-frewrite-map-file X`. To the best of my knowledge they are not used. Their conventional Joined forms (`-fFOO=`) should be used instead. `-fdebug-compilation-dir X` is used in several places, e.g. chromium/infra/goma. It is also advertised in http://blog.llvm.org/2019/11/deterministic-builds-with-clang-and-lld.html So we keep it but make the EQ form canonical and the Separate form an alias. Differential Revision: https://reviews.llvm.org/D96886	2021-02-17 13:49:41 -08:00
Sriraman Tallam	e741916330	Basic block sections should enable not function sections implicitly. Basic block sections enables function sections implicitly, this is not needed and is inefficient with "=list" option. We had basic block sections enable function sections implicitly in clang. This is particularly inefficient with "=list" option as it places functions that do not have any basic block sections in separate sections. This causes unnecessary object file overhead for large applications. This patch disables this implicit behavior. It only creates function sections for those functions that require basic block sections. This patch is the second of two patches and this patch removes the implicit enabling of function sections with basic block sections in clang. Differential Revision: https://reviews.llvm.org/D93876	2021-02-17 12:37:50 -08:00
Sven van Haastregt	23d65aa446	[OpenCL] Support enum and typedef args in TableGen BIFs Add enum and typedef argument support to `-fdeclare-opencl-builtins`, which was the last major missing feature. Adding the remaining missing builtins is left as future work. Differential Revision: https://reviews.llvm.org/D96051	2021-02-17 14:17:43 +00:00
Igor Kudrin	72eee60b24	[Driver] Support -gdwarf64 for assembly files The option was added in D90507 for C/C++ source files. This patch adds support for assembly files. Differential Revision: https://reviews.llvm.org/D96783	2021-02-17 17:03:34 +07:00
Igor Kudrin	aa84289629	[DebugInfo] Keep the DWARF64 flag in the module metadata This allows the option to affect the LTO output. Module::Max helps to generate debug info for all modules in the same format. Differential Revision: https://reviews.llvm.org/D96597	2021-02-17 17:03:34 +07:00
Anton Zabaznov	e1a64aa66c	[OpenCL] Create VoidPtrTy with generic AS in C++ for OpenCL mode This change affects 'SemaOpenCLCXX/newdelete.cl' test, thus the patch contains adjustments in types validation of operators new and delete Reviewed By: Anastasia Differential Revision: https://reviews.llvm.org/D96178	2021-02-17 12:18:46 +03:00
Balázs Kéri	085dcc8217	[clang][Frontend] Fix a crash in DiagnosticRenderer. Displaying the problem range could crash if the begin and end of a range is in different files or macros. After the change such range is displayed only as the beginning location. There is a bug for this problem: https://bugs.llvm.org/show_bug.cgi?id=46540 Reviewed By: steakhal Differential Revision: https://reviews.llvm.org/D95860	2021-02-17 09:02:49 +01:00
Alexey Bataev	60d71a286b	[OPENMP50]Allow overlapping mapping in target constructs. OpenMP 5.0 removed a lot of restriction for overlapped mapped items comparing to OpenMP 4.5. Patch restricts the checks for overlapped data mappings only for OpenMP 4.5 and less and reorders mapping of the arguments so, that present and alloc mappings are processed first and then all others. Differential Revision: https://reviews.llvm.org/D86119	2021-02-16 14:42:08 -08:00
Yang Fan	fbee4a0c79	[C++20] [P1825] More implicit moves Implement all of P1825R0: - implicitly movable entity can be an rvalue reference to non-volatile automatic object. - operand of throw-expression can be a function or catch-clause parameter (support for function parameter has already been implemented). - in the first overload resolution, the selected function no need to be a constructor. - in the first overload resolution, the first parameter of the selected function no need to be an rvalue reference to the object's type. This patch also removes the diagnostic `-Wreturn-std-move-in-c++11`. Differential Revision: https://reviews.llvm.org/D88220	2021-02-16 17:24:20 -05:00
Michael Kruse	6c05005238	[OpenMP] Implement '#pragma omp tile', by Michael Kruse (@Meinersbur). The tile directive is in OpenMP's Technical Report 8 and foreseeably will be part of the upcoming OpenMP 5.1 standard. This implementation is based on an AST transformation providing a de-sugared loop nest. This makes it simple to forward the de-sugared transformation to loop associated directives taking the tiled loops. In contrast to other loop associated directives, the OMPTileDirective does not use CapturedStmts. Letting loop associated directives consume loops from different capture context would be difficult. A significant amount of code generation logic is taking place in the Sema class. Eventually, I would prefer if these would move into the CodeGen component such that we could make use of the OpenMPIRBuilder, together with flang. Only expressions converting between the language's iteration variable and the logical iteration space need to take place in the semantic analyzer: Getting the of iterations (e.g. the overload resolution of `std::distance`) and converting the logical iteration number to the iteration variable (e.g. overload resolution of `iteration + .omp.iv`). In clang, only CXXForRangeStmt is also represented by its de-sugared components. However, OpenMP loop are not defined as syntatic sugar. Starting with an AST-based approach allows us to gradually move generated AST statements into CodeGen, instead all at once. I would also like to refactor `checkOpenMPLoop` into its functionalities in a follow-up. In this patch it is used twice. Once for checking proper nesting and emitting diagnostics, and additionally for deriving the logical iteration space per-loop (instead of for the loop nest). Differential Revision: https://reviews.llvm.org/D76342	2021-02-16 09:45:07 -08:00
serge-sans-paille	3c8bf29f14	Reduce the number of attributes attached to each function This takes advantage of the implicit default behavior to reduce the number of attributes, which in turns reduces compilation time. I've observed -3% in instruction count when compiling sqlite3 amalgamation with -O0 Differential Revision: https://reviews.llvm.org/D96400	2021-02-16 16:19:54 +01:00
Jan Svoboda	32389346ed	[clang][cli] Generate -f[no-]finite-loops arguments This patch generates the `-f[no-]finite-loops` arguments from `CompilerInvocation` (added in D96419), fixing test failures of Clang built with `-DCLANG_ROUND_TRIP_CC1_ARGS=ON`. Reviewed By: fhahn Differential Revision: https://reviews.llvm.org/D96761	2021-02-16 14:39:20 +01:00
Johannes Doerfert	1dd66e6111	[OpenMP] Delay more diagnostics of potentially non-emitted code Even code in target and declare target regions might not be emitted. With this patch we delay more diagnostics and use laziness and linkage to determine if a function is emitted (for the device). Note that we still eagerly emit diagnostics for target regions, unfortunately, see the TODO for the reason. This hopefully fixes PR48933. Reviewed By: JonChesterfield Differential Revision: https://reviews.llvm.org/D95928	2021-02-15 13:17:05 -06:00
Johannes Doerfert	f9286b434b	[OpenMP] Attribute target diagnostics properly Type errors in function declarations were not (always) diagnosed prior to this patch. Furthermore, certain remarks did not get associated properly which caused them to be emitted multiple times. Reviewed By: JonChesterfield Differential Revision: https://reviews.llvm.org/D95912	2021-02-15 13:16:55 -06:00
Johannes Doerfert	3b2f19d0bc	[OpenMP][NFC] Pre-commit test changes regarding PR48933 This will highlight the effective changes in subsequent commits. Reviewed By: ABataev Differential Revision: https://reviews.llvm.org/D95903	2021-02-15 13:16:44 -06:00
Valeriy Savchenko	6f21adac6d	[analyzer][NFC] Fix test failures for builds w/o assertions	2021-02-15 16:38:15 +03:00
Deep Majumder	21daada950	[analyzer] Fix static_cast on pointer-to-member handling This commit fixes bug #48739. The bug was caused by the way static_casts on pointer-to-member caused the CXXBaseSpecifier list of a MemberToPointer to grow instead of shrink. The list is now grown by implicit casts and corresponding entries are removed by static_casts. No-op static_casts cause no effect. Reviewed By: vsavchenko Differential Revision: https://reviews.llvm.org/D95877	2021-02-15 11:44:37 +03:00
Wang, Pengfei	61da20575d	[X86] Convert fmin/fmax _mm_reduce_* intrinsics to emit llvm.reduction intrinsics (PR47506) This is a follow up of D92940. We have successfully converted fadd/fmul _mm_reduce_* intrinsics to llvm.reduction + reassoc flag. We can do the same approach for fmin/fmax too, i.e. llvm.reduction + nnan flag. Reviewed By: spatel Differential Revision: https://reviews.llvm.org/D93179	2021-02-15 08:52:06 +08:00
Malhar	74ddacd30d	[Clang] Ensure vector predication loop metadata is always emitted when pragma is specified. This patch ensures that vector predication and vectorization width pragmas work together correctly/as expected. Specifically, this patch fixes the issue that when vectorization_width > 1, the vector predication behaviour (this would matter if it has NOT been disabled explicitly by a pragma) was getting ignored, which was incorrect. The fix here removes the dependence of vector predication on the vectorization width. The loop metadata corresponding to clang loop pragma vectorize_predicate is always emitted, if the pragma is specified, even if vectorization is disabled by vectorize_width(1) or vectorize(disable) since the option is also used for interleaving by the LoopVectorize pass. Reviewed By: dmgreen, Meinersbur Differential Revision: https://reviews.llvm.org/D94779	2021-02-13 17:35:54 -06:00
Fangrui Song	39db16e75b	[test] Make ELF tests less reliant on the lexicographical order of non-local symbols	2021-02-13 01:01:06 -08:00
Artur Gainullin	ff50b121e3	[SYCL] Ignore file-scope asm during device-side SYCL compilation. Reviewed By: bader, eandrews Differential Revision: https://reviews.llvm.org/D96538	2021-02-12 17:00:45 -08:00
Jonas Paulsson	b3ac5b84cd	[SystemZ] Fix vecintrin.h to not emit alignment hints in vec_xl/vec_xst. vec_xl() and vec_xst() should not emit alignment hints since they take a scalar pointer and also add a byte offset if passed. This patch uses memcpy to achieve the desired result. Review: Ulrich Weigand Differential Revision: https://reviews.llvm.org/D96471	2021-02-12 18:26:36 -06:00
Florian Hahn	51bf4c0e6d	[clang] Add -ffinite-loops & -fno-finite-loops options. This patch adds 2 new options to control when Clang adds `mustprogress`: 1. -ffinite-loops: assume all loops are finite; mustprogress is added to all loops, regardless of the selected language standard. 2. -fno-finite-loops: assume no loop is finite; mustprogress is not added to any loop or function. We could add mustprogress to functions without loops, but we would have to detect that in Clang, which is probably not worth it. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D96419	2021-02-12 19:25:49 +00:00
Amy Huang	3fe465fb2c	Revert "[DebugInfo] Add an attribute to force type info to be emitted for" Didn't mean to commit this. This reverts commit `1b5c2915a2`.	2021-02-12 10:18:17 -08:00
Amy Huang	1b5c2915a2	[DebugInfo] Add an attribute to force type info to be emitted for class types. The goal is to provide a way to bypass constructor homing when emitting class definitions and force class definitions in the debug info. Not sure about the wording of the attribute, or whether it should be specific to classes with constructors	2021-02-12 10:16:49 -08:00
Akira Hatanaka	ed4718eccb	[ObjC][ARC] Use operand bundle 'clang.arc.attachedcall' instead of explicitly emitting retainRV or claimRV calls in the IR Background: This fixes a longstanding problem where llvm breaks ARC's autorelease optimization (see the link below) by separating calls from the marker instructions or retainRV/claimRV calls. The backend changes are in https://reviews.llvm.org/D92569. https://clang.llvm.org/docs/AutomaticReferenceCounting.html#arc-runtime-objc-autoreleasereturnvalue What this patch does to fix the problem: - The front-end adds operand bundle "clang.arc.attachedcall" to calls, which indicates the call is implicitly followed by a marker instruction and an implicit retainRV/claimRV call that consumes the call result. In addition, it emits a call to @llvm.objc.clang.arc.noop.use, which consumes the call result, to prevent the middle-end passes from changing the return type of the called function. This is currently done only when the target is arm64 and the optimization level is higher than -O0. - ARC optimizer temporarily emits retainRV/claimRV calls after the calls with the operand bundle in the IR and removes the inserted calls after processing the function. - ARC contract pass emits retainRV/claimRV calls after the call with the operand bundle. It doesn't remove the operand bundle on the call since the backend needs it to emit the marker instruction. The retainRV and claimRV calls are emitted late in the pipeline to prevent optimization passes from transforming the IR in a way that makes it harder for the ARC middle-end passes to figure out the def-use relationship between the call and the retainRV/claimRV calls (which is the cause of PR31925). - The function inliner removes an autoreleaseRV call in the callee if nothing in the callee prevents it from being paired up with the retainRV/claimRV call in the caller. It then inserts a release call if claimRV is attached to the call since autoreleaseRV+claimRV is equivalent to a release. If it cannot find an autoreleaseRV call, it tries to transfer the operand bundle to a function call in the callee. This is important since the ARC optimizer can remove the autoreleaseRV returning the callee result, which makes it impossible to pair it up with the retainRV/claimRV call in the caller. If that fails, it simply emits a retain call in the IR if retainRV is attached to the call and does nothing if claimRV is attached to it. - SCCP refrains from replacing the return value of a call with a constant value if the call has the operand bundle. This ensures the call always has at least one user (the call to @llvm.objc.clang.arc.noop.use). - This patch also fixes a bug in replaceUsesOfNonProtoConstant where multiple operand bundles of the same kind were being added to a call. Future work: - Use the operand bundle on x86-64. - Fix the auto upgrader to convert call+retainRV/claimRV pairs into calls with the operand bundles. rdar://71443534 Differential Revision: https://reviews.llvm.org/D92808	2021-02-12 09:51:57 -08:00
Florian Hahn	fb4d8fe807	[clang] Update mustprogress tests. This unifies the positive and negative tests in a single file and manually adjusts the check lines to check for differences surgically.	2021-02-12 16:53:51 +00:00
Yaxun (Sam) Liu	053e61d54e	Relands "[HIP] Change default --gpu-max-threads-per-block value to 1024" This reverts commit `e384e94fbe`.	2021-02-12 10:53:59 -05:00
Pushpinder Singh	79401b43ce	[OpenMP][AMDGPU] Add support for linking libomptarget bitcode This patch uses the existing logic of CUDA for searching libomptarget and extracts it to a common method. Reviewed By: JonChesterfield, tianshilei1992 Differential Revision: https://reviews.llvm.org/D96248	2021-02-12 00:42:41 -05:00
Vitaly Buka	686b65f85f	[Msan, NewPM] Reduce size of msan binaries EarlyCSEPass called after msan redices code size by about 10%. Similar optimization exists for legacy pass manager in addGeneralOptsForMemorySanitizer. Reviewed By: eugenis Differential Revision: https://reviews.llvm.org/D96406	2021-02-11 16:07:18 -08:00
James Y Knight	8043d5a964	NFC: update clang tests to check ordering and alignment for atomicrmw/cmpxchg. The ability to specify alignment was recently added, and it's an important property which we should ensure is set as expected by Clang. (Especially before making further changes to Clang's code in this area.) But, because it's on the end of the lines, the existing tests all ignore it. Therefore, update all the tests to also verify the expected alignment for atomicrmw and cmpxchg. While I was in there, I also updated uses of 'load atomic' and 'store atomic', and added the memory ordering, where that was missing.	2021-02-11 17:35:09 -05:00
Hafiz Abid Qadeer	60bed4ab57	Replace deprecated %T in 2 tests. In D91442, @MaskRay commented about a failure. This commit does the following to address his comments: 1. Replace %T with %t as former is deprecated. 2. Add an explicit --sysroot argument in a test. Some tests were failing when gcc-10-riscv64-linux-gnu is installed on test machine. This was happening because the test was checking a case when --gcc-toolchain is not provided. But if --sysroot was also not provided then code could pick a toolchain installed in /usr. So to make the test more robust, I have provided an explicit --sysroot argument. Its value has been chosen to match the existing patterns. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D93023	2021-02-11 22:21:21 +00:00
Pengxuan Zheng	61cca0f2e5	[AArch64] Adding Neon Sm3 & Sm4 Intrinsics This adds SM3 and SM4 Intrinsics support for AArch64, specifically: vsm3ss1q_u32 vsm3tt1aq_u32 vsm3tt1bq_u32 vsm3tt2aq_u32 vsm3tt2bq_u32 vsm3partw1q_u32 vsm3partw2q_u32 vsm4eq_u32 vsm4ekeyq_u32 Reviewed By: labrinea Differential Revision: https://reviews.llvm.org/D95655	2021-02-11 14:20:20 -08:00
Douglas Yung	7b4832648a	NFCI. With the move to the new pass manager by default, sanitize-coverage.c is now passing on ARM. This change removes the XFAIL from the original test and duplicates the test into sanitize-coverage-old-pm.c which uses the old pass manager and has the corresponding XFAIL. This should fix the XPASS from this and similar runs: http://lab.llvm.org:8011/#/builders/60/builds/1875	2021-02-11 13:18:18 -08:00
Nick Desaulniers	a680bc3a31	[clang][Arm] Fix handling of -Wa,-implicit-it= Similiar to D95872, this flag can be set for the assembler directly. Move validation code into a reusable helper function. Link: https://bugs.llvm.org/show_bug.cgi?id=49023 Link: https://github.com/ClangBuiltLinux/linux/issues/1270 Reported-by: Arnd Bergmann <arnd@kernel.org> Signed-off-by: Nick Desaulniers <ndesaulniers@google.com> Reviewed By: DavidSpickett Differential Revision: https://reviews.llvm.org/D96285	2021-02-11 10:51:25 -08:00
Stella Stamenova	ed98676fa4	Support multi-configuration generators correctly in several config files Multi-configuration generators (such as Visual Studio and Xcode) allow the specification of a build flavor at build time instead of config time, so the lit configuration files need to support that - and they do for the most part. There are several places that had one of two issues (or both!): 1) Paths had %(build_mode)s set up, but then not configured, resulting in values that would not work correctly e.g. D:/llvm-build/%(build_mode)s/bin/dsymutil.exe 2) Paths did not have %(build_mode)s set up, but instead contained $(Configuration) (which is the value for Visual Studio at configuration time, for Xcode they would have had the equivalent) e.g. "D:/llvm-build/$(Configuration)/lib". This seems to indicate that we still have a lot of fragility in the configurations, but also that a number of these paths are never used (at least on Windows) since the errors appear to have been there a while. This patch fixes the configurations and it has been tested with Ninja and Visual Studio to generate the correct paths. We should consider removing some of these settings altogether. Reviewed By: JDevlieghere, mehdi_amini Differential Revision: https://reviews.llvm.org/D96427	2021-02-11 09:32:20 -08:00
Aaron Ballman	059a335ee9	Store the calculated constant expression value into the ConstantExpr object With https://reviews.llvm.org/D63376, we began storing the APValue directly into the ConstantExpr object so that we could reuse the calculated value later. However, it missed a case when not in C++11 mode but the expression is known to be constant.	2021-02-11 10:18:16 -05:00
Valeriy Savchenko	81a9707723	[Attr] Apply GNU-style attributes to expression statements Before this commit, expression statements could not be annotated with statement attributes. Whenever parser found attribute, it unconditionally assumed that it was followed by a declaration. This not only doesn't allow expression attributes to have attributes, but also produces spurious error diagnostics. In order to maintain all previously compiled code, we still assume that GNU attributes are followed by declarations unless ALL of those are statement attributes. And even in this case we are not forcing the parser to think that it should parse a statement, but rather let it proceed as if no attributes were found. Differential Revision: https://reviews.llvm.org/D93630	2021-02-11 16:44:41 +03:00
Aaron Ballman	81bc1365d8	Correct swift_bridge duplicate attribute warning logic The swift_bridge attribute warns when the attribute is applied multiple times to the same declaration. However, it warns about the arguments being different to the attribute without ever checking if the arguments actually are different. If the arguments are different, diagnose, otherwise silently accept the code. Either way, drop the duplicated attribute.	2021-02-11 07:11:27 -05:00
Haojian Wu	6c47eafb39	[clang][index] report references from unreslovedLookupExpr. Fix https://github.com/clangd/clangd/issues/675 Differential Revision: https://reviews.llvm.org/D96262	2021-02-11 11:08:26 +01:00
Sam McCall	5c55d3747b	[CodeComplete] Member completion: heuristically resolve some dependent base exprs Today, inside a template, you can get completion for: Foo<T> t; t.^ t has dependent type Foo<T>, and we use the primary template to find its members. However we also want this to work: t.foo.bar().^ The type of t.foo.bar() is DependentTy, so we attempt to resolve using similar heuristics (e.g. primary template). Differential Revision: https://reviews.llvm.org/D96376	2021-02-11 11:03:40 +01:00
Sven van Haastregt	0b448854da	[OpenCL] Add cl_khr_subgroup_extended_types to TableGen BIFs Add the builtin functions brought by the cl_khr_subgroup_extended_types extension to `-fdeclare-opencl-builtins`. Differential Revision: https://reviews.llvm.org/D96279	2021-02-11 09:32:42 +00:00
Vitaly Buka	b6051f52ac	[Clang, NewPM] Add KMSan support Depends on D96320. Reviewed By: eugenis Differential Revision: https://reviews.llvm.org/D96328	2021-02-10 14:07:49 -08:00
Vitaly Buka	228f00bd75	[NFC] Simplify test Redundant check-prefixes is needed for folloup patches.	2021-02-10 13:57:36 -08:00
Erik Pilkington	1e8afba6f1	[clang] Add support for attribute 'swift_async_error' This attribute specifies how an error is represented for a swift async method. rdar://71941280 Differential revision: https://reviews.llvm.org/D96175	2021-02-10 13:18:13 -05:00
Paul Robinson	5ea2d4fa48	Avoid conflicts between debug-info and pseudo-probe profiling After D93264, using both -fdebug-info-for-profiling and -fpseudo-probe-for-profiling will cause the compiler to crash. Diagnose these conflicting options in the driver. Also, the existing CodeGen test was using the driver when it should be running cc1. Differential Revision: https://reviews.llvm.org/D96354	2021-02-10 07:09:18 -08:00
Nico Weber	c6a1b16db7	clang: try to fix Driver/undefined-libs.cpp on non-linux	2021-02-10 09:45:04 -05:00
Timm Bäder	6f9db455a5	[clang][NFC] Fix undefined-libs tests Not all platforms accept -stdlib or -rtlib. Instead of complaining about the wrong argument to these options, clang complains about the option itself being present. Pass an appropriate -target to the clang invocations.	2021-02-10 15:01:09 +01:00
Sven van Haastregt	a7d01772ac	[OpenCL] Add cl_khr_subgroup_clustered_reduce to TableGen BIFs Add the builtin functions brought by the cl_khr_subgroup_clustered_reduce extension to `-fdeclare-opencl-builtins`.	2021-02-10 09:44:52 +00:00
Sven van Haastregt	9ae99a0de8	[OpenCL] Add cl_khr_subgroup_non_uniform_arithmetic to TableGen BIFs Add the builtin functions brought by the cl_khr_subgroup_non_uniform_arithmetic extension to `-fdeclare-opencl-builtins`. Differential Revision: https://reviews.llvm.org/D95951	2021-02-10 09:44:39 +00:00
Artem Dergachev	ddb01010b2	Revert "[analyzer] RetainCountChecker: Add a suppression for OSSymbols." This reverts commit `3500cc8d89`. This old commit was made over a completely false premise. OSSymbols aren't different from other OSObjects and we shouldn't treat them differently for the purposes of static analysis.	2021-02-09 23:44:33 -08:00
Timm Bäder	a6439b5208	[clang][driver] Only warn once about invalid library values Since ToolChain::GetCXXStdlibType() is a simple getter that might emit the "invalid library name in argument" warning, it can conceivably be called several times while initializing the build pipeline. Before this patch, a simple 'clang++ -stdlib=foo ./test.cpp' would print the warning twice, -rt=lib=foo would print 6 times. Change this and always only print the warning once. Keep the rest of the semantics of the functions. Differential Revision: https://reviews.llvm.org/D95915	2021-02-10 06:19:52 +01:00
Richard Smith	d5d8c529ab	PR48545: Access check the inherited constructor, not the inheriting constructor. We got this wrong only when forming a CXXTemporaryObjectExpr, which caused the bug to only appear for certain syntactic forms.	2021-02-09 13:27:55 -08:00
Nico Weber	de1966e542	Revert "[ObjC][ARC] Use operand bundle 'clang.arc.rv' instead of explicitly" This reverts commit `4a64d8fe39`. Makes clang crash when buildling trivial iOS programs, see comment after https://reviews.llvm.org/D92808#2551401	2021-02-09 11:06:32 -05:00
Anastasia Stulova	79b222c39f	[OpenCL] Fix types with signed prefix in arginfo metadata. Signed prefix is removed and the single word spelling is printed for the scalar types. Tags: #clang Differential Revision: https://reviews.llvm.org/D96161	2021-02-09 15:13:19 +00:00
Wang, Pengfei	dd2460ed5d	[X86] Always assign reassoc flag for intrinsics reduce_add/mul_ps/pd. Intrinsics reduce_add/mul_ps/pd have assumption that the elements in the vector are reassociable. So we need to always assign the reassoc flag when we call _mm_reduce_* intrinsics. Reviewed By: spatel Differential Revision: https://reviews.llvm.org/D96231	2021-02-09 21:14:06 +08:00
Vitaly Buka	03c6a6d9ef	[NFC,Clang] Add more Asan Driver tests	2021-02-09 03:08:00 -08:00
Vitaly Buka	4ddf7562d5	[NFC,Clang] Add SanCov Driver tests	2021-02-09 03:08:00 -08:00
Vitaly Buka	dde9f0fa98	[NFC,Clang] Add LTO Driver MSan,KMsan tests	2021-02-09 03:08:00 -08:00
Vitaly Buka	9ff678f614	[NFC,Clang] Add LTO Driver DFsan tests	2021-02-09 03:08:00 -08:00
Vitaly Buka	ea891099f2	[NFC,Clang] Add LTO Driver Tsan tests	2021-02-09 03:08:00 -08:00
Valeriy Savchenko	2f994d4ee9	[-Wcompletion-handler][NFC] Remove unexpected warnings on Windows	2021-02-09 13:50:11 +03:00
Valeriy Savchenko	d1522d349f	[-Wcompletion-handler] Support checks with builtins It is very common to check callbacks and completion handlers for null. This patch supports such checks using built-in functions: * __builtin_expect * __builtin_expect_with_probablity * __builtin_unpredictable rdar://73455388 Differential Revision: https://reviews.llvm.org/D96268	2021-02-09 11:32:24 +03:00
Yaxun (Sam) Liu	98c21289f1	[CUDA][HIP] Add -fuse-cuid This patch added a distinct CUID for each input file, which is represented by InputAction. clang initially creates an InputAction for each input file for the host compilation. In CUDA/HIP action builder, each InputAction is given a CUID and cloned for each GPU arch, and the CUID is also cloned. In this way, we guarantee the corresponding device and host compilation for the same file shared the same CUID. On the other hand, different compilation units have different CUID. -fuse-cuid=random\|hash\|none is added to control the method to generate CUID. The default is hash. -cuid=X is also added to specify CUID explicitly, which overrides -fuse-cuid. Reviewed by: Artem Belevich Differential Revision: https://reviews.llvm.org/D95007	2021-02-08 22:26:12 -05:00
Richard Smith	21e8bb8325	PR48606: The lifetime of a constexpr heap allocation always started during the same evaluation. It looks like the only case for which this matters is determining whether mutable subobjects of a heap allocation can be modified during constant evaluation.	2021-02-08 17:58:05 -08:00
Richard Smith	c945dc4a50	PR48587: is_constant_evaluated() should not evaluate to true during a variable's destruction if it didn't do so during construction. The standard doesn't give any guidance as to what to do here, but this approach seems reasonable and conservative, and has been proposed to the standard committee.	2021-02-08 17:34:40 -08:00
Yaxun (Sam) Liu	52f312c69e	Fix failure in cuda-external-tools.cu -fgpu-rdc is output in different order	2021-02-08 19:27:43 -05:00
Argyrios Kyrtzidis	a8cb39bab0	Make sure a module file with errors produced via '-fallow-pcm-with-compiler-errors' can be loaded when using implicit modules A module with errors would be marked as out-of-date, then the `compilerModule` action would produce it, but due to the error it would be treated as failure and the resulting PCM would not get used. rdar://74087062 Differential Revision: https://reviews.llvm.org/D96246	2021-02-08 16:10:39 -08:00
Yaxun (Sam) Liu	1dab94f9ed	[CUDA][HIP] Pass -fgpu-rdc to host clang -cc1 Currently -fgpu-rdc is not passed to host clang -cc1. This causes issue because -fgpu-rdc affects shadow variable linkage in host compilation. Reviewed by: Artem Belevich Differential Revision: https://reviews.llvm.org/D96105	2021-02-08 19:08:20 -05:00
Fangrui Song	87dbdd2e3b	[FileCheck] Default --allow-unused-prefixes to false Link: https://lists.llvm.org/pipermail/llvm-dev/2020-October/146162.html "[RFC] FileCheck: (dis)allowing unused prefixes" If a downstream project using lit needs time for transition, add the following to `lit.local.cfg`: ``` from lit.llvm.subst import ToolSubst fc = ToolSubst('FileCheck', unresolved='fatal') config.substitutions.insert(0, (fc.regex, 'FileCheck --allow-unused-prefixes')) ``` Differential Revision: https://reviews.llvm.org/D95849	2021-02-08 13:37:04 -08:00
Xiangling Liao	6b1e2fc893	[FE] Manipulate the first byte of guard variable type in both load and store operation As Itanium ABI[http://itanium-cxx-abi.github.io/cxx-abi/abi.html#once-ctor] points out: "The size of the guard variable is 64 bits. The first byte (i.e. the byte at the address of the full variable) shall contain the value 0 prior to initialization of the associated variable, and 1 after initialization is complete." Differential Revision: https://reviews.llvm.org/D95822	2021-02-08 11:14:34 -05:00
Anastasia Stulova	ecc8ac3f08	[OpenCL] Fix pipe type printing in arg info metadata Pipe element type spelling for arg info metadata should follow the same behavior as normal type spelling. We should only use the canonical type spelling in the base type field. This patch also removed duplication in type handling. Tags: #clang Differential Revision: https://reviews.llvm.org/D96151	2021-02-08 16:05:13 +00:00
einvbri	9083d0a40d	Revert "[Sema] Fix -Warray-bounds false negative when casting an out-of-bounds array item" This reverts commit `e48f444751`. thakis noticed false reports, so reverting this change for now until those can be sorted out. See https://reviews.llvm.org/D71714	2021-02-08 06:38:31 -06:00
Kadir Cetinkaya	f743184911	[clang][CodeComplete] Fix crash on ParenListExprs Fixes https://github.com/clangd/clangd/issues/676. Differential Revision: https://reviews.llvm.org/D95935	2021-02-08 13:16:49 +01:00
Jan Svoboda	e22677bbdb	Reapply "[clang][cli] Report result of ParseLangArgs" This reverts commit `6039f821` and reapplies `bff6d9bb`. Clang's Index/implicit-attrs.m test invokes c-index-test with -fobjc-arc. This flag is not compatible with -fobjc-runtime=gcc, which gets implied on Linux. The original commit uncovered this by correctly reporting issues when parsing -cc1 command line. This commit fixes the test to explicitly provide ObjectiveC runtime compatible with ARC.	2021-02-08 13:14:43 +01:00
Jan Svoboda	c1b482e726	[clang][index] Mark file as C++ in parse-all-comments test `CompilerInvocation::CreateFromArgs` doesn't always report command line parsing failures through the return value. Sometimes, errors are only reported via diagnostics. Some clients like `c-index-test` only check the return value and don't check the state of `DiagnosticsEngine`. If we were to start returning the correct return value from `CreateFromArgs`, this index test starts to fail, because it specifies `-std=c++11` for a C input, which is invalid. This patch fixes that issue by adding forgotten `-x c++` argument. Reviewed By: dexonsmith Differential Revision: https://reviews.llvm.org/D95879	2021-02-08 09:42:44 +01:00
Sam Clegg	38a285885d	[clang][emscripten] Add builtin define for __EMSCRIPTEN_PTHREADS__ Currently the emscripten frontend driver injects this when building with thread support. Moving this into the clang driver itself makes the emscripten python driver less magical. Differential Revision: https://reviews.llvm.org/D96171	2021-02-05 13:53:05 -08:00
Petr Hosek	9fd9b5a9c9	Don't emit coverage mapping for excluded functions When a function or a file is excluded using -fprofile-list= option, don't emit coverage mapping as doing so confuses users since those functions would always have zero count. This also reduces the binary size considerably in cases where only a few functions or files are being instrumented. Differential Revision: https://reviews.llvm.org/D96000	2021-02-05 13:03:57 -08:00
Yaxun (Sam) Liu	b008ea304d	[CUDA][HIP] Fix device variable linkage For -fgpu-rdc, shadow variables should not be internalized, otherwise they cannot be accessed by other TUs. This is necessary because the shadow variable of external device variables are always emitted as undefined symbols, which need to resolve to a global symbols. Managed variables need to be emitted as undefined symbols in device compilations. Reviewed by: Artem Belevich Differential Revision: https://reviews.llvm.org/D95901	2021-02-05 15:11:12 -05:00
Thomas Preud'homme	00a62547da	Stop traping on sNaN in __builtin_isnan __builtin_isnan currently generates a floating-point compare operation which triggers a trap when faced with a signaling NaN in StrictFP mode. This commit uses integer operations instead to not generate any trap in such a case. Reviewed By: kpn Differential Revision: https://reviews.llvm.org/D95948	2021-02-05 18:28:48 +00:00
Michael Liao	01bf529db2	Recommit of `a2fdf9d4d7`. - The failures are all cc1-based tests due to the missing `-aux-triple` options, which is always prepared by the driver in CUDA/HIP compilation. - Add extra check on the missing aux-targetinfo to prevent crashing. [hip][cuda] Enable extended lambda support on Windows. - On Windows, extended lambda has extra issues due to the numbering schemes are different between the host compilation (Microsoft C++ ABI) and the device compilation (Itanium C++ ABI. Additional device side lambda number is required per lambda for the host compilation to correctly mangle the device-side lambda name. - A hybrid numbering context `MSHIPNumberingContext` is introduced to number a lambda for both host- and device-compilations. Reviewed By: rnk Differential Revision: https://reviews.llvm.org/D69322 This reverts commit `4874ff0241`.	2021-02-05 11:27:30 -05:00
Anton Zabaznov	d88c55ab95	[OpenCL] Add macro definitions of OpenCL C 3.0 features This patch adds possibility to define OpenCL C 3.0 feature macros via command line option or target setting. Reviewed By: Anastasia Differential Revision: https://reviews.llvm.org/D95776	2021-02-05 18:42:25 +03:00
Akira Hatanaka	4a64d8fe39	[ObjC][ARC] Use operand bundle 'clang.arc.rv' instead of explicitly emitting retainRV or claimRV calls in the IR This reapplies `3fe3946d9a` without the changes made to lib/IR/AutoUpgrade.cpp, which was violating layering. Original commit message: Background: This patch makes changes to the front-end and middle-end that are needed to fix a longstanding problem where llvm breaks ARC's autorelease optimization (see the link below) by separating calls from the marker instructions or retainRV/claimRV calls. The backend changes are in https://reviews.llvm.org/D92569. https://clang.llvm.org/docs/AutomaticReferenceCounting.html#arc-runtime-objc-autoreleasereturnvalue What this patch does to fix the problem: - The front-end adds operand bundle "clang.arc.rv" to calls, which indicates the call is implicitly followed by a marker instruction and an implicit retainRV/claimRV call that consumes the call result. In addition, it emits a call to @llvm.objc.clang.arc.noop.use, which consumes the call result, to prevent the middle-end passes from changing the return type of the called function. This is currently done only when the target is arm64 and the optimization level is higher than -O0. - ARC optimizer temporarily emits retainRV/claimRV calls after the calls with the operand bundle in the IR and removes the inserted calls after processing the function. - ARC contract pass emits retainRV/claimRV calls after the call with the operand bundle. It doesn't remove the operand bundle on the call since the backend needs it to emit the marker instruction. The retainRV and claimRV calls are emitted late in the pipeline to prevent optimization passes from transforming the IR in a way that makes it harder for the ARC middle-end passes to figure out the def-use relationship between the call and the retainRV/claimRV calls (which is the cause of PR31925). - The function inliner removes an autoreleaseRV call in the callee if nothing in the callee prevents it from being paired up with the retainRV/claimRV call in the caller. It then inserts a release call if the call is annotated with claimRV since autoreleaseRV+claimRV is equivalent to a release. If it cannot find an autoreleaseRV call, it tries to transfer the operand bundle to a function call in the callee. This is important since ARC optimizer can remove the autoreleaseRV returning the callee result, which makes it impossible to pair it up with the retainRV/claimRV call in the caller. If that fails, it simply emits a retain call in the IR if the implicit call is a call to retainRV and does nothing if it's a call to claimRV. Future work: - Use the operand bundle on x86-64. - Fix the auto upgrader to convert call+retainRV/claimRV pairs into calls annotated with the operand bundles. rdar://71443534 Differential Revision: https://reviews.llvm.org/D92808	2021-02-05 06:09:42 -08:00
Akira Hatanaka	2fbbb18c1d	Revert "[ObjC][ARC] Use operand bundle 'clang.arc.rv' instead of explicitly" This reverts commit `3fe3946d9a`. The commit violates layering by including a header from Analysis in lib/IR/AutoUpgrade.cpp.	2021-02-05 06:00:05 -08:00
Akira Hatanaka	3fe3946d9a	[ObjC][ARC] Use operand bundle 'clang.arc.rv' instead of explicitly emitting retainRV or claimRV calls in the IR Background: This patch makes changes to the front-end and middle-end that are needed to fix a longstanding problem where llvm breaks ARC's autorelease optimization (see the link below) by separating calls from the marker instructions or retainRV/claimRV calls. The backend changes are in https://reviews.llvm.org/D92569. https://clang.llvm.org/docs/AutomaticReferenceCounting.html#arc-runtime-objc-autoreleasereturnvalue What this patch does to fix the problem: - The front-end adds operand bundle "clang.arc.rv" to calls, which indicates the call is implicitly followed by a marker instruction and an implicit retainRV/claimRV call that consumes the call result. In addition, it emits a call to @llvm.objc.clang.arc.noop.use, which consumes the call result, to prevent the middle-end passes from changing the return type of the called function. This is currently done only when the target is arm64 and the optimization level is higher than -O0. - ARC optimizer temporarily emits retainRV/claimRV calls after the calls with the operand bundle in the IR and removes the inserted calls after processing the function. - ARC contract pass emits retainRV/claimRV calls after the call with the operand bundle. It doesn't remove the operand bundle on the call since the backend needs it to emit the marker instruction. The retainRV and claimRV calls are emitted late in the pipeline to prevent optimization passes from transforming the IR in a way that makes it harder for the ARC middle-end passes to figure out the def-use relationship between the call and the retainRV/claimRV calls (which is the cause of PR31925). - The function inliner removes an autoreleaseRV call in the callee if nothing in the callee prevents it from being paired up with the retainRV/claimRV call in the caller. It then inserts a release call if the call is annotated with claimRV since autoreleaseRV+claimRV is equivalent to a release. If it cannot find an autoreleaseRV call, it tries to transfer the operand bundle to a function call in the callee. This is important since ARC optimizer can remove the autoreleaseRV returning the callee result, which makes it impossible to pair it up with the retainRV/claimRV call in the caller. If that fails, it simply emits a retain call in the IR if the implicit call is a call to retainRV and does nothing if it's a call to claimRV. Future work: - Use the operand bundle on x86-64. - Fix the auto upgrader to convert call+retainRV/claimRV pairs into calls annotated with the operand bundles. rdar://71443534 Differential Revision: https://reviews.llvm.org/D92808	2021-02-05 05:55:18 -08:00
Qiu Chaofan	447dc856b2	Revert "[PowerPC] [Clang] Enable float128 feature on P9 by default" Commit `6bf29dbb` enables float128 feature by default for Power9 targets. But float128 may cause build failure in libcxx testing. Revert this commit first to unblock LLVM 12 release.	2021-02-05 20:33:56 +08:00
Aaron Ballman	45ccfd9c9d	Treat opencl_unroll_hint subject errors as semantic rather than parse errors The attribute definition claimed the attribute was inheritable (which only applies to declaration attributes) and not a statement attribute. Further, it treats subject appertainment errors as being parse errors rather than semantic errors, which leads to us accepting invalid code. For instance, we currently fail to reject: void foo() { int i = 1000; __attribute__((nomerge, opencl_unroll_hint(8))) if (i) { foo(); } } This addresses the issues by clarifying that opencl_unroll_hint is a statement attribute and handles its appertainment checks in the semantic layer instead of the parsing layer. This changes the output of the diagnostic text to be more consistent with other appertainment errors.	2021-02-05 07:20:41 -05:00
Dan Gohman	95da64da23	[WebAssembly] Use single-threaded mode when -matomics isn't enabled. When the -matomics feature is not enabled, disable POSIXThreads mode and set the thread model to Single, so that we don't predefine macros like `__STDCPP_THREADS__`. Differential Revision: https://reviews.llvm.org/D96091	2021-02-04 18:16:48 -08:00
Zequan Wu	96fb49c3ff	[AST] Update LVal before evaluating lambda decl fields. Differential Revision: https://reviews.llvm.org/D96092	2021-02-04 17:01:09 -08:00
Yaxun (Sam) Liu	e355110040	[CUDA][HIP] Fix checking dependent initalizer Defer constant checking of dependent initializer to template instantiation since it cannot be done for dependent values. Reviewed by: Artem Belevich Differential Revision: https://reviews.llvm.org/D95840	2021-02-04 18:04:54 -05:00
Sam McCall	eb4ab3358c	[CodeComplete] Guess type for designated initializers This enables: - completion in { .x.^ } - completion in { .x = { .^ } } - type-based ranking of candidates for { .x = ^ } Differential Revision: https://reviews.llvm.org/D96058	2021-02-04 22:14:49 +01:00
Richard Smith	fcb90cbd3b	Fix miscomputation of dependence for elaborated types that are explicitly qualified as members of the current instantiation. Despite the nested name specifier being fully-dependent in this case, the elaborated type might only be instantiation-dependent, because the type is a member of the current instantiation.	2021-02-04 13:14:15 -08:00
Aaron Ballman	cd2f65b71a	Correct some confused diagnostic terminology Attributes accept arguments, not parameters, so we should report that the duplicate attribute arguments don't match.	2021-02-04 15:52:07 -05:00
David Spickett	1d51c699b9	[clang][Arm] Fix handling of -Wa,-march= This fixes Bugzilla #48894 for Arm, where it was reported that -Wa,-march was not being handled by the integrated assembler. This was previously fixed for -Wa,-mthumb by parsing the argument in ToolChain::ComputeLLVMTriple instead of CollectArgsForIntegratedAssembler. It has to be done in the former because the Triple is read only by the time we get to the latter. Previously only mcpu would work via -Wa but only because "-target-cpu" is it's own option to cc1, which we were able to modify. Target architecture is part of "-target-triple". This change applies the same workaround to -march and cleans up handling of -Wa,-mcpu at the same time. There were some places where we were not using the last instance of an argument. The existing -Wa,-mthumb code was doing this correctly, so I've just added tests to confirm that. Now the same rules will apply to -Wa,-march/-mcpu as would if you just passed them to the compiler: * -Wa/-Xassembler options only apply to assembly files. * Architecture derived from mcpu beats any march options. * When there are multiple mcpu or multiple march, the last one wins. * If there is a compiler option and an assembler option of the same type, we prefer the one that fits the input type. * If there is an applicable mcpu option but it is overruled by an march, the cpu value is still used for the "-target-cpu" cc1 option. Reviewed By: nickdesaulniers Differential Revision: https://reviews.llvm.org/D95872	2021-02-04 16:36:15 +00:00
Krzysztof Parzyszek	bc097f645e	[Hexagon] Add clang builtin definitions for Hexagon V68	2021-02-04 09:54:52 -06:00
Anastasia Stulova	0c65993be1	[OpenCL] Fix default address space in template argument deduction. When deducing a reference type for forwarding references prevent adding default address space of a template argument if it is given. This got reported in PR48896 because in OpenCL all parameters are in private address space and therefore when we initialize a forwarding reference with a parameter we should just inherit the address space from it i.e. keep __private instead of __generic. Tags: #clang Differential Revision: https://reviews.llvm.org/D95624	2021-02-04 13:51:53 +00:00
Nico Weber	4874ff0241	Revert "[hip][cuda] Enable extended lambda support on Windows." This reverts commit `a2fdf9d4d7`. Slightly speculative, seeing several cuda tests fail on this Windows bot: http://45.33.8.238/win/32620/step_7.txt	2021-02-04 07:10:46 -05:00
Hans Wennborg	6625680a58	[clang-cl] Remove the /fallback option As discussed in https://lists.llvm.org/pipermail/cfe-dev/2021-January/067524.html It doesn't appear to be used, isn't really maintained, and adds some complexity to the code. Let's remove it. Differential revision: https://reviews.llvm.org/D95876	2021-02-04 10:33:16 +01:00
Jan Svoboda	225ccf0c50	[clang][cli] Command line round-trip for HeaderSearch options This patch implements generation of remaining header search arguments. It's done manually in C++ as opposed to TableGen, because we need the flexibility and don't anticipate reuse. This patch also tests the generation of header search options via a round-trip. This way, the code gets exercised whenever Clang is built and tested in asserts mode. All `check-clang` tests pass. Reviewed By: dexonsmith Differential Revision: https://reviews.llvm.org/D94472	2021-02-04 10:18:34 +01:00
Richard Smith	3b9de993c9	Give this test a target triple.	2021-02-03 23:38:52 -08:00
Richard Smith	cde8d2fddb	Fix miscompile when performing template instantiation of non-dependent doubly-nested implicit CXXConstructExprs. Ensure that we transform the parameter initializer using TransformInitializer rather than TransformExpr so that we properly strip down and rebuild the initialization, including any necessary CXXBindTemporaryExprs. Otherwise we can end up forgetting to destroy temporary objects used to construct a constructor parameter.	2021-02-03 23:38:02 -08:00
Michael Liao	a2fdf9d4d7	[hip][cuda] Enable extended lambda support on Windows. - On Windows, extended lambda has extra issues due to the numbering schemes are different between the host compilation (Microsoft C++ ABI) and the device compilation (Itanium C++ ABI. Additional device side lambda number is required per lambda for the host compilation to correctly mangle the device-side lambda name. - A hybrid numbering context `MSHIPNumberingContext` is introduced to number a lambda for both host- and device-compilations. Reviewed By: rnk Differential Revision: https://reviews.llvm.org/D69322	2021-02-04 01:38:29 -05:00
Ben Barham	a2c1054c30	[ASTReader] Always rebuild a cached module that has errors A module in the cache with an error should just be a cache miss. If allowing errors (with -fallow-pcm-with-compiler-errors), a rebuild is needed so that the appropriate diagnostics are output and in case search paths have changed. If not allowing errors, the module was built allowing errors and thus should be rebuilt regardless. Reviewed By: akyrtzi Differential Revision: https://reviews.llvm.org/D95989	2021-02-03 22:06:46 -08:00
Akira Hatanaka	aade0ec23b	Fix the guaranteed alignment of memory returned by malloc/new on Darwin The guaranteed alignment is 16 bytes on Darwin. rdar://73431623 Differential Revision: https://reviews.llvm.org/D95910	2021-02-03 19:40:51 -08:00
Shilei Tian	0f0ce3c12e	[OpenMP][NVPTX] Take functions in `deviceRTLs` as `convergent` OpenMP device compiler (similar to other SPMD compilers) assumes that functions are convergent by default to avoid invalid transformations, such as the bug (https://bugs.llvm.org/show_bug.cgi?id=49021). Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D95971	2021-02-03 20:58:12 -05:00
Richard Smith	1f06f41993	PR44325 (and duplicates): don't issue -Wzero-as-null-pointer-constant when rewriting 'a < b' as '(a <=> b) < 0'. It's pretty common for comparison category types to use a pointer or pointer-to-member type as their '0' parameter.	2021-02-03 14:58:53 -08:00
Richard Smith	b15cbaf5a0	PR49020: Diagnose brace elision in designated initializers in C++. This is a corner of the differences between C99 designators and C++20 designators that we'd previously overlooked. As with other such cases, this continues to be permitted as an extension and allowed by default, behind the -Wc99-designators warning flag, except in cases where it leads to a conformance difference (such as in overload resolution and in a SFINAE context).	2021-02-03 14:36:49 -08:00
Zequan Wu	4dc08cc3aa	[Coverage] Propogate counter to condition of conditional operator Clang usually propagates counter mapping region for conditions of `if`, `while`, `for`, etc from parent counter. We should do the same for condition of conditional operator. Differential Revision: https://reviews.llvm.org/D95918	2021-02-03 13:33:22 -08:00
Félix Cloutier	554cf3729e	[clang-tblgen] AnnotateAttr::printPretty has spurious comma when no variadic argument is specified rdar://73742471 Differential Revision: https://reviews.llvm.org/D95695	2021-02-03 11:41:38 -08:00
Kevin P. Neal	81b69879c9	[FPEnv][X86] Platform builtins edition: clang should get from the AST the metadata for constrained FP builtins Currently clang is not correctly retrieving from the AST the metadata for constrained FP builtins. This patch fixes that for the X86 specific builtins. Differential Revision: https://reviews.llvm.org/D94614	2021-02-03 11:49:17 -05:00
Juneyoung Lee	06829034ca	Revert "[ConstantFold] Fold more operations to poison" This reverts commit `53040a968d` due to its bad interaction with select i1 -> and/or i1 transformation. This fixes: https://bugs.llvm.org/show_bug.cgi?id=49005 https://bugs.llvm.org/show_bug.cgi?id=48435	2021-02-04 00:24:02 +09:00
Abhina Sreeskantharajan	e59d336e75	[test] Use host platform specific error message substitution in lit tests - continued On z/OS, other error messages are not matched correctly in lit tests. ``` EDC5121I Invalid argument. EDC5111I Permission denied. ``` This patch adds a lit substitution to fix it. Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D95808	2021-02-03 09:53:22 -05:00
Ilya Mirsky	e48f444751	[Sema] Fix -Warray-bounds false negative when casting an out-of-bounds array item Patch by Ilya Mirsky! Fixes: http://llvm.org/PR44343 Reviewed By: rsmith Differential Revision: https://reviews.llvm.org/D71714	2021-02-03 07:50:50 -06:00
Anastasia Stulova	e635feb15a	[OpenCL] Fix address space in binding of initializer lists to referencs Prevent materializing temporaries in the address space of the references they are bind to. The temporaries should always be in the same address space - private for OpenCL. Tags: #clang Differential Revision: https://reviews.llvm.org/D95608	2021-02-03 12:48:21 +00:00
Sven van Haastregt	9caf364d69	[OpenCL] Add cl_khr_subgroup_ballot to TableGen BIFs Add the builtin functions brought by the cl_khr_subgroup_ballot extension to `-fdeclare-opencl-builtins`. Also add placeholder comments for the other Extended Subgroup Functions from the OpenCL Extension Specification. Add a comment clarifying the scope of the test. Differential Revision: https://reviews.llvm.org/D95523	2021-02-03 10:23:49 +00:00
Ben Shi	d38973aa4d	[clang][AVR] Improve avr-ld command line options Reviewed By: dylanmckay, MaskRay Differential Revision: https://reviews.llvm.org/D93579	2021-02-03 18:23:01 +08:00
Pushpinder Singh	fcf03e7280	[OpenMP] Add OpenMP offloading toolchain for AMDGPU This patch adds AMDGPUOpenMPToolChain for supporting OpenMP offloading to AMD GPU's. Originally authored by Greg Rodgers Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D94961	2021-02-03 00:42:52 -05:00
Hongtao Yu	3d89b3cbec	[CSSPGO] Introducing distribution factor for pseudo probe. Sample re-annotation is required in LTO time to achieve a reasonable post-inline profile quality. However, we have seen that such LTO-time re-annotation degrades profile quality. This is mainly caused by preLTO code duplication that is done by passes such as loop unrolling, jump threading, indirect call promotion etc, where samples corresponding to a source location are aggregated multiple times due to the duplicates. In this change we are introducing a concept of distribution factor for pseudo probes so that samples can be distributed for duplicated probes scaled by a factor. We hope that optimizations duplicating code well-maintain the branch frequency information (BFI) based on which probe distribution factors are calculated. Distribution factors are updated at the end of preLTO pipeline to reflect an estimated portion of the real execution count. This change also introduces a pseudo probe verifier that can be run after each IR passes to detect duplicated pseudo probes. A saturated distribution factor stands for 1.0. A pesudo probe will carry a factor with the value ranged from 0.0 to 1.0. A 64-bit integral distribution factor field that represents [0.0, 1.0] is associated to each block probe. Unfortunately this cannot be done for callsite probes due to the size limitation of a 32-bit Dwarf discriminator. A 7-bit distribution factor is used instead. Changes are also needed to the sample profile inliner to deal with prorated callsite counts. Call sites duplicated by PreLTO passes, when later on inlined in LTO time, should have the callees’s probe prorated based on the Prelink-computed distribution factors. The distribution factors should also be taken into account when computing hotness for inline candidates. Also, Indirect call promotion results in multiple callisites. The original samples should be distributed across them. This is fixed by adjusting the callisites' distribution factors. Reviewed By: wmi Differential Revision: https://reviews.llvm.org/D93264	2021-02-02 11:55:01 -08:00
Fangrui Song	74c94b5d9c	[test] Default clang/test to FileCheck --allow-unused-prefixes=false	2021-02-02 11:22:46 -08:00
Mike Rice	ca98c15f23	[OpenMP] Fix iterations calculation for dependent counters. The number of iterations calculation was failing in some cases with more than two collpased loops. Now the LoopIterationSpace selected matches InitDependOnLC and CondDependOnLC. Differential Revision: https://reviews.llvm.org/D95834	2021-02-02 10:09:37 -08:00
Hongtao Yu	d3e2e3740d	[CSSPGO] Passing the clang driver switch -fpseudo-probe-for-profiling to the linker. As titled. Reviewed By: wmi, wenlei Differential Revision: https://reviews.llvm.org/D95271	2021-02-02 09:43:57 -08:00
Anastasia Stulova	844f01fc95	Fixed failing OpenCL test	2021-02-02 16:19:28 +00:00
Zarko Todorovski	eb3426a528	[AIX] Improve option processing for mabi=vec-extabi and mabi=vec=defaul Opening this revision to better address comments by @hubert.reinterpretcast in https://reviews.llvm.org/rGcaaaebcde462 Reviewed By: hubert.reinterpretcast Differential Revision: https://reviews.llvm.org/D95702	2021-02-02 10:59:21 -05:00
Anastasia Stulova	5bbf39704c	[OpenCL] Add diagnostics for references to functions Restrict use of references to functions as they can result in non-conforming behavior. Tags: #clang Differential Revision: https://reviews.llvm.org/D95442	2021-02-02 15:07:40 +00:00
Melanie Blower	9a5dc01e4b	[clang][PATCH][NFC] Correct test case related to review D95482	2021-02-02 07:06:43 -08:00
Ben Shi	9b0b435d79	[AVR][clang] Fix a bug in AVR toolchain search paths Reviewed By: dylanmckay, MaskRay Differential Revision: https://reviews.llvm.org/D95529	2021-02-02 22:45:52 +08:00
Nico Weber	f2b4cc91e0	Revert "[test] Default clang/test to FileCheck --allow-unused-prefixes=false" This reverts commit `80f539526e`. Many test failures on mac: http://45.33.8.238/macm1/2772/summary.html One on win: http://45.33.8.238/win/32442/summary.html	2021-02-02 07:38:44 -05:00
Sven van Haastregt	dc00c96b2d	[OpenCL] Change extension handling for -fdeclare-opencl-builtins Until now, the `-fdeclare-opencl-builtins` option behaved differently compared to inclusion of `opencl-c.h`: builtins that are part of an extension were only available if the extension was enabled using the corresponding pragma. Builtins that belong to an extension are guarded using a preprocessor macro (that is named after the extension) in `opencl-c.h`. Align the behaviour of `-fdeclare-opencl-builtins` with this. Co-authored-by: Anastasia Stulova Differential Revision: https://reviews.llvm.org/D95616	2021-02-02 11:15:29 +00:00
Hans Wennborg	0479c53b6c	[dllimport] Honor always_inline when deciding whether a dllimport function should be available for inlining (PR48925) Normally, Clang will not make dllimport functions available for inlining if they reference non-imported symbols, as this can lead to confusing link errors. But if the function is marked always_inline, the user presumably knows what they're doing and the attribute should be honored. Differential revision: https://reviews.llvm.org/D95673	2021-02-02 10:28:32 +01:00
Fangrui Song	80f539526e	[test] Default clang/test to FileCheck --allow-unused-prefixes=false	2021-02-01 22:02:59 -08:00
Nathan Hawes	ecb00a7762	[VFS] Add support to RedirectingFileSystem for mapping a virtual directory to one in the external FS. Previously file entries in the -ivfsoverlay yaml could map to a file in the external file system, but directories had to list their contents in the form of other file entries or directories. Allowing directory entries to map to a directory in the external file system makes it possible to present an external directory's contents in a different location and (in combination with the 'fallthrough' option) overlay one directory's contents on top of another. rdar://problem/72485443 Differential Revision: https://reviews.llvm.org/D94844	2021-02-02 14:56:17 +10:00
Fangrui Song	98768bab19	[test] Fix unuses FileCheck prefixes in clang/test/Modules	2021-02-01 19:46:23 -08:00
Stanislav Mekhanoshin	8e661d3d9c	[AMDGPU] Set s-memtime-inst feature from clang Differential Revision: https://reviews.llvm.org/D95733	2021-02-01 14:20:43 -08:00
Melanie Blower	08d46d5059	[clang][PATCH] Fix bug 48848 assertion related to recoverFromMSUnqualifiedLookup Reviewed By: rnk Differential Revision: https://reviews.llvm.org/D95482	2021-02-01 10:56:47 -08:00
Mircea Trofin	c4d6f2707a	[NFC] Disallow unused prefixes under clang/test/Driver Differential Revision: https://reviews.llvm.org/D95660	2021-02-01 10:34:38 -08:00
James Y Knight	20b1c1300c	Fix test in "CFG: Create scope for non-compound range-for body." The constant 4 is sometimes printed as "4L", or "4LL", in CFG dump output, depending on platform; accept all variants. Ammends commit `8f670d5b6d`.	2021-01-31 19:56:26 -05:00
James Y Knight	8f670d5b6d	CFG: Create scope for non-compound range-for body. Previously, it was omitting the destructor call from the CFG, which could result in incorrect diagnostics.	2021-01-31 18:43:00 -05:00
Luís Marques	2de4f19ecd	[LSan][RISCV] Enable LSan for RISCV64 Fixes the broken RISCV64 implementation of `internal_clone` and adds RISCV64 support for LSan. Differential Revision: https://reviews.llvm.org/D92403	2021-01-31 21:53:25 +00:00
Hsiangkai Wang	282aca10ae	[RISCV] Update the version number to v0.10 for vector. v0.10 is tagged in V specification. Update the version to v0.10. Differential Revision: https://reviews.llvm.org/D95680	2021-01-30 07:20:05 +08:00
Petr Hosek	0217f1c7a3	Make the profile-filter.c test compatible with 32-bit systems This addresses PR48930. Differential Revision: https://reviews.llvm.org/D95658	2021-01-29 09:58:32 -08:00
Pavel Iliin	c5e7e649d5	[AArch64][Clang][Linux] Enable out-of-line atomics by default. Generate outline atomics if compiling for armv8-a non-LSE AArch64 Linux (including Android) targets to use LSE instructions, if they are available, at runtime. Library support is checked by clang driver which doesn't enable outline atomics if no proper libraries (libgcc >= 9.3.1 or compiler-rt) found. Differential Revision: https://reviews.llvm.org/D93585	2021-01-29 17:44:45 +00:00
Nico Weber	1608ba0946	Revert "Disable rosegment for old Android versions." This reverts commit `fae16fc0ee`. Breaks building compiler-rt android runtimes with trunk clang but older NDK, see discussion on https://reviews.llvm.org/D95166	2021-01-29 11:20:48 -05:00
Nico Weber	d087d805ac	clang-cl: Accept /std:c11, /std:c17 flags clang-cl already defaults to C17 for .c files, but no harm in accepting these flags. Fixes PR48185. Differential Revision: https://reviews.llvm.org/D95575	2021-01-29 09:59:00 -05:00
Nico Weber	82847436e9	clang-cl: Invent a /winsysroot concept On non-Windows platforms, --sysroot can be used to make the compiler use a single, hermetic directory for all header and library files. This is useful, but difficult to do on Windows. After D95472 it's possible to achieve this with two flags: out/gn/bin/clang-cl win.c -fuse-ld=lld \ /vctoolsdir path/to/VC/Tools/MSVC/14.26.28801 \ /winsdkdir path/to/win_sdk But that's still cumbersome: It requires two flags instead of one, and it requires writing down the (changing) VC/Tools/MSVC version. This adds a new `/winsysroot <dir>` flag that's effectively an alias to these two flags. With this, building against a hermetic Windows toolchain only needs: out/gn/bin/clang-cl win.c -fuse-ld=lld /winsysroot path `/winsysroot <dir>` is the same as adding /vctoolsdir <dir>/VC/Tools/MSVC/<vctoolsver> /winsdkdir <dir>/Windows Kits/<winsdkmajorversion> `<vctoolsver>` is taken from `/vctoolsversion` if passed, or else it's the name of the directory in `<dir>/VC/Tools/MSVC` that's the highest numeric tuple. `<winsdkmajorversion>` is the major version in /winsdkversion if passed, else it's the name of the directory in `<dir>/Windows Kits` that's the highest number. So `/winsysroot <path>` requires this subfolder structure: path/ VC/ Tools/ MSVC/ 14.26.28801 (or another number) include/ ... Windows Kits/ 10/ Include/ 10.0.19041.0/ (or another number) um/ ... Lib/ 10.0.19041.0/ (or another number) um/ x64/ ... ... Differential Revision: https://reviews.llvm.org/D95534	2021-01-29 09:47:00 -05:00
Haojian Wu	e90e455d2a	[Syntax] Add syntax-tree-dump in clang-check. This is useful to experiment/develop syntax trees. Reviewed By: sammccall Differential Revision: https://reviews.llvm.org/D95526	2021-01-29 14:10:27 +01:00
Abhina Sreeskantharajan	42a21778f6	[test] Use host platform specific error message substitution in lit tests On z/OS, the following error message is not matched correctly in lit tests. ``` EDC5129I No such file or directory. ``` This patch uses a lit config substitution to check for platform specific error messages. Reviewed By: muiez, jhenderson Differential Revision: https://reviews.llvm.org/D95246	2021-01-29 07:16:30 -05:00
Thomas Preud'homme	305ac81e1d	Fix macos target assumption in test Clang test Driver/macos-apple-silicon-slice-link-libs-darwin-only.cpp assumes the target is darwin when the host is darwin which is not necessarily the case, causing the test to fail when it is not. This commit adds a -triple argument to the clang invocation to ensure the target is darwin. Reviewed By: hans Differential Revision: https://reviews.llvm.org/D94396	2021-01-29 10:22:04 +00:00
Amy Huang	7ef79bb8e2	Fix typo in "[DebugInfo][CodeView] Use <lambda_n> as the display name for lambdas." (Commited in `d5f5deee9e`)	2021-01-28 19:03:41 -08:00
Amy Huang	d5f5deee9e	Reland "[DebugInfo][CodeView] Use <lambda_n> as the display name for lambdas" with fix to test case and stringrefs. Currently (for codeview) lambdas have a string like `<lambda_0>` in their mangled name, and don't have any display name. This change uses the `<lambda_0>` as the display name, which helps distinguish between lambdas in -gline-tables-only, since there are no linkage names there. It also changes how we display lambda names; previously we used `<unnamed-tag>`; now it will show `<lambda_0>`. I added a function to the mangling context code to create this string; for Itanium it just returns an empty string. Bug: https://bugs.llvm.org/show_bug.cgi?id=48432 Reviewed By: rnk Differential Revision: https://reviews.llvm.org/D95187 This reverts `9b21d4b943`	2021-01-28 18:44:48 -08:00
Amy Huang	9b21d4b943	Revert "[DebugInfo][CodeView] Use <lambda_n> as the display name for lambdas." for test failures. This reverts commit `d73564c510`.	2021-01-28 16:41:26 -08:00
Amy Huang	d73564c510	[DebugInfo][CodeView] Use <lambda_n> as the display name for lambdas. Currently (for codeview) lambdas have a string like `<lambda_0>` in their mangled name, and don't have any display name. This change uses the `<lambda_0>` as the display name, which helps distinguish between lambdas in -gline-tables-only, since there are no linkage names there. It also changes how we display lambda names; previously we used `<unnamed-tag>`; now it will show `<lambda_0>`. I added a function to the mangling context code to create this string; for Itanium it just returns an empty string. Bug: https://bugs.llvm.org/show_bug.cgi?id=48432 Reviewed By: rnk Differential Revision: https://reviews.llvm.org/D95187	2021-01-28 16:30:38 -08:00
Thomas Lively	4b68b64dcc	[WebAssembly] Prototype i8x16 to i32x4 widening instructions As proposed in https://github.com/WebAssembly/simd/pull/395 and matching the opcodes used in V8: https://chromium-review.googlesource.com/c/v8/v8/+/2617385/4/src/wasm/wasm-opcodes.h Differential Revision: https://reviews.llvm.org/D95557	2021-01-28 10:59:32 -08:00
Mircea Trofin	cfcc1110d7	[NFC] Disallow unused prefixes under clang/test/CodeGenCXX The only test that needed change had 'QUAL' as an unused prefix. The rest of the changes are to simplify the prefix lists. Differential Revision: https://reviews.llvm.org/D95499	2021-01-28 09:47:21 -08:00
Hans Wennborg	0024efc69e	Relax test expectations in debug-info-gline-tables-only-codeview.cpp To make it pass also on 32-bit Windows, see PR48920.	2021-01-28 14:40:11 +01:00
Sven van Haastregt	526c42e76c	[OpenCL] Hide sampler-less read_image builtins before CL1.2 Ensure sampler-less image read functions are not available with `-fdeclare-opencl-builtins` before OpenCL 1.2.	2021-01-28 11:14:19 +00:00
Tomas Matheson	01b9e613c2	[Clang][Codegen] Truncate initializers of union bitfield members If an initial value is given for a bitfield that does not fit in the bitfield, the value should be truncated. Constant folding for expressions did not account for this truncation in the case of union member functions, despite a warning being emitted. In some contexts, evaluation of expressions was not enabled unless C++11, ROPI or RWPI was enabled. Differential Revision: https://reviews.llvm.org/D93101	2021-01-28 09:19:19 +00:00
Nico Weber	764a7a2155	clang: Fix static_assert in a few contexts in microsoft mode Follow-up to D17444. Fixes PR48904. See bug for details. Differential Revision: https://reviews.llvm.org/D95559	2021-01-27 18:15:25 -05:00
James Y Knight	a7246ba02a	Itanium Mangling: In 'enable_if', omit X/E around <expr-primary>. The Clang enable_if extension is mangled as an <extended-qualifier>, which is supposed to contain <template-args>. However, we were unconditionally emitting X/E around its arguments, neglecting the fact that <expr-primary> should be emitted directly without the surrounding X/E. Differential Revision: https://reviews.llvm.org/D95488	2021-01-27 16:46:52 -05:00
James Y Knight	8ca33605ff	Itanium Mangling: Fix handling of <expr-primary> in <template-arg>. Previously, we were emitting an extraneous X .. E in <template-arg> around an <expr-primary> if the template argument was constructed from an expression (rather than an already-evaluated literal value). In such a case, we would then e.g. emit 'XLi0EE' instead of 'Li0E'. We had one special-case for DeclRefExpr expressions, in particular, to omit them the mangled-name without the surrounding X/E. However, unfortunately, that special case also triggered for ParmVarDecl (a subtype of VarDecl), and _incorrectly_ emitted 'L_Z .. E' instead of the proper 'Xfp_E'. This change causes mangleExpression itself to be responsible for emitting X/E around non-primary expressions, which removes the special-case, and corrects both these problems. Differential Revision: https://reviews.llvm.org/D95487	2021-01-27 16:46:52 -05:00
James Y Knight	9c7aeaebb3	Itanium Mangling: Mangle `__alignof__` differently than `alignof`. The two operations have acted differently since Clang 8, but were unfortunately mangled the same. The new mangling uses new "vendor extended expression" syntax proposed in https://github.com/itanium-cxx-abi/cxx-abi/issues/112 GCC had the same mangling problem, https://gcc.gnu.org/PR88115, and will hopefully be switching to the same mangling as implemented here. Additionally, fix the mangling of `__uuidof` to use the new extension syntax, instead of its previous nonstandard special-case. Adjusts the demangler accordingly. Differential Revision: https://reviews.llvm.org/D93922	2021-01-27 16:46:51 -05:00
Richard Smith	5dfa37a761	Don't allow __VA_OPT__ to be detected by #ifdef. More study has discovered this to not actually be useful: because current C++20 implementations reject `#ifdef __VA_OPT__`, this can't really be used as a feature-test mechanism. And it's not too hard to detect __VA_OPT__ without this, for example: #define THIRD_ARG(a, b, c, ...) c #define HAS_VA_OPT(...) THIRD_ARG(__VA_OPT__(,), 1, 0, ) #if HAS_VA_OPT(?) Partially reverts `0436ec2128`.	2021-01-27 13:34:15 -08:00
Aaron Ballman	5d3dca24aa	Ignore unknown attribute warnings in this test We're testing the parsing behavior, not the actual attributes used, and the attribute name cannot be elided for __declspec attributes.	2021-01-27 15:45:35 -05:00
Richard Smith	0436ec2128	Permit __VA_OPT__ in all language modes and allow it to be detected with #ifdef. These changes are intended to give code a path to move away from the GNU ,##__VA_ARGS__ extension, which is non-conforming in some situations and which we'd like to disable in our conforming mode in those cases.	2021-01-27 12:34:43 -08:00
Aaron Ballman	9f2c7effd7	Parse different attribute syntaxes in arbitrary order In Clang today, we parse the different attribute syntaxes (__attribute__, __declspec, and [[]]) in a fairly rigid order. This leads to confusion for users when they guess the order incorrectly, and leads to bug reports like PR24559 or necessitates changes like D94788. This patch adds a helper function to allow us to more easily parse attributes in arbitrary order, and then updates all of the places where we would parse two or more different syntaxes in a rigid order to use the helper method. The patch does not attempt to handle Microsoft attributes ([]) because those are ambiguous with other code constructs and we don't have any attributes that use the syntax.	2021-01-27 15:30:15 -05:00
Reid Kleckner	61a66e4b5e	Revert "Suppress non-conforming GNU paste extension in all standard-conforming modes" This reverts commit `f4537935dc`. This reverts commit `b43c26d036`. This GNU and MSVC extension turns out to be very popular. Most projects are not using C++20, so cannot use the new __VA_OPT__ feature to be standards conformant. The other workaround, using -std=gnu*, enables too many language extensions and isn't viable. Until there is a way for users to get the behavior provided by the `, ## __VA_ARGS__` extension in the -std=c++17 and earlier language modes, we need to revert this.	2021-01-27 10:59:57 -08:00
Fangrui Song	3e80686186	[test] Fix clang/test/CodeGen tests	2021-01-27 10:55:27 -08:00
Freddy Ye	1edb76cc91	[X86] merge "={eax}" and "~{eax}" into "=&eax" for MSInlineASM Reviewed By: pengfei Differential Revision: https://reviews.llvm.org/D94466	2021-01-27 22:54:17 +08:00
Nico Weber	c0fc38ce15	Try to fix cl-options.c on bots were the default triple is non-x86 non-arm llvmArchToWindowsSDKArch() returns "" for non-intel non-arm archs. We're checking for "/fake/lib/" which is followed by the result of that function -- but if that returns an empty string, then that trailing slash isn't there. As fix, just explicitly pass a triple that's intel or arm (I randomly chose aarch64). Since the test runs with -###, that arch doesn't have to be in LLVM_TARGETS_TO_BUILD.	2021-01-27 09:19:25 -05:00
Nico Weber	a5d85cbec5	clang-cl: Add /winsdkdir and /winsdkversion flags These do for the Windows SDK path what D85998 did for %VCToolsInstallDir% with /vctoolsdir: Offer a way to set them with an explicit commandline switch. With this (and /vctoolsdir), it's possible to compile and link against hermetic vctools and winsdk directories with: out/gn/bin/clang-cl win.c -fuse-ld=lld \ /vctoolsdir path/to/VC/Tools/MSVC/14.26.28801 \ /winsdkdir path/to/win_sdk compared to a long list of -imsvc and /link /libpath: flags. While here: - Change the case of the "Include" folder inside the windows sdk from "include" to "Include" to match on-disk case. Since the Windows file system is case-insensitive this isn't a behavior change, it's just a bit cleaner. - Add libpath tests to the /vctoolsdir - Add a FIXME about reading env vars for win sdk and ucrt sdk if these flags aren't present, to match the VCToolsInstallDir logic We should also cache all these computed paths in the driver instead of computing them every time they're queried, but that's for a future patch. It'd also be nice to invent a /winsysroot: flag that sets both /vctoolsdir: and /winsdkdir: to some well-known subdirectory. That's for a future patch as well. Differential Revision: https://reviews.llvm.org/D95472	2021-01-27 06:37:51 -05:00
Sven van Haastregt	79c727328b	[clang] Fix signedness in vector bitcast evaluation The included test case triggered a sign assertion on the result in `Success()`. This was caused by the APSInt created for a bitcast having its signedness bit inverted. The second APSInt constructor argument is `isUnsigned`, so invert the result of `isSignedIntegerType`. Relanding this patch after reverting. The test case had to be updated to be insensitive to 32/64-bit extractelement indices. Differential Revision: https://reviews.llvm.org/D95135	2021-01-27 09:30:26 +00:00
Duncan P. N. Exon Smith	e4871c1e2e	Rename clang/test/Frontend/output-{failures,paths}.c, NFC A follow up patch will add a few success cases here; rename it to `output-paths.c` instead of `output-failures.c`.	2021-01-26 19:26:24 -08:00
Petr Hosek	bb9eb19829	Support for instrumenting only selected files or functions This change implements support for applying profile instrumentation only to selected files or functions. The implementation uses the sanitizer special case list format to select which files and functions to instrument, and relies on the new noprofile IR attribute to exclude functions from instrumentation. Differential Revision: https://reviews.llvm.org/D94820	2021-01-26 17:13:34 -08:00
Dan Albert	fae16fc0ee	Disable rosegment for old Android versions. The unwinder used by the crash handler on versions of Android prior to API 29 did not correctly handle binaries built with rosegment, which is enabled by default for LLD. Android only supports LLD, so it's not an issue that this flag is not accepted by other linkers. Reviewed By: srhines Differential Revision: https://reviews.llvm.org/D95166	2021-01-26 16:15:45 -08:00
Fangrui Song	34b60d8a56	Add -fbinutils-version= to gate ELF features on the specified binutils version There are two use cases. Assembler We have accrued some code gated on MCAsmInfo::useIntegratedAssembler(). Some features are supported by latest GNU as, but we have to use MCAsmInfo::useIntegratedAs() because the newer versions have not been widely adopted (e.g. SHF_LINK_ORDER 'o' and 'unique' linkage in 2.35, --compress-debug-sections= in 2.26). Linker We want to use features supported only by LLD or very new GNU ld, or don't want to work around older GNU ld. We currently can't represent that "we don't care about old GNU ld". You can find such workarounds in a few other places, e.g. Mips/MipsAsmprinter.cpp PowerPC/PPCTOCRegDeps.cpp X86/X86MCInstrLower.cpp AArch64 TLS workaround for R_AARCH64_TLSLD_MOVW_DTPREL_* (PR ld/18276), R_AARCH64_TLSLE_LDST8_TPREL_LO12 (https://bugs.llvm.org/show_bug.cgi?id=36727 https://sourceware.org/bugzilla/show_bug.cgi?id=22969) Mixed SHF_LINK_ORDER and non-SHF_LINK_ORDER components (supported by LLD in D84001; GNU ld feature request https://sourceware.org/bugzilla/show_bug.cgi?id=16833 may take a while before available). This feature allows to garbage collect some unused sections (e.g. fragmented .gcc_except_table). This patch adds `-fbinutils-version=` to clang and `-binutils-version` to llc. It changes one codegen place in SHF_MERGE to demonstrate its usage. `-fbinutils-version=2.35` means the produced object file does not care about GNU ld<2.35 compatibility. When `-fno-integrated-as` is specified, the produced assembly can be consumed by GNU as>=2.35, but older versions may not work. `-fbinutils-version=none` means that we can use all ELF features, regardless of GNU as/ld support. Both clang and llc need `parseBinutilsVersion`. Such command line parsing is usually implemented in `llvm/lib/CodeGen/CommandFlags.cpp` (LLVMCodeGen), however, ClangCodeGen does not depend on LLVMCodeGen. So I add `parseBinutilsVersion` to `llvm/lib/Target/TargetMachine.cpp` (LLVMTarget). Differential Revision: https://reviews.llvm.org/D85474	2021-01-26 12:28:23 -08:00
Petr Hosek	1e634f3952	Revert "Support for instrumenting only selected files or functions" This reverts commit `4edf35f11a` because the test fails on Windows bots.	2021-01-26 12:25:28 -08:00
Fangrui Song	189f311130	CGDebugInfo CreatedLimitedType: Drop file/line for RecordType with invalid location For Clang synthesized `__va_list_tag` (`CreateX86_64ABIBuiltinVaListDecl`), its DW_AT_decl_file/DW_AT_decl_line are arbitrarily set from `CurLoc`. In a stage 2 `-DCMAKE_BUILD_TYPE=Debug` clang build, I observe that in driver.cpp, DW_AT_decl_file/DW_AT_decl_line may be set to an `#include` line (the transitively included file uses va_arg (`__builtin_va_arg`)). This seems arbitrary. Drop that. Reviewed By: #debug-info, dblaikie Differential Revision: https://reviews.llvm.org/D94735	2021-01-26 11:53:25 -08:00
Petr Hosek	4edf35f11a	Support for instrumenting only selected files or functions This change implements support for applying profile instrumentation only to selected files or functions. The implementation uses the sanitizer special case list format to select which files and functions to instrument, and relies on the new noprofile IR attribute to exclude functions from instrumentation. Differential Revision: https://reviews.llvm.org/D94820	2021-01-26 11:11:39 -08:00
Shilei Tian	7c03f7d7d0	[OpenMP][deviceRTLs] Build the deviceRTLs with OpenMP instead of target dependent language From this patch (plus some landed patches), `deviceRTLs` is taken as a regular OpenMP program with just `declare target` regions. In this way, ideally, `deviceRTLs` can be written in OpenMP directly. No CUDA, no HIP anymore. (Well, AMD is still working on getting it work. For now AMDGCN still uses original way to compile) However, some target specific functions are still required, but they're no longer written in target specific language. For example, CUDA parts have all refined by replacing CUDA intrinsic and builtins with LLVM/Clang/NVVM intrinsics. Here're a list of changes in this patch. 1. For NVPTX, `DEVICE` is defined empty in order to make the common parts still work with AMDGCN. Later once AMDGCN is also available, we will completely remove `DEVICE` or probably some other macros. 2. Shared variable is implemented with OpenMP allocator, which is defined in `allocator.h`. Again, this feature is not available on AMDGCN, so two macros are redefined properly. 3. CUDA header `cuda.h` is dropped in the source code. In order to deal with code difference in various CUDA versions, we build one bitcode library for each supported CUDA version. For each CUDA version, the highest PTX version it supports will be used, just as what we currently use for CUDA compilation. 4. Correspondingly, compiler driver is also updated to support CUDA version encoded in the name of bitcode library. Now the bitcode library for NVPTX is named as `libomptarget-nvptx-cuda_[cuda_version]-sm_[sm_number].bc`, such as `libomptarget-nvptx-cuda_80-sm_20.bc`. With this change, there are also multiple features to be expected in the near future: 1. CUDA will be completely dropped when compiling OpenMP. By the time, we also build bitcode libraries for all supported SM, multiplied by all supported CUDA version. 2. Atomic operations used in `deviceRTLs` can be replaced by `omp atomic` if OpenMP 5.1 feature is fully supported. For now, the IR generated is totally wrong. 3. Target specific parts will be wrapped into `declare variant` with `isa` selector if it can work properly. No target specific macro is needed anymore. 4. (Maybe more...) Reviewed By: JonChesterfield Differential Revision: https://reviews.llvm.org/D94745	2021-01-26 12:28:47 -05:00
Mircea Trofin	0c0d009a88	[NFC] Disallow unused prefixes under clang/test/CodeGen Differential Revision: https://reviews.llvm.org/D95417	2021-01-26 08:05:45 -08:00
Zarko Todorovski	028d7a3668	Remove requirement for -maltivec to be used when using -mabi=vec-extabi or -mabi=vec-default when not using vector code The previous implementation required that `-maltivec` be specified when using either `-mabi=vec-extabi` or `-mabi=vec-default`, this patch removes that requirement. Reviewed By: cebowleratibm Differential Revision: https://reviews.llvm.org/D94986	2021-01-26 07:58:01 -05:00
Johannes Doerfert	bd756286d2	[OpenMP][FIX] Enforce a function boundary for a new data environment Whenever we enter a new OpenMP data environment we want to enter a function to simplify reasoning. Later we probably want to remove the entire specialization wrt. the if clause and pass the result to the runtime, for now this should fix PR48686. Reviewed By: ABataev Differential Revision: https://reviews.llvm.org/D94315	2021-01-25 22:43:37 -06:00
Hsiangkai Wang	f19849a07b	[RISCV] Update V extension to v1.0-draft 08a0b464. Differential Revision: https://reviews.llvm.org/D94583	2021-01-26 12:02:43 +08:00
Mircea Trofin	91b61abafb	[NFC] Disallow unused prefixes in clang/test/Analysis Differential Revision: https://reviews.llvm.org/D95249	2021-01-25 15:53:00 -08:00
Leonard Chan	c0e94e9974	[clang][Fuchsia] Add relative-vtables + asan multilibs We're choosing to take an opt-in approach for landing Relative VTables, so we'll need asan-equivalent multilibs with relative vtables enabled. Afterwards, we can just flip the switch in our build. Differential Revision: https://reviews.llvm.org/D95253	2021-01-25 15:24:16 -08:00
Harald van Dijk	b43c26d036	Restore GNU , ## __VA_ARGS__ behavior in MSVC mode As noted in D91913, MSVC implements the GNU behavior for , ## __VA_ARGS__ as well. Do the same when `-fms-compatibility` is used. Reviewed By: rsmith Differential Revision: https://reviews.llvm.org/D95392	2021-01-25 22:34:49 +00:00
Richard Smith	925ae8c790	Revert "[ObjC][ARC] Annotate calls with attributes instead of emitting retainRV" This reverts commit `53176c1680`, which introduceed a layering violation. LLVM's IR library can't include headers from Analysis.	2021-01-25 13:53:38 -08:00
Albertas Vyšniauskas	60bf5826cf	[clang-format] PR16518 Add flag to suppress empty line insertion before access modifier Add new option called InsertEmptyLineBeforeAccessModifier. Empty line before access modifier is inerted if this option is set to true (which is the default value, because clang-format always inserts empty lines before access modifiers), otherwise empty lines are removed. Fixes issue #16518. Differential Revision: https://reviews.llvm.org/D93846	2021-01-25 21:02:41 +01:00
Akira Hatanaka	53176c1680	[ObjC][ARC] Annotate calls with attributes instead of emitting retainRV or claimRV calls in the IR Background: This patch makes changes to the front-end and middle-end that are needed to fix a longstanding problem where llvm breaks ARC's autorelease optimization (see the link below) by separating calls from the marker instructions or retainRV/claimRV calls. The backend changes are in https://reviews.llvm.org/D92569. https://clang.llvm.org/docs/AutomaticReferenceCounting.html#arc-runtime-objc-autoreleasereturnvalue What this patch does to fix the problem: - The front-end annotates calls with attribute "clang.arc.rv"="retain" or "clang.arc.rv"="claim", which indicates the call is implicitly followed by a marker instruction and a retainRV/claimRV call that consumes the call result. This is currently done only when the target is arm64 and the optimization level is higher than -O0. - ARC optimizer temporarily emits retainRV/claimRV calls after the annotated calls in the IR and removes the inserted calls after processing the function. - ARC contract pass emits retainRV/claimRV calls after the annotated calls. It doesn't remove the attribute on the call since the backend needs it to emit the marker instruction. The retainRV/claimRV calls are emitted late in the pipeline to prevent optimization passes from transforming the IR in a way that makes it harder for the ARC middle-end passes to figure out the def-use relationship between the call and the retainRV/claimRV calls (which is the cause of PR31925). - The function inliner removes the autoreleaseRV call in the callee that returns the result if nothing in the callee prevents it from being paired up with the calls annotated with "clang.arc.rv"="retain/claim" in the caller. If the call is annotated with "claim", a release call is inserted since autoreleaseRV+claimRV is equivalent to a release. If it cannot find an autoreleaseRV call, it tries to transfer the attributes to a function call in the callee. This is important since ARC optimizer can remove the autoreleaseRV call returning the callee result, which makes it impossible to pair it up with the retainRV or claimRV call in the caller. If that fails, it simply emits a retain call in the IR if the call is annotated with "retain" and does nothing if it's annotated with "claim". - This patch teaches dead argument elimination pass not to change the return type of a function if any of the calls to the function are annotated with attribute "clang.arc.rv". This is necessary since the pass can incorrectly determine nothing in the IR uses the function return, which can happen since the front-end no longer explicitly emits retainRV/claimRV calls in the IR, and change its return type to 'void'. Future work: - Use the attribute on x86-64. - Fix the auto upgrader to convert call+retainRV/claimRV pairs into calls annotated with the attributes. rdar://71443534 Differential Revision: https://reviews.llvm.org/D92808	2021-01-25 11:57:08 -08:00
Keith Smiley	c3324450b2	[clang] Add -fprofile-prefix-map This flag allows you to re-write absolute paths in coverage data analogous to -fdebug-prefix-map. This flag is also implied by -ffile-prefix-map.	2021-01-25 10:14:04 -08:00
Erik Pilkington	c4355670b4	[Sema] Fix an assertion failure in -Wcompletion-handler NamedDecl::getName() was being called on a constructor.	2021-01-25 13:02:02 -05:00
Anton Zabaznov	e123cd674c	[OpenCL] Refactor of targets OpenCL option settings Currently, there is some refactoring needed in existing interface of OpenCL option settings to support OpenCL C 3.0. The problem is that OpenCL extensions and features are not only determined by the target platform but also by the OpenCL version. Also, there are core extensions/features which are supported unconditionally in specific OpenCL C version. In fact, these rules are not being followed for all targets. For example, there are some targets (as nvptx and r600) which don't support OpenCL C 2.0 core features (nvptx.languageOptsOpenCL.cl, r600.languageOptsOpenCL.cl). After the change there will be explicit differentiation between optional core and core OpenCL features which allows giving diagnostics if target doesn't support any of necessary core features for specific OpenCL version. This patch also eliminates `OpenCLOptions` instance duplication from `TargetOptions`. `OpenCLOptions` instance should take place in `Sema` as it's going to be modified during parsing. Removing this duplication will also allow to generally simplify `OpenCLOptions` class for parsing purposes. Reviewed By: Anastasia Differential Revision: https://reviews.llvm.org/D92277	2021-01-25 19:50:23 +03:00
Adam Czachorowski	d462aa5a61	[clang] Fix a nullptr dereference bug on invalid code When working with invalid code, we would try to dereference a nullptr while deducing template arguments in some dependend code operating on a lambda with invalid return type. Differential Revision: https://reviews.llvm.org/D95145	2021-01-25 15:02:25 +01:00
Abhina Sreeskantharajan	978444d531	Revert "[SystemZ][z/OS] Fix No such file or directory expression error" This reverts commit `06f8a49693`.	2021-01-25 08:29:38 -05:00
Abhina Sreeskantharajan	84851a274e	Revert "[SystemZ][z/OS] Fix No such file or directory expression error matching in lit tests - continued" This reverts commit `520b5ecf85`.	2021-01-25 08:29:38 -05:00
Sven van Haastregt	b16fb1ffc3	Revert "[clang] Fix signedness in vector bitcast evaluation" This reverts commit `14947cd047` because it broke clang-cmake-armv7-quick.	2021-01-25 12:43:30 +00:00
Sven van Haastregt	14947cd047	[clang] Fix signedness in vector bitcast evaluation The included test case triggered a sign assertion on the result in `Success()`. This was caused by the APSInt created for a bitcast having its signedness bit inverted. The second APSInt constructor argument is `isUnsigned`, so invert the result of `isSignedIntegerType`. Differential Revision: https://reviews.llvm.org/D95135	2021-01-25 12:01:42 +00:00
Simon Cook	666815d61b	[RISCV] Implement new architecture extension macros This adds support for the new architecture extension test macros as defined in the C-API Document: https://github.com/riscv/riscv-c-api-doc/blob/master/riscv-c-api.md Extension versions have been taken from what are used in RISCVTargetStreamer for ratified extensions, and the -march parser for experimental extensions. Differential Revision: https://reviews.llvm.org/D94403	2021-01-25 08:58:46 +00:00
Haojian Wu	c6bd6607bf	Fix a build-bot failure. The test ms-lookup-template-base-classes.cpp added in `d972d4c749` is failing on some builtbot that don't include x86. This patch should fix that (following the patterns in the test directory).	2021-01-25 09:46:29 +01:00
Ben Shi	01d9f13c3a	Revert "[clang][AVR] Improve avr-ld command line options" This reverts commit `89a5147e5a`.	2021-01-25 16:33:58 +08:00
Ben Shi	89a5147e5a	[clang][AVR] Improve avr-ld command line options	2021-01-25 12:01:26 +08:00
Harald van Dijk	f4537935dc	Suppress non-conforming GNU paste extension in all standard-conforming modes The GNU token paste extension that removes the comma in , ## __VA_ARGS__ conflicts with C99/C++11's requirements when a variadic macro has no named parameters: according to the standard, an invocation as FOO() gives it a single empty argument, and concatenation of anything with an empty argument is well-defined. For this reason, the GNU extension was already disabled in C99 standard-conforming mode. It was not yet disabled in C++11 standard-conforming mode. The associated comment suggested that GCC keeps this extension enabled in C90/C++03 standard-conforming mode, but it actually does not, so rather than adding a check for C++ language version, this change simply removes the check for C language version. Reviewed By: rsmith Differential Revision: https://reviews.llvm.org/D91913	2021-01-25 00:56:45 +00:00
Simon Cook	afd483e57d	[RISCV] Add support for Zvamo/Zvlsseg to driver Differential Revision: https://reviews.llvm.org/D94930	2021-01-24 22:07:56 +00:00
Shilei Tian	5ad038aafa	[Clang][OpenMP][NVPTX] Replace `libomptarget-nvptx-path` with `libomptarget-nvptx-bc-path` D94700 removed the static library so we no longer need to pass `-llibomptarget-nvptx` to `nvlink`. Since the bitcode library is the only device runtime for now, instead of emitting a warning when it is not found, an error should be raised. We also set a new option `libomptarget-nvptx-bc-path` to let user choose which bitcode library is being used. Reviewed By: JonChesterfield Differential Revision: https://reviews.llvm.org/D95161	2021-01-23 14:42:38 -05:00
Jeroen Dobbelaere	2b9a834c43	[InlineFunction] Use llvm.experimental.noalias.scope.decl for noalias arguments. Insert a llvm.experimental.noalias.scope.decl intrinsic that identifies where a noalias argument was inlined. This patch includes some refactorings from D90104. Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D93040	2021-01-23 12:10:57 +01:00
George Koehler	018984ae68	[PowerPC] Fix va_arg in C++, Objective-C on 32-bit ELF targets In the PPC32 SVR4 ABI, a va_list has copies of registers from the function call. va_arg looked in the wrong registers for (the pointer representation of) an object in Objective-C, and for some types in C++. Fix va_arg to look in the general-purpose registers, not the floating-point registers. Also fix va_arg for some C++ types, like a member function pointer, that are aggregates for the ABI. Anthony Richardby found the problem in Objective-C. Eli Friedman suggested part of this fix. Fixes https://bugs.llvm.org/show_bug.cgi?id=47921 Reviewed By: efriedma, nemanjai Differential Revision: https://reviews.llvm.org/D90329	2021-01-23 00:13:36 -05:00
Richard Smith	e92be7cd9f	PR47682: Merge the DeclContext of a merged FunctionDecl before we inherit default arguments. When a function is declared with a qualified name, its eventual semantic DeclContext may differ from the scope specified by the qualifier if it redeclares a function in an inline namespace. In this case, we need to update the DeclContext to be that of the previous declaration, and we need to do so before we decide whether to inherit default arguments from that previous declaration, because we only inherit default arguments from declarations in the same scope.	2021-01-22 15:46:41 -08:00
Craig Topper	20f2e32d2c	[RISCV] Update B extension version to 0.93. Reviewed By: asb, frasercrmck Differential Revision: https://reviews.llvm.org/D95002	2021-01-22 12:49:10 -08:00
Craig Topper	4e6ad11bc6	[RISCV] Add Zba feature and move add.uw and slli.uw to it. Still need to add SH*ADD instructions. Reviewed By: asb, frasercrmck Differential Revision: https://reviews.llvm.org/D94617	2021-01-22 12:49:10 -08:00
Abhina Sreeskantharajan	520b5ecf85	[SystemZ][z/OS] Fix No such file or directory expression error matching in lit tests - continued This is a continuation of https://reviews.llvm.org/D94239. I missed some other spellings of the same error. Reviewed By: muiez Differential Revision: https://reviews.llvm.org/D95246	2021-01-22 13:54:25 -05:00
Yaxun (Sam) Liu	622eaa4a4c	[HIP] Support __managed__ attribute This patch implements codegen for __managed__ variable attribute for HIP. Diagnostics will be added later. Differential Revision: https://reviews.llvm.org/D94814	2021-01-22 11:43:58 -05:00
Haojian Wu	d972d4c749	Revert "[clang] Suppress "follow-up" diagnostics on recovery call expressions." This reverts commit `efa9aaad70` and adds a crash test. The commit caused a crash in CodeGen with -fms-compatibility, see https://bugs.llvm.org/show_bug.cgi?id=48690.	2021-01-22 13:04:37 +01:00
Argyrios Kyrtzidis	b0e89906f5	[ASTReader] Allow controlling separately whether validation should be disabled for a PCH vs a module file This addresses an issue with how the PCH preable works, specifically: 1. When using a PCH/preamble the module hash changes and a different cache directory is used 2. When the preamble is used, PCH & PCM validation is disabled. Due to combination of #1 and #2, reparsing with preamble enabled can end up loading a stale module file before a header change and using it without updating it because validation is disabled and it doesn’t check that the header has changed and the module file is out-of-date. rdar://72611253 Differential Revision: https://reviews.llvm.org/D95159	2021-01-21 20:45:54 -08:00
Akira Hatanaka	3d349ed7e1	[CodeGen][ObjC] Fix broken IR generated when there is a nil receiver check This patch fixes a bug in emitARCOperationAfterCall where it inserts the fall-back call after a bitcast instruction and then replaces the bitcast's operand with the result of the fall-back call. The generated IR without this patch looks like this: msgSend.call: ; preds = %entry %call = call i8* bitcast (i8* (i8, i8, ...)* @objc_msgSend br label %msgSend.cont msgSend.null-receiver: ; preds = %entry call void @llvm.objc.release(i8* %4) br label %msgSend.cont msgSend.cont: %8 = phi i8* [ %call, %msgSend.call ], [ null, %msgSend.null-receiver ] %9 = bitcast i8* %10 to %0* %10 = call i8* @llvm.objc.retain(i8* %8) Notice that `%9 = bitcast i8* %10` to %0* is taking operand %10 which is defined after it. To fix the bug, this patch modifies the insert point to point to the bitcast instruction so that the fall-back call is inserted before the bitcast. In addition, it teaches the function to look at phi instructions that are generated when there is a check for a null receiver and insert the retainRV/claimRV instruction right after the call instead of inserting a fall-back call right after the phi instruction. rdar://73360225 Differential Revision: https://reviews.llvm.org/D95181	2021-01-21 17:38:46 -08:00
Jon Roelofs	1deee5cacb	Fix crash when emitting NullReturn guards for functions returning BOOL CodeGenModule::EmitNullConstant() creates constants with their "in memory" type, not their "in vregs" type. The one place where this difference matters is when the type is _Bool, as that is an i1 when in vregs and an i8 in memory. Fixes: rdar://73361264	2021-01-21 14:29:36 -08:00
Nikita Popov	65fd034b95	[FunctionAttrs] Infer willreturn for functions without loops If a function doesn't contain loops and does not call non-willreturn functions, then it is willreturn. Loops are detected by checking for backedges in the function. We don't attempt to handle finite loops at this point. Differential Revision: https://reviews.llvm.org/D94633	2021-01-21 20:29:33 +01:00
Artem Belevich	127091bfd5	[CUDA] Normalize handling of defauled dtor. Defaulted destructor was treated inconsistently, compared to other compiler-generated functions. When Sema::IdentifyCUDATarget() got called on just-created dtor which didn't have implicit __host__ __device__ attributes applied yet, it would treat it as a host function. That happened to (sometimes) hide the error when dtor referred to a host-only functions. Even when we had identified defaulted dtor as a HD function, we still treated it inconsistently during selection of usual deallocators, where we did not allow referring to wrong-side functions, while it is allowed for other HD functions. This change brings handling of defaulted dtors in line with other HD functions. Differential Revision: https://reviews.llvm.org/D94732	2021-01-21 10:48:07 -08:00
Joseph Huber	e4eaf9d820	[OpenMP] Add support for mapping names in mapper API Summary: The custom mapper API did not previously support the mapping names added previously. This means they were not present if a user requested debugging information while using the mapper functions. This adds basic support for passing the mapped names to the runtime library. Reviewers: jdoerfert Differential Revision: https://reviews.llvm.org/D94806	2021-01-21 09:26:44 -05:00

... 16 17 18 19 20 ...

44094 Commits