llvm-project

Commit Graph

Author	SHA1	Message	Date
Fangrui Song	aed6a1b137	Add tests for clang -fno-zero-initialized-in-bss and llc -nozero-initialized-in-bss And rename the CC1 option.	2020-07-04 23:26:57 -07:00
Kai Luo	68e07da3e5	[clang][PowerPC] Enable -fstack-clash-protection option for ppc64 Differential Revision: https://reviews.llvm.org/D81355	2020-07-05 03:43:56 +00:00
Kazushi (Jam) Marukawa	804d968744	[VE] Rename VE toolchain source files Summary: Rename VE.cpp and VE.h to VEToolchain.cpp and VEToolchain.h respectively in order to avoid link warning message. Linker warns that VE.cpp.o and Arch/VE.cpp.o have the same name. Reviewers: simoll, k-ishizaka Reviewed By: simoll Subscribers: mgorny, cfe-commits Tags: #llvm, #ve, #clang Differential Revision: https://reviews.llvm.org/D82968	2020-07-02 18:45:16 +09:00
Daniel Kiss	070acb1d1e	[Driver][ARM] parse version of arm/thumb architecture correctly Summary: If you execute the following commandline multiple times, the behavior was not always the same: clang++ --target=thumbv7em-none-windows-eabi-coff -march=armv7-m -mcpu=cortex-m7 -o temp.obj -c -x c++ empty.cpp Most of the time the compilation succeeded, but sometimes clang reported this error: clang++: error: the target architecture 'thumbv7em' is not supported by the target 'thumbv7em-none-windows-eabi' The cause of the inconsistent behavior was the uninitialized variable Version. With these commandline arguments, the variable Version was not set by getAsInteger(), because it cannot parse a number from the substring "7em" (of "thumbv7em"). To get a consistent behaviour, it's enough to initialize the variable Version to zero. Zero is smaller than 7, so the comparison will be true. Then the command always fails with the error message seen above. By using consumeInteger() instead of getAsInteger() we get 7 from the substring "7em" and the command does not fail. Reviewers: compnerd, danielkiss Reviewed By: danielkiss Subscribers: danielkiss, kristof.beyls, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D75453	2020-07-01 12:13:52 +02:00
Alex Lorenz	6792a60778	[darwin][driver] pass the minimum supported OS version to the linker if it's newer than the target version This change ensures that the arm64-apple-macOS slice is linked for macOS 11 even if the deployment target is earlier than macOS 11.	2020-06-29 19:03:57 -07:00
James Y Knight	4772b99dff	Clang Driver: refactor support for writing response files to be specified at Command creation, rather than as part of the Tool. This resolves the hack I just added to allow Darwin toolchain to vary its level of support based on `-mlinker-version=`. The change preserves the _current_ settings for response-file support. Some tools look likely to be declaring that they don't support response files in error, however I kept them as-is in order for this change to be a simple refactoring. Differential Revision: https://reviews.llvm.org/D82782	2020-06-29 18:27:02 -04:00
James Y Knight	381df1653c	Clang Driver: Use Apple ld64's new @response-file support. In XCode 12, ld64 got support for @files, in addition to the old -filelist mechanism. Response files allow passing all command-line arguments to the linker via a file, rather than just filenames, and is therefore preferred. Because of the way response-file support is currently implemented as part of the Tool class in Clang, this change requires an ugly backdoor function to access Args. A follow-up commit fixes this, but I've ordered this change first, for easier backportability. I've added no tests here, because unfortunately, there don't appear to be _any_ response-file emission automated tests, and I don't see an obvious way to add them. I've tested that this change works as expected locally. Differential Revision: https://reviews.llvm.org/D82777	2020-06-29 18:26:53 -04:00
Alex Lorenz	f7a14514ee	[darwin][driver] isMacosxVersionLT should check against the minimum supported OS version This change ensures that the Darwin driver doesn't add unsupported libraries to the link invocation when linking the Apple Silicon macOS slice. rdar://61011136 Differential Revision: https://reviews.llvm.org/D82696	2020-06-29 12:21:54 -07:00
Hsiangkai Wang	d698ff92a5	[RISCV] Support experimental v extensions. This follows the design as discussed on the mailing lists in the following RFC: http://lists.llvm.org/pipermail/llvm-dev/2020-January/138364.html Support for the vector 'v' extension v0.8. Differential revision: https://reviews.llvm.org/D81188	2020-06-28 00:54:07 +08:00
David Zarzycki	dab859d1bf	Reland: [clang driver] Move default module cache from system temporary directory This fixes a unit test. Otherwise here is the original commit: 1) Shared writable directories like /tmp are a security problem. 2) Systems provide dedicated cache directories these days anyway. 3) This also refines LLVM's cache_directory() on Darwin platforms to use the Darwin per-user cache directory. Reviewers: compnerd, aprantl, jakehehrlich, espindola, respindola, ilya-biryukov, pcc, sammccall Reviewed By: compnerd, sammccall Subscribers: hiraditya, llvm-commits, cfe-commits Tags: #clang, #llvm Differential Revision: https://reviews.llvm.org/D82362	2020-06-27 05:35:15 -04:00
Alex Lorenz	2b00cacb28	[darwin][driver] NFC, split addStartObjectFileArgs into multiple functions	2020-06-26 17:15:37 -07:00
Alex Lorenz	253988f0f4	[darwin][driver] Do not link with libarclite when building for Apple Silicon macOS	2020-06-26 17:15:37 -07:00
Nico Weber	4d5c448943	Revert "[clang driver] Move default module cache from system temporary directory" This reverts commit `bb26838cef`. Breaks Support.CacheDirectoryNoEnv, Support.CacheDirectoryWithEnv in SupportTests (part of check-llvm) on macOS.	2020-06-26 13:25:45 -04:00
David Zarzycki	bb26838cef	[clang driver] Move default module cache from system temporary directory 1) Shared writable directories like /tmp are a security problem. 2) Systems provide dedicated cache directories these days anyway. 3) This also refines LLVM's cache_directory() on Darwin platforms to use the Darwin per-user cache directory. Reviewers: compnerd, aprantl, jakehehrlich, espindola, respindola, ilya-biryukov, pcc, sammccall Reviewed By: compnerd, sammccall Subscribers: hiraditya, llvm-commits, cfe-commits Tags: #clang, #llvm Differential Revision: https://reviews.llvm.org/D82362	2020-06-26 07:46:03 -04:00
Yaxun (Sam) Liu	8013ce4490	[HIP] Add missing options for lto Add -mcpu, -mattr, -mllvm, and -save-temps options for lto when necessary. Differential Revision: https://reviews.llvm.org/D82506	2020-06-26 00:26:05 -04:00
Yaxun (Sam) Liu	ed398c3ca4	[NFC] Extract unifyTargetFeatures Differential Revision: https://reviews.llvm.org/D82579	2020-06-25 23:17:08 -04:00
Wouter van Oortmerssen	b9a539c010	[WebAssembly] Adding 64-bit versions of __stack_pointer and other globals We have 6 globals, all of which except for __table_base are 64-bit under wasm64. Differential Revision: https://reviews.llvm.org/D82130	2020-06-25 15:52:44 -07:00
Alexey Bataev	32ea3397be	[OPENMP]Dynamic globalization for parallel target regions. Summary: Added support for dynamic memory allocation for globalized variables in case if execution of target regions in parallel is required. Reviewers: jdoerfert Subscribers: jholewinski, yaxunl, guansong, sstefan1, cfe-commits, caomhin Tags: #clang Differential Revision: https://reviews.llvm.org/D82324	2020-06-25 08:25:24 -04:00
Kazushi (Jam) Marukawa	96d4ccf00c	[VE] Clang toolchain for VE Summary: This patch enables compilation of C code for the VE target with Clang. Differential Revision: https://reviews.llvm.org/D79411	2020-06-24 10:12:09 +02:00
Alex Lorenz	565603cc94	[clang][driver] set macOS as the target OS for -arch arm64 when clang is running on an Apple Silicon mac This change allows users to use `-arch arm64` to build for mac when running it on Apple Silicon mac without explicit `-target` option. Differential Revision: https://reviews.llvm.org/D82428	2020-06-23 21:08:11 -07:00
Alex Lorenz	f724ce0d73	[clang][driver] allow macOS 11 OS version in the driver	2020-06-23 15:14:26 -07:00
Dylan McKay	15b9dc46c6	Revert "[AVR] Explicitly set the address of the data section when invoking the linker" This reverts commit `ede6005e70`. Ayke suggests this value varies chip-by-chip, and thus it is not safe to hardcode to 0x800100. Proper logic for this linker parameter will have to be wired up in a follow up patch.	2020-06-23 22:23:05 +12:00
Aaron En Ye Shi	4bafb0adcf	Add Statically Linked Libraries Add GNU Static Lib Tool, which supports the --emit-static-lib flag. For HIP, a static library archive will be created and consist of HIP Fat Binary host object with the device images embedded. Using llvm-ar to create the static archive. Also, delete existing output file to ensure a new archive is created each time. Reviewers: yaxunl, tra, rjmccall, echristo Subscribers: echristo, JonChesterfield, scchan, msearles Differential Revision: https://reviews.llvm.org/D78759	2020-06-22 19:48:49 +00:00
Aaron En Ye Shi	77df5a8283	[HIP] Move HIP Linking Logic into HIP ToolChain This patch is a follow up on https://reviews.llvm.org/D78759. Extract the HIP Linker script from generic GNU linker, and move it into HIP ToolChain. Update OffloadActionBuilder Link actions feature to apply device linking and host linking actions separately. Using MC Directives, embed the device images and define symbols. Reviewers: JonChesterfield, yaxunl Subscribers: tra, echristo, jdoerfert, msearles, scchan Differential Revision: https://reviews.llvm.org/D81963	2020-06-22 19:48:48 +00:00
Yaxun (Sam) Liu	2580635bd2	Let HIP default include respect -nogpuinc and -nogpulib	2020-06-22 08:39:12 -04:00
Luboš Luňák	a45f713c67	add option to instantiate templates already in the PCH Add -fpch-instantiate-templates which makes template instantiations be performed already in the PCH instead of it being done in every single file that uses the PCH (but every single file will still do it as well in order to handle its own instantiations). I can see 20-30% build time saved with the few tests I've tried. The change may reorder compiler output and also generated code, but should be generally safe and produce functionally identical code. There are some rare cases that do not compile with it, such as test/PCH/pch-instantiate-templates-forward-decl.cpp. If template instantiation bailed out instead of reporting the error, these instantiations could even be postponed, which would make them work. Enable this by default for clang-cl. MSVC creates PCHs by compiling them using an empty .cpp file, which means templates are instantiated while building the PCH and so the .h needs to be self-contained, making test/PCH/pch-instantiate-templates-forward-decl.cpp to fail with MSVC anyway. So the option being enabled for clang-cl matches this. Differential Revision: https://reviews.llvm.org/D69585	2020-06-21 17:05:52 +02:00
Xiangling Liao	3f2e61c1fe	[AIX] Default AIX to using -fno-use-cxa-atexit On AIX, we use __atexit to register dtor functions rather than __cxa_atexit. So a driver change is needed to default AIX to using -fno-use-cxa-atexit. Windows platform does not uses __cxa_atexit either. Following its precedent, we remove the assertion for when -fuse-cxa-atexit is specified by the user, do not produce a message and silently default to -fno-use-cxa-atexit behavior. Differential Revision: https://reviews.llvm.org/D82136	2020-06-19 08:27:07 -04:00
Kristof Beyls	c113b59ef5	[AArch64] Add clang command line support for -mharden-sls= The accepted options to -mharden-sls= are: * all: enable all mitigations against Straight Line Speculation that are implemented. * none: disable all mitigations against Straight Line Speculation. * retbr: enable the mitigation against Straight Line Speculation for RET and BR instructions. * blr: enable the mitigation against Straight Line Speculation for BLR instructions. Differential Revision: https://reviews.llvm.org/D81404	2020-06-19 07:31:48 +01:00
Dylan McKay	01741d6dbe	[AVR] Remove duplicate specification of lib directory Reviewers: dylanmckay Reviewed By: dylanmckay Subscribers: Jim, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D77334 This was originally committed in `03b0831144` but I missed the commit attribution. Patch by Dennis van der Schagt.	2020-06-19 17:39:18 +12:00
Dylan McKay	aeaa09ec10	Revert "[AVR] Remove duplicate specification of lib directory" This reverts commit `03b0831144`. I forgot to attribute the commit originally so I am recommitting in a subsequent patch.	2020-06-19 17:37:15 +12:00
Dylan McKay	03b0831144	[AVR] Remove duplicate specification of lib directory Reviewers: dylanmckay Reviewed By: dylanmckay Subscribers: Jim, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D77334	2020-06-19 17:35:09 +12:00
Yaxun (Sam) Liu	c830d517b4	[HIP] Enable -amdgpu-internalize-symbols Enable -amdgpu-internalize-symbols to eliminate unused functions and global variables for whole program to speed up compilation and improve performance. For -fno-gpu-rdc, -amdgpu-internalize-symbols is passed to clang -cc1. For -fgpu-rdc, -amdgpu-internalize-symbols is passed to lld. Differential Revision: https://reviews.llvm.org/D81959	2020-06-18 16:34:37 -04:00
Yaxun (Sam) Liu	92d8ad02e9	[HIP] Fix rocm not found on rocm3.5 Currently rocm detector expects device library bitcodes named as .bc instead of .amdgcn.bc. However in rocm3.5 the device library bitcodes are named as *.amdgcn.bc, which causes rocm3.5 not detected. This patch fixes that. Differential Revision: https://reviews.llvm.org/D81713	2020-06-18 08:40:09 -04:00
Ryan Prichard	6c4ce20226	[Driver] Search computed sysroot for libc++ header paths Summary: The Android NDK's clang driver is used with an Android -target setting, and the driver automatically finds the Android sysroot at a path relative to the driver. The sysroot has the libc++ headers in it. Remove Hurd::computeSysRoot as it is equivalent to the new ToolChain::computeSysRoot method. Fixes PR46213. Reviewers: srhines, danalbert, #libc, kristina Reviewed By: srhines, danalbert Subscribers: ldionne, sthibaul, asb, rbar, johnrusso, simoncook, sabuasal, niosHD, jrtc27, MaskRay, zzheng, edward-jones, rogfer01, MartinMosbeck, brucehoult, the_o, PkmX, jocewei, Jim, lenary, s.egerton, pzheng, sameer.abuasal, apazos, luismarques, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D81622	2020-06-17 16:17:37 -07:00
Yuanfang Chen	2956cc50f3	[Clang][Driver] Remove gold linker support for PS4 toolchain Reviewers: probinson Subscribers: cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D81970	2020-06-17 09:45:14 -07:00
Martin Storsjö	e3fd9dc973	[clang] Enable -mms-bitfields by default for mingw targets This matches GCC, which enabled -mms-bitfields by default for mingw targets in 4.7 [1]. [1] https://www.gnu.org/software/gcc/gcc-4.7/changes.html Differential Revision: https://reviews.llvm.org/D81795	2020-06-17 09:37:07 +03:00
Martin Storsjö	beeed368b6	[clang] [MinGW] Link kernel32 once after the last instance of msvcrt The msvcrt library isn't a pure import library; it does contain regular object files with wrappers/fallbacks, and these can require linking against kernel32. This only makes a difference when linking with ld.bfd, as lld always searches all static libraries. This matches a similar change made recently in gcc in https://gcc.gnu.org/git/?p=gcc.git;a=commitdiff;h=850533ab160ef40eccfd039e1e3b138cf26e76b8, although clang adds --start-group --end-group around these libraries if -static is specified, which gcc doesn't. But try to match gcc's linking order in any case, for consistency. Differential Revision: https://reviews.llvm.org/D80880	2020-06-17 09:37:07 +03:00
Yuanfang Chen	8d4a806ef0	Revert "remove gold linker" This reverts commit `719c87edc5`. Checked in by accident. Sorry.	2020-06-16 13:05:53 -07:00
Yuanfang Chen	719c87edc5	remove gold linker	2020-06-16 13:03:31 -07:00
Ilya Leoshkevich	ccd8b7b103	[LSan] Enable for SystemZ Summary: Add runtime support, adjust the tests and enable LSan. Reviewers: vitalybuka, eugenis, uweigand, jonpa Reviewed By: uweigand Subscribers: mgorny, cfe-commits, #sanitizers Tags: #clang, #sanitizers Differential Revision: https://reviews.llvm.org/D78644	2020-06-16 13:45:29 +02:00
Yaxun (Sam) Liu	6752786d65	[HIP] Do not use llvm-link/opt/llc for -fgpu-rdc This patch is a follow up on https://reviews.llvm.org/D81627. In addition to default -fno-gpu-rdc case, this patches let HIP toolchain not use llvm-link/opt/llc to link device code for -fgpu-rdc case. Instead, uses standard lto. This will eliminate some redundant optimizations and speed up the compilation/linking. Differential Revision: https://reviews.llvm.org/D81861	2020-06-15 21:09:18 -04:00
Yaxun (Sam) Liu	e8090d83fd	[HIP] Do not call opt/llc for -fno-gpu-rdc Currently HIP toolchain calls clang to emit bitcode then calls opt/llc for device compilation for the default -fno-gpu-rdc case, which is unnecessary since clang is able to compile a single source file to ISA. This patch fixes the HIP action builder and toolchain so that the default -fno-gpu-rdc can be done like a canonical toolchain, i.e. one clang -cc1 invocation to compile source code to ISA. This can avoid unnecessary processes to speed up the compilation, and avoid redundant LLVM passes which are performed in clang -cc1 and opt. Differential Revision: https://reviews.llvm.org/D81627	2020-06-15 18:55:01 -04:00
Artem Belevich	d700237f1a	[CUDA,HIP] Use VFS for SDK detection. It's useful for using clang from tools that may need need to provide SDK files from non-standard locations. Clang CLI only provides a way to specify VFS for include files, so there's no good way to test this yet. Differential Revision: https://reviews.llvm.org/D81771	2020-06-15 12:54:44 -07:00
Sterling Augustine	f07b3d41e7	Add #includes so that ROCm.h is compilable stand-alone. Summary: ROCm.h had been getting the declarations for various data structures by being #included next to them, rather than #includeing them itself. This change fixes that by explicitly including the appropriate headers. Subscribers: cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D81432	2020-06-08 14:16:30 -07:00
Jian Cai	4db2b70248	Add a flag to debug automatic variable initialization Summary: Add -ftrivial-auto-var-init-stop-after= to limit the number of times stack variables are initialized when -ftrivial-auto-var-init= is used to initialize stack variables to zero or a pattern. This flag can be used to bisect uninitialized uses of a stack variable exposed by automatic variable initialization, such as http://crrev.com/c/2020401. Reviewers: jfb, vitalybuka, kcc, glider, rsmith, rjmccall, pcc, eugenis, vlad.tsyrklevich Reviewed By: jfb Subscribers: phosek, hubert.reinterpretcast, srhines, MaskRay, george.burgess.iv, dexonsmith, inglorion, gbiv, llozano, manojgupta, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D77168	2020-06-08 12:30:56 -07:00
Fangrui Song	336e1f03d1	[Driver] Omit -mthread-model posix which is the CC1 default	2020-06-07 12:27:11 -07:00
Yaxun (Sam) Liu	8422bc9efc	recommit "[HIP] Add default header and include path" recommit `11d06b9511` with fix for lit tests.	2020-06-06 14:21:22 -04:00
Jonas Paulsson	515bfc66ea	[SystemZ] Implement -fstack-clash-protection Probing of allocated stack space is now done when this option is passed. The purpose is to protect against the stack clash attack (see https://www.qualys.com/2017/06/19/stack-clash/stack-clash.txt). Review: Ulrich Weigand Differential Revision: https://reviews.llvm.org/D78717	2020-06-06 18:38:36 +02:00
Nico Weber	2920348063	Revert "recommit "[HIP] Add default header and include path"" This reverts commit `1fa43e0b34`. Still breaks tests on several bots, see https://reviews.llvm.org/D81176	2020-06-05 21:50:04 -04:00
Yaxun (Sam) Liu	1fa43e0b34	recommit "[HIP] Add default header and include path" recommit `11d06b9511` with fix for lit tests.	2020-06-05 20:41:15 -04:00
stevewan	61cd264068	[PowerPC] Do not special case Darwin on PowerPC in target cpu handling Summary: This patch removes the special handling for Darwin on PowerPC in the default target cpu handling, because Darwin is no longer supported on the PowerPC platform. Reviewers: hubert.reinterpretcast, daltenty Reviewed By: hubert.reinterpretcast Subscribers: wuzish, nemanjai, shchenz, steven.zhang, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D81115	2020-06-05 19:55:28 -04:00
Yaxun (Sam) Liu	8a8c6913a9	Revert "[HIP] Add default header and include path" This reverts commit `11d06b9511`.	2020-06-05 15:42:57 -04:00
Yaxun (Sam) Liu	11d06b9511	[HIP] Add default header and include path To support std::complex and some other standard C/C++ functions in HIP device code, they need to be forced to be __host__ __device__ functions by pragmas. This is done by some clang standard C++ wrapper headers which are shared between cuda-clang and hip-Clang. For these standard C++ wapper headers to work properly, specific include path order has to be enforced: clang C++ wrapper include path standard C++ include path clang include path Also, these C++ wrapper headers require device version of some standard C/C++ functions must be declared before including them. This needs to be done by including a default header which declares or defines these device functions. The default header is always included before any other headers are included by users. This patch adds the the default header and include path for HIP. Differential Revision: https://reviews.llvm.org/D81176	2020-06-05 12:44:57 -04:00
Petr Hosek	b16ed493dd	[Fuchsia] Rely on linker switch rather than dead code ref for profile runtime Follow the model used on Linux, where the clang driver passes the linker a -u switch to force the profile runtime to be linked in, rather than having every TU emit a dead function with a reference. Differential Revision: https://reviews.llvm.org/D79835	2020-06-04 15:47:05 -07:00
Petr Hosek	e1ab90001a	Revert "[Fuchsia] Rely on linker switch rather than dead code ref for profile runtime" This reverts commit `d510542174` since it broke several bots.	2020-06-04 15:44:10 -07:00
Petr Hosek	d510542174	[Fuchsia] Rely on linker switch rather than dead code ref for profile runtime Follow the model used on Linux, where the clang driver passes the linker a -u switch to force the profile runtime to be linked in, rather than having every TU emit a dead function with a reference. Patch By: mcgrathr Differential Revision: https://reviews.llvm.org/D79835	2020-06-04 14:25:19 -07:00
Dan Gohman	d496437a0b	[WebAssembly] Add support for -mexec-model=reactor This adds a -mexec-model= command-line flag. The default value is "command" which is no change from the current behavior. The other option is "reactor" which enables the WASI Reactor ABI: https://github.com/WebAssembly/WASI/blob/master/design/application-abi.md Differential Revision: https://reviews.llvm.org/D62922	2020-06-03 14:02:47 -07:00
Steven Wan	ba4afe6f7a	[AIX] Change the default target CPU to power4 for AIX on Power Summary: This patch changes the AIX default target CPU to power4 since this is the the lowest arch for the lowest OS level supported. Reviewers: hubert.reinterpretcast, cebowleratibm, daltenty Reviewed By: hubert.reinterpretcast Subscribers: cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D80835	2020-06-03 13:50:26 -04:00
Shengchen Kan	ac47588bc4	[Driver] Add negative option for -fkeep-static-consts	2020-06-03 14:59:14 +08:00
Nick Desaulniers	8eda71616f	[Clang][A32/T32][Linux] -O1 implies -fomit-frame-pointer Summary: An upgrade of LLVM for CrOS [0] containing [1] triggered a bunch of errors related to writing to reserved registers for a Linux kernel's arm64 compat vdso (which is a aarch32 image). After a discussion on LKML [2], it was determined that -f{no-}omit-frame-pointer was not being specified. Comparing GCC and Clang [3], it becomes apparent that GCC defaults to omitting the frame pointer implicitly when optimizations are enabled, and Clang does not. ie. setting -O1 (or above) implies -fomit-frame-pointer. Clang was defaulting to -fno-omit-frame-pointer implicitly unless -fomit-frame-pointer was set explicitly. Why this becomes a problem is that the Linux kernel's arm64 compat vdso contains code that uses r7. r7 is used sometimes for the frame pointer (for example, when targeting thumb (-mthumb)). See useR7AsFramePointer() in llvm/llvm-project/llvm/lib/Target/ARM/ARMSubtarget.h. This is mostly for legacy/compatibility reasons, and the 2019 Q4 revision of the ARM AAPCS looks to standardize r11 as the frame pointer for aarch32, though this is not yet implemented in LLVM. Users that are reliant on the implicit value if unspecified when optimizations are enabled should explicitly choose -fomit-frame-pointer (new behavior) or -fno-omit-frame-pointer (old behavior). [0] https://bugs.chromium.org/p/chromium/issues/detail?id=1084372 [1] https://reviews.llvm.org/D76848 [2] https://lore.kernel.org/lkml/20200526173117.155339-1-ndesaulniers@google.com/ [3] https://godbolt.org/z/0oY39t Reviewers: kristof.beyls, psmith, danalbert, srhines, MaskRay, ostannard, efriedma Reviewed By: psmith, danalbert, srhines, MaskRay, efriedma Subscribers: efriedma, olista01, MaskRay, vhscampos, cfe-commits, llvm-commits, manojgupta, llozano, glider, hctim, eugenis, pcc, peter.smith, srhines Tags: #clang, #llvm Differential Revision: https://reviews.llvm.org/D80828	2020-06-02 15:54:14 -07:00
Sriraman Tallam	e0bca46b08	Options for Basic Block Sections, enabled in D68063 and D73674. This patch adds clang options: -fbasic-block-sections={all,<filename>,labels,none} and -funique-basic-block-section-names. LLVM Support for basic block sections is already enabled. + -fbasic-block-sections={all, <file>, labels, none} : Enables/Disables basic block sections for all or a subset of basic blocks. "labels" only enables basic block symbols. + -funique-basic-block-section-names: Enables unique section names for basic block sections, disabled by default. Differential Revision: https://reviews.llvm.org/D68049	2020-06-02 00:23:32 -07:00
Fangrui Song	92448fd23d	[Driver] Simplify Linux::addProfileRTLibs	2020-05-31 17:15:14 -07:00
Hubert Tong	c15d5d12c6	[Driver] NFC: Use Twine temp to replace std::string local This patch replaces a `std::string` local used for a concatentation with a `Twine` where the string was being passed into call.	2020-05-31 16:38:10 -04:00
Mateusz Mikuła	ab4d02cf26	[clang] [MinGW] Fix libunwind extension Differential Revision: https://reviews.llvm.org/D79995	2020-05-29 15:23:14 +03:00
Shengchen Kan	4954449995	[Driver][X86] Support branch align options with LTO Summary: Before this patch, we use two different ways to pass options to align branch depending on whether LTO is enabled. For example, `-mbranches-within-32B-boundaries` w/o LTO and `-Wl,-plugin-opt=-x86-branches-within-32B-boundaries` w/ LTO. It's inconvenient, so this patch unifies the way: we only need to pass options like `-mbranches-within-32B-boundaries` to align branches, no matter LTO is enabled or not. Differential Revision: https://reviews.llvm.org/D80289	2020-05-28 13:17:14 +08:00
Lei Huang	2368bf52cd	[PowerPC] Add support for -mcpu=pwr10 in both clang and llvm Summary: This patch simply adds support for the new CPU in anticipation of Power10. There isn't really any functionality added so there are no associated test cases at this time. Reviewers: stefanp, nemanjai, amyk, hfinkel, power-llvm-team, #powerpc Reviewed By: stefanp, nemanjai, amyk, #powerpc Subscribers: NeHuang, steven.zhang, hiraditya, llvm-commits, wuzish, shchenz, cfe-commits, kbarton, echristo Tags: #clang, #powerpc, #llvm Differential Revision: https://reviews.llvm.org/D80020	2020-05-27 13:14:25 -05:00
Lei Huang	559845f8fe	Revert "[PowerPC] Add support for -mcpu=pwr10 in both clang and llvm" This reverts commit `7eb666b155`.	2020-05-27 09:40:21 -05:00
Lei Huang	7eb666b155	[PowerPC] Add support for -mcpu=pwr10 in both clang and llvm Summary: This patch simply adds support for the new CPU in anticipation of Power10. There isn't really any functionality added so there are no associated test cases at this time. Reviewers: stefanp, nemanjai, amyk, hfinkel, power-llvm-team, #powerpc Reviewed By: stefanp, nemanjai, amyk, #powerpc Subscribers: NeHuang, steven.zhang, hiraditya, llvm-commits, wuzish, shchenz, cfe-commits, kbarton, echristo Tags: #clang, #powerpc, #llvm Differential Revision: https://reviews.llvm.org/D80020	2020-05-26 13:48:22 -05:00
Fangrui Song	9d55e4ee13	Make explicit -fno-semantic-interposition (in -fpic mode) infer dso_local -fno-semantic-interposition is currently the CC1 default. (The opposite disables some interprocedural optimizations.) However, it does not infer dso_local: on most targets accesses to ExternalLinkage functions/variables defined in the current module still need PLT/GOT. This patch makes explicit -fno-semantic-interposition infer dso_local, so that PLT/GOT can be eliminated if targets implement local aliases for AsmPrinter::getSymbolPreferLocal (currently only x86). Currently we check whether the module flag "SemanticInterposition" is 0. If yes, infer dso_local. In the future, we can infer dso_local unless "SemanticInterposition" is 1: frontends other than clang will also benefit from the optimization if they don't bother setting the flag. (There will be risks if they do want ELF interposition: they need to set "SemanticInterposition" to 1.)	2020-05-25 20:48:18 -07:00
stevewan	5bf2409a4e	[AIX] Add '-bcdtors:all:0:s' to linker to gather static init functions Summary: On AIX, add '-bcdtors:all:0:s' to the linker implicitly through the driver so that we can collect all static constructor and destructor functions. Reviewers: hubert.reinterpretcast, Xiangling_L, ZarkoCA, daltenty Reviewed By: hubert.reinterpretcast Subscribers: cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D80415	2020-05-25 13:44:15 -04:00
Eli Friedman	9292ece995	[clang driver] Spell "--export-dynamic-symbol" with two dashes. This doesn't make a difference for linkers that support the option, but it improves the error message from older linkers that don't support it.	2020-05-23 15:46:28 -07:00
Matt Arsenault	1d96dca949	HIP: Try to deal with more llvm package layouts The various HIP builds are all inconsistent. The default llvm install goes to ${INSTALL_PREFIX}/bin/clang, but the rocm packaging scripts move this under ${INSTALL_PREFIX}/llvm/bin/clang. Some other builds further pollute this with ${INSTALL_PREFIX}/bin/x86_64/clang. These should really be consolidated, but try to handle them for now.	2020-05-23 13:28:24 -04:00
Ben Dunbobbin	9d23b5025d	[PS4] Enable relaxed relocations by default PS4 supports these relocations types; so, we want to enable them by default. Differential Revision: https://reviews.llvm.org/D79980	2020-05-21 23:05:44 +01:00
Dylan McKay	ede6005e70	[AVR] Explicitly set the address of the data section when invoking the linker This is required to get avr-gdb correctly showing values at the right addresses. This problem was discovered by using debug symbols in an external program to lookup values in an AVR simulator.	2020-05-18 02:24:51 +12:00
Fangrui Song	3841ed4104	[Driver] Render -T for Gnu.cpp clang -T a.lds a.c currently does not render -T.	2020-05-16 23:54:31 -07:00
Yvan Roux	0e4827aa4e	[ARM][MachineOutliner] Add Machine Outliner support for ARM. Enables Machine Outlining for ARM and Thumb2 modes. This is the first patch of the series which adds all the basic logic for the support, and only handles tail-calls and thunks. The outliner can be turned on by using clang -moutline option or -mllvm -enable-machine-outliner one (like AArch64). Differential Revision: https://reviews.llvm.org/D76066	2020-05-15 08:44:23 +02:00
Volodymyr Sapsai	6a3469f58d	[ObjC] Add compatibility mode for type checking of qualified id block parameters. Commit `73152a2ec2` fixed type checking for blocks with qualified id parameters. But there are existing APIs in Apple SDKs relying on the old type checking behavior. Specifically, these are APIs using NSItemProviderCompletionHandler in Foundation/NSItemProvider.h. To keep existing code working and to allow developers to use affected APIs introduce a compatibility mode that enables the previous and the fixed type checking. This mode is enabled only on Darwin platforms. Reviewed By: jyknight, ahatanak Differential Revision: https://reviews.llvm.org/D79511	2020-05-14 12:08:19 -07:00
Fangrui Song	5ecb514146	[Driver] Pass -plugin-opt=O2 for -Os -Oz and -plugin-opt=O1 for -Og Fixes PR42445 (compiler driver options -Os -Oz translate to -plugin-opt=Os (Oz) which are not recognized by LLVMgold.so or LLD). The optimization level mapping matches CompilerInvocation.cpp:getOptimizationLevel() and SpeedLevel of PassBuilder::OptimizationLevel::O. -plugin-opt=O affects the way we construct regular LTO/ThinLTO pass manager pipeline. Reviewed By: pcc Differential Revision: https://reviews.llvm.org/D79919	2020-05-14 10:37:33 -07:00
Matt Arsenault	235fb7dc24	AMDGPU/OpenCL: Accept -nostdlib in place of -nogpulib -nogpulib makes sense when there is a host (where -nostdlib would apply) and offload target. Accept nostdlib when there is no offload target as an alias.	2020-05-14 12:33:31 -04:00
Matt Arsenault	14e1845711	HIP: Merge builtin library handling Merge with the new --rocm-path handling used for OpenCL. This looks for a usable set of device libraries upfront, rather than giving a generic "no such file or directory error". If any of the required bitcode libraries are missing, this will now produce a "cannot find ROCm installation." error. This differs from the existing hip specific flags by pointing to a rocm root install instead of a single directory with bitcode files. This tries to maintain compatibility with the existing the --hip-device-lib and --hip-device-lib-path flags, as well as the HIP_DEVICE_LIB_PATH environment variable, or at least the range of uses with testcases. The existing range of uses and behavior doesn't entirely make sense to me, so some of the untested edge cases change behavior. Currently the two path forms seem to have the double purpose of a search path for an arbitrary --hip-device-lib, and for finding the stock set of libraries. Since the stock set of libraries This also changes the behavior when multiple paths are specified, and only takes the last one (and the environment variable only handles a single path). If --hip-device-lib is used, it now only treats --hip-device-lib-path as the search path for it, and does not attempt to find the rocm installation. If not, --hip-device-lib-path and the environment variable are used as the directory to search instead of the rocm root based path. This should also automatically fix handling of the options to use wave64.	2020-05-12 09:50:22 -04:00
Matt Arsenault	123bee602a	AMDGPU: Search for new ROCm bitcode library structure The current install situation is a mess, but I'm working on fixing it. Search for the target layout instead of one of the N options that exist today.	2020-05-12 09:41:07 -04:00
Zola Bridges	379e68a763	[clang][SLH] Add __has_feature(speculative_load_hardening) SLH doesn't support asm goto and is unlikely to ever support it. Users of asm goto need a way to choose whether to use asm goto or fallback to an SLH compatible code path when SLH is enabled. This feature flag will give users this ability. Tested via unit test Reviewed By: mattdr Differential Revision: https://reviews.llvm.org/D79733	2020-05-11 13:37:12 -07:00
Scott Constable	e97a3e5d9d	[X86] Add a Pass that builds a Condensed CFG for Load Value Injection (LVI) Gadgets Adds a new data structure, ImmutableGraph, and uses RDF to find LVI gadgets and add them to a MachineGadgetGraph. More specifically, a new X86 machine pass finds Load Value Injection (LVI) gadgets consisting of a load from memory (i.e., SOURCE), and any operation that may transmit the value loaded from memory over a covert channel, or use the value loaded from memory to determine a branch/call target (i.e., SINK). Also adds a new target feature to X86: +lvi-load-hardening The feature can be added via the clang CLI using -mlvi-hardening. Differential Revision: https://reviews.llvm.org/D75936	2020-05-11 13:08:35 -07:00
Florian Hahn	1065869195	[Matrix] Add matrix type to Clang. This patch adds a matrix type to Clang as described in the draft specification in clang/docs/MatrixSupport.rst. It introduces a new option -fenable-matrix, which can be used to enable the matrix support. The patch adds new MatrixType and DependentSizedMatrixType types along with the plumbing required. Loads of and stores to pointers to matrix values are lowered to memory operations on 1-D IR arrays. After loading, the loaded values are cast to a vector. This ensures matrix values use the alignment of the element type, instead of LLVM's large vector alignment. The operators and builtins described in the draft spec will will be added in follow-up patches. Reviewers: martong, rsmith, Bigcheese, anemet, dexonsmith, rjmccall, aaron.ballman Reviewed By: rjmccall Differential Revision: https://reviews.llvm.org/D72281	2020-05-11 18:55:45 +01:00
Petr Hosek	5b02be0b97	[Clang] Pass -z max-page-size to linker for Fuchsia Currently all Fuchsia ABIs use a 4k page size, departing from the recommended page sizes in the respective psABI documents. Differential Revision: https://reviews.llvm.org/D79667	2020-05-09 13:44:20 -07:00
Petr Hosek	c8fbcb1e78	[Clang] Pass --pack-dyn-relocs=relr to lld for Fuchsia The compact format is fully supported on Fuchsia and is the preferred default. Patch By: mcgrathr Differential Revision: https://reviews.llvm.org/D79665	2020-05-09 13:42:19 -07:00
Fangrui Song	6bf0ad78dc	[Driver] Don't pass -u__llvm_profile_runtime for clang -fprofile-arcs a.o clang --coverage a.o # InstrProfilingRuntime.cpp.o not linked in clang --fprofile-arcs a.o # InstrProfilingRuntime.cpp.o unexpectedly linked in Fix --fprofile-arcs.	2020-05-08 23:36:29 -07:00
Fangrui Song	9a11174287	[Driver] Add -fno-test-coverage	2020-05-08 17:01:53 -07:00
Fangrui Song	0d4a33ba61	[Driver] Don't warn -Wunused-command-line-argument for --coverage -ftest-coverage -fprofile-arcs	2020-05-08 16:31:15 -07:00
Fangrui Song	e1815eb2e1	[Driver] Reorganize --coverage -ftest-coverage -fprofile-arcs related tests And fix a comment about __llvm_profile_runtime	2020-05-08 16:06:33 -07:00
Hubert Tong	b116ded57d	[AIX] Avoid structor alias; die before bad alias codegen Summary: `AsmPrinter::emitGlobalIndirectSymbol` is dependent on `MCStreamer::emitAssignment` to produce `.set` directives for alias symbols; however, the `.set` pseudo-op on AIX is documented as not usable with external relocatable terms or expressions, which limits its applicability in generating alias symbols. Disable generating aliases on AIX until a different implementation strategy is available. Reviewers: cebowleratibm, jasonliu, sfertile, daltenty, DiggerLin Reviewed By: jasonliu Differential Revision: https://reviews.llvm.org/D79044	2020-05-08 16:51:34 -04:00
Sriraman Tallam	e8147ad822	Uniuqe Names for Internal Linkage Symbols. This is a standalone patch and this would help Propeller do a better job of code layout as it can accurately attribute the profiles to the right internal linkage function. This also helps SampledFDO/AutoFDO correctly associate sampled profiles to the right internal function. Currently, if there is more than one internal symbol foo, their profiles are aggregated by SampledFDO. This patch adds a new clang option, -funique-internal-funcnames, to generate unique names for functions with internal linkage. This patch appends the md5 hash of the module name to the function symbol as a best effort to generate a unique name for symbols with internal linkage. Differential Revision: https://reviews.llvm.org/D73307	2020-05-07 18:18:37 -07:00
Calixte Denizet	bec223a9bc	[profile] Don't crash when forking in several threads Summary: When forking in several threads, the counters were written out in using the same global static variables (see GCDAProfiling.c): that leads to crashes. So when there is a fork, the counters are resetted in the child process and they will be dumped at exit using the interprocess file locking. When there is an exec, the counters are written out and in case of failures they're resetted. Reviewers: jfb, vsk, marco-c, serge-sans-paille Reviewed By: marco-c, serge-sans-paille Subscribers: llvm-commits, serge-sans-paille, dmajor, cfe-commits, hiraditya, dexonsmith, #sanitizers, marco-c, sylvestre.ledru Tags: #sanitizers, #clang, #llvm Differential Revision: https://reviews.llvm.org/D78477	2020-05-07 14:13:11 +02:00
Samuel Thibault	c298e5a022	[Gnu toolchain] Move GCC multilib/multiarch paths support from Linux to Gnu Summary: The current code for GNU/Linux is actually completely generic, and can be moved to Gnu, so it can benefit GNU/Hurd and GNU/kFreeBSD Reviewers: kristina, sammccall, lebedev.ri, MaskRay, arsenm, phosek Reviewed By: MaskRay, phosek Subscribers: wdng, ormris, emaste, arichardson, krytarowski, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D73845	2020-05-01 12:12:27 +02:00
Martin Storsjö	a0e53de472	[clang] [MinGW] Add the compiler rt libdirs to the search path This matches what is done for MSVC in `b8000c0ce8`. Since that commit, compiler rt sanitizer libraries aren't linked to with absolute path on windows, but using their basenames, requiring the libdirs to be passed to the linker. This fixes undefined behaviour sanitizer on MinGW after `b8000c0ce8`. Differential Revision: https://reviews.llvm.org/D79076	2020-04-29 20:35:50 +03:00
Reid Kleckner	b8000c0ce8	[Windows] Autolink with basenames and add libdir to libpath Prior to this change, for a few compiler-rt libraries such as ubsan and the profile library, Clang would embed "-defaultlib:path/to/rt-arch.lib" into the .drective section of every object compiled with -finstr-profile-generate or -fsanitize=ubsan as appropriate. These paths assume that the link step will run from the same working directory as the compile step. There is also evidence that sometimes the paths become absolute, such as when clang is run from a different drive letter from the current working directory. This is fragile, and I'd like to get away from having paths embedded in the object if possible. Long ago it was suggested that we use this for ASan, and apparently I felt the same way back then: https://reviews.llvm.org/D4428#56536 This is also consistent with how all other autolinking usage works for PS4, Mac, and Windows: they all use basenames, not paths. To keep things working for people using the standard GCC driver workflow, the driver now adds the resource directory to the linker library search path when it calls the linker. This is enough to make check-ubsan pass, and seems like a generally good thing. Users that invoke the linker directly (most clang-cl users) will have to add clang's resource library directory to their linker search path in their build system. I'm not sure where I can document this. Ideally I'd also do it in the MSBuild files, but I can't figure out where they go. I'd like to start with this for now. Reviewed By: hans Differential Revision: https://reviews.llvm.org/D65543	2020-04-28 11:36:21 -07:00
Francis Visoiu Mistrih	e770153865	[AArch64] Add support for -ffixed-x30 Add support for reserving LR in: * the driver through `-ffixed-x30` * cc1 through `-target-feature +reserve-x30` * the backend through `-mattr=+reserve-x30` * a subtarget feature `reserve-x30` the same way we're doing for the other registers.	2020-04-28 08:48:28 -07:00
Samuel Thibault	6395980370	[Driver] Move GCC multilib/multiarch paths support from Linux.cpp to Gnu.cpp The current code for GNU/Linux is actually completely generic, and can be moved to ToolChains/Gnu.cpp, so that it can benefit GNU/Hurd and GNU/kFreeBSD. Reviewed By: MaskRay, phosek Differential Revision: https://reviews.llvm.org/D73845	2020-04-24 09:33:19 -07:00
Luke Geeson	740a1dd050	[ARM] Armv8.6-a Matrix Mul cmd line support This patch upstreams support for the Armv8.6-a Matrix Multiplication Extension. A summary of the features can be found here: https://community.arm.com/developer/ip-products/processors/b/processors-ip-blog/posts/arm-architecture-developments-armv8-6-a This patch includes: - Command line options to enable these features with +i8mm, +f32mm, or f64mm Note: +f32mm and +f64mm are optional and so are not enabled by default This is part of a patch series, starting with BFloat16 support and the other components in the armv8.6a extension (in previous patches linked in phabricator) Based on work by: - Luke Geeson - Oliver Stannard - Luke Cheeseman Reviewers: t.p.northover, DavidSpickett Reviewed By: DavidSpickett Subscribers: DavidSpickett, ostannard, kristof.beyls, danielkiss, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D77875	2020-04-24 15:54:06 +01:00
Yaxun (Sam) Liu	2da89df4e8	Fix minor bug in CommonArgs.cpp Change-Id: Ibe87b1633cc7516479bb08bf51b6860a1585a94f	2020-04-24 08:46:33 -04:00
Justin Hibbits	4ca2cad947	[PowerPC] Add clang -msvr4-struct-return for 32-bit ELF Summary: Change the default ABI to be compatible with GCC. For 32-bit ELF targets other than Linux, Clang now returns small structs in registers r3/r4. This affects FreeBSD, NetBSD, OpenBSD. There is no change for 32-bit Linux, where Clang continues to return all structs in memory. Add clang options -maix-struct-return (to return structs in memory) and -msvr4-struct-return (to return structs in registers) to be compatible with gcc. These options are only for PPC32; reject them on PPC64 and other targets. The options are like -fpcc-struct-return and -freg-struct-return for X86_32, and use similar code. To actually return a struct in registers, coerce it to an integer of the same size. LLVM may optimize the code to remove unnecessary accesses to memory, and will return i32 in r3 or i64 in r3:r4. Fixes PR#40736 Patch by George Koehler! Reviewed By: jhibbits, nemanjai Differential Revision: https://reviews.llvm.org/D73290	2020-04-21 20:17:25 -05:00
Stephan Dollberg	7b9c6c16c3	Also look for devtoolset-9 gcc toolchain devtoolset-9 has been out for a while so also look for it. Differential Revision: https://reviews.llvm.org/D77420	2020-04-16 11:17:39 +02:00
Matt Arsenault	3a61245050	clang/AMDGPU: Assume denormals are enabled for the default target. Since the default logic was based on having fast denormal/fma features, and the default target has no features, we assumed flushing by default. This fixes incorrectly assuming flushing in builds for "generic" IR libraries. The handling for no specified --cuda-gpu-arch in HIP is kind of broken. Somewhere else forces a default target of gfx803, which does not enable denormal handling by default. We don't see this default switching here, so you'll end up with a different denormal mode depending on whether you explicitly requested gfx803, or used it by default.	2020-04-15 09:17:26 -04:00
Matt Arsenault	dc89a3efb4	HIP: Fix handling of denormal mode I didn't realize HIP was a distinct offloading kind, so the subtarget was looking for -march, which isn't correct for HIP. We also have the possibility of different denormal defaults in the case of multiple offload targets, so we need to thread the JobAction through the target hook.	2020-04-13 11:48:45 -07:00
Matt Arsenault	4593e4131a	AMDGPU: Teach toolchain to link rocm device libs Currently the library is separately linked, but this isn't correct to implement fast math flags correctly. Each module should get the version of the library appropriate for its combination of fast math and related flags, with the attributes propagated into its functions and internalized. HIP already maintains the list of libraries, but this is not used for OpenCL. Unfortunately, HIP uses a separate --hip-device-lib argument, despite both languages using the same bitcode library. Eventually these two searches need to be merged. An additional problem is there are 3 different locations the libraries are installed, depending on which build is used. This also needs to be consolidated (or at least the search logic needs to deal with this unnecessary complexity).	2020-04-10 13:37:32 -04:00
Simon Cook	dd1ee6dc07	[RISCV] Support experimental/unratified extensions This adds support for enabling experimental/unratified RISC-V ISA extensions in the -march string in the case where an explicit version number has been declared, and the -menable-experimental-extensions flag has been provided. This follows the design as discussed on the mailing lists in the following RFC: http://lists.llvm.org/pipermail/llvm-dev/2020-January/138364.html Since the RISC-V toolchain definition currently rejects any extension with an explicit version number, the parsing logic has been tweaked to support this, and to allow standard extensions to have their versions checked in future patches. The bitmanip 'b' extension has been added as a first use of this support, it should easily extend to other as yet unratified extensions (such as the vector 'v' extension). Differential Revision: https://reviews.llvm.org/D73891	2020-04-09 18:04:22 +01:00
Shengchen Kan	792b10978d	[Driver][X86] Add -mpad-max-prefix-size Summary: The option `-mpad-max-prefix-size` performs some checking and delegate to MC option `-x86-pad-max-prefix-size`. This option is designed for eliminate NOPs when we need to align something by adding redundant prefixes to instructions, e.g. it can be used along with `-malign-branch`, `-malign-branch-boundary` to prefix padding branch. It has similar (but slightly different) effect as GAS's option `-malign-branch-prefix-size`, e.g. `-mpad-max-prefix-size` can also elminate NOPs emitted by align directive, so we use a different name here. I remove the option `-malign-branch-prefix-size` since is unimplemented and not needed. If we need to be compatible with GAS, we can make `-malign-branch-prefix-size` an alias for this option later. Reviewers: jyknight, reames, MaskRay, craig.topper, LuoYuanke Reviewed By: MaskRay, LuoYuanke Subscribers: annita.zhang, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D77628	2020-04-09 19:34:12 +08:00
Fangrui Song	969b91af73	[Driver] Default arm-linux-androideabi to -z max-page-size=4096 Similar to D55029. The requirement arises when discussing increasing default max-page-size for lld ARM (D77330). For the record, the default max-page-size on the 3 commonly used linkers: * GNU ld since 2014 (https://sourceware.org/git/?p=binutils-gdb.git;a=commit;h=7572ca8989ead4c3425a1500bc241eaaeffa2c89) defaults to 65536 * GNU gold remains 4096 * lld<=10 uses 4096. lld from 11 onwards will use 65536 (D77330) Reviewed By: srhines, thieta Differential Revision: https://reviews.llvm.org/D77746	2020-04-08 12:05:28 -07:00
Artem Belevich	a9627b7ea7	[CUDA] Add partial support for recent CUDA versions. Generate PTX using newer versions of PTX and allow using sm_80 with CUDA-11. None of the new features of CUDA-10.2+ have been implemented yet, so using these versions will still produce a warning. Differential Revision: https://reviews.llvm.org/D77670	2020-04-08 11:19:44 -07:00
Artem Belevich	33386b20aa	[CUDA] Simplify GPU variant handling. NFC. Instead of hardcoding individual GPU mappings in multiple functions, keep them all in one table and use it to look up the mappings. We also don't care about 'virtual' architecture much, so the API is trimmed down down to a simpler GPU->Virtual arch name lookup. Differential Revision: https://reviews.llvm.org/D77665	2020-04-08 11:19:43 -07:00
Johannes Doerfert	f85ae058f5	[OpenMP] Provide math functions in OpenMP device code via OpenMP variants For OpenMP target regions to piggy back on the CUDA/AMDGPU/... implementation of math functions, we include the appropriate definitions inside of an `omp begin/end declare variant match(device={arch(nvptx)})` scope. This way, the vendor specific math functions will become specialized versions of the system math functions. When a system math function is called and specialized version is available the selection logic introduced in D75779 instead call the specialized version. In contrast to the code path we used so far, the system header is actually included. This means functions without specialized versions are available and so are macro definitions. This should address PR42061, PR42798, and PR42799. Reviewed By: ye-luo Differential Revision: https://reviews.llvm.org/D75788	2020-04-07 23:33:24 -05:00
Francis Visoiu Mistrih	9e6670b03c	[Driver] Only pass LTO remark arguments if the driver asks for it Previous fix missed a check to willEmitRemarks, causing remarks to always be enabled for LTO.	2020-04-07 14:11:47 -07:00
Sid Manning	aed2fdb167	[Hexagon] Update paths for linux/musl Update the sysroot expectation to match other targets and breakout linux/musl toolchain tests into a new file. Differential Revision: https://reviews.llvm.org/D77440	2020-04-07 13:45:52 -05:00
Michael Liao	c97be2c377	[hip] Remove `hip_pinned_shadow`. Summary: - Use `device_builtin_surface` and `device_builtin_texture` for surface/texture reference support. So far, both the host and device use the same reference type, which could be revised later when interface/implementation is stablized. Reviewers: yaxunl Subscribers: cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D77583	2020-04-07 09:51:49 -04:00
Sid Manning	2c5d6dfda9	[Hexagon] Make lld be the default linker for linux/musl When the target is hexagon-unknown-linux-musl select lld as the default linker. Differential Revision: https://reviews.llvm.org/D77498	2020-04-06 12:59:07 -05:00
Craig Topper	1d42c0db9a	Revert "[X86] Add a Pass that builds a Condensed CFG for Load Value Injection (LVI) Gadgets" This reverts commit `c74dd640fd`. Reverting to address coding standard issues raised in post-commit review.	2020-04-03 16:56:08 -07:00
Francis Visoiu Mistrih	ba8b3052b5	[Driver] Handle all optimization-record options for Darwin LTO clang with -flto does not handle -foptimization-record-path=<path> This dulicates the code from ToolChains/Clang.cpp with modifications to support everything in the same fashion.	2020-04-03 15:30:08 -07:00
Scott Constable	c74dd640fd	[X86] Add a Pass that builds a Condensed CFG for Load Value Injection (LVI) Gadgets Adds a new data structure, ImmutableGraph, and uses RDF to find LVI gadgets and add them to a MachineGadgetGraph. More specifically, a new X86 machine pass finds Load Value Injection (LVI) gadgets consisting of a load from memory (i.e., SOURCE), and any operation that may transmit the value loaded from memory over a covert channel, or use the value loaded from memory to determine a branch/call target (i.e., SINK). Also adds a new target feature to X86: +lvi-load-hardening The feature can be added via the clang CLI using -mlvi-hardening. Differential Revision: https://reviews.llvm.org/D75936	2020-04-03 13:02:04 -07:00
Scott Constable	5b519cf1fc	[X86] Add Indirect Thunk Support to X86 to mitigate Load Value Injection (LVI) This pass replaces each indirect call/jump with a direct call to a thunk that looks like: lfence jmpq *%r11 This ensures that if the value in register %r11 was loaded from memory, then the value in %r11 is (architecturally) correct prior to the jump. Also adds a new target feature to X86: +lvi-cfi ("cfi" meaning control-flow integrity) The feature can be added via clang CLI using -mlvi-cfi. This is an alternate implementation to https://reviews.llvm.org/D75934 That merges the thunk insertion functionality with the existing X86 retpoline code. Differential Revision: https://reviews.llvm.org/D76812	2020-04-03 00:34:39 -07:00
Matt Arsenault	4ea3650c21	HIP: Link correct denormal mode library This wasn't respecting the flush mode based on the default, and also wasn't correctly handling the explicit -fno-cuda-flush-denormals-to-zero overriding the mode.	2020-04-01 12:36:22 -04:00
Matt Arsenault	6593360ee7	AMDGPU: Fix consistently backwards logic for default denormal mode I forgot to squash this into `c9d65a48af`	2020-04-01 12:36:22 -04:00
Puyan Lotfi	e3033c0ce5	[llvm][clang][IFS] Enhancing the llvm-ifs yaml format for symbol lists. Prior to this change the clang interface stubs format resembled something ending with a symbol list like this: Symbols: a: { Type: Func } This was problematic because we didn't actually want a map format and also because we didn't like that an empty symbol list required "Symbols: {}". That is to say without the empty {} llvm-ifs would crash on an empty list. With this new format it is much more clear which field is the symbol name, and instead the [] that is used to express an empty symbol vector is optional, ie: Symbols: - { Name: a, Type: Func } or Symbols: [] or Symbols: This further diverges the format from existing llvm-elftapi. This is a good thing because although the format originally came from the same place, they are not the same in any way. Differential Revision: https://reviews.llvm.org/D76979	2020-04-01 10:49:06 -04:00
Fangrui Song	531b3aff30	[Frontend] Replace CC1 option -masm-verbose with -fno-verbose-asm Most OS✕target enable -fverbose-asm, so it makes sense to flip the CC1 option to reduce common command lines.	2020-03-31 22:33:55 -07:00
Fangrui Song	d0d076fed9	[Driver] Flip the CC1 default of -fdiagnostics-show-option The driver enables -fdiagnostics-show-option by default, so flip the CC1 default to reduce the lengths of common CC1 command lines. This change also makes ParseDiagnosticArgs() consistently enable -fdiagnostics-show-option by default.	2020-03-31 21:59:27 -07:00
Fangrui Song	3341dc7339	[Driver] Don't pass -fobjc-rumtime= for non-ObjC input	2020-03-31 17:50:37 -07:00
Fangrui Song	4805901930	[Driver] Don't pass -fmessage-length=0 to CC1 -fmessage-length=0 is common (unless the environment variable COLUMNS is set and exported. This simplifies a common CC1 command line.	2020-03-31 17:12:08 -07:00
Matt Arsenault	175e42303b	AMDGPU: Make HIPToolChain a subclass of AMDGPUToolChain This fixes some code duplication. This is also a step towards consolidating builtin library handling.	2020-03-31 18:22:46 -04:00
Matt Arsenault	c9d65a48af	HIP: Ensure new denormal mode attributes are set Apparently HIPToolChain does not subclass from AMDGPUToolChain, so this was not applying the new denormal attributes. I'm not sure why this doesn't subclass. Just copy the implementation for now.	2020-03-31 18:00:37 -04:00
Amara Emerson	7f1ea924c6	Add a new -fglobal-isel option and make -fexperimental-isel an alias for it. Since GlobalISel is maturing and is already on at -O0 for AArch64, it's not completely "experimental". Create a more appropriate driver flag and make the older option an alias for it. Differential Revision: https://reviews.llvm.org/D77103	2020-03-31 12:06:11 -07:00
Benjamin Kramer	e8f13f4f62	Replace std::string::find == 0 with StringRef::startswith This is both more readable and faster. Found by clang-tidy's abseil-string-find-startswith.	2020-03-31 21:01:09 +02:00
Nico Weber	c506adcdf2	Move CLANG_SYSTEMZ_DEFAULT_ARCH to config.h. Instead of using a global define; see comments on D75914. While here, port `9c9d88d8b1` to the GN build.	2020-03-30 14:16:17 -04:00
Florian Hahn	7899a111ea	Revert "[Darwin] Respect -fno-unroll-loops during LTO." As per post-commit comment at https://reviews.llvm.org/D76916, this should better be done at the TU level. This reverts commit `9ce198d6ed`.	2020-03-30 15:20:30 +01:00
Ulrich Weigand	9c9d88d8b1	[SystemZ] Allow configuring default CLANG_SYSTEMZ_ARCH On Ubuntu, we want to raise default CLANG_SYSTEMZ_ARCH to z13, thus allow configuring this via CMake. On Debian, we want to raise it to z196. Author: Dimitri John Ledkov Differential Revision: https://reviews.llvm.org/D75914	2020-03-30 14:20:48 +02:00
Florian Hahn	9ce198d6ed	[Darwin] Respect -fno-unroll-loops during LTO. Currently -fno-unroll-loops is ignored when doing LTO on Darwin. This patch adds a new -lto-no-unroll-loops option to the LTO code generator and forwards it to the linker if -fno-unroll-loops is passed. Reviewers: thegameg, steven_wu Reviewed By: thegameg Differential Revision: https://reviews.llvm.org/D76916	2020-03-27 22:19:03 +00:00
Paul Robinson	ee7510dc86	Fix a Diag call not to assume option spelling	2020-03-27 12:12:44 -07:00
Alexandre Ganea	09158252f7	[ThinLTO] Allow usage of all hardware threads in the system Before this patch, it wasn't possible to extend the ThinLTO threads to all SMT/CMT threads in the system. Only one thread per core was allowed, instructed by usage of llvm::heavyweight_hardware_concurrency() in the ThinLTO code. Any number passed to the LLD flag /opt:lldltojobs=..., or any other ThinLTO-specific flag, was previously interpreted in the context of llvm::heavyweight_hardware_concurrency(), which means SMT disabled. One can now say in LLD: /opt:lldltojobs=0 -- Use one std::thread / hardware core in the system (no SMT). Default value if flag not specified. /opt:lldltojobs=N -- Limit usage to N threads, regardless of usage of heavyweight_hardware_concurrency(). /opt:lldltojobs=all -- Use all hardware threads in the system. Equivalent to /opt:lldltojobs=$(nproc) on Linux and /opt:lldltojobs=%NUMBER_OF_PROCESSORS% on Windows. When an affinity mask is set for the process, threads will be created only for the cores selected by the mask. When N > number-of-hardware-threads-in-the-system, the threads in the thread pool will be dispatched equally on all CPU sockets (tested only on Windows). When N <= number-of-hardware-threads-on-a-CPU-socket, the threads will remain on the CPU socket where the process started (only on Windows). Differential Revision: https://reviews.llvm.org/D75153	2020-03-27 10:20:58 -04:00
Douglas Yung	5db37f3bca	Make PS4 use -fno-use-init-array only as the ABI does not support .init_array. Reviewed by Paul Robinson	2020-03-26 15:45:40 -07:00
Yaxun (Sam) Liu	2ae25647d1	[CUDA][HIP] Add -Xarch_device and -Xarch_host options The argument after -Xarch_device will be added to the arguments for CUDA/HIP device compilation and will be removed for host compilation. The argument after -Xarch_host will be added to the arguments for CUDA/HIP host compilation and will be removed for device compilation. Differential Revision: https://reviews.llvm.org/D76520	2020-03-24 10:13:05 -04:00
Momchil Velikov	080d046c91	[ARM][CMSE] Implement CMSE attributes This patch adds CMSE attributes `cmse_nonsecure_call` and `cmse_nonsecure_entry`. As usual, specification is available here: https://developer.arm.com/docs/ecm0359818/latest Patch by Javed Absar, Bradley Smith, David Green, Momchil Velikov, possibly others. Differential Revision: https://reviews.llvm.org/D71129	2020-03-24 10:21:26 +00:00
Yaxun (Sam) Liu	78957bab55	[NFC] Refactor handling of Xarch option Extract common code to a function. To prepare for adding an option for CUDA/HIP host and device only option. Differential Revision: https://reviews.llvm.org/D76455	2020-03-22 14:42:09 -04:00
Adrian Prantl	0e916bf9f5	Driver: Improve performance of getSDKName() The ".sdk" component is usually the last one in the -isysroot, so it makes more sense to scan from the back. Also, technically, someone could install Xcode into a directory ending with .sdk, which would break this heuristic. Differential Revision: https://reviews.llvm.org/D76097	2020-03-20 16:41:59 -07:00
Shiva Chen	fc3752665f	[RISCV] Passing small data limitation value to RISCV backend Passing small data limit to RISCVELFTargetObjectFile by module flag, So the backend can set small data section threshold by the value. The data will be put into the small data section if the data smaller than the threshold. Differential Revision: https://reviews.llvm.org/D57497	2020-03-20 11:03:51 +08:00
Yaxun (Sam) Liu	6f79f80e6e	[HIP] Fix duplicate clang -cc1 options on MSVC toolchain HIPToolChain::TranslateArgs call TranslateArgs of host toolchain with the input args to get a list of derived args called DAL, then go through the input args by itself and append them to DAL. This assumes that the host toolchain should not append any unchanged args to DAL, otherwise there will be duplicates since HIPToolChain will append it again. This works for GNU toolchain since it returns an empty list for DAL. However, MSVC toolchain will append unchanged args to DAL, which causes duplicate args. This patch let MSVC toolchain not append unchanged args for HIP offloading kind, which fixes this issue. Differential Revision: https://reviews.llvm.org/D76032	2020-03-18 14:48:04 -04:00
Ayke van Laethem	4add249205	[AVR] Add support for the -mdouble=x flag This flag is used by avr-gcc (starting with v10) to set the width of the double type. The double type is by default interpreted as a 32-bit floating point number in avr-gcc instead of a 64-bit floating point number as is common on other architectures. Starting with GCC 10, a new option has been added to control this behavior: https://gcc.gnu.org/wiki/avr-gcc#Deviations_from_the_Standard This commit keeps the default double at 32 bits but adds support for the -mdouble flag (-mdouble=32 and -mdouble=64) to control this behavior. Differential Revision: https://reviews.llvm.org/D76181	2020-03-17 13:21:03 +01:00
Dmitry Mikulin	fbb23c9714	Fix profiling options on PS4 target: - libclang_rt.profile should be added when -fcs-profile-generate is on thecommand line. - OPT_fno_profile_instr_generate was used as a negative for OPT_fprofile_generate. Fix it to use OPT_fno_profile_generate. Differential Revision: https://reviews.llvm.org/D75274	2020-03-16 16:52:47 -07:00
Sid Manning	6174fddbe3	[Hexagon] Enable init_arrays when target is linux-musl Differential Revision: https://reviews.llvm.org/D76079	2020-03-12 15:00:15 -05:00
Artem Belevich	eb2ba2ea95	[CUDA] Warn about unsupported CUDA SDK version only if it's used. This fixes an issue with clang issuing a warning about unknown CUDA SDK if it's detected during non-CUDA compilation. Differential Revision: https://reviews.llvm.org/D76030	2020-03-12 10:04:10 -07:00
Reid Kleckner	213aea4c58	Remove unused Endian.h includes, NFC Mainly avoids including Host.h everywhere: $ diff -u <(sort thedeps-before.txt) <(sort thedeps-after.txt) \ \| grep '^[-+] ' \| sort \| uniq -c \| sort -nr 3141 - /usr/local/google/home/rnk/llvm-project/llvm/include/llvm/Support/Host.h	2020-03-11 15:45:34 -07:00
Artem Belevich	0c06a389e5	[CUDA,clang-cl] Filter out unsupported arguments for device-side compilation. Device-side compilation does not support some features and we need to filter them out when command line options enable them for the host. We're already doing this in various places in the regular clang driver, but clang-cl mode constructs cc1 options independently and needs to implement the filtering, too. Differential Revision: https://reviews.llvm.org/D75310	2020-03-11 13:42:16 -07:00
Sjoerd Meijer	3d9a0445cc	Recommit #2 "[Driver] Default to -fno-common for all targets" After a first attempt to fix the test-suite failures, my first recommit caused the same failures again. I had updated CMakeList.txt files of tests that needed -fcommon, but it turns out that there are also Makefiles which are used by some bots, so I've updated these Makefiles now too. See the original commit message for more details on this change: `0a9fc9233e`	2020-03-09 19:57:03 +00:00

1 2 3 4 5 ...

1505 Commits