llvm-project

Commit Graph

Author	SHA1	Message	Date
Yaxun (Sam) Liu	ed398c3ca4	[NFC] Extract unifyTargetFeatures Differential Revision: https://reviews.llvm.org/D82579	2020-06-25 23:17:08 -04:00
Wouter van Oortmerssen	b9a539c010	[WebAssembly] Adding 64-bit versions of __stack_pointer and other globals We have 6 globals, all of which except for __table_base are 64-bit under wasm64. Differential Revision: https://reviews.llvm.org/D82130	2020-06-25 15:52:44 -07:00
Alexey Bataev	32ea3397be	[OPENMP]Dynamic globalization for parallel target regions. Summary: Added support for dynamic memory allocation for globalized variables in case if execution of target regions in parallel is required. Reviewers: jdoerfert Subscribers: jholewinski, yaxunl, guansong, sstefan1, cfe-commits, caomhin Tags: #clang Differential Revision: https://reviews.llvm.org/D82324	2020-06-25 08:25:24 -04:00
David Spickett	d6efc98116	Reland "[clang][Driver] Correct tool search path priority" This reverts commit `f570d58104`. The test was failing on MacOS if you set LLVM_DEFAULT_TARGET_TRIPLE. For example if you set it to "x86_64-apple-darwin" clang actually uses "x86_64-apple-darwin<version>". To fix this get default triple from clang itself during the test instead of substituting it in via lit.	2020-06-25 09:33:43 +01:00
Kazushi (Jam) Marukawa	96d4ccf00c	[VE] Clang toolchain for VE Summary: This patch enables compilation of C code for the VE target with Clang. Differential Revision: https://reviews.llvm.org/D79411	2020-06-24 10:12:09 +02:00
Alex Lorenz	565603cc94	[clang][driver] set macOS as the target OS for -arch arm64 when clang is running on an Apple Silicon mac This change allows users to use `-arch arm64` to build for mac when running it on Apple Silicon mac without explicit `-target` option. Differential Revision: https://reviews.llvm.org/D82428	2020-06-23 21:08:11 -07:00
Alex Lorenz	f724ce0d73	[clang][driver] allow macOS 11 OS version in the driver	2020-06-23 15:14:26 -07:00
Dylan McKay	15b9dc46c6	Revert "[AVR] Explicitly set the address of the data section when invoking the linker" This reverts commit `ede6005e70`. Ayke suggests this value varies chip-by-chip, and thus it is not safe to hardcode to 0x800100. Proper logic for this linker parameter will have to be wired up in a follow up patch.	2020-06-23 22:23:05 +12:00
Aaron En Ye Shi	4bafb0adcf	Add Statically Linked Libraries Add GNU Static Lib Tool, which supports the --emit-static-lib flag. For HIP, a static library archive will be created and consist of HIP Fat Binary host object with the device images embedded. Using llvm-ar to create the static archive. Also, delete existing output file to ensure a new archive is created each time. Reviewers: yaxunl, tra, rjmccall, echristo Subscribers: echristo, JonChesterfield, scchan, msearles Differential Revision: https://reviews.llvm.org/D78759	2020-06-22 19:48:49 +00:00
Aaron En Ye Shi	77df5a8283	[HIP] Move HIP Linking Logic into HIP ToolChain This patch is a follow up on https://reviews.llvm.org/D78759. Extract the HIP Linker script from generic GNU linker, and move it into HIP ToolChain. Update OffloadActionBuilder Link actions feature to apply device linking and host linking actions separately. Using MC Directives, embed the device images and define symbols. Reviewers: JonChesterfield, yaxunl Subscribers: tra, echristo, jdoerfert, msearles, scchan Differential Revision: https://reviews.llvm.org/D81963	2020-06-22 19:48:48 +00:00
David Spickett	f570d58104	Revert "[clang][Driver] Correct tool search path priority" Revert `028571d608` to investigate MacOS failure. (also the review link was incorrect, should be https://reviews.llvm.org/D79842)	2020-06-22 14:18:54 +01:00
Yaxun (Sam) Liu	2580635bd2	Let HIP default include respect -nogpuinc and -nogpulib	2020-06-22 08:39:12 -04:00
David Spickett	028571d608	[clang][Driver] Correct tool search path priority Summary: As seen in: https://bugs.llvm.org/show_bug.cgi?id=45693 When clang looks for a tool it has a set of possible names for it, in priority order. Previously it would look for these names in the program path. Then look for all the names in the PATH. This means that aarch64-none-elf-gcc on the PATH would lose to gcc in the program path. (which was /usr/bin in the bug's case) This changes that logic to search each name in both possible locations, then move to the next name. Which is more what you would expect to happen when using a non default triple. (-B prefixes maybe should follow this logic too, but are not changed in this patch) Subscribers: kristof.beyls, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D79988	2020-06-22 09:41:13 +01:00
Luboš Luňák	a45f713c67	add option to instantiate templates already in the PCH Add -fpch-instantiate-templates which makes template instantiations be performed already in the PCH instead of it being done in every single file that uses the PCH (but every single file will still do it as well in order to handle its own instantiations). I can see 20-30% build time saved with the few tests I've tried. The change may reorder compiler output and also generated code, but should be generally safe and produce functionally identical code. There are some rare cases that do not compile with it, such as test/PCH/pch-instantiate-templates-forward-decl.cpp. If template instantiation bailed out instead of reporting the error, these instantiations could even be postponed, which would make them work. Enable this by default for clang-cl. MSVC creates PCHs by compiling them using an empty .cpp file, which means templates are instantiated while building the PCH and so the .h needs to be self-contained, making test/PCH/pch-instantiate-templates-forward-decl.cpp to fail with MSVC anyway. So the option being enabled for clang-cl matches this. Differential Revision: https://reviews.llvm.org/D69585	2020-06-21 17:05:52 +02:00
Fangrui Song	2a4317bfb3	[SanitizeCoverage] Rename -fsanitize-coverage-{white,black}list to -fsanitize-coverage-{allow,block}list Keep deprecated -fsanitize-coverage-{white,black}list as aliases for compatibility for now. Reviewed By: echristo Differential Revision: https://reviews.llvm.org/D82244	2020-06-19 22:22:47 -07:00
Xiangling Liao	3f2e61c1fe	[AIX] Default AIX to using -fno-use-cxa-atexit On AIX, we use __atexit to register dtor functions rather than __cxa_atexit. So a driver change is needed to default AIX to using -fno-use-cxa-atexit. Windows platform does not uses __cxa_atexit either. Following its precedent, we remove the assertion for when -fuse-cxa-atexit is specified by the user, do not produce a message and silently default to -fno-use-cxa-atexit behavior. Differential Revision: https://reviews.llvm.org/D82136	2020-06-19 08:27:07 -04:00
Kristof Beyls	c113b59ef5	[AArch64] Add clang command line support for -mharden-sls= The accepted options to -mharden-sls= are: * all: enable all mitigations against Straight Line Speculation that are implemented. * none: disable all mitigations against Straight Line Speculation. * retbr: enable the mitigation against Straight Line Speculation for RET and BR instructions. * blr: enable the mitigation against Straight Line Speculation for BLR instructions. Differential Revision: https://reviews.llvm.org/D81404	2020-06-19 07:31:48 +01:00
Dylan McKay	01741d6dbe	[AVR] Remove duplicate specification of lib directory Reviewers: dylanmckay Reviewed By: dylanmckay Subscribers: Jim, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D77334 This was originally committed in `03b0831144` but I missed the commit attribution. Patch by Dennis van der Schagt.	2020-06-19 17:39:18 +12:00
Dylan McKay	aeaa09ec10	Revert "[AVR] Remove duplicate specification of lib directory" This reverts commit `03b0831144`. I forgot to attribute the commit originally so I am recommitting in a subsequent patch.	2020-06-19 17:37:15 +12:00
Dylan McKay	03b0831144	[AVR] Remove duplicate specification of lib directory Reviewers: dylanmckay Reviewed By: dylanmckay Subscribers: Jim, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D77334	2020-06-19 17:35:09 +12:00
Yaxun (Sam) Liu	c830d517b4	[HIP] Enable -amdgpu-internalize-symbols Enable -amdgpu-internalize-symbols to eliminate unused functions and global variables for whole program to speed up compilation and improve performance. For -fno-gpu-rdc, -amdgpu-internalize-symbols is passed to clang -cc1. For -fgpu-rdc, -amdgpu-internalize-symbols is passed to lld. Differential Revision: https://reviews.llvm.org/D81959	2020-06-18 16:34:37 -04:00
Alexandre Ganea	a45409d885	[Clang] Move clang::Job::printArg to llvm::sys::printArg. NFCI. This patch is to support/simplify https://reviews.llvm.org/D80833	2020-06-18 09:17:13 -04:00
Yaxun (Sam) Liu	92d8ad02e9	[HIP] Fix rocm not found on rocm3.5 Currently rocm detector expects device library bitcodes named as .bc instead of .amdgcn.bc. However in rocm3.5 the device library bitcodes are named as *.amdgcn.bc, which causes rocm3.5 not detected. This patch fixes that. Differential Revision: https://reviews.llvm.org/D81713	2020-06-18 08:40:09 -04:00
Ryan Prichard	6c4ce20226	[Driver] Search computed sysroot for libc++ header paths Summary: The Android NDK's clang driver is used with an Android -target setting, and the driver automatically finds the Android sysroot at a path relative to the driver. The sysroot has the libc++ headers in it. Remove Hurd::computeSysRoot as it is equivalent to the new ToolChain::computeSysRoot method. Fixes PR46213. Reviewers: srhines, danalbert, #libc, kristina Reviewed By: srhines, danalbert Subscribers: ldionne, sthibaul, asb, rbar, johnrusso, simoncook, sabuasal, niosHD, jrtc27, MaskRay, zzheng, edward-jones, rogfer01, MartinMosbeck, brucehoult, the_o, PkmX, jocewei, Jim, lenary, s.egerton, pzheng, sameer.abuasal, apazos, luismarques, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D81622	2020-06-17 16:17:37 -07:00
Fangrui Song	f39000b450	[Driver] Delete CC1 -fxray-function-index and clean up some tests	2020-06-17 13:00:22 -07:00
Ian Levesque	7c7c8e0da4	[xray] Option to omit the function index Summary: Add a flag to omit the xray_fn_idx to cut size overhead and relocations roughly in half at the cost of reduced performance for single function patching. Minor additions to compiler-rt support per-function patching without the index. Reviewers: dberris, MaskRay, johnislarry Subscribers: hiraditya, arphaman, cfe-commits, #sanitizers, llvm-commits Tags: #clang, #sanitizers, #llvm Differential Revision: https://reviews.llvm.org/D81995	2020-06-17 13:49:01 -04:00
Yuanfang Chen	2956cc50f3	[Clang][Driver] Remove gold linker support for PS4 toolchain Reviewers: probinson Subscribers: cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D81970	2020-06-17 09:45:14 -07:00
Martin Storsjö	e3fd9dc973	[clang] Enable -mms-bitfields by default for mingw targets This matches GCC, which enabled -mms-bitfields by default for mingw targets in 4.7 [1]. [1] https://www.gnu.org/software/gcc/gcc-4.7/changes.html Differential Revision: https://reviews.llvm.org/D81795	2020-06-17 09:37:07 +03:00
Martin Storsjö	beeed368b6	[clang] [MinGW] Link kernel32 once after the last instance of msvcrt The msvcrt library isn't a pure import library; it does contain regular object files with wrappers/fallbacks, and these can require linking against kernel32. This only makes a difference when linking with ld.bfd, as lld always searches all static libraries. This matches a similar change made recently in gcc in https://gcc.gnu.org/git/?p=gcc.git;a=commitdiff;h=850533ab160ef40eccfd039e1e3b138cf26e76b8, although clang adds --start-group --end-group around these libraries if -static is specified, which gcc doesn't. But try to match gcc's linking order in any case, for consistency. Differential Revision: https://reviews.llvm.org/D80880	2020-06-17 09:37:07 +03:00
Yuanfang Chen	8d4a806ef0	Revert "remove gold linker" This reverts commit `719c87edc5`. Checked in by accident. Sorry.	2020-06-16 13:05:53 -07:00
Yuanfang Chen	719c87edc5	remove gold linker	2020-06-16 13:03:31 -07:00
Tom Stellard	d970ab63e2	Driver: Accept multiple --config options if filenames are the same Summary: We're trying to use the --config options to pass distro specific options for Fedora via the CFLAGS variable. However, some projects end up using the CFLAGS variable multiple times in their command line, which leads to an error when --config is used. This patch resolves this issue by allowing more than one --config option on the command line as long as the file names are the same. Reviewers: sepavloff, hfinkel Reviewed By: sepavloff Subscribers: cfe-commits, llvm-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D81424	2020-06-16 12:23:57 -07:00
Ilya Leoshkevich	ccd8b7b103	[LSan] Enable for SystemZ Summary: Add runtime support, adjust the tests and enable LSan. Reviewers: vitalybuka, eugenis, uweigand, jonpa Reviewed By: uweigand Subscribers: mgorny, cfe-commits, #sanitizers Tags: #clang, #sanitizers Differential Revision: https://reviews.llvm.org/D78644	2020-06-16 13:45:29 +02:00
Yaxun (Sam) Liu	6752786d65	[HIP] Do not use llvm-link/opt/llc for -fgpu-rdc This patch is a follow up on https://reviews.llvm.org/D81627. In addition to default -fno-gpu-rdc case, this patches let HIP toolchain not use llvm-link/opt/llc to link device code for -fgpu-rdc case. Instead, uses standard lto. This will eliminate some redundant optimizations and speed up the compilation/linking. Differential Revision: https://reviews.llvm.org/D81861	2020-06-15 21:09:18 -04:00
Yaxun (Sam) Liu	e8090d83fd	[HIP] Do not call opt/llc for -fno-gpu-rdc Currently HIP toolchain calls clang to emit bitcode then calls opt/llc for device compilation for the default -fno-gpu-rdc case, which is unnecessary since clang is able to compile a single source file to ISA. This patch fixes the HIP action builder and toolchain so that the default -fno-gpu-rdc can be done like a canonical toolchain, i.e. one clang -cc1 invocation to compile source code to ISA. This can avoid unnecessary processes to speed up the compilation, and avoid redundant LLVM passes which are performed in clang -cc1 and opt. Differential Revision: https://reviews.llvm.org/D81627	2020-06-15 18:55:01 -04:00
Artem Belevich	d700237f1a	[CUDA,HIP] Use VFS for SDK detection. It's useful for using clang from tools that may need need to provide SDK files from non-standard locations. Clang CLI only provides a way to specify VFS for include files, so there's no good way to test this yet. Differential Revision: https://reviews.llvm.org/D81771	2020-06-15 12:54:44 -07:00
Michael Liao	8b6821a584	[hip] Fix device-only relocatable code compilation. Summary: - In HIP, just as the regular device-only compilation, the device-only relocatable code compilation should not involve offload bundle. - In addition, that device-only relocatable code compilation should have the similar 3 steps, namely preprocessor, compile, and backend, to the regular code generation with `-emit-llvm`. Reviewers: yaxunl, tra Subscribers: cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D81427	2020-06-10 14:10:41 -04:00
Sterling Augustine	f07b3d41e7	Add #includes so that ROCm.h is compilable stand-alone. Summary: ROCm.h had been getting the declarations for various data structures by being #included next to them, rather than #includeing them itself. This change fixes that by explicitly including the appropriate headers. Subscribers: cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D81432	2020-06-08 14:16:30 -07:00
Jian Cai	4db2b70248	Add a flag to debug automatic variable initialization Summary: Add -ftrivial-auto-var-init-stop-after= to limit the number of times stack variables are initialized when -ftrivial-auto-var-init= is used to initialize stack variables to zero or a pattern. This flag can be used to bisect uninitialized uses of a stack variable exposed by automatic variable initialization, such as http://crrev.com/c/2020401. Reviewers: jfb, vitalybuka, kcc, glider, rsmith, rjmccall, pcc, eugenis, vlad.tsyrklevich Reviewed By: jfb Subscribers: phosek, hubert.reinterpretcast, srhines, MaskRay, george.burgess.iv, dexonsmith, inglorion, gbiv, llozano, manojgupta, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D77168	2020-06-08 12:30:56 -07:00
Sam McCall	9c3909556b	Recognize .hxx as a C++ header extension, like .hpp. Reviewers: kadircet Subscribers: cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D81366	2020-06-08 11:50:34 +02:00
Fangrui Song	336e1f03d1	[Driver] Omit -mthread-model posix which is the CC1 default	2020-06-07 12:27:11 -07:00
Benjamin Kramer	02e35832c3	[Driver] Simplify code. NFCI.	2020-06-07 20:18:14 +02:00
Benjamin Kramer	27e0077dcf	Try to make msvc crash less llvm-project\clang\lib\Driver\Types.cpp(44): fatal error C1001: An internal error has occurred in the compiler. (compiler file 'msc1.cpp', line 1518)	2020-06-07 18:07:07 +02:00
Benjamin Kramer	5a098086f9	Put compilation phases from Types.def into a bit set This avoids a global constructor and is a bit more efficient for "contained" queries. No functionality change intended.	2020-06-07 17:22:44 +02:00
Yaxun (Sam) Liu	8422bc9efc	recommit "[HIP] Add default header and include path" recommit `11d06b9511` with fix for lit tests.	2020-06-06 14:21:22 -04:00
Jonas Paulsson	515bfc66ea	[SystemZ] Implement -fstack-clash-protection Probing of allocated stack space is now done when this option is passed. The purpose is to protect against the stack clash attack (see https://www.qualys.com/2017/06/19/stack-clash/stack-clash.txt). Review: Ulrich Weigand Differential Revision: https://reviews.llvm.org/D78717	2020-06-06 18:38:36 +02:00
Nico Weber	2920348063	Revert "recommit "[HIP] Add default header and include path"" This reverts commit `1fa43e0b34`. Still breaks tests on several bots, see https://reviews.llvm.org/D81176	2020-06-05 21:50:04 -04:00
Yaxun (Sam) Liu	1fa43e0b34	recommit "[HIP] Add default header and include path" recommit `11d06b9511` with fix for lit tests.	2020-06-05 20:41:15 -04:00
stevewan	61cd264068	[PowerPC] Do not special case Darwin on PowerPC in target cpu handling Summary: This patch removes the special handling for Darwin on PowerPC in the default target cpu handling, because Darwin is no longer supported on the PowerPC platform. Reviewers: hubert.reinterpretcast, daltenty Reviewed By: hubert.reinterpretcast Subscribers: wuzish, nemanjai, shchenz, steven.zhang, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D81115	2020-06-05 19:55:28 -04:00
Yaxun (Sam) Liu	8a8c6913a9	Revert "[HIP] Add default header and include path" This reverts commit `11d06b9511`.	2020-06-05 15:42:57 -04:00
Yaxun (Sam) Liu	11d06b9511	[HIP] Add default header and include path To support std::complex and some other standard C/C++ functions in HIP device code, they need to be forced to be __host__ __device__ functions by pragmas. This is done by some clang standard C++ wrapper headers which are shared between cuda-clang and hip-Clang. For these standard C++ wapper headers to work properly, specific include path order has to be enforced: clang C++ wrapper include path standard C++ include path clang include path Also, these C++ wrapper headers require device version of some standard C/C++ functions must be declared before including them. This needs to be done by including a default header which declares or defines these device functions. The default header is always included before any other headers are included by users. This patch adds the the default header and include path for HIP. Differential Revision: https://reviews.llvm.org/D81176	2020-06-05 12:44:57 -04:00
Petr Hosek	b16ed493dd	[Fuchsia] Rely on linker switch rather than dead code ref for profile runtime Follow the model used on Linux, where the clang driver passes the linker a -u switch to force the profile runtime to be linked in, rather than having every TU emit a dead function with a reference. Differential Revision: https://reviews.llvm.org/D79835	2020-06-04 15:47:05 -07:00
Petr Hosek	e1ab90001a	Revert "[Fuchsia] Rely on linker switch rather than dead code ref for profile runtime" This reverts commit `d510542174` since it broke several bots.	2020-06-04 15:44:10 -07:00
Petr Hosek	d510542174	[Fuchsia] Rely on linker switch rather than dead code ref for profile runtime Follow the model used on Linux, where the clang driver passes the linker a -u switch to force the profile runtime to be linked in, rather than having every TU emit a dead function with a reference. Patch By: mcgrathr Differential Revision: https://reviews.llvm.org/D79835	2020-06-04 14:25:19 -07:00
Dan Gohman	d496437a0b	[WebAssembly] Add support for -mexec-model=reactor This adds a -mexec-model= command-line flag. The default value is "command" which is no change from the current behavior. The other option is "reactor" which enables the WASI Reactor ABI: https://github.com/WebAssembly/WASI/blob/master/design/application-abi.md Differential Revision: https://reviews.llvm.org/D62922	2020-06-03 14:02:47 -07:00
Steven Wan	ba4afe6f7a	[AIX] Change the default target CPU to power4 for AIX on Power Summary: This patch changes the AIX default target CPU to power4 since this is the the lowest arch for the lowest OS level supported. Reviewers: hubert.reinterpretcast, cebowleratibm, daltenty Reviewed By: hubert.reinterpretcast Subscribers: cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D80835	2020-06-03 13:50:26 -04:00
Shengchen Kan	ac47588bc4	[Driver] Add negative option for -fkeep-static-consts	2020-06-03 14:59:14 +08:00
Nick Desaulniers	8eda71616f	[Clang][A32/T32][Linux] -O1 implies -fomit-frame-pointer Summary: An upgrade of LLVM for CrOS [0] containing [1] triggered a bunch of errors related to writing to reserved registers for a Linux kernel's arm64 compat vdso (which is a aarch32 image). After a discussion on LKML [2], it was determined that -f{no-}omit-frame-pointer was not being specified. Comparing GCC and Clang [3], it becomes apparent that GCC defaults to omitting the frame pointer implicitly when optimizations are enabled, and Clang does not. ie. setting -O1 (or above) implies -fomit-frame-pointer. Clang was defaulting to -fno-omit-frame-pointer implicitly unless -fomit-frame-pointer was set explicitly. Why this becomes a problem is that the Linux kernel's arm64 compat vdso contains code that uses r7. r7 is used sometimes for the frame pointer (for example, when targeting thumb (-mthumb)). See useR7AsFramePointer() in llvm/llvm-project/llvm/lib/Target/ARM/ARMSubtarget.h. This is mostly for legacy/compatibility reasons, and the 2019 Q4 revision of the ARM AAPCS looks to standardize r11 as the frame pointer for aarch32, though this is not yet implemented in LLVM. Users that are reliant on the implicit value if unspecified when optimizations are enabled should explicitly choose -fomit-frame-pointer (new behavior) or -fno-omit-frame-pointer (old behavior). [0] https://bugs.chromium.org/p/chromium/issues/detail?id=1084372 [1] https://reviews.llvm.org/D76848 [2] https://lore.kernel.org/lkml/20200526173117.155339-1-ndesaulniers@google.com/ [3] https://godbolt.org/z/0oY39t Reviewers: kristof.beyls, psmith, danalbert, srhines, MaskRay, ostannard, efriedma Reviewed By: psmith, danalbert, srhines, MaskRay, efriedma Subscribers: efriedma, olista01, MaskRay, vhscampos, cfe-commits, llvm-commits, manojgupta, llozano, glider, hctim, eugenis, pcc, peter.smith, srhines Tags: #clang, #llvm Differential Revision: https://reviews.llvm.org/D80828	2020-06-02 15:54:14 -07:00
Sriraman Tallam	e0bca46b08	Options for Basic Block Sections, enabled in D68063 and D73674. This patch adds clang options: -fbasic-block-sections={all,<filename>,labels,none} and -funique-basic-block-section-names. LLVM Support for basic block sections is already enabled. + -fbasic-block-sections={all, <file>, labels, none} : Enables/Disables basic block sections for all or a subset of basic blocks. "labels" only enables basic block symbols. + -funique-basic-block-section-names: Enables unique section names for basic block sections, disabled by default. Differential Revision: https://reviews.llvm.org/D68049	2020-06-02 00:23:32 -07:00
Fangrui Song	92448fd23d	[Driver] Simplify Linux::addProfileRTLibs	2020-05-31 17:15:14 -07:00
Hubert Tong	c15d5d12c6	[Driver] NFC: Use Twine temp to replace std::string local This patch replaces a `std::string` local used for a concatentation with a `Twine` where the string was being passed into call.	2020-05-31 16:38:10 -04:00
Mateusz Mikuła	ab4d02cf26	[clang] [MinGW] Fix libunwind extension Differential Revision: https://reviews.llvm.org/D79995	2020-05-29 15:23:14 +03:00
Martin Storsjö	ac1f7ab007	[clang] [Darwin] Add reverse mappings for aarch64/aarch64_32 to darwin arch names These are mapped in MachO::getMachOArchName already, but were missing in ToolChain::getDefaultUniversalArchName. Having these reverse mapped here fixes weird inconsistencies like -dumpmachine showing a target triple like "aarch64-apple-darwin", while "clang -target aarch64-apple-darwin" didn't use to work (ended up mapped as unknown-apple-ios). Differential Revision: https://reviews.llvm.org/D79117	2020-05-29 15:23:14 +03:00
Shengchen Kan	4954449995	[Driver][X86] Support branch align options with LTO Summary: Before this patch, we use two different ways to pass options to align branch depending on whether LTO is enabled. For example, `-mbranches-within-32B-boundaries` w/o LTO and `-Wl,-plugin-opt=-x86-branches-within-32B-boundaries` w/ LTO. It's inconvenient, so this patch unifies the way: we only need to pass options like `-mbranches-within-32B-boundaries` to align branches, no matter LTO is enabled or not. Differential Revision: https://reviews.llvm.org/D80289	2020-05-28 13:17:14 +08:00
Lei Huang	2368bf52cd	[PowerPC] Add support for -mcpu=pwr10 in both clang and llvm Summary: This patch simply adds support for the new CPU in anticipation of Power10. There isn't really any functionality added so there are no associated test cases at this time. Reviewers: stefanp, nemanjai, amyk, hfinkel, power-llvm-team, #powerpc Reviewed By: stefanp, nemanjai, amyk, #powerpc Subscribers: NeHuang, steven.zhang, hiraditya, llvm-commits, wuzish, shchenz, cfe-commits, kbarton, echristo Tags: #clang, #powerpc, #llvm Differential Revision: https://reviews.llvm.org/D80020	2020-05-27 13:14:25 -05:00
Fangrui Song	b9c6871a95	[Driver] Support -fsanitize=shadow-call-stack and cfi-icall on aarch64_be D80647 did not fix https://bugs.llvm.org/show_bug.cgi?id=46076 This is the fix.	2020-05-27 10:55:05 -07:00
Fangrui Song	a2a3e9f0a6	[Driver] Support -fsanitize=shadow-call-stack on aarch64_be Fixes https://bugs.llvm.org/show_bug.cgi?id=46076 Reviewed By: nickdesaulniers, pcc Differential Revision: https://reviews.llvm.org/D80647	2020-05-27 10:37:39 -07:00
Simon Pilgrim	b5b0087722	SpecialCaseList.h - reduce unnecessary includes to forward declarations. NFC. Remove Regex forward declaration as we already require the Regex.h include. Add missing VirtualFileSystem.h include to dependent source files.	2020-05-27 15:51:03 +01:00
Lei Huang	559845f8fe	Revert "[PowerPC] Add support for -mcpu=pwr10 in both clang and llvm" This reverts commit `7eb666b155`.	2020-05-27 09:40:21 -05:00
Marco Elver	14de6e29b1	[Clang][Driver] Add Bounds and Thread to SupportsCoverage list Summary: This permits combining -fsanitize-coverage with -fsanitize=bounds or -fsanitize=thread. Note that, GCC already supports combining these. Tested: - Add Clang end-to-end test checking IR is generated for both combinations of sanitizers. - Several previously failing TSAN tests now pass. Bugzilla: https://bugs.llvm.org/show_bug.cgi?id=45831 Reviewers: vitalybuka Reviewed By: vitalybuka Subscribers: #sanitizers, dvyukov, nickdesaulniers, cfe-commits Tags: #clang, #sanitizers Differential Revision: https://reviews.llvm.org/D79628	2020-05-26 13:36:21 -07:00
Lei Huang	7eb666b155	[PowerPC] Add support for -mcpu=pwr10 in both clang and llvm Summary: This patch simply adds support for the new CPU in anticipation of Power10. There isn't really any functionality added so there are no associated test cases at this time. Reviewers: stefanp, nemanjai, amyk, hfinkel, power-llvm-team, #powerpc Reviewed By: stefanp, nemanjai, amyk, #powerpc Subscribers: NeHuang, steven.zhang, hiraditya, llvm-commits, wuzish, shchenz, cfe-commits, kbarton, echristo Tags: #clang, #powerpc, #llvm Differential Revision: https://reviews.llvm.org/D80020	2020-05-26 13:48:22 -05:00
Fangrui Song	9d55e4ee13	Make explicit -fno-semantic-interposition (in -fpic mode) infer dso_local -fno-semantic-interposition is currently the CC1 default. (The opposite disables some interprocedural optimizations.) However, it does not infer dso_local: on most targets accesses to ExternalLinkage functions/variables defined in the current module still need PLT/GOT. This patch makes explicit -fno-semantic-interposition infer dso_local, so that PLT/GOT can be eliminated if targets implement local aliases for AsmPrinter::getSymbolPreferLocal (currently only x86). Currently we check whether the module flag "SemanticInterposition" is 0. If yes, infer dso_local. In the future, we can infer dso_local unless "SemanticInterposition" is 1: frontends other than clang will also benefit from the optimization if they don't bother setting the flag. (There will be risks if they do want ELF interposition: they need to set "SemanticInterposition" to 1.)	2020-05-25 20:48:18 -07:00
stevewan	5bf2409a4e	[AIX] Add '-bcdtors:all:0:s' to linker to gather static init functions Summary: On AIX, add '-bcdtors:all:0:s' to the linker implicitly through the driver so that we can collect all static constructor and destructor functions. Reviewers: hubert.reinterpretcast, Xiangling_L, ZarkoCA, daltenty Reviewed By: hubert.reinterpretcast Subscribers: cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D80415	2020-05-25 13:44:15 -04:00
Eli Friedman	9292ece995	[clang driver] Spell "--export-dynamic-symbol" with two dashes. This doesn't make a difference for linkers that support the option, but it improves the error message from older linkers that don't support it.	2020-05-23 15:46:28 -07:00
Matt Arsenault	1d96dca949	HIP: Try to deal with more llvm package layouts The various HIP builds are all inconsistent. The default llvm install goes to ${INSTALL_PREFIX}/bin/clang, but the rocm packaging scripts move this under ${INSTALL_PREFIX}/llvm/bin/clang. Some other builds further pollute this with ${INSTALL_PREFIX}/bin/x86_64/clang. These should really be consolidated, but try to handle them for now.	2020-05-23 13:28:24 -04:00
Ben Dunbobbin	9d23b5025d	[PS4] Enable relaxed relocations by default PS4 supports these relocations types; so, we want to enable them by default. Differential Revision: https://reviews.llvm.org/D79980	2020-05-21 23:05:44 +01:00
Sylvestre Ledru	dbb034947c	Add support of the next Ubuntu (Ubuntu 20.10 - Groovy Gorilla)	2020-05-19 08:30:25 +02:00
Dylan McKay	ede6005e70	[AVR] Explicitly set the address of the data section when invoking the linker This is required to get avr-gdb correctly showing values at the right addresses. This problem was discovered by using debug symbols in an external program to lookup values in an AVR simulator.	2020-05-18 02:24:51 +12:00
Fangrui Song	3841ed4104	[Driver] Render -T for Gnu.cpp clang -T a.lds a.c currently does not render -T.	2020-05-16 23:54:31 -07:00
Yvan Roux	0e4827aa4e	[ARM][MachineOutliner] Add Machine Outliner support for ARM. Enables Machine Outlining for ARM and Thumb2 modes. This is the first patch of the series which adds all the basic logic for the support, and only handles tail-calls and thunks. The outliner can be turned on by using clang -moutline option or -mllvm -enable-machine-outliner one (like AArch64). Differential Revision: https://reviews.llvm.org/D76066	2020-05-15 08:44:23 +02:00
Volodymyr Sapsai	6a3469f58d	[ObjC] Add compatibility mode for type checking of qualified id block parameters. Commit `73152a2ec2` fixed type checking for blocks with qualified id parameters. But there are existing APIs in Apple SDKs relying on the old type checking behavior. Specifically, these are APIs using NSItemProviderCompletionHandler in Foundation/NSItemProvider.h. To keep existing code working and to allow developers to use affected APIs introduce a compatibility mode that enables the previous and the fixed type checking. This mode is enabled only on Darwin platforms. Reviewed By: jyknight, ahatanak Differential Revision: https://reviews.llvm.org/D79511	2020-05-14 12:08:19 -07:00
Fangrui Song	5ecb514146	[Driver] Pass -plugin-opt=O2 for -Os -Oz and -plugin-opt=O1 for -Og Fixes PR42445 (compiler driver options -Os -Oz translate to -plugin-opt=Os (Oz) which are not recognized by LLVMgold.so or LLD). The optimization level mapping matches CompilerInvocation.cpp:getOptimizationLevel() and SpeedLevel of PassBuilder::OptimizationLevel::O. -plugin-opt=O affects the way we construct regular LTO/ThinLTO pass manager pipeline. Reviewed By: pcc Differential Revision: https://reviews.llvm.org/D79919	2020-05-14 10:37:33 -07:00
Matt Arsenault	235fb7dc24	AMDGPU/OpenCL: Accept -nostdlib in place of -nogpulib -nogpulib makes sense when there is a host (where -nostdlib would apply) and offload target. Accept nostdlib when there is no offload target as an alias.	2020-05-14 12:33:31 -04:00
Yaxun (Sam) Liu	cf2fb13932	Add -print-targets to print the registered targets Differential Revision: https://reviews.llvm.org/D79565	2020-05-13 11:34:22 -04:00
Matt Arsenault	14e1845711	HIP: Merge builtin library handling Merge with the new --rocm-path handling used for OpenCL. This looks for a usable set of device libraries upfront, rather than giving a generic "no such file or directory error". If any of the required bitcode libraries are missing, this will now produce a "cannot find ROCm installation." error. This differs from the existing hip specific flags by pointing to a rocm root install instead of a single directory with bitcode files. This tries to maintain compatibility with the existing the --hip-device-lib and --hip-device-lib-path flags, as well as the HIP_DEVICE_LIB_PATH environment variable, or at least the range of uses with testcases. The existing range of uses and behavior doesn't entirely make sense to me, so some of the untested edge cases change behavior. Currently the two path forms seem to have the double purpose of a search path for an arbitrary --hip-device-lib, and for finding the stock set of libraries. Since the stock set of libraries This also changes the behavior when multiple paths are specified, and only takes the last one (and the environment variable only handles a single path). If --hip-device-lib is used, it now only treats --hip-device-lib-path as the search path for it, and does not attempt to find the rocm installation. If not, --hip-device-lib-path and the environment variable are used as the directory to search instead of the rocm root based path. This should also automatically fix handling of the options to use wave64.	2020-05-12 09:50:22 -04:00
Matt Arsenault	123bee602a	AMDGPU: Search for new ROCm bitcode library structure The current install situation is a mess, but I'm working on fixing it. Search for the target layout instead of one of the N options that exist today.	2020-05-12 09:41:07 -04:00
Zola Bridges	379e68a763	[clang][SLH] Add __has_feature(speculative_load_hardening) SLH doesn't support asm goto and is unlikely to ever support it. Users of asm goto need a way to choose whether to use asm goto or fallback to an SLH compatible code path when SLH is enabled. This feature flag will give users this ability. Tested via unit test Reviewed By: mattdr Differential Revision: https://reviews.llvm.org/D79733	2020-05-11 13:37:12 -07:00
Scott Constable	e97a3e5d9d	[X86] Add a Pass that builds a Condensed CFG for Load Value Injection (LVI) Gadgets Adds a new data structure, ImmutableGraph, and uses RDF to find LVI gadgets and add them to a MachineGadgetGraph. More specifically, a new X86 machine pass finds Load Value Injection (LVI) gadgets consisting of a load from memory (i.e., SOURCE), and any operation that may transmit the value loaded from memory over a covert channel, or use the value loaded from memory to determine a branch/call target (i.e., SINK). Also adds a new target feature to X86: +lvi-load-hardening The feature can be added via the clang CLI using -mlvi-hardening. Differential Revision: https://reviews.llvm.org/D75936	2020-05-11 13:08:35 -07:00
Florian Hahn	1065869195	[Matrix] Add matrix type to Clang. This patch adds a matrix type to Clang as described in the draft specification in clang/docs/MatrixSupport.rst. It introduces a new option -fenable-matrix, which can be used to enable the matrix support. The patch adds new MatrixType and DependentSizedMatrixType types along with the plumbing required. Loads of and stores to pointers to matrix values are lowered to memory operations on 1-D IR arrays. After loading, the loaded values are cast to a vector. This ensures matrix values use the alignment of the element type, instead of LLVM's large vector alignment. The operators and builtins described in the draft spec will will be added in follow-up patches. Reviewers: martong, rsmith, Bigcheese, anemet, dexonsmith, rjmccall, aaron.ballman Reviewed By: rjmccall Differential Revision: https://reviews.llvm.org/D72281	2020-05-11 18:55:45 +01:00
Petr Hosek	5b02be0b97	[Clang] Pass -z max-page-size to linker for Fuchsia Currently all Fuchsia ABIs use a 4k page size, departing from the recommended page sizes in the respective psABI documents. Differential Revision: https://reviews.llvm.org/D79667	2020-05-09 13:44:20 -07:00
Petr Hosek	c8fbcb1e78	[Clang] Pass --pack-dyn-relocs=relr to lld for Fuchsia The compact format is fully supported on Fuchsia and is the preferred default. Patch By: mcgrathr Differential Revision: https://reviews.llvm.org/D79665	2020-05-09 13:42:19 -07:00
Fangrui Song	6bf0ad78dc	[Driver] Don't pass -u__llvm_profile_runtime for clang -fprofile-arcs a.o clang --coverage a.o # InstrProfilingRuntime.cpp.o not linked in clang --fprofile-arcs a.o # InstrProfilingRuntime.cpp.o unexpectedly linked in Fix --fprofile-arcs.	2020-05-08 23:36:29 -07:00
Fangrui Song	9a11174287	[Driver] Add -fno-test-coverage	2020-05-08 17:01:53 -07:00
Fangrui Song	0d4a33ba61	[Driver] Don't warn -Wunused-command-line-argument for --coverage -ftest-coverage -fprofile-arcs	2020-05-08 16:31:15 -07:00
Fangrui Song	e1815eb2e1	[Driver] Reorganize --coverage -ftest-coverage -fprofile-arcs related tests And fix a comment about __llvm_profile_runtime	2020-05-08 16:06:33 -07:00
Hubert Tong	b116ded57d	[AIX] Avoid structor alias; die before bad alias codegen Summary: `AsmPrinter::emitGlobalIndirectSymbol` is dependent on `MCStreamer::emitAssignment` to produce `.set` directives for alias symbols; however, the `.set` pseudo-op on AIX is documented as not usable with external relocatable terms or expressions, which limits its applicability in generating alias symbols. Disable generating aliases on AIX until a different implementation strategy is available. Reviewers: cebowleratibm, jasonliu, sfertile, daltenty, DiggerLin Reviewed By: jasonliu Differential Revision: https://reviews.llvm.org/D79044	2020-05-08 16:51:34 -04:00
Sriraman Tallam	e8147ad822	Uniuqe Names for Internal Linkage Symbols. This is a standalone patch and this would help Propeller do a better job of code layout as it can accurately attribute the profiles to the right internal linkage function. This also helps SampledFDO/AutoFDO correctly associate sampled profiles to the right internal function. Currently, if there is more than one internal symbol foo, their profiles are aggregated by SampledFDO. This patch adds a new clang option, -funique-internal-funcnames, to generate unique names for functions with internal linkage. This patch appends the md5 hash of the module name to the function symbol as a best effort to generate a unique name for symbols with internal linkage. Differential Revision: https://reviews.llvm.org/D73307	2020-05-07 18:18:37 -07:00
Evgenii Stepanov	b4aa71e1bd	Allow -fsanitize-minimal-runtime with memtag sanitizer. Summary: MemTag does not have any runtime at the moment, it's strictly code instrumentation. Reviewers: pcc Subscribers: cryptoad, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D79522	2020-05-07 13:07:46 -07:00
Yaxun (Sam) Liu	e85bc0056e	Revert "Let clang print registered targets for --version" This reverts commit `66041187c9`. Will reland it under --print-target. Differential Revision: https://reviews.llvm.org/D79210	2020-05-07 11:01:42 -04:00
Calixte Denizet	bec223a9bc	[profile] Don't crash when forking in several threads Summary: When forking in several threads, the counters were written out in using the same global static variables (see GCDAProfiling.c): that leads to crashes. So when there is a fork, the counters are resetted in the child process and they will be dumped at exit using the interprocess file locking. When there is an exec, the counters are written out and in case of failures they're resetted. Reviewers: jfb, vsk, marco-c, serge-sans-paille Reviewed By: marco-c, serge-sans-paille Subscribers: llvm-commits, serge-sans-paille, dmajor, cfe-commits, hiraditya, dexonsmith, #sanitizers, marco-c, sylvestre.ledru Tags: #sanitizers, #clang, #llvm Differential Revision: https://reviews.llvm.org/D78477	2020-05-07 14:13:11 +02:00
Yaxun (Sam) Liu	66041187c9	Let clang print registered targets for --version We need a way to know supported targets by clang since people may use clang as assembler and they want to choose the clang which supports their target. This patch let clang print registered targets when --version option is passed to clang. Differential Revision: https://reviews.llvm.org/D79210	2020-05-04 12:26:18 -04:00
Samuel Thibault	c298e5a022	[Gnu toolchain] Move GCC multilib/multiarch paths support from Linux to Gnu Summary: The current code for GNU/Linux is actually completely generic, and can be moved to Gnu, so it can benefit GNU/Hurd and GNU/kFreeBSD Reviewers: kristina, sammccall, lebedev.ri, MaskRay, arsenm, phosek Reviewed By: MaskRay, phosek Subscribers: wdng, ormris, emaste, arichardson, krytarowski, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D73845	2020-05-01 12:12:27 +02:00
Sylvain Audi	226489715c	[clang] Disable check for system sanitizer blacklists files if -fno-sanitizer-blacklist was specified This is to avoid checking for the validity of a file that is not used. This also contains a minor fix for the test, as the cfi sanitizer requires -flto and -fvisibility= arguments. Differential Revision: https://reviews.llvm.org/D79043	2020-04-30 16:04:50 -04:00
Martin Storsjö	a0e53de472	[clang] [MinGW] Add the compiler rt libdirs to the search path This matches what is done for MSVC in `b8000c0ce8`. Since that commit, compiler rt sanitizer libraries aren't linked to with absolute path on windows, but using their basenames, requiring the libdirs to be passed to the linker. This fixes undefined behaviour sanitizer on MinGW after `b8000c0ce8`. Differential Revision: https://reviews.llvm.org/D79076	2020-04-29 20:35:50 +03:00
Reid Kleckner	b8000c0ce8	[Windows] Autolink with basenames and add libdir to libpath Prior to this change, for a few compiler-rt libraries such as ubsan and the profile library, Clang would embed "-defaultlib:path/to/rt-arch.lib" into the .drective section of every object compiled with -finstr-profile-generate or -fsanitize=ubsan as appropriate. These paths assume that the link step will run from the same working directory as the compile step. There is also evidence that sometimes the paths become absolute, such as when clang is run from a different drive letter from the current working directory. This is fragile, and I'd like to get away from having paths embedded in the object if possible. Long ago it was suggested that we use this for ASan, and apparently I felt the same way back then: https://reviews.llvm.org/D4428#56536 This is also consistent with how all other autolinking usage works for PS4, Mac, and Windows: they all use basenames, not paths. To keep things working for people using the standard GCC driver workflow, the driver now adds the resource directory to the linker library search path when it calls the linker. This is enough to make check-ubsan pass, and seems like a generally good thing. Users that invoke the linker directly (most clang-cl users) will have to add clang's resource library directory to their linker search path in their build system. I'm not sure where I can document this. Ideally I'd also do it in the MSBuild files, but I can't figure out where they go. I'd like to start with this for now. Reviewed By: hans Differential Revision: https://reviews.llvm.org/D65543	2020-04-28 11:36:21 -07:00
Francis Visoiu Mistrih	e770153865	[AArch64] Add support for -ffixed-x30 Add support for reserving LR in: * the driver through `-ffixed-x30` * cc1 through `-target-feature +reserve-x30` * the backend through `-mattr=+reserve-x30` * a subtarget feature `reserve-x30` the same way we're doing for the other registers.	2020-04-28 08:48:28 -07:00
Samuel Thibault	6395980370	[Driver] Move GCC multilib/multiarch paths support from Linux.cpp to Gnu.cpp The current code for GNU/Linux is actually completely generic, and can be moved to ToolChains/Gnu.cpp, so that it can benefit GNU/Hurd and GNU/kFreeBSD. Reviewed By: MaskRay, phosek Differential Revision: https://reviews.llvm.org/D73845	2020-04-24 09:33:19 -07:00
Luke Geeson	740a1dd050	[ARM] Armv8.6-a Matrix Mul cmd line support This patch upstreams support for the Armv8.6-a Matrix Multiplication Extension. A summary of the features can be found here: https://community.arm.com/developer/ip-products/processors/b/processors-ip-blog/posts/arm-architecture-developments-armv8-6-a This patch includes: - Command line options to enable these features with +i8mm, +f32mm, or f64mm Note: +f32mm and +f64mm are optional and so are not enabled by default This is part of a patch series, starting with BFloat16 support and the other components in the armv8.6a extension (in previous patches linked in phabricator) Based on work by: - Luke Geeson - Oliver Stannard - Luke Cheeseman Reviewers: t.p.northover, DavidSpickett Reviewed By: DavidSpickett Subscribers: DavidSpickett, ostannard, kristof.beyls, danielkiss, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D77875	2020-04-24 15:54:06 +01:00
Yaxun (Sam) Liu	2da89df4e8	Fix minor bug in CommonArgs.cpp Change-Id: Ibe87b1633cc7516479bb08bf51b6860a1585a94f	2020-04-24 08:46:33 -04:00
Justin Hibbits	4ca2cad947	[PowerPC] Add clang -msvr4-struct-return for 32-bit ELF Summary: Change the default ABI to be compatible with GCC. For 32-bit ELF targets other than Linux, Clang now returns small structs in registers r3/r4. This affects FreeBSD, NetBSD, OpenBSD. There is no change for 32-bit Linux, where Clang continues to return all structs in memory. Add clang options -maix-struct-return (to return structs in memory) and -msvr4-struct-return (to return structs in registers) to be compatible with gcc. These options are only for PPC32; reject them on PPC64 and other targets. The options are like -fpcc-struct-return and -freg-struct-return for X86_32, and use similar code. To actually return a struct in registers, coerce it to an integer of the same size. LLVM may optimize the code to remove unnecessary accesses to memory, and will return i32 in r3 or i64 in r3:r4. Fixes PR#40736 Patch by George Koehler! Reviewed By: jhibbits, nemanjai Differential Revision: https://reviews.llvm.org/D73290	2020-04-21 20:17:25 -05:00
Richard Smith	c8248dc3bb	Change deprecated -fsanitize-recover flag to apply to all sanitizers, not just UBSan. Summary: This flag has been deprecated, with an on-by-default warning encouraging users to explicitly specify whether they mean "all" or ubsan for 5 years (released in Clang 3.7). Change it to mean what we wanted and undeprecate it. Also make the argument to -fsanitize-trap optional, and likewise default it to 'all', and express the aliases for these flags in the .td file rather than in code. (Plus documentation updates for the above.) Reviewers: kcc Subscribers: cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D77753	2020-04-17 22:37:30 -07:00
Stephan Dollberg	7b9c6c16c3	Also look for devtoolset-9 gcc toolchain devtoolset-9 has been out for a while so also look for it. Differential Revision: https://reviews.llvm.org/D77420	2020-04-16 11:17:39 +02:00
Matt Arsenault	3a61245050	clang/AMDGPU: Assume denormals are enabled for the default target. Since the default logic was based on having fast denormal/fma features, and the default target has no features, we assumed flushing by default. This fixes incorrectly assuming flushing in builds for "generic" IR libraries. The handling for no specified --cuda-gpu-arch in HIP is kind of broken. Somewhere else forces a default target of gfx803, which does not enable denormal handling by default. We don't see this default switching here, so you'll end up with a different denormal mode depending on whether you explicitly requested gfx803, or used it by default.	2020-04-15 09:17:26 -04:00
Matt Arsenault	dc89a3efb4	HIP: Fix handling of denormal mode I didn't realize HIP was a distinct offloading kind, so the subtarget was looking for -march, which isn't correct for HIP. We also have the possibility of different denormal defaults in the case of multiple offload targets, so we need to thread the JobAction through the target hook.	2020-04-13 11:48:45 -07:00
Matt Morehouse	bef187c750	Implement `-fsanitize-coverage-whitelist` and `-fsanitize-coverage-blacklist` for clang Summary: This commit adds two command-line options to clang. These options let the user decide which functions will receive SanitizerCoverage instrumentation. This is most useful in the libFuzzer use case, where it enables targeted coverage-guided fuzzing. Patch by Yannis Juglaret of DGA-MI, Rennes, France libFuzzer tests its target against an evolving corpus, and relies on SanitizerCoverage instrumentation to collect the code coverage information that drives corpus evolution. Currently, libFuzzer collects such information for all functions of the target under test, and adds to the corpus every mutated sample that finds a new code coverage path in any function of the target. We propose instead to let the user specify which functions' code coverage information is relevant for building the upcoming fuzzing campaign's corpus. To this end, we add two new command line options for clang, enabling targeted coverage-guided fuzzing with libFuzzer. We see targeted coverage guided fuzzing as a simple way to leverage libFuzzer for big targets with thousands of functions or multiple dependencies. We publish this patch as work from DGA-MI of Rennes, France, with proper authorization from the hierarchy. Targeted coverage-guided fuzzing can accelerate bug finding for two reasons. First, the compiler will avoid costly instrumentation for non-relevant functions, accelerating fuzzer execution for each call to any of these functions. Second, the built fuzzer will produce and use a more accurate corpus, because it will not keep the samples that find new coverage paths in non-relevant functions. The two new command line options are `-fsanitize-coverage-whitelist` and `-fsanitize-coverage-blacklist`. They accept files in the same format as the existing `-fsanitize-blacklist` option <https://clang.llvm.org/docs/SanitizerSpecialCaseList.html#format>. The new options influence SanitizerCoverage so that it will only instrument a subset of the functions in the target. We explain these options in detail in `clang/docs/SanitizerCoverage.rst`. Consider now the woff2 fuzzing example from the libFuzzer tutorial <https://github.com/google/fuzzer-test-suite/blob/master/tutorial/libFuzzerTutorial.md>. We are aware that we cannot conclude much from this example because mutating compressed data is generally a bad idea, but let us use it anyway as an illustration for its simplicity. Let us use an empty blacklist together with one of the three following whitelists: ``` # (a) src:* fun:* # (b) src:SRC/* fun:* # (c) src:SRC/src/woff2_dec.cc fun:* ``` Running the built fuzzers shows how many instrumentation points the compiler adds, the fuzzer will output //XXX PCs//. Whitelist (a) is the instrument-everything whitelist, it produces 11912 instrumentation points. Whitelist (b) focuses coverage to instrument woff2 source code only, ignoring the dependency code for brotli (de)compression; it produces 3984 instrumented instrumentation points. Whitelist (c) focuses coverage to only instrument functions in the main file that deals with WOFF2 to TTF conversion, resulting in 1056 instrumentation points. For experimentation purposes, we ran each fuzzer approximately 100 times, single process, with the initial corpus provided in the tutorial. We let the fuzzer run until it either found the heap buffer overflow or went out of memory. On this simple example, whitelists (b) and (c) found the heap buffer overflow more reliably and 5x faster than whitelist (a). The average execution times when finding the heap buffer overflow were as follows: (a) 904 s, (b) 156 s, and (c) 176 s. We explain these results by the fact that WOFF2 to TTF conversion calls the brotli decompression algorithm's functions, which are mostly irrelevant for finding bugs in WOFF2 font reconstruction but nevertheless instrumented and used by whitelist (a) to guide fuzzing. This results in longer execution time for these functions and a partially irrelevant corpus. Contrary to whitelist (a), whitelists (b) and (c) will execute brotli-related functions without instrumentation overhead, and ignore new code paths found in them. This results in faster bug finding for WOFF2 font reconstruction. The results for whitelist (b) are similar to the ones for whitelist (c). Indeed, WOFF2 to TTF conversion calls functions that are mostly located in SRC/src/woff2_dec.cc. The 2892 extra instrumentation points allowed by whitelist (b) do not tamper with bug finding, even though they are mostly irrelevant, simply because most of these functions do not get called. We get a slightly faster average time for bug finding with whitelist (b), which might indicate that some of the extra instrumentation points are actually relevant, or might just be random noise. Reviewers: kcc, morehouse, vitalybuka Reviewed By: morehouse, vitalybuka Subscribers: pratyai, vitalybuka, eternalsakura, xwlin222, dende, srhines, kubamracek, #sanitizers, lebedev.ri, hiraditya, cfe-commits, llvm-commits Tags: #clang, #sanitizers, #llvm Differential Revision: https://reviews.llvm.org/D63616	2020-04-10 10:44:03 -07:00
Matt Arsenault	4593e4131a	AMDGPU: Teach toolchain to link rocm device libs Currently the library is separately linked, but this isn't correct to implement fast math flags correctly. Each module should get the version of the library appropriate for its combination of fast math and related flags, with the attributes propagated into its functions and internalized. HIP already maintains the list of libraries, but this is not used for OpenCL. Unfortunately, HIP uses a separate --hip-device-lib argument, despite both languages using the same bitcode library. Eventually these two searches need to be merged. An additional problem is there are 3 different locations the libraries are installed, depending on which build is used. This also needs to be consolidated (or at least the search logic needs to deal with this unnecessary complexity).	2020-04-10 13:37:32 -04:00
ADRA	6ed88afd78	[CUDA] Accept -x cu to indicate language is CUDA, transfer CUDA language flag to header-file arguments Summary: * accept -x cu to indicate language is CUDA * transfer CUDA language flag to header-file arguments Differential Revision: https://reviews.llvm.org/D77451	2020-04-09 13:08:41 -07:00
Simon Cook	dd1ee6dc07	[RISCV] Support experimental/unratified extensions This adds support for enabling experimental/unratified RISC-V ISA extensions in the -march string in the case where an explicit version number has been declared, and the -menable-experimental-extensions flag has been provided. This follows the design as discussed on the mailing lists in the following RFC: http://lists.llvm.org/pipermail/llvm-dev/2020-January/138364.html Since the RISC-V toolchain definition currently rejects any extension with an explicit version number, the parsing logic has been tweaked to support this, and to allow standard extensions to have their versions checked in future patches. The bitmanip 'b' extension has been added as a first use of this support, it should easily extend to other as yet unratified extensions (such as the vector 'v' extension). Differential Revision: https://reviews.llvm.org/D73891	2020-04-09 18:04:22 +01:00
Shengchen Kan	792b10978d	[Driver][X86] Add -mpad-max-prefix-size Summary: The option `-mpad-max-prefix-size` performs some checking and delegate to MC option `-x86-pad-max-prefix-size`. This option is designed for eliminate NOPs when we need to align something by adding redundant prefixes to instructions, e.g. it can be used along with `-malign-branch`, `-malign-branch-boundary` to prefix padding branch. It has similar (but slightly different) effect as GAS's option `-malign-branch-prefix-size`, e.g. `-mpad-max-prefix-size` can also elminate NOPs emitted by align directive, so we use a different name here. I remove the option `-malign-branch-prefix-size` since is unimplemented and not needed. If we need to be compatible with GAS, we can make `-malign-branch-prefix-size` an alias for this option later. Reviewers: jyknight, reames, MaskRay, craig.topper, LuoYuanke Reviewed By: MaskRay, LuoYuanke Subscribers: annita.zhang, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D77628	2020-04-09 19:34:12 +08:00
Pratyai Mazumder	ced398fdc8	[SanitizerCoverage] Add -fsanitize-coverage=inline-bool-flag Reviewers: kcc, vitalybuka Reviewed By: vitalybuka Subscribers: cfe-commits, llvm-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D77637	2020-04-09 02:40:55 -07:00
Fangrui Song	969b91af73	[Driver] Default arm-linux-androideabi to -z max-page-size=4096 Similar to D55029. The requirement arises when discussing increasing default max-page-size for lld ARM (D77330). For the record, the default max-page-size on the 3 commonly used linkers: * GNU ld since 2014 (https://sourceware.org/git/?p=binutils-gdb.git;a=commit;h=7572ca8989ead4c3425a1500bc241eaaeffa2c89) defaults to 65536 * GNU gold remains 4096 * lld<=10 uses 4096. lld from 11 onwards will use 65536 (D77330) Reviewed By: srhines, thieta Differential Revision: https://reviews.llvm.org/D77746	2020-04-08 12:05:28 -07:00
Artem Belevich	a9627b7ea7	[CUDA] Add partial support for recent CUDA versions. Generate PTX using newer versions of PTX and allow using sm_80 with CUDA-11. None of the new features of CUDA-10.2+ have been implemented yet, so using these versions will still produce a warning. Differential Revision: https://reviews.llvm.org/D77670	2020-04-08 11:19:44 -07:00
Artem Belevich	33386b20aa	[CUDA] Simplify GPU variant handling. NFC. Instead of hardcoding individual GPU mappings in multiple functions, keep them all in one table and use it to look up the mappings. We also don't care about 'virtual' architecture much, so the API is trimmed down down to a simpler GPU->Virtual arch name lookup. Differential Revision: https://reviews.llvm.org/D77665	2020-04-08 11:19:43 -07:00
Johannes Doerfert	f85ae058f5	[OpenMP] Provide math functions in OpenMP device code via OpenMP variants For OpenMP target regions to piggy back on the CUDA/AMDGPU/... implementation of math functions, we include the appropriate definitions inside of an `omp begin/end declare variant match(device={arch(nvptx)})` scope. This way, the vendor specific math functions will become specialized versions of the system math functions. When a system math function is called and specialized version is available the selection logic introduced in D75779 instead call the specialized version. In contrast to the code path we used so far, the system header is actually included. This means functions without specialized versions are available and so are macro definitions. This should address PR42061, PR42798, and PR42799. Reviewed By: ye-luo Differential Revision: https://reviews.llvm.org/D75788	2020-04-07 23:33:24 -05:00
Francis Visoiu Mistrih	9e6670b03c	[Driver] Only pass LTO remark arguments if the driver asks for it Previous fix missed a check to willEmitRemarks, causing remarks to always be enabled for LTO.	2020-04-07 14:11:47 -07:00
Sid Manning	aed2fdb167	[Hexagon] Update paths for linux/musl Update the sysroot expectation to match other targets and breakout linux/musl toolchain tests into a new file. Differential Revision: https://reviews.llvm.org/D77440	2020-04-07 13:45:52 -05:00
Michael Liao	c97be2c377	[hip] Remove `hip_pinned_shadow`. Summary: - Use `device_builtin_surface` and `device_builtin_texture` for surface/texture reference support. So far, both the host and device use the same reference type, which could be revised later when interface/implementation is stablized. Reviewers: yaxunl Subscribers: cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D77583	2020-04-07 09:51:49 -04:00
Sid Manning	2c5d6dfda9	[Hexagon] Make lld be the default linker for linux/musl When the target is hexagon-unknown-linux-musl select lld as the default linker. Differential Revision: https://reviews.llvm.org/D77498	2020-04-06 12:59:07 -05:00
Craig Topper	1d42c0db9a	Revert "[X86] Add a Pass that builds a Condensed CFG for Load Value Injection (LVI) Gadgets" This reverts commit `c74dd640fd`. Reverting to address coding standard issues raised in post-commit review.	2020-04-03 16:56:08 -07:00
Francis Visoiu Mistrih	ba8b3052b5	[Driver] Handle all optimization-record options for Darwin LTO clang with -flto does not handle -foptimization-record-path=<path> This dulicates the code from ToolChains/Clang.cpp with modifications to support everything in the same fashion.	2020-04-03 15:30:08 -07:00
Scott Constable	c74dd640fd	[X86] Add a Pass that builds a Condensed CFG for Load Value Injection (LVI) Gadgets Adds a new data structure, ImmutableGraph, and uses RDF to find LVI gadgets and add them to a MachineGadgetGraph. More specifically, a new X86 machine pass finds Load Value Injection (LVI) gadgets consisting of a load from memory (i.e., SOURCE), and any operation that may transmit the value loaded from memory over a covert channel, or use the value loaded from memory to determine a branch/call target (i.e., SINK). Also adds a new target feature to X86: +lvi-load-hardening The feature can be added via the clang CLI using -mlvi-hardening. Differential Revision: https://reviews.llvm.org/D75936	2020-04-03 13:02:04 -07:00
Scott Constable	5b519cf1fc	[X86] Add Indirect Thunk Support to X86 to mitigate Load Value Injection (LVI) This pass replaces each indirect call/jump with a direct call to a thunk that looks like: lfence jmpq *%r11 This ensures that if the value in register %r11 was loaded from memory, then the value in %r11 is (architecturally) correct prior to the jump. Also adds a new target feature to X86: +lvi-cfi ("cfi" meaning control-flow integrity) The feature can be added via clang CLI using -mlvi-cfi. This is an alternate implementation to https://reviews.llvm.org/D75934 That merges the thunk insertion functionality with the existing X86 retpoline code. Differential Revision: https://reviews.llvm.org/D76812	2020-04-03 00:34:39 -07:00
Matt Arsenault	4ea3650c21	HIP: Link correct denormal mode library This wasn't respecting the flush mode based on the default, and also wasn't correctly handling the explicit -fno-cuda-flush-denormals-to-zero overriding the mode.	2020-04-01 12:36:22 -04:00
Matt Arsenault	6593360ee7	AMDGPU: Fix consistently backwards logic for default denormal mode I forgot to squash this into `c9d65a48af`	2020-04-01 12:36:22 -04:00
Puyan Lotfi	e3033c0ce5	[llvm][clang][IFS] Enhancing the llvm-ifs yaml format for symbol lists. Prior to this change the clang interface stubs format resembled something ending with a symbol list like this: Symbols: a: { Type: Func } This was problematic because we didn't actually want a map format and also because we didn't like that an empty symbol list required "Symbols: {}". That is to say without the empty {} llvm-ifs would crash on an empty list. With this new format it is much more clear which field is the symbol name, and instead the [] that is used to express an empty symbol vector is optional, ie: Symbols: - { Name: a, Type: Func } or Symbols: [] or Symbols: This further diverges the format from existing llvm-elftapi. This is a good thing because although the format originally came from the same place, they are not the same in any way. Differential Revision: https://reviews.llvm.org/D76979	2020-04-01 10:49:06 -04:00
Fangrui Song	531b3aff30	[Frontend] Replace CC1 option -masm-verbose with -fno-verbose-asm Most OS✕target enable -fverbose-asm, so it makes sense to flip the CC1 option to reduce common command lines.	2020-03-31 22:33:55 -07:00
Fangrui Song	d0d076fed9	[Driver] Flip the CC1 default of -fdiagnostics-show-option The driver enables -fdiagnostics-show-option by default, so flip the CC1 default to reduce the lengths of common CC1 command lines. This change also makes ParseDiagnosticArgs() consistently enable -fdiagnostics-show-option by default.	2020-03-31 21:59:27 -07:00
Fangrui Song	3341dc7339	[Driver] Don't pass -fobjc-rumtime= for non-ObjC input	2020-03-31 17:50:37 -07:00
Fangrui Song	4805901930	[Driver] Don't pass -fmessage-length=0 to CC1 -fmessage-length=0 is common (unless the environment variable COLUMNS is set and exported. This simplifies a common CC1 command line.	2020-03-31 17:12:08 -07:00
Matt Arsenault	175e42303b	AMDGPU: Make HIPToolChain a subclass of AMDGPUToolChain This fixes some code duplication. This is also a step towards consolidating builtin library handling.	2020-03-31 18:22:46 -04:00
Matt Arsenault	c9d65a48af	HIP: Ensure new denormal mode attributes are set Apparently HIPToolChain does not subclass from AMDGPUToolChain, so this was not applying the new denormal attributes. I'm not sure why this doesn't subclass. Just copy the implementation for now.	2020-03-31 18:00:37 -04:00
Amara Emerson	7f1ea924c6	Add a new -fglobal-isel option and make -fexperimental-isel an alias for it. Since GlobalISel is maturing and is already on at -O0 for AArch64, it's not completely "experimental". Create a more appropriate driver flag and make the older option an alias for it. Differential Revision: https://reviews.llvm.org/D77103	2020-03-31 12:06:11 -07:00
Benjamin Kramer	e8f13f4f62	Replace std::string::find == 0 with StringRef::startswith This is both more readable and faster. Found by clang-tidy's abseil-string-find-startswith.	2020-03-31 21:01:09 +02:00
Yaxun (Sam) Liu	764f54bb85	Rename options --cuda-gpu-arch and --no-cuda-gpu-arch Per discussion http://lists.llvm.org/pipermail/llvm-dev/2017-February/109930.html Rename -cuda-gpu-arch and --no-cuda-gpu-arch to --offload-arch and --no-offload-arch. The original options will be alias to the new options. Differential Revision: https://reviews.llvm.org/D76987	2020-03-30 20:29:50 -04:00
Nico Weber	c506adcdf2	Move CLANG_SYSTEMZ_DEFAULT_ARCH to config.h. Instead of using a global define; see comments on D75914. While here, port `9c9d88d8b1` to the GN build.	2020-03-30 14:16:17 -04:00
Florian Hahn	7899a111ea	Revert "[Darwin] Respect -fno-unroll-loops during LTO." As per post-commit comment at https://reviews.llvm.org/D76916, this should better be done at the TU level. This reverts commit `9ce198d6ed`.	2020-03-30 15:20:30 +01:00
Ulrich Weigand	9c9d88d8b1	[SystemZ] Allow configuring default CLANG_SYSTEMZ_ARCH On Ubuntu, we want to raise default CLANG_SYSTEMZ_ARCH to z13, thus allow configuring this via CMake. On Debian, we want to raise it to z196. Author: Dimitri John Ledkov Differential Revision: https://reviews.llvm.org/D75914	2020-03-30 14:20:48 +02:00
Florian Hahn	9ce198d6ed	[Darwin] Respect -fno-unroll-loops during LTO. Currently -fno-unroll-loops is ignored when doing LTO on Darwin. This patch adds a new -lto-no-unroll-loops option to the LTO code generator and forwards it to the linker if -fno-unroll-loops is passed. Reviewers: thegameg, steven_wu Reviewed By: thegameg Differential Revision: https://reviews.llvm.org/D76916	2020-03-27 22:19:03 +00:00
Paul Robinson	ee7510dc86	Fix a Diag call not to assume option spelling	2020-03-27 12:12:44 -07:00
Alexandre Ganea	09158252f7	[ThinLTO] Allow usage of all hardware threads in the system Before this patch, it wasn't possible to extend the ThinLTO threads to all SMT/CMT threads in the system. Only one thread per core was allowed, instructed by usage of llvm::heavyweight_hardware_concurrency() in the ThinLTO code. Any number passed to the LLD flag /opt:lldltojobs=..., or any other ThinLTO-specific flag, was previously interpreted in the context of llvm::heavyweight_hardware_concurrency(), which means SMT disabled. One can now say in LLD: /opt:lldltojobs=0 -- Use one std::thread / hardware core in the system (no SMT). Default value if flag not specified. /opt:lldltojobs=N -- Limit usage to N threads, regardless of usage of heavyweight_hardware_concurrency(). /opt:lldltojobs=all -- Use all hardware threads in the system. Equivalent to /opt:lldltojobs=$(nproc) on Linux and /opt:lldltojobs=%NUMBER_OF_PROCESSORS% on Windows. When an affinity mask is set for the process, threads will be created only for the cores selected by the mask. When N > number-of-hardware-threads-in-the-system, the threads in the thread pool will be dispatched equally on all CPU sockets (tested only on Windows). When N <= number-of-hardware-threads-on-a-CPU-socket, the threads will remain on the CPU socket where the process started (only on Windows). Differential Revision: https://reviews.llvm.org/D75153	2020-03-27 10:20:58 -04:00

1 2 3 4 5 ...

5721 Commits