llvm-project

Commit Graph

Author	SHA1	Message	Date
Nathan Froyd	31a3c5fb45	[clang] use string tables for static diagnostic descriptions Using a pointer for the description string in StaticDiagInfoRec causes several problems: 1. We don't need to use a whole pointer to represent the string; 2. The use of pointers incurs runtime relocations for those pointers; the relocations take up space on disk and represent runtime overhead; 3. The need to relocate data implies that, on some platforms, the entire array containing StaticDiagInfoRecs cannot be shared between processes. This patch changes the storage scheme for the diagnostic descriptions to avoid these problems. We instead generate (effectively) one large string and then StaticDiagInfoRec conceptually holds offsets into the string. We elected to also move the storage of those offsets into a separate array to further reduce the space required. On x86-64 Linux, this change removes about 120KB of relocations and moves about 60KB from the non-shareable .data.rel.ro section to shareable .rodata. (The array is about 80KB before this, but we eliminated 4 bytes/entry by using offsets rather than pointers.) We actually reap this benefit twice, because these tables show up in both libclang.so and libclang-cpp.so and we get the reduction in both places. Differential Revision: https://reviews.llvm.org/D81865	2020-09-24 10:54:28 -04:00
Yaxun (Sam) Liu	e39da8ab6a	Recommit "[CUDA][HIP] Defer overloading resolution diagnostics for host device functions" This recommits `7f1f89ec8d` and `40df06cdaf` after fixing memory sanitizer failure.	2020-09-24 08:44:37 -04:00
Yaxun (Sam) Liu	8e780a1653	Recommit [NFC] Refactor DiagnosticBuilder and PartialDiagnostic This recommits `829d14ee0a`. The patch was reverted due to a regression in some CUDA app which was thought to be caused by this patch. However, investigation showed that the regression was due to some other issues, therefore recommit this patch.	2020-09-23 16:55:00 -04:00
Abhina Sreeskantharajan	0fb97fd6a4	[SystemZ][z/OS] Set default wchar_t type for zOS Set the default wchar_t type on z/OS, and unsigned as the default. Reviewed By: hubert.reinterpretcast, fanbo-meng Differential Revision: https://reviews.llvm.org/D87624	2020-09-22 08:03:03 -04:00
Yaxun (Sam) Liu	829d14ee0a	Revert "[NFC] Refactor DiagnosticBuilder and PartialDiagnostic" This reverts commit `ee5519d323`.	2020-09-17 13:56:09 -04:00
Yaxun (Sam) Liu	772bd8a7d9	Revert "[CUDA][HIP] Defer overloading resolution diagnostics for host device functions" This reverts commit `7f1f89ec8d`. This reverts commit `40df06cdaf`.	2020-09-17 13:55:31 -04:00
Yaxun (Sam) Liu	40df06cdaf	[CUDA][HIP] Defer overloading resolution diagnostics for host device functions In CUDA/HIP a function may become implicit host device function by pragma or constexpr. A host device function is checked in both host and device compilation. However it may be emitted only on host or device side, therefore the diagnostics should be deferred until it is known to be emitted. Currently clang is only able to defer certain diagnostics. This causes false alarms and limits the usefulness of host device functions. This patch lets clang defer all overloading resolution diagnostics for host device functions. An option -fgpu-defer-diag is added to control this behavior. By default it is off. It is NFC for other languages. Differential Revision: https://reviews.llvm.org/D84364	2020-09-17 11:30:42 -04:00
Yaxun (Sam) Liu	ee5519d323	[NFC] Refactor DiagnosticBuilder and PartialDiagnostic PartialDiagnostic misses some functions compared to DiagnosticBuilder. This patch refactors DiagnosticBuilder and PartialDiagnostic, extracts the common functionality so that the streaming << operators are shared. Differential Revision: https://reviews.llvm.org/D84362	2020-09-16 17:35:28 -04:00
Fanbo Meng	2240ca0bd1	[SystemZ][z/OS] Set aligned allocation unavailable by default for z/OS Aligned allocation is not supported on z/OS. This patch sets -faligned-alloc-unavailable as default in z/OS toolchain. Reviewed By: abhina.sreeskantharajan, hubert.reinterpretcast Differential Revision: https://reviews.llvm.org/D87611	2020-09-16 14:49:03 -04:00
Jan Korous	ae726fecae	[SourceManager] Explicitly check for potential iterator underflow Differential Revision: https://reviews.llvm.org/D86231	2020-09-15 15:54:16 -07:00
Qiu Chaofan	8ecc8520bc	[FPEnv] [Clang] Enable constrained FP support for PowerPC `d4ce862f` introduced HasStrictFP to disable generating constrained FP operations for platforms lacking support. Since work for enabling constrained FP on PowerPC is almost done, we'd like to enable it. Reviewed By: kpn, steven.zhang Differential Revision: https://reviews.llvm.org/D87223	2020-09-12 00:39:52 +08:00
Rainer Orth	76e85ae268	[clang][Sparc] Default to -mcpu=v9 for Sparc V8 on Solaris As reported in Bug 42535, `clang` doesn't inline atomic ops on 32-bit Sparc, unlike `gcc` on Solaris. In a 1-stage build with `gcc`, only two testcases are affected (currently `XFAIL`ed), while in a 2-stage build more than 100 tests `FAIL` due to this issue. The reason for this `gcc`/`clang` difference is that `gcc` on 32-bit Solaris/SPARC defaults to `-mpcu=v9` where atomic ops are supported, unlike with `clang`'s default of `-mcpu=v8`. This patch changes `clang` to use `-mcpu=v9` on 32-bit Solaris/SPARC, too. Doing so uncovered two bugs: `clang -m32 -mcpu=v9` chokes with any Solaris system headers included: /usr/include/sys/isa_defs.h:461:2: error: "Both _ILP32 and _LP64 are defined" #error "Both _ILP32 and _LP64 are defined" While `clang` currently defines `__sparcv9` in a 32-bit `-mcpu=v9` compilation, neither `gcc` nor Studio `cc` do. In fact, the Studio 12.6 `cc(1)` man page clearly states: These predefinitions are valid in all modes: [...] __sparcv8 (SPARC) __sparcv9 (SPARC -m64) At the same time, the patch defines `__GCC_HAVE_SYNC_COMPARE_AND_SWAP_[1248]` for a 32-bit Sparc compilation with any V9 cpu. I've also changed `MaxAtomicInlineWidth` for V9, matching what `gcc` does and the Oracle Developer Studio 12.6: C User's Guide documents (Ch. 3, Support for Atomic Types, 3.1 Size and Alignment of Atomic C Types). The two testcases that had been `XFAIL`ed for Bug 42535 are un-`XFAIL`ed again. Tested on `sparcv9-sun-solaris2.11` and `amd64-pc-solaris2.11`. Differential Revision: https://reviews.llvm.org/D86621	2020-09-11 09:53:19 +02:00
Yaxun (Sam) Liu	041da0d828	[HIP] Add gfx1031 and gfx1030 Differential Revision: https://reviews.llvm.org/D87324	2020-09-08 16:38:34 -04:00
Cullen Rhodes	f9091e56d3	[clang][aarch64] Drop experimental from __ARM_FEATURE_SVE_BITS macro The __ARM_FEATURE_SVE_BITS feature macro is specified in the Arm C Language Extensions (ACLE) for SVE [1] (version 00bet5). From the spec, where __ARM_FEATURE_SVE_BITS==N: When N is nonzero, indicates that the implementation is generating code for an N-bit SVE target and that the arm_sve_vector_bits(N) attribute is available. This was defined in D83550 as __ARM_FEATURE_SVE_BITS_EXPERIMENTAL and enabled under the -msve-vector-bits flag to simplify initial tests. This patch drops _EXPERIMENTAL now there is support for the feature. [1] https://developer.arm.com/documentation/100987/latest Reviewed By: david-arm Differential Revision: https://reviews.llvm.org/D86720	2020-09-03 09:39:37 +00:00
Douglas Yung	8d2d0e8485	Revert "Move all fields of '-cc1' option related classes into def file databases" This reverts commit `c4a2a13074`. This commit was causing a test failure: http://lab.llvm.org:8011/builders/llvm-clang-win-x-armv7l/builds/1068	2020-09-02 10:38:34 -07:00
Daniel Grumberg	c4a2a13074	Move all fields of '-cc1' option related classes into def file databases Once the new option parsing system is committed, this will allow to generate a check to ensure that correct command line generation happens Differential Revision: https://reviews.llvm.org/D86290	2020-09-02 13:07:01 +01:00
Brad Smith	4fbf0636a2	Remove OpenBSD/sparc support	2020-08-29 20:47:18 -04:00
Dimitry Andric	fc2dac4116	[PPC] Fix platform definitions when compiling FreeBSD powerpc64 as LE As a prerequisite to doing experimental buids of pieces of FreeBSD PowerPC64 as little-endian, allow actually targeting it. This is needed so basic platform definitions are pulled in. Without it, the compiler will only run freestanding. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D73425	2020-08-29 12:03:20 +02:00
Craig Topper	71f3169e1b	[X86] Default to -mtune=generic unless -march is passed to the driver. Add TuneCPU to the AST serialization This patch defaults to -mtune=generic unless -march is present. If -march is present we'll use the empty string unless its overridden by mtune. The back should use the target cpu if the tune-cpu isn't present. It also adds AST serialization support to fix some tests that emit AST and parse it back. These tests diff the IR against the output from not going through AST. So if we don't serialize the tune CPU we fail the diff. Differential Revision: https://reviews.llvm.org/D86488	2020-08-26 14:52:03 -07:00
Abhina Sreeskantharajan	97ccf93b36	[SystemZ][z/OS] Add z/OS Target and define macros This patch adds the z/OS target and defines macros as a stepping stone towards enabling a native build on z/OS. Reviewed By: hubert.reinterpretcast Differential Revision: https://reviews.llvm.org/D85324	2020-08-25 15:51:59 -04:00
Freddy Ye	e02d081f2b	[X86] Support -march=sapphirerapids Support -march=sapphirerapids for x86. Compare with Icelake Server, it includes 14 more new features. They are amxtile, amxint8, amxbf16, avx512bf16, avx512vp2intersect, cldemote, enqcmd, movdir64b, movdiri, ptwrite, serialize, shstk, tsxldtrk, waitpkg. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D86503	2020-08-25 14:21:21 +08:00
Baptiste Saleil	512e256c0d	[PowerPC] Add clang options to control MMA support This patch adds frontend and backend options to enable and disable the PowerPC MMA operations added in ISA 3.1. Instructions using these options will be added in subsequent patches. Differential Revision: https://reviews.llvm.org/D81442	2020-08-24 09:35:55 -05:00
Craig Topper	cc7bf9bcbf	[X86] Allow 32-bit mode only CPUs with -mtune on 64-bit targets gcc errors on this, but I'm nervous that since -mtune has been ignored by clang for so long that there may be code bases out there that pass 32-bit cpus to clang.	2020-08-22 16:38:05 -07:00
Bevin Hansson	1a995a0af3	[ADT] Move FixedPoint.h from Clang to LLVM. This patch moves FixedPointSemantics and APFixedPoint from Clang to LLVM ADT. This will make it easier to use the fixed-point classes in LLVM for constructing an IR builder for fixed-point and for reusing the APFixedPoint class for constant evaluation purposes. RFC: http://lists.llvm.org/pipermail/llvm-dev/2020-August/144025.html Reviewed By: leonardchan, rjmccall Differential Revision: https://reviews.llvm.org/D85312	2020-08-20 10:29:45 +02:00
Craig Topper	724f570ad2	[X86] Add support 'tune' in target attribute This adds parsing and codegen support for tune in target attribute. I've implemented this so that arch in the target attribute implicitly disables tune from the command line. I'm not sure what gcc does here. But since -march implies -mtune. I assume 'arch' in the target attribute implies tune in the target attribute. Differential Revision: https://reviews.llvm.org/D86187	2020-08-19 15:58:19 -07:00
Martin Storsjö	cb6cf18ff5	[clang] Remove stray semicolons, fixing GCC warnings. NFC.	2020-08-19 10:41:03 +03:00
Yaxun (Sam) Liu	7546b29e76	[HIP] Support target id by --offload-arch This patch introduces support of target id by -offload-arch. Differential Revision: https://reviews.llvm.org/D60620	2020-08-18 23:43:53 -04:00
Brad Smith	d9ff48d038	WCharType and WIntType are always signed int on OpenBSD.	2020-08-18 19:59:54 -04:00
Brad Smith	592b8996bf	Hook up OpenBSD 64-bit RISC-V support	2020-08-18 18:59:55 -04:00
Craig Topper	4cbceb74bb	[X86] Add basic support for -mtune command line option in clang Building on the backend support from D85165. This parses the command line option in the driver, passes it on to CC1 and adds a function attribute. -Still need to support tune on the target attribute. -Need to use "generic" as the tuning by default. But need to change generic in the backend first. -Need to set tune if march is specified and mtune isn't. -May need to disable getHostCPUName's ability to guess CPU name from features when it doesn't have a family/model match for mtune=native. That's what gcc appears to do. Differential Revision: https://reviews.llvm.org/D85384	2020-08-18 15:13:19 -07:00
Craig Topper	5c1fe4e20f	[Target] Cache the command line derived feature map in TargetOptions. We can use this to remove some calls to initFeatureMap from Sema and CodeGen when a function doesn't have a target attribute. This reduces compile time of the linux kernel where this map is needed to diagnose some inline assembly constraints based on whether sse, avx, or avx512 is enabled. Differential Revision: https://reviews.llvm.org/D85807	2020-08-12 12:37:23 -07:00
Craig Topper	2b8ad6b604	[WebAssembly] Don't depend on the flags set by handleTargetFeatures in initFeatureMap. Properly set "simd128" in the feature map when "unimplemented-simd128" is requested. initFeatureMap is used to create the feature vector used by handleTargetFeatures. There are later calls to initFeatureMap in CodeGen that were using these flags to recreate the map. But the original feature vector should be passed to those calls. So that should be enough to rebuild the map. The only issue seemed to be that simd128 was not enabled in the map by the first call to initFeatureMap. Using the SIMDLevel set by handleTargetFeatures in the later calls allowed simd128 to be set in the later versions of the map. To fix this I've added an override of setFeatureEnabled that will update the map the first time with the correct simd dependency. Differential Revision: https://reviews.llvm.org/D85806	2020-08-12 11:43:46 -07:00
Erich Keane	aa4bc1cb79	Limit Max Vector alignment on COFF targets to 8192. COFF targets have a max object alignment of 8192, so trying to create one with a larger size results in an unreachable in WinCOFFObjectWriter. For the reproducer I have uses thread local storage, however other alignments are likely affected as well. This patch sets the MaxVectorAlign for COFF to 8192. Additionally, though there is no longer a way to reproduce that I could find, it correctly sets the MaxTLSAlign for COFF to that value as well, so that if anyone comes up with a situation where this is true, it will cause an error. Differential Revision: https://reviews.llvm.org/D85543	2020-08-12 06:35:35 -07:00
Brad Smith	5fe171321c	[Sparc] Define __GCC_HAVE_SYNC_COMPARE_AND_SWAP macros on SPARCv9	2020-08-11 00:04:24 -04:00
Krzysztof Parzyszek	7406eb4f6a	[Hexagon] Avoid creating an empty target feature If the CPU string is empty, the target feature map may end up having an empty string inserted to it. The symptom of the problem is a warning message: '+' is not a recognized feature for this target (ignoring feature) Also, the target-features attribute in the module will have an empty string in it.	2020-08-10 10:37:24 -05:00
Brad Smith	92e82a2890	int64_t and intmax_t are always (signed) long long on OpenBSD.	2020-08-09 19:43:16 -04:00
Brad Smith	4eb4ebf76a	Hook up OpenBSD 64-bit PowerPC support	2020-08-08 17:51:19 -04:00
Bevin Hansson	aa0d19a0c8	[Fixed Point] Add fixed-point shift operations and consteval. Reviewers: rjmccall, leonardchan, bjope Subscribers: cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D83212	2020-08-07 15:09:24 +02:00
Simon Pilgrim	24cca30f7f	Remove unreachable return (PR47026)	2020-08-07 11:23:43 +01:00
Craig Topper	504a197fe5	[X86] Rename X86::getImpliedFeatures to X86::updateImpliedFeatures and pass clang's StringMap directly to it. No point in building a vector of StringRefs for clang to apply to the StringMap. Just pass the StringMap and modify it directly.	2020-08-06 00:20:46 -07:00
Stanislav Mekhanoshin	ea7d0e2996	[AMDGPU] gfx1031 target Differential Revision: https://reviews.llvm.org/D85337	2020-08-05 12:36:26 -07:00
Baptiste Saleil	7aaa85627b	[PowerPC] Add options to control paired vector memops support Adds frontend and backend options to enable and disable the PowerPC paired vector memory operations added in ISA 3.1. Instructions using these options will be added in subsequent patches. Differential Revision: https://reviews.llvm.org/D83722	2020-07-29 14:00:53 -05:00
Joel E. Denny	9f2f3b9de6	[OpenMP] Implement TR8 `present` motion modifier in Clang (1/2) This patch implements Clang front end support for the OpenMP TR8 `present` motion modifier for `omp target update` directives. The next patch in this series implements OpenMP runtime support. Reviewed By: ABataev Differential Revision: https://reviews.llvm.org/D84711	2020-07-29 12:18:45 -04:00
Alexey Bader	8d27be8dba	[OpenCL] Add global_device and global_host address spaces This patch introduces 2 new address spaces in OpenCL: global_device and global_host which are a subset of a global address space, so the address space scheme will be looking like: ``` generic->global->host ->device ->private ->local constant ``` Justification: USM allocations may be associated with both host and device memory. We want to give users a way to tell the compiler the allocation type of a USM pointer for optimization purposes. (Link to the Unified Shared Memory extension: https://github.com/intel/llvm/blob/sycl/sycl/doc/extensions/USM/cl_intel_unified_shared_memory.asciidoc) Before this patch USM pointer could be only in opencl_global address space, hence a device backend can't tell if a particular pointer points to host or device memory. On FPGAs at least we can generate more efficient hardware code if the user tells us where the pointer can point - being able to distinguish between these types of pointers at compile time allows us to instantiate simpler load-store units to perform memory transactions. Patch by Dmitry Sidorov. Reviewed By: Anastasia Differential Revision: https://reviews.llvm.org/D82174	2020-07-29 17:24:53 +03:00
Joel E. Denny	69fc33f0cd	Revert "[OpenMP] Implement TR8 `present` motion modifier in Clang (1/2)" This reverts commit `3c3faae497`. It breaks a number of bots.	2020-07-28 20:30:05 -04:00
Joel E. Denny	3c3faae497	[OpenMP] Implement TR8 `present` motion modifier in Clang (1/2) This patch implements Clang front end support for the OpenMP TR8 `present` motion modifier for `omp target update` directives. The next patch in this series implements OpenMP runtime support. Reviewed By: ABataev Differential Revision: https://reviews.llvm.org/D84711	2020-07-28 19:15:18 -04:00
Joel E. Denny	a3d1f88fa7	[OpenMP][NFC] Consolidate `to` and `from` clause modifiers `to` and `from` clauses take the same modifiers, which are called "motion modifiers" in TR8, so implement handling of their modifiers once not twice. This will make it easier to implement additional motion modifiers in the future. Reviewed By: ABataev Differential Revision: https://reviews.llvm.org/D84710	2020-07-28 19:15:18 -04:00
Jinsong Ji	d28f86723f	Re-land "[PowerPC] Remove QPX/A2Q BGQ/BGP CNK support" This reverts commit `bf544fa1c3`. Fixed the typo in PPCInstrInfo.cpp.	2020-07-28 14:00:11 +00:00
Jinsong Ji	bf544fa1c3	Revert "[PowerPC] Remove QPX/A2Q BGQ/BGP CNK support" This reverts commit `adffce7153`. This is breaking test-suite, revert while investigation.	2020-07-27 21:07:00 +00:00
Jinsong Ji	adffce7153	[PowerPC] Remove QPX/A2Q BGQ/BGP CNK support Per RFC http://lists.llvm.org/pipermail/llvm-dev/2020-April/141295.html no one is making use of QPX/A2Q/BGQ/BGP CNK anymore. This patch remove the support of QPX/A2Q in llvm, BGQ/BGP in clang, CNK support in openmp/polly. Reviewed By: hfinkel Differential Revision: https://reviews.llvm.org/D83915	2020-07-27 19:24:39 +00:00
Xiangling Liao	05ad8e9429	[AIX] Implement AIX special alignment rule about double/long double Implement AIX default `power` alignment rule by adding `PreferredAlignment` and `PreferredNVAlignment` in ASTRecordLayout class. The patchh aims at returning correct value for `__alignof(x)` and `alignof(x)` under `power` alignment rules. Differential Revision: https://reviews.llvm.org/D79719	2020-07-27 15:13:03 -04:00
Ulrich Weigand	7f003957bf	[SystemZ] Implement __builtin_eh_return_data_regno Implement __builtin_eh_return_data_regno for SystemZ. Match behavior of GCC. Author: slavek-kucera Differential Revision: https://reviews.llvm.org/D84341	2020-07-24 10:28:06 +02:00
Joel E. Denny	aa82c40f0a	[OpenMP] Implement TR8 `present` map type modifier in Clang (1/2) This patch implements Clang front end support for the OpenMP TR8 `present` map type modifier. The next patch in this series implements OpenMP runtime support. This patch does not attempt to implement TR8 sec. 2.22.7.1 "map Clause", p. 319, L14-16: > If a map clause with a present map-type-modifier is present in a map > clause, then the effect of the clause is ordered before all other > map clauses that do not have the present modifier. Compare to L10-11, which Clang does not appear to implement yet: > For a given construct, the effect of a map clause with the to, from, > or tofrom map-type is ordered before the effect of a map clause with > the alloc, release, or delete map-type. This patch also does not implement the `present` implicit-behavior for `defaultmap` or the `present` motion-modifier for `target update`. Reviewed By: ABataev Differential Revision: https://reviews.llvm.org/D83061	2020-07-22 10:15:32 -04:00
Adrian Prantl	b907ad539a	[NFC] Clean up doc comment and implementation for Module::isSubModuleOf. Patch by Varun Gandhi! Differential Revision: https://reviews.llvm.org/D84087	2020-07-21 16:23:36 -07:00
Akira Hatanaka	73bc23ff86	Fix the data layout mangling specification for 'i686-pc-macho' Use 'o' for the mangling specification instead of 'e'. This fixes an error in the backend caused by a mismatch between the data layouts generated by the backend and the frontend. rdar://problem/64168540	2020-07-21 12:58:17 -07:00
Anatoly Trosinenko	16a4350f76	[MSP430] Actualize the toolchain description Reviewed By: krisb Differential Revision: https://reviews.llvm.org/D81676	2020-07-17 15:42:12 +03:00
Cullen Rhodes	bb160e769d	[Sema][AArch64] Add parsing support for arm_sve_vector_bits attribute Summary: This patch implements parsing support for the 'arm_sve_vector_bits' type attribute, defined by the Arm C Language Extensions (ACLE, version 00bet5, section 3.7.3) for SVE [1]. The purpose of this attribute is to define fixed-length (VLST) versions of existing sizeless types (VLAT). For example: #if __ARM_FEATURE_SVE_BITS==512 typedef svint32_t fixed_svint32_t __attribute__((arm_sve_vector_bits(512))); #endif Creates a type 'fixed_svint32_t' that is a fixed-length version of 'svint32_t' that is normal-sized (rather than sizeless) and contains exactly 512 bits. Unlike 'svint32_t', this type can be used in places such as structs and arrays where sizeless types can't. Implemented in this patch is the following: * Defined and tested attribute taking single argument. * Checks the argument is an integer constant expression. * Attribute can only be attached to a single SVE vector or predicate type, excluding tuple types such as svint32x4_t. * Added the `-msve-vector-bits=<bits>` flag. When specified the `__ARM_FEATURE_SVE_BITS__EXPERIMENTAL` macro is defined. * Added a language option to store the vector size specified by the `-msve-vector-bits=<bits>` flag. This is used to validate `N == __ARM_FEATURE_SVE_BITS`, where N is the number of bits passed to the attribute and `__ARM_FEATURE_SVE_BITS` is the feature macro defined under the same flag. The `__ARM_FEATURE_SVE_BITS` macro will be made non-experimental in the final patch of the series. [1] https://developer.arm.com/documentation/100987/latest This is patch 1/4 of a patch series. Reviewers: sdesmalen, rsandifo-arm, efriedma, ctetreau, cameron.mcinally, rengolin, aaron.ballman Reviewed By: sdesmalen, aaron.ballman Differential Revision: https://reviews.llvm.org/D83550	2020-07-17 10:06:54 +00:00
Zakk Chen	294d1eae75	[RISCV] Add support for -mcpu option. Summary: 1. gcc uses `-march` and `-mtune` flag to chose arch and pipeline model, but clang does not have `-mtune` flag, we uses `-mcpu` to chose both infos. 2. Add SiFive e31 and u54 cpu which have default march and pipeline model. 3. Specific `-mcpu` with rocket-rv[32\|64] would select pipeline model only, and use the driver's arch choosing logic to get default arch. Reviewers: lenary, asb, evandro, HsiangKai Reviewed By: lenary, asb, evandro Tags: #llvm, #clang Differential Revision: https://reviews.llvm.org/D71124	2020-07-16 11:46:22 -07:00
Logan Smith	2c2a297bb6	[clang][NFC] Add 'override' keyword to virtual function overrides This patch adds override to several overriding virtual functions that were missing the keyword within the clang/ directory. These were found by the new -Wsuggest-override.	2020-07-14 08:59:57 -07:00
Craig Topper	b4dbb37f32	[X86] Rename X86_CPU_TYPE_COMPAT_ALIAS/X86_CPU_TYPE_COMPAT/X86_CPU_SUBTYPE_COMPAT macros. NFC Remove _COMPAT. Drop the ARCHNAME. Remove the non-COMPAT versions that are no longer needed. We now only use these macros in places where we need compatibility with libgcc/compiler-rt. So we don't need to call out _COMPAT specifically.	2020-07-12 17:00:24 -07:00
Kevin P. Neal	d4ce862f2a	Reland "[FPEnv][Clang][Driver] Disable constrained floating point on targets lacking support." We currently have strict floating point/constrained floating point enabled for all targets. Constrained SDAG nodes get converted to the regular ones before reaching the target layer. In theory this should be fine. However, the changes are exposed to users through multiple clang options already in use in the field, and the changes are _completely_ _untested_ on almost all of our targets. Bugs have already been found, like "https://bugs.llvm.org/show_bug.cgi?id=45274". This patch disables constrained floating point options in clang everywhere except X86 and SystemZ. A warning will be printed when this happens. Use the new -fexperimental-strict-floating-point flag to force allowing strict floating point on hosts that aren't already marked as supporting it (X86 and SystemZ). Differential Revision: https://reviews.llvm.org/D80952	2020-07-10 08:49:45 -04:00
Sylvestre Ledru	bbea4d5e6b	clang: Don't show a trailing space with --version when not built from the repo Reported here: https://bugs.llvm.org/show_bug.cgi?id=38998#c15 Reviewers: hans Differential Revision: https://reviews.llvm.org/D83386	2020-07-08 14:02:02 +02:00
Craig Topper	3cbfe988bc	[X86] Merge X86TargetInfo::setFeatureEnabled and X86TargetInfo::setFeatureEnabledImpl. NFC setFeatureEnabled is a virtual function. setFeatureEnabledImpl was its implementation. This split was to avoid virtual calls when we need to call setFeatureEnabled in initFeatureMap. With C++11 we can use 'final' on setFeatureEnabled to enable the compiler to perform de-virtualization for the initFeatureMap calls.	2020-07-06 23:54:56 -07:00
Craig Topper	16f3d698f2	[X86] Move the feature dependency handling in X86TargetInfo::setFeatureEnabledImpl to a table based lookup in X86TargetParser.cpp Previously we had to specify the forward and backwards feature dependencies separately which was error prone. And as dependencies have gotten more complex it was hard to be sure the transitive dependencies were handled correctly. The way it was written was also not super readable. This patch replaces everything with a table that lists what features a feature is dependent on directly. Then we can recursively walk through the table to find the transitive dependencies. This is largely based on how we handle subtarget features in the MC layer from the tablegen descriptions. Differential Revision: https://reviews.llvm.org/D83273	2020-07-06 23:14:02 -07:00
Xiang1 Zhang	939d8309db	[X86-64] Support Intel AMX Intrinsic INTEL ADVANCED MATRIX EXTENSIONS (AMX). AMX is a new programming paradigm, it has a set of 2-dimensional registers (TILES) representing sub-arrays from a larger 2-dimensional memory image and operate on TILES. These intrinsics use direct TMM register number as its params. Spec can be found in Chapter 3 here https://software.intel.com/content/www/us/en/develop/download/intel-architecture-instruction-set-extensions-programming-reference.html Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D83111	2020-07-07 10:13:40 +08:00
Craig Topper	c359c5d534	[X86] Centalize the 'sse4' hack to a single place in X86TargetInfo::setFeatureEnabledImpl. NFCI Instead of detecting the string in 2 places. Just swap the string to 'sse4.1' or 'sse4.2' at the top of the function. Prep work for a patch to switch the rest of this function to a table based system. And I don't want to include 'sse4a' in the table.	2020-07-06 15:00:32 -07:00
Kevin P. Neal	916e2ca997	Revert "[FPEnv][Clang][Driver] Disable constrained floating point on targets lacking support." My mistake, I had a blocking reviewer. This reverts commit `39d2ae0afb`. This reverts commit `bfdafa32a0`. This reverts commit `2b35511350`. Differential Revision: https://reviews.llvm.org/D80952	2020-07-06 14:57:45 -04:00
Kevin P. Neal	39d2ae0afb	[FPEnv][Clang][Driver] Disable constrained floating point on targets lacking support. We currently have strict floating point/constrained floating point enabled for all targets. Constrained SDAG nodes get converted to the regular ones before reaching the target layer. In theory this should be fine. However, the changes are exposed to users through multiple clang options already in use in the field, and the changes are _completely_ _untested_ on almost all of our targets. Bugs have already been found, like "https://bugs.llvm.org/show_bug.cgi?id=45274". This patch disables constrained floating point options in clang everywhere except X86 and SystemZ. A warning will be printed when this happens. Differential Revision: https://reviews.llvm.org/D80952	2020-07-06 13:32:49 -04:00
Kazushi (Jam) Marukawa	df3bda047d	[VE] Correct stack alignment Summary: Change stack alignment from 64 bits to 128 bits to follow ABI correctly. And add a regression test for datalayout. Reviewers: simoll, k-ishizaka Reviewed By: simoll Subscribers: hiraditya, cfe-commits, llvm-commits Tags: #llvm, #ve, #clang Differential Revision: https://reviews.llvm.org/D83173	2020-07-06 17:25:29 +09:00
Kai Luo	68e07da3e5	[clang][PowerPC] Enable -fstack-clash-protection option for ppc64 Differential Revision: https://reviews.llvm.org/D81355	2020-07-05 03:43:56 +00:00
jasonliu	572dde55ee	[XCOFF][AIX] Use 'L..' instead of '.L' for getPrivateGlobalPrefix in DataLayout Summary: D80831 changed part of the prefix usage for AIX. But there are other places getting prefix from DataLayout. This patch intends to make prefix usage consistent on AIX. Reviewed by: hubert.reinterpretcast, daltenty Differential Revision: https://reviews.llvm.org/D81270	2020-07-03 18:25:14 +00:00
Dmitry Preobrazhensky	53422e8b4f	[AMDGPU] Added support of new inline assembler constraints Added support for constraints 'I', 'J', 'L', 'B', 'C', 'Kf', 'DA', 'DB'. See https://gcc.gnu.org/onlinedocs/gcc/Machine-Constraints.html#Machine-Constraints. Reviewers: arsenm, rampitec Differential Revision: https://reviews.llvm.org/D81657	2020-07-03 18:01:12 +03:00
Whisperity	4cf24cb868	[NFC][clang] Add missing VALIDATE_DIAG_SIZE() Originally when libCrossTU was introduced in commit `e350b0a196`, the macro which thus had all diagnostic kinds covered was not added.	2020-07-02 14:14:57 +02:00
Valentin Clement	2ddba3082c	[flang][openmp] Use common Directive and Clause enum from llvm/Frontend Summary: This patch is removing the custom enumeration for OpenMP Directives and Clauses and replace them with the newly tablegen generated one from llvm/Frontend. This is a first patch and some will follow to share the same infrastructure where possible. The next patch should use the clauses allowance defined in the tablegen file. Reviewers: jdoerfert, DavidTruby, sscalpone, kiranchandramohan, ichoyjx Reviewed By: DavidTruby, ichoyjx Subscribers: jholewinski, cfe-commits, dblaikie, MaskRay, ymandel, ichoyjx, mgorny, yaxunl, guansong, jfb, sstefan1, aaron.ballman, llvm-commits Tags: #llvm, #flang, #clang Differential Revision: https://reviews.llvm.org/D82906	2020-07-01 20:58:11 -04:00
Craig Topper	3537939cda	[X86] Move frontend CPU feature initialization to a look up table based implementation. NFCI This replaces the switch statement implementation in the clang's X86.cpp with a lookup table in X86TargetParser.cpp. I've used constexpr and copy of the FeatureBitset from SubtargetFeature.h to store the features in a lookup table. After the lookup the bitset is translated into strings for use by the rest of the frontend code. I had to modify the implementation of the FeatureBitset to avoid bugs in gcc 5.5 constexpr handling. It seems to not like the same array entry to be used on the left side and right hand side of an assignment or &= or \|=. I've also used uint32_t instead of uint64_t and sized based on the X86::CPU_FEATURE_MAX. I've initialized the features for different CPUs outside of the table so that we can express inheritance in an adhoc way. This was one of the big limitations of the switch and we had resorted to labels and gotos. Differential Revision: https://reviews.llvm.org/D82731	2020-06-30 12:04:58 -07:00
Francesco Petrogalli	d54e4dded7	[sve][acle] Enable feature macros for SVE ACLE extensions. Summary: The following feature macros have been added: __ARM_FEATURE_SVE_BF16 __ARM_FEATURE_SVE_MATMUL_INT8 __ARM_FEATURE_SVE_MATMUL_FP32 __ARM_FEATURE_SVE_MATMUL_FP64 The driver has been updated to enable them accordingly to the value of the target feature passed at command line. The SVE ACLE tests using the macros have been modified to work with the target feature instead of passing the macro at command line. Reviewers: sdesmalen, efriedma, c-rhodes, kmclaughlin, SjoerdMeijer, rengolin Subscribers: tschuett, kristof.beyls, rkruppe, psnobl, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D82623	2020-06-30 18:33:03 +00:00
Bevin Hansson	33bae9c265	[AST] Fix handling of some edge cases in fixed-point division. Division by zero was not being handled, and division of -EPSILON / MAX did not perform rounding correctly.	2020-06-30 13:47:12 +02:00
Nick Desaulniers	7b8cf98b4a	Reland "[clang][SourceManager] cache Macro Expansions"" This reverts commit 33d63f02ce408d181e13089ee5a667fb2e1cdc78. Differential Revision: https://reviews.llvm.org/D80681	2020-06-29 12:54:32 -07:00
Nick Desaulniers	7c2cb1448a	Revert "[clang][SourceManager] cache Macro Expansions" This reverts commit `dffc142045`. Missed a hunk (D82690).	2020-06-29 12:54:32 -07:00
Craig Topper	20a60f46f5	[X86] Explicitly add popcnt feature to Intel CPUs with SSE4.2 in the frontend. Previously we inferred it if sse4.2 ended up being enabled after all feature processing. But writing -march=nehalem -mno-sse4.2 should have popcnt enabled.	2020-06-28 11:06:40 -07:00
Melanie Blower	f4aaed3bf1	Reland D81869 "Modify FPFeatures to use delta not absolute settings" This reverts commit `defd43a5b3`. with correction to solve msan report To solve https://bugs.llvm.org/show_bug.cgi?id=46166 where the floating point settings in PCH files aren't compatible, rewrite FPFeatures to use a delta in the settings rather than absolute settings. With this patch, these floating point options can be benign. Reviewers: rjmccall Differential Revision: https://reviews.llvm.org/D81869	2020-06-27 01:34:57 -07:00
Craig Topper	9e8b5a20e9	[X86] Add MOVBE and RDRND features to BDVER4. Only 6 years behind gcc. https://gcc.gnu.org/legacy-ml/gcc-patches/2014-08/msg00231.html Found while working on improving how we define CPU features for clang and auditing for correctness.	2020-06-26 23:32:17 -07:00
Craig Topper	d298acde82	[X86] Don't disable xsave when avx is disabled. Implicitly enable xsave with avx is enabled and xsave wasn't explciitly disabled CPUs with avx always have xsave, but some CPUs without avx also have xsave. So we shouldn't disable xsave just because avx is disabled. This would prevent xsave from being enabled with -march=native on CPUs with xsave and not avx. But we also don't want -mavx -mno-avx to leave xsave eanabled. So only enable xsave if avx is enabled after processing all features. I thought about just not turning xsave on with avx at all, but there might be someone out there depending on it.	2020-06-26 16:45:44 -07:00
Nick Desaulniers	dffc142045	[clang][SourceManager] cache Macro Expansions A seemingly innocuous Linux kernel change [0] seemingly blew up our compile times by over 3x, as reported by @nathanchance in [1]. The code in question uses a doubly nested macro containing GNU C statement expressions that are then passed to typeof(), which is then used in a very important macro for atomic variable access throughout most of the kernel. The inner most macro, is passed a GNU C statement expression. In this case, we have macro arguments that are GNU C statement expressions, which can contain a significant number of tokens. The upstream kernel patch caused significant build time regressions for both Clang and GCC. Since then, some of the nesting has been removed via @melver, which helps gain back most of the lost compilation time. [2] Profiles collected [3] from compilations of the slowest TU for us in the kernel show: * 51.4% time spent in clang::TokenLexer::updateLocForMacroArgTokens * 48.7% time spent in clang::SourceManager::getFileIDLocal * 35.5% time spent in clang::SourceManager::isOffsetInFileID (mostly calls from the former through to the latter). So it seems we have a pathological case for which properly tracking the SourceLocation of macro arguments is significantly harming build performance. This stands out in referenced flame graph. In fact, this case was identified previously as being problematic in commit `3339c568c4` ("[Lex] Speed up updateConsecutiveMacroArgTokens (NFC)") Looking at the above call chain, there's 3 things we can do to speed up this case. 1. TokenLexer::updateConsecutiveMacroArgTokens() calls SourceManager::isWrittenInSameFile() which calls SourceManager::getFileID(), which is both very hot and very expensive to call. SourceManger has a one entry cache, member LastFileIDLookup. If that isn't the FileID for a give source location offset, we fall back to a linear probe, and then to a binary search for the FileID. These fallbacks update the one entry cache, but noticeably they do not for the case of macro expansions! For the slowest TU to compile in the Linux kernel, it seems that we miss about 78.67% of the 68 million queries we make to getFileIDLocal that we could have had cache hits for, had we saved the macro expansion source location's FileID in the one entry cache. [4] I tried adding a separate cache item for macro expansions, and to check that before the linear then binary search fallbacks, but did not find it faster than simply allowing macro expansions into the one item cache. This alone nets us back a lot of the performance loss. That said, this is a modification of caching logic, which is playing with a double edged sword. While it significantly improves the pathological case, its hard to say that there's not an equal but opposite pathological case that isn't regressed by this change. Though non-pathological cases of builds of the Linux kernel before [0] are only slightly improved (<1%) and builds of LLVM itself don't change due to this patch. Should future travelers find this change to significantly harm their build times, I encourage them to feel empowered to revert this change. 2. SourceManager::getFileIDLocal has a FIXME hinting that the call to SourceManager::isOffsetInFileID could be made much faster since isOffsetInFileID is generic in the sense that it tries to handle the more generic case of "local" (as opposed to "loaded") files, though the caller has already determined the file to be local. This patch implements a new method that specialized for use when the caller already knows the file is local, then use that in TokenLexer::updateLocForMacroArgTokens. This should be less controversial than 1, and is likely an across the board win. It's much less significant for the pathological case, but still a measurable win once we have fallen to the final case of binary search. D82497 3. A bunch of methods in SourceManager take a default argument. SourceManager::getLocalSLocEntry doesn't do anything with this argument, yet many callers of getLocalSLocEntry setup, pass, then check this argument. This is wasted work. D82498 With this patch applied, the above profile [5] for the same pathological input looks like: * 25.1% time spent in clang::TokenLexer::updateLocForMacroArgTokens * 17.2% time spent in clang::SourceManager::getFileIDLocal and clang::SourceManager::isOffsetInFileID is no longer called, and thus falls out of the profile. There may be further improvements to the general problem of "what interval contains one number out of millions" than the current use of a one item cache, followed by linear probing, followed by binary searching. We might even be able to do something smarter in TokenLexer::updateLocForMacroArgTokens. [0] `cdd28ad2d8` [1] https://github.com/ClangBuiltLinux/linux/issues/1032 [2] https://git.kernel.org/pub/scm/linux/kernel/git/tip/tip.git/commit/?h=locking/kcsan&id=a5dead405f6be1fb80555bdcb77c406bf133fdc8 [3] https://github.com/ClangBuiltLinux/linux/issues/1032#issuecomment-633712667 [4] https://github.com/ClangBuiltLinux/linux/issues/1032#issuecomment-633741923 [5] https://github.com/ClangBuiltLinux/linux/issues/1032#issuecomment-634932736 Reviewed By: kadircet Differential Revision: https://reviews.llvm.org/D80681	2020-06-26 12:52:43 -07:00
Nick Desaulniers	8cce7af090	[SourceManager] don't check invalid param of getLocalSLocEntry() Forked from D80681. getLocalSLocEntry() has an unused parameter used to satisfy an interface of libclang (see getInclusions() in clang/tools/libclang/CIndexInclusionStack.cpp). It's pointless for callers to construct/pass/check this inout parameter that can never signify that a FileID is invalid. Reviewed By: kadircet Differential Revision: https://reviews.llvm.org/D82498	2020-06-26 10:22:26 -07:00
Melanie Blower	defd43a5b3	Revert "Revert "Revert "Modify FPFeatures to use delta not absolute settings""" This reverts commit `9518763d71`. Memory sanitizer fails in CGFPOptionsRAII::CGFPOptionsRAII dtor	2020-06-26 08:47:04 -07:00
Melanie Blower	9518763d71	Revert "Revert "Modify FPFeatures to use delta not absolute settings"" This reverts commit `b55d723ed6`. Reapply Modify FPFeatures to use delta not absolute settings To solve https://bugs.llvm.org/show_bug.cgi?id=46166 where the floating point settings in PCH files aren't compatible, rewrite FPFeatures to use a delta in the settings rather than absolute settings. With this patch, these floating point options can be benign. Reviewers: rjmccall Differential Revision: https://reviews.llvm.org/D81869	2020-06-26 08:00:08 -07:00
Melanie Blower	b55d723ed6	Revert "Modify FPFeatures to use delta not absolute settings" This reverts commit `3a748cbf86`. I'm reverting this commit because I forgot to format the commit message propertly. Sorry for the thrash.	2020-06-26 07:52:57 -07:00
Melanie Blower	3a748cbf86	Modify FPFeatures to use delta not absolute settings	2020-06-26 07:41:09 -07:00
Simon Pilgrim	0069824fea	Revert rGf0bab7875e78e01c149d12302dcc4b6d4c43e25c - "Triple.h - reduce Twine.h include to forward declarations. NFC." This causes ICEs on the clang-ppc64be buildbots and I've limited ability to triage the problem.	2020-06-26 14:46:40 +01:00
Anatoly Trosinenko	cb56fa2196	[MSP430] Update register names When writing a unit test on replacing standard epilogue sequences with `BR __mspabi_func_epilog_<N>`, by manually asm-clobbering `rN` - `r10` for N = 4..10, everything worked well except for seeming inability to clobber r4. The problem was that MSP430 code generator of LLVM used an obsolete name FP for that register. Things were worse because when `llc` read an unknown register name, it silently ignored it. That is, I cannot use `fp` register name from the C code because Clang does not accept it (exactly like GCC). But the accepted name `r4` is not recognised by `llc` (it can be used in listings passed to `llvm-mc` and even `fp` is replace to `r4` by `llvm-mc`). So I can specify any of `fp` or `r4` for the string literal of `asm(...)` but nothing in the clobber list. This patch replaces `MSP430::FP` with `MSP430::R4` in the backend code (even [MSP430 EABI](http://www.ti.com/lit/an/slaa534/slaa534.pdf) doesn't mention FP as a register name). The R0 - R3 registers, on the other hand, are left as is in the backend code (after all, they have some special meaning on the ISA level). It is just ensured clang is renaming them as expected by the downstream tools. There is probably not much sense in marking them clobbered but rename them //just in case// for use at potentially different contexts. Differential Revision: https://reviews.llvm.org/D82184	2020-06-26 15:32:07 +03:00
Simon Pilgrim	f0bab7875e	Triple.h - reduce Twine.h include to forward declarations. NFC. Move include down to a number of other files that had an implicit dependency on the Twine class.	2020-06-26 13:06:57 +01:00
Bevin Hansson	94e8ec631d	[AST] Add fixed-point division constant evaluation. Reviewers: rjmccall, leonardchan, bjope Subscribers: cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D73187	2020-06-26 13:38:11 +02:00
Bevin Hansson	53f5c8b4a1	[AST] Add fixed-point multiplication constant evaluation. Reviewers: rjmccall, leonardchan, bjope Subscribers: cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D73186	2020-06-26 13:38:11 +02:00
Bevin Hansson	eccf7fc7b3	[AST] Add fixed-point subtraction constant evaluation. Reviewers: rjmccall, leonardchan Subscribers: cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D73185	2020-06-26 13:38:11 +02:00
Craig Topper	12665f2812	[X86] Make XSAVEC/XSAVEOPT/XSAVES properly depend on XSAVE in both the frontend and the backend. These features implicitly enabled XSAVE in the frontend, but not the backend. Disabling XSAVE in the frontend disabled XSAVEOPT, but not the other 2. Nothing happened in the backend.	2020-06-26 00:14:58 -07:00
Craig Topper	a7db230d75	[X86] Add CMPXCHG16B feature to amdfam10 in the frontend. We already have this feature on it in the backend.	2020-06-25 22:55:36 -07:00
Craig Topper	6673d69226	[X86] Don't imply -mprfchw when -m3dnow is specified. Enable prefetchw in the backend with 3dnow feature. The PREFETCHW instruction was originally part of the 3DNow. But it was given its own CPUID bit on later CPUs just before 3DNow was deprecated. We were setting the -mprfchw flag if -m3dnow was passed or the CPU supported 3dnow unless -mno-prfchw was passed. But -march=native on a CPU without the PRFCHW CPUID bit set will pass -mno-prfchw. So -march=k8 will behave differently than -march=native on a K8 for example. So remove this implicit setting from the frontend and instead enable the backend to use PREFETCHW if 3dnow OR prfchw is enabled. Also enable PRFCHW flag on amdfam10/barcelona which seems to be where this CPUID bit was introduced. That CPU also supported 3dnow.	2020-06-25 12:46:52 -07:00
Craig Topper	01c18f9199	Revert "[X86] Don't imply -mprfchw when -m3dnow is specified. Enable prefetchw in the backend with 3dnow feature." This is failing on the bots. This reverts commit `636d31a5c3`.	2020-06-25 11:43:02 -07:00
Craig Topper	636d31a5c3	[X86] Don't imply -mprfchw when -m3dnow is specified. Enable prefetchw in the backend with 3dnow feature. The PREFETCHW instruction was originally part of the 3DNow. But it was given its own CPUID bit on later CPUs just before 3DNow was deprecated. We were setting the -mprfchw flag if -m3dnow was passed or the CPU supported 3dnow unless -mno-prfchw was passed. But -march=native on a CPU without the PRFCHW CPUID bit set will pass -mno-prfchw. So -march=k8 will behave differently than -march=native on a K8 for example. So remove this implicit setting from the frontend and instead enable the backend to use PREFETCHW if 3dnow OR prfchw is enabled. Also enable PRFCHW flag on amdfam10/barcelona which seems to be where this CPUID bit was introduced. That CPU also supported 3dnow.	2020-06-25 11:25:35 -07:00
Nick Desaulniers	408efffbe4	[Clang][SourceManager] optimize getFileIDLocal() Summary: A recent Linux kernel commit exposed a performance cliff in Clang. Calls to SourceManager::getFileIDLocal() when there's a cache miss against LastFileIDLookup can be relatively expensive, as getFileIDLocal() tries a few linear probes, then falls back to binary search. The use of SourceManager::isOffsetInFileID() is also relatively expensive (both isOffsetInFileID and getFileIDLocal dominated a trace of the performance cliff case). As a FIXME notes (and as @kadircet helpfully noted in review of D80681), there's a few optimizations we can do here since we've already identified that an offset is local (as opposed to "loaded"). This patch was forked off of D80681, which additionally did this and modified some caching behavior, as we expect this change to be less controversial. In terms of optimizations, we've already determined that the SLocOffset parameter to SourceManager::getFileIDLocal() is local in the caller SourceManager::getFileIDSlow(). Also, there's an early continue in the binary search loop in getFileIDLocal() that are duplicated in isOffsetInFileID() as pointed out by @kadircet. Take advantage of these to optimize the binary search patch, and remove the FIXME. Reviewers: kadircet Reviewed By: kadircet Subscribers: cfe-commits, kadircet, srhines Tags: #clang Differential Revision: https://reviews.llvm.org/D82497	2020-06-25 09:59:41 -07:00
Valentin Clement	5b9ce07a76	[openmp] Use Directive_enumSize instead of OMPD_unknown position Summary: Previously OMPD_unknown was last item in the Directive enumeration and its position was used in various comparison and assertion. With the new Directive enumeration, this should be change with llvm::omp::Directive_enumSize. This patch fix two place where it was not done in D81736. Reviewers: vdmitrie, jdoerfert, jdenny Reviewed By: jdoerfert Subscribers: yaxunl, guansong, sstefan1, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D82518	2020-06-25 09:18:54 -04:00
Sander de Smalen	fabe67728e	[AArch64][SVE] Enable __ARM_FEATURE_SVE macros. This patch enables the following macros when their corresponding target attributes are set: __ARM_FEATURE_SVE (+sve) __ARM_FEATURE_SVE2 (+sve2) __ARM_FEATURE_SVE2_AES (+sve2-aes) __ARM_FEATURE_SVE2_BITPERM (+sve2-bitperm) __ARM_FEATURE_SVE2_SHA3 (+sve2-sha3) __ARM_FEATURE_SVE2_SM4 (+sve2-sm4) This implies that the base SVE and SVE2 ACLE (00bet2) are now feature complete, meaning that all intrinsics are implemented in LLVM and Clang. Disclaimer: To implement the ACLE we have had to fix up many parts of LLVM to make it support scalable vectors. We have also used many target-specific intrinsics to reduce reliance on parts of LLVM where we know scalable vectors may not yet be handled properly (e.g. some transformation might drop the 'scalable' flag on a vector type). While we've done a best effort with the limited testing that is available to us, we're still working to improve the stability of the implementation. Additionally, Clang may print warnings that code may have miscompiled. We find this often to be a false alarm where the wrong interfaces have been used in LLVM and where resulting code is not actually incorrect. However, this warrants a bug report and investigation. If you find any bugs or issues, please raise them on bugs.llvm.org and let us know! Reviewers: rengolin, efriedma, david-arm, SjoerdMeijer Reviewed By: SjoerdMeijer Differential Revision: https://reviews.llvm.org/D81725	2020-06-25 08:14:19 +01:00
Craig Topper	8dc92142e3	[X86] Replace PROC macros with an enum and a lookup table of processor information. This patch removes the PROC macro in favor of CPUKind enum and a table that contains information about CPUs. The current information in the table is the CPU name, CPUKind enum value, key feature for target multiversioning, and Is64Bit capable. For the strings that are aliases, I've duplicated the information in the table. This means there are more rows in the table than CPUKind enums. This replaces multiple StringSwitch's with loops through the table. They are linear searches due to the table being more logically ordered than alphabetical. The StringSwitch's would have also been linear. I've used StringLiteral on the strings in the table so we can quickly check the length while searching. I contemplated having a CPUKind for each string so there was a 1:1 mapping, but didn't want to spread more names to the places that use the enum. My ultimate goal here is to store the features for each CPU as a bitset within the table. Hoping to use constexpr to make this composable so we can group features and inherit them. After the table lookup we can turn the bitset into a list of strings for the frontend. The current switch we have for selecting features for CPUs has become difficult to maintain while trying to express inheritance relationships. Differential Revision: https://reviews.llvm.org/D82414	2020-06-24 10:46:25 -07:00
Kazushi (Jam) Marukawa	96d4ccf00c	[VE] Clang toolchain for VE Summary: This patch enables compilation of C code for the VE target with Clang. Differential Revision: https://reviews.llvm.org/D79411	2020-06-24 10:12:09 +02:00
Sam Clegg	5804a8b122	[WebAssebmly] Fully disable 'protected' visibility Emscripten doesn't use protected visibility either. Differential Revision: https://reviews.llvm.org/D82346	2020-06-23 17:50:05 -07:00
Valentin Clement	d90443b1d9	[openmp] Base of tablegen generated OpenMP common declaration Summary: As discussed previously when landing patch for OpenMP in Flang, the idea is to share common part of the OpenMP declaration between the different Frontend. While doing this it was thought that moving to tablegen instead of Macros will also give a cleaner and more powerful way of generating these declaration. This first part of a future series of patches is setting up the base .td file for DirectiveLanguage as well as the OpenMP version of it. The base file is meant to be used by other directive language such as OpenACC. In this first patch, the Directive and Clause enums are generated with tablegen instead of the macros on OMPConstants.h. The next pacth will extend this to other enum and move the Flang frontend to use it. Reviewers: jdoerfert, DavidTruby, fghanim, ABataev, jdenny, hfinkel, jhuber6, kiranchandramohan, kiranktp Reviewed By: jdoerfert, jdenny Subscribers: arphaman, martong, cfe-commits, mgorny, yaxunl, hiraditya, guansong, jfb, sstefan1, aaron.ballman, llvm-commits Tags: #llvm, #openmp, #clang Differential Revision: https://reviews.llvm.org/D81736	2020-06-23 10:32:32 -04:00
Craig Topper	0dfc8e1837	[X86] Remove encoding value from the X86_FEATURE and X86_FEATURE_COMPAT macro. NFCI This was orignally done so we could separate the compatibility values and the llvm internal only features into a separate entries in the feature array. This was needed when we explicitly had to convert the feature into the proper 32-bit chunk at every reference and we didn't want things moving around. Now everything is in an array and we have helper funtions or macros to convert encoding to index. So we renumbering is no longer an issue.	2020-06-22 11:46:21 -07:00
Anton Korobeynikov	6cb80fbe40	Revert "[MSP430] Update register names" This reverts commit `8f6620f663`.	2020-06-22 13:37:22 +03:00
Anatoly Trosinenko	8f6620f663	[MSP430] Update register names When writing a unit test on replacing standard epilogue sequences with `BR __mspabi_func_epilog_<N>`, by manually asm-clobbering `rN` - `r10` for N = 4..10, everything worked well except for seeming inability to clobber r4. The problem was that MSP430 code generator of LLVM used an obsolete name FP for that register. Things were worse because when `llc` read an unknown register name, it silently ignored it. Differential Revision: https://reviews.llvm.org/D82184	2020-06-22 13:24:03 +03:00
Eric Christopher	1f593f46f3	[AST/Lex/Parse/Sema] As part of using inclusive language within the llvm project, migrate away from the use of blacklist and whitelist.	2020-06-20 01:15:32 -07:00
Brad Smith	0f92096c0a	Revert "Hook up OpenBSD 64-bit PowerPC support"	2020-06-18 20:05:39 -04:00
Brad Smith	3008609d45	Hook up OpenBSD 64-bit PowerPC support	2020-06-18 19:19:45 -04:00
Ahsan Saghir	37e72f47a4	[PowerPC] Add -m[no-]power10-vector clang and llvm option Summary: This patch adds command line option for enabling power10-vector support. Reviewers: hfinkel, nemanjai, lei, amyk, #powerpc Reviewed By: lei, amyk, #powerpc Subscribers: wuzish, kbarton, hiraditya, shchenz, cfe-commits, llvm-commits Tags: #llvm, #clang, #powerpc Differential Revision: https://reviews.llvm.org/D80758	2020-06-16 14:47:35 -05:00
Michael Liao	e830fa260d	[clang][amdgpu] Prefer not using `fp16` conversion intrinsics. Reviewers: yaxunl, arsenm Subscribers: kzhuravl, jvesely, wdng, nhaehnle, dstuttard, tpr, t-tye, kerbowa, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D81849	2020-06-16 10:21:56 -04:00
Stanislav Mekhanoshin	9ee272f13d	[AMDGPU] Add gfx1030 target Differential Revision: https://reviews.llvm.org/D81886	2020-06-15 16:18:05 -07:00
Dan Gohman	6604295959	[WebAssembly] WebAssembly doesn't support "protected" visibility Implement the `hasProtectedVisibility()` hook to indicate that, like Darwin, WebAssembly doesn't support "protected" visibility. On ELF, "protected" visibility is intended to be an optimization, however in practice it often [isn't], and ELF documentation generally ranges from [not mentioning it at all] to [strongly discouraging its use]. [isn't]: https://www.airs.com/blog/archives/307 [not mentioning it at all]: https://gcc.gnu.org/wiki/Visibility [strongly discouraging its use]: https://www.akkadia.org/drepper/dsohowto.pdf While here, also mention the new Reactor support in the release notes.	2020-06-12 19:52:35 -07:00
Bruno Ricci	78e636b3f2	[clang][NFC] Generate the {Type,ArrayType,UnaryExprOrType,Expression}Traits... ...enumerations from TokenKinds.def and use the new macros from TokenKinds.def to remove the hard-coded lists of traits. All the information needed to generate these enumerations is already present in TokenKinds.def. The motivation here is to be able to dump the trait spelling without hard-coding the list in yet another place. Note that this change the order of the enumerators in the enumerations (except that in the TypeTrait enumeration all unary type traits are before all binary type traits, and all binary type traits are before all n-ary type traits). Apart from the aforementioned ordering which is relied upon, after this patch no code in clang or in the various clang tools depend on the specific ordering of the enumerators. No functional changes intended. Differential Revision: https://reviews.llvm.org/D81455 Reviewed By: aaron.ballman	2020-06-11 14:35:52 +01:00
Craig Topper	ed34140e11	[X86] Move X86 stuff out of TargetParser.h and into the recently created X86TargetParser.h. NFC	2020-06-10 22:06:34 -07:00
Saiyedul Islam	4022bc2a6c	[OpenMP][AMDGCN] Support OpenMP offloading for AMDGCN architecture - Part 2 Summary: New file include to support platform dependent grid constants. It will be used by clang, libomptarget plugins, and deviceRTLs to access constant values consistently and with fast access in the deviceRTLs. Originally authored by Greg Rodgers (@gregrodgers). Reviewers: arsenm, sameerds, jdoerfert, yaxunl, b-sumner, scchan, JonChesterfield Reviewed By: arsenm Subscribers: llvm-commits, pdhaliwal, jholewinski, jvesely, wdng, nhaehnle, guansong, kerbowa, sstefan1, cfe-commits, ronlieb, gregrodgers Tags: #clang, #llvm Differential Revision: https://reviews.llvm.org/D80917	2020-06-10 18:09:59 +00:00
Craig Topper	d5c28c4094	[X86] Move CPUKind enum from clang to llvm/lib/Support. NFCI Similar to what some other targets have done. This information could be reused by other frontends so doesn't make sense to live in clang. -Rename CK_Generic to CK_None to better reflect its illegalness. -Move function for translating from string to enum into llvm. -Call checkCPUKind directly from the string to enum translation and update CPU kind to CK_None accordinly. Caller will use CK_None as sentinel for bad CPU. I'm planning to move all the CPU to feature mapping out next. As part of that I want to devise a better way to express CPUs inheriting features from an earlier CPU. Allowing this to be expressed in a less rigid way than just falling through a switch. Or using gotos as we've had to do lately. Differential Revision: https://reviews.llvm.org/D81439	2020-06-09 12:52:41 -07:00
Jonas Paulsson	515bfc66ea	[SystemZ] Implement -fstack-clash-protection Probing of allocated stack space is now done when this option is passed. The purpose is to protect against the stack clash attack (see https://www.qualys.com/2017/06/19/stack-clash/stack-clash.txt). Review: Ulrich Weigand Differential Revision: https://reviews.llvm.org/D78717	2020-06-06 18:38:36 +02:00
Ties Stuij	a6fcf5ca03	[clang][BFloat] add NEON emitter for bfloat Summary: This patch adds the bfloat16_t struct typedefs (e.g. bfloat16x8x2_t) to arm_neon.h This patch is part of a series implementing the Bfloat16 extension of the Armv8.6-a architecture, as detailed here: https://community.arm.com/developer/ip-products/processors/b/processors-ip-blog/posts/arm-architecture-developments-armv8-6-a The bfloat type, and its properties are specified in the Arm Architecture Reference Manual: https://developer.arm.com/docs/ddi0487/latest/arm-architecture-reference-manual-armv8-for-armv8-a-architecture-profile The following people contributed to this patch: - Luke Cheeseman - Simon Tatham - Ties Stuij Reviewers: t.p.northover, fpetrogalli, sdesmalen, az, LukeGeeson Reviewed By: fpetrogalli Subscribers: SjoerdMeijer, LukeGeeson, pbarrio, mgorny, kristof.beyls, ilya-biryukov, MaskRay, jkorous, arphaman, usaxena95, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D79708	2020-06-05 14:11:51 +01:00
Ties Stuij	ecd682bbf5	[ARM] Add __bf16 as new Bfloat16 C Type Summary: This patch upstreams support for a new storage only bfloat16 C type. This type is used to implement primitive support for bfloat16 data, in line with the Bfloat16 extension of the Armv8.6-a architecture, as detailed here: https://community.arm.com/developer/ip-products/processors/b/processors-ip-blog/posts/arm-architecture-developments-armv8-6-a The bfloat type, and its properties are specified in the Arm Architecture Reference Manual: https://developer.arm.com/docs/ddi0487/latest/arm-architecture-reference-manual-armv8-for-armv8-a-architecture-profile In detail this patch: - introduces an opaque, storage-only C-type __bf16, which introduces a new bfloat IR type. This is part of a patch series, starting with command-line and Bfloat16 assembly support. The subsequent patches will upstream intrinsics support for BFloat16, followed by Matrix Multiplication and the remaining Virtualization features of the armv8.6-a architecture. The following people contributed to this patch: - Luke Cheeseman - Momchil Velikov - Alexandros Lamprineas - Luke Geeson - Simon Tatham - Ties Stuij Reviewers: SjoerdMeijer, rjmccall, rsmith, liutianle, RKSimon, craig.topper, jfb, LukeGeeson, fpetrogalli Reviewed By: SjoerdMeijer Subscribers: labrinea, majnemer, asmith, dexonsmith, kristof.beyls, arphaman, danielkiss, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D76077	2020-06-05 10:32:43 +01:00
Craig Topper	dd863ccae1	[X86] Separate X86_CPU_TYPE_COMPAT_WITH_ALIAS from X86_CPU_TYPE_COMPAT. NFC Add a separate X86_CPU_TYPE_COMPAT_ALIAS that carries alias string and the enum from X86_CPU_TYPE_COMPAT.	2020-06-03 14:13:12 -07:00
Vyacheslav Zakharin	3a1b07506c	Define __SPIR__ macro for spir/spir64 targets. Differential Revision: https://reviews.llvm.org/D80655	2020-06-03 12:36:21 -07:00
Craig Topper	bb1d8bf270	[X86] Add CLWB to Tremont CPU. Remove CLDEMOTE, MOVDIRI, MOVDIR64B, and WAITPKG to match gcc.	2020-06-02 22:38:51 -07:00
Nick Desaulniers	8eda71616f	[Clang][A32/T32][Linux] -O1 implies -fomit-frame-pointer Summary: An upgrade of LLVM for CrOS [0] containing [1] triggered a bunch of errors related to writing to reserved registers for a Linux kernel's arm64 compat vdso (which is a aarch32 image). After a discussion on LKML [2], it was determined that -f{no-}omit-frame-pointer was not being specified. Comparing GCC and Clang [3], it becomes apparent that GCC defaults to omitting the frame pointer implicitly when optimizations are enabled, and Clang does not. ie. setting -O1 (or above) implies -fomit-frame-pointer. Clang was defaulting to -fno-omit-frame-pointer implicitly unless -fomit-frame-pointer was set explicitly. Why this becomes a problem is that the Linux kernel's arm64 compat vdso contains code that uses r7. r7 is used sometimes for the frame pointer (for example, when targeting thumb (-mthumb)). See useR7AsFramePointer() in llvm/llvm-project/llvm/lib/Target/ARM/ARMSubtarget.h. This is mostly for legacy/compatibility reasons, and the 2019 Q4 revision of the ARM AAPCS looks to standardize r11 as the frame pointer for aarch32, though this is not yet implemented in LLVM. Users that are reliant on the implicit value if unspecified when optimizations are enabled should explicitly choose -fomit-frame-pointer (new behavior) or -fno-omit-frame-pointer (old behavior). [0] https://bugs.chromium.org/p/chromium/issues/detail?id=1084372 [1] https://reviews.llvm.org/D76848 [2] https://lore.kernel.org/lkml/20200526173117.155339-1-ndesaulniers@google.com/ [3] https://godbolt.org/z/0oY39t Reviewers: kristof.beyls, psmith, danalbert, srhines, MaskRay, ostannard, efriedma Reviewed By: psmith, danalbert, srhines, MaskRay, efriedma Subscribers: efriedma, olista01, MaskRay, vhscampos, cfe-commits, llvm-commits, manojgupta, llozano, glider, hctim, eugenis, pcc, peter.smith, srhines Tags: #clang, #llvm Differential Revision: https://reviews.llvm.org/D80828	2020-06-02 15:54:14 -07:00
Matt Arsenault	301a6da8c2	AMDGPU: Fix clang side null pointer value for private The change to fold_priv_arith looks strange to me, but this was already the untested behavior for local.	2020-06-02 09:23:46 -04:00
Lei Huang	7cfded350a	[PowerPC] Add clang option -m[no-]pcrel Summary: Add user-facing front end option to turn off pc-relative memops. This will be compatible with gcc. Reviewers: stefanp, nemanjai, hfinkel, power-llvm-team, #powerpc, NeHuang, saghir Reviewed By: stefanp, NeHuang, saghir Subscribers: saghir, wuzish, shchenz, cfe-commits, kbarton, echristo Tags: #clang, #powerpc Differential Revision: https://reviews.llvm.org/D80757	2020-06-01 15:34:59 -05:00
Nemanja Ivanovic	9021ce9576	[Clang] Enable KF and KC mode for [_Complex] __float128 The headers provided with recent GNU toolchains for PPC have code that includes typedefs such as: typedef _Complex float __cfloat128 __attribute__ ((__mode__ (__KC__))) This patch allows clang to compile programs that contain #include <math.h> with -mfloat128 which it currently fails to compile. Fixes: https://bugs.llvm.org/show_bug.cgi?id=46068 Differential revision: https://reviews.llvm.org/D80374	2020-05-28 15:48:15 -05:00
Lei Huang	2368bf52cd	[PowerPC] Add support for -mcpu=pwr10 in both clang and llvm Summary: This patch simply adds support for the new CPU in anticipation of Power10. There isn't really any functionality added so there are no associated test cases at this time. Reviewers: stefanp, nemanjai, amyk, hfinkel, power-llvm-team, #powerpc Reviewed By: stefanp, nemanjai, amyk, #powerpc Subscribers: NeHuang, steven.zhang, hiraditya, llvm-commits, wuzish, shchenz, cfe-commits, kbarton, echristo Tags: #clang, #powerpc, #llvm Differential Revision: https://reviews.llvm.org/D80020	2020-05-27 13:14:25 -05:00
Alexey Bataev	a888fc6b34	[OPENMP50]Initial support for use_device_addr clause. Summary: Added parsing/sema analysis/serialization support for use_device_addr clauses. Reviewers: jdoerfert Subscribers: yaxunl, guansong, arphaman, sstefan1, llvm-commits, cfe-commits, caomhin Tags: #clang, #llvm Differential Revision: https://reviews.llvm.org/D80404	2020-05-27 11:35:31 -04:00
Lei Huang	559845f8fe	Revert "[PowerPC] Add support for -mcpu=pwr10 in both clang and llvm" This reverts commit `7eb666b155`.	2020-05-27 09:40:21 -05:00
Lei Huang	7eb666b155	[PowerPC] Add support for -mcpu=pwr10 in both clang and llvm Summary: This patch simply adds support for the new CPU in anticipation of Power10. There isn't really any functionality added so there are no associated test cases at this time. Reviewers: stefanp, nemanjai, amyk, hfinkel, power-llvm-team, #powerpc Reviewed By: stefanp, nemanjai, amyk, #powerpc Subscribers: NeHuang, steven.zhang, hiraditya, llvm-commits, wuzish, shchenz, cfe-commits, kbarton, echristo Tags: #clang, #powerpc, #llvm Differential Revision: https://reviews.llvm.org/D80020	2020-05-26 13:48:22 -05:00
Dmitry Preobrazhensky	3c6c2ecd6e	[AMDGPU] Added 'A' constraint for inline assembler Summary: 'A' constraint requires an immediate int or fp constant that can be inlined in an instruction encoding. This is the second part of the change. The llvm part has been committed as `b087b91c91`. See https://reviews.llvm.org/D78494 Reviewers: arsenm, rampitec Differential Revision: https://reviews.llvm.org/D79493	2020-05-25 17:47:06 +03:00
Nemanja Ivanovic	aede24ecaa	[PowerPC] Treat 'Z' inline asm constraint as a true memory constraint We currently emit incorrect codegen for this constraint because we set it as a constraint that allows registers. This will cause the value to be copied to the stack and that address to be passed as the address. This is not what we want. Fixes: https://bugs.llvm.org/show_bug.cgi?id=42762 Differential revision: https://reviews.llvm.org/D77542	2020-05-22 07:59:21 -05:00
Alexey Bataev	2e499eee58	[OPENMP50]Add initial support for 'affinity' clause. Summary: Added parsing/sema/serialization support for affinity clause in task directives. Reviewers: jdoerfert Subscribers: yaxunl, guansong, arphaman, llvm-commits, cfe-commits, caomhin Tags: #clang, #llvm Differential Revision: https://reviews.llvm.org/D80148	2020-05-19 08:19:09 -04:00
Alex Lorenz	11d612ac99	[clang][Preprocessor] Replace the slow translateFile call by a new, faster isMainFile check The commit `3c28a2dc6b` introduced the check that checks if we're trying to re-enter a main file when building a preamble. Unfortunately this slowed down the preamble compilation by 80-90% in some test cases, as translateFile is really slow. This change checks to see if the FileEntry is the main file without calling translateFile, but by using the new isMainFile check instead. This speeds up preamble building by 1.5-2x for certain test cases that we have. rdar://59361291 Differential Revision: https://reviews.llvm.org/D79834	2020-05-14 14:13:34 -07:00
Fangrui Song	b56b1e67e3	[gcov] Default coverage version to '408' and delete CC1 option -coverage-exit-block-before-body gcov 4.8 (r189778) moved the exit block from the last to the second. The .gcda format is compatible with 4.7 but decoding libgcov 4.7 produced .gcda with gcov [4.7,8) can mistake the exit block, emit bogus `%s:'%s' has arcs from exit block\n` warnings, and print wrong `" returned %s` for branch statistics (-b). * decoding libgcov 4.8 produced .gcda with gcov 4.7 has similar issues. Also, rename "return block" to "exit block" because the latter is the appropriate term.	2020-05-12 09:14:03 -07:00
Fangrui Song	25544ce2df	[gcov] Default coverage version to '407' and delete CC1 option -coverage-cfg-checksum Defaulting to -Xclang -coverage-version='407' makes .gcno/.gcda compatible with gcov [4.7,8) In addition, delete clang::CodeGenOptionsBase::CoverageExtraChecksum and GCOVOptions::UseCfgChecksum. We can infer the information from the version. With this change, .gcda files produced by `clang --coverage a.o` linked executable can be read by gcov 4.7~7. We don't need other -Xclang -coverage* options. There may be a mismatching version warning, though. (Note, GCC r173147 "split checksum into cfg checksum and line checksum" made gcov 4.7 incompatible with previous versions.)	2020-05-10 16:14:07 -07:00
Erich Keane	ed86058b53	Add static assert to ID Table to make sure aux targets work right. I discovered that the limit on possible builtins managed by this ObjCOrBuiltin variable is too low when combining large targets, since aux-targets are appended to the targets list. A runtime assert exists for this, however this patch creates a static-assert as well. The logic for said static-assert is to make sure we have the room for the aux-target and target to both be the largest list, which makes sure we have room for all possible combinations. I also incremented the number of bits by 1, since I discovered this currently broken. The current bit-count was 36, so this doesn't increase any size.	2020-05-07 12:49:46 -07:00
Craig Topper	16c800b8b7	[X86] Remove support for Y0 constraint as an alias for Yz in inline assembly. Neither gcc or icc support this. Split out from D79472. I want to remove more, but it looks like icc does support some things gcc doesn't and I need to double check our internal test suites.	2020-05-06 14:58:53 -07:00
Craig Topper	9bb9ff0957	[X86] Remove incomplete support for 'Y' has an inline assembly constraint by itself. Y is the start of several 2 letter constraints, but we also had partial support to recognize it by itself. But it doesn't look like it can get through clang as a single letter so the backend support for this was effectively dead.	2020-05-06 14:23:04 -07:00
Erich Keane	8a1c999c9b	Implement _ExtInt ABI for all ABIs in Clang, enable type for ABIs This is the result of an audit of all of the ABIs in clang to implement and enable the type for those targets. Additionally, this finds an issue with integer-promotion passing for a few platforms when using _ExtInt of < int, so this also corrects that resulting in signext/zeroext being on a params of those types in some platforms. Differential Revisions: https://reviews.llvm.org/D79118	2020-05-06 06:52:18 -07:00
Craig Topper	0fac1c1912	[X86] Allow Yz inline assembly constraint to choose ymm0 or zmm0 when avx/avx512 are enabled and type is 256 or 512 bits gcc supports selecting ymm0/zmm0 for the Yz constraint when used with 256 or 512 bit vector types. Fixes PR45806 Differential Revision: https://reviews.llvm.org/D79448	2020-05-05 21:12:30 -07:00
Alexey Bataev	b5be1c5419	[OPENMP50]Basic support for uses_allocators clause. Summary: Added parsing/sema/serialization supoprt for uses_allocators clause. Reviewers: jdoerfert Subscribers: yaxunl, guansong, arphaman, cfe-commits, caomhin Tags: #clang Differential Revision: https://reviews.llvm.org/D78577	2020-04-30 16:24:36 -04:00
Erich Keane	911add149a	Disable _ExtInt by default Since the _ExtInt type got into the repo, we've discovered that the ABI implications weren't completely understood. The other architectures are going to be audited (see D79118), however downstream targets aren't going to benefit from this audit. This patch disables the _ExtInt type by default and makes the target-info an opt-in. As it is audited, I'll re-enable these for all of our default targets.	2020-04-29 13:48:12 -07:00
Richard Smith	20df6038ee	Make -fno-char8_t disable the char8_t keyword, even in C++20. This fixes a regression introduced in r354736, and makes our behavior compatible with that of Clang 8 and GCC.	2020-04-28 23:49:35 -07:00
Petr Hosek	063128f979	[Fuchsia] Build compiler-rt builtins for 32-bit x86 While we don't support 32-bit architectures in Fuchsia, these are needed in the early boot phase on x86, so we build just these to satisfy that use case. Differential Revision: https://reviews.llvm.org/D78687	2020-04-24 12:18:30 -07:00
Luke Geeson	7da1905125	[AArch32] Armv8.6-a Matrix Mult Assembly + Intrinsics This patch upstreams support for the Armv8.6-a Matrix Multiplication Extension. A summary of the features can be found here: https://community.arm.com/developer/ip-products/processors/b/processors-ip-blog/posts/arm-architecture-developments-armv8-6-a This patch includes: - Assembly support for AArch32 - Intrinsics Support for AArch32 Neon Intrinsics for Matrix Multiplication Note: these extensions are optional in the 8.6a architecture and so have to be enabled by default No additional IR types or C Types are needed for this extension. This is part of a patch series, starting with BFloat16 support and the other components in the armv8.6a extension (in previous patches linked in phabricator) Based on work by: - Luke Geeson - Oliver Stannard - Luke Cheeseman Reviewers: t.p.northover, miyuki Reviewed By: miyuki Subscribers: miyuki, ostannard, kristof.beyls, hiraditya, danielkiss, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D77872	2020-04-24 15:54:06 +01:00
Luke Geeson	832cd74913	[AArch64] Armv8.6-a Matrix Mult Assembly + Intrinsics This patch upstreams support for the Armv8.6-a Matrix Multiplication Extension. A summary of the features can be found here: https://community.arm.com/developer/ip-products/processors/b/processors-ip-blog/posts/arm-architecture-developments-armv8-6-a This patch includes: - Assembly support for AArch64 only (no SVE or Neon) - Intrinsics Support for AArch64 Armv8.6a Matrix Multiplication Instructions (No bfloat16 matrix multiplication) No IR types or C Types are needed for this extension. This is part of a patch series, starting with BFloat16 support and the other components in the armv8.6a extension (in previous patches linked in phabricator) Based on work by: - Luke Geeson - Oliver Stannard - Luke Cheeseman Reviewers: ostannard, t.p.northover, rengolin, kmclaughlin Reviewed By: kmclaughlin Subscribers: kmclaughlin, kristof.beyls, hiraditya, danielkiss, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D77871	2020-04-24 15:54:06 +01:00
Kadir Cetinkaya	411a254af3	[clang] Make sure argument expansion locations are correct in presence of predefined buffer Summary: Macro argument expansion logic relies on skipping file IDs that created as a result of an include. Unfortunately it fails to do that for predefined buffer since it doesn't have a valid insertion location. As a result of that any file ID created for an include inside the predefined buffers breaks the traversal logic in SourceManager::computeMacroArgsCache. To fix this issue we first record number of created FIDs for predefined buffer, and then skip them explicitly in source manager. Another solution would be to just give predefined buffers a valid source location, but it is unclear where that should be.. Reviewers: sammccall Subscribers: cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D78649	2020-04-22 21:01:52 +02:00
Michael Liao	86e3b735cd	[hip] Claim builtin type `__float128` supported if the host target supports it. Reviewers: tra, yaxunl Subscribers: jvesely, nhaehnle, kerbowa, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D78513	2020-04-21 15:56:40 -04:00
Aaron Ballman	6a30894391	C++2a -> C++20 in some identifiers; NFC.	2020-04-21 15:37:19 -04:00
Richard Smith	6bc7502385	When making modules transitively visible, don't take into account whether they have missing header files. Whether a module's headers happen to be present on the local file system should make no difference to whether we make its contents visible when importing another module that re-exports it. If we have an up-to-date AST file that we can load, that's all that matters. This fixes the ability to header syntax checking for modular headers in C++20 mode (or in prior modes where -fmodules-local-submodule-visibility is enabled but -fmodules is not).	2020-04-17 22:49:58 -07:00
Richard Smith	fc76b4ad3d	Rename IsMissingRequirement to IsUnimportable and set it for shadowed modules too. This more accurately reflects the semantics of this flag, as distinct from "IsAvailable", which (in an explicit modules world) only describes whether a module is buildable, not whether it's importable.	2020-04-17 22:48:56 -07:00
Lei Huang	10b60dde76	[PowerPC] Refactor ppcUserFeaturesCheck() Summary: This function keeps growing, refactor to use lambda. Reviewers: nemanjai, stefanp Subscribers: kbarton, shchenz, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D78308	2020-04-17 15:19:46 -05:00
Benjamin Kramer	3ee1ec0b9d	LangOptions cannot depend on ASTContext, make it not use ASTContext directly Fixes a layering violation introduced in `2ba4e3a459`.	2020-04-16 11:46:35 +02:00
Michael Spencer	92e8af0ecb	[Clang] Expose RequiresNullTerminator in FileManager. This is needed to fix the reason `0a2be46cfd` (Modules: Invalidate out-of-date PCMs as they're discovered) and 5b44a4b07fc1d ([modules] Do not cache invalid state for modules that we attempted to load.) were reverted. These patches changed Clang to use `isVolatile` when loading modules. This had the side effect of not using mmap when loading modules, and thus greatly increased memory usage. The reason it wasn't using mmap is because `MemoryBuffer` plays some games with file size when you request null termination, and it has to disable these when `isVolatile` is set as the size may change by the time it's mmapped. Clang by default passes `RequiresNullTerminator = true`, and `shouldUseMmap` ignored if `RequiresNullTerminator` was even requested. This patch adds `RequiresNullTerminator` to the `FileManager` interface so Clang can use it when loading modules, and changes `shouldUseMmap` to only take volatility into account if `RequiresNullTerminator` is true. This is fine as both `mmap` and a `read` loop are vulnerable to modifying the file while reading, but are immune to the rename Clang does when replacing a module file. Differential Revision: https://reviews.llvm.org/D77772	2020-04-15 14:17:51 -07:00
Melanie Blower	2ba4e3a459	Move BinaryOperators.FPOptions to trailing storage Reviewers: rjmccall Differential Revision: https://reviews.llvm.org/D76384	2020-04-15 12:57:31 -07:00
Ayke van Laethem	fe06e231ff	[AVR] Define __ELF__ This symbol is defined in avr-gcc. Because AVR normally uses the ELF format, define the symbol unconditionally. This patch is needed to get Clang to compile compiler-rt. Differential Revision: https://reviews.llvm.org/D78117	2020-04-15 00:22:53 +02:00
Artem Belevich	8c635ba4a8	[CUDA] Fix missed CUDA version mappings.	2020-04-13 15:54:12 -07:00
Scott Egerton	61ff296375	[RISCV] Add Clang frontend support for Bitmanip extension This adds the __riscv_bitmanip macro and the 'b' target feature to enable it. Differential Revision: https://reviews.llvm.org/D71553	2020-04-09 18:04:22 +01:00
WangTianQing	a3dc949000	[X86] Add TSXLDTRK instructions. Summary: For more details about these instructions, please refer to the latest ISE document: https://software.intel.com/en-us/download/intel-architecture-instruction-set-extensions-programming-reference Reviewers: craig.topper, RKSimon, LuoYuanke Reviewed By: craig.topper Subscribers: mgorny, hiraditya, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D77205	2020-04-09 13:17:29 +08:00
Artem Belevich	a9627b7ea7	[CUDA] Add partial support for recent CUDA versions. Generate PTX using newer versions of PTX and allow using sm_80 with CUDA-11. None of the new features of CUDA-10.2+ have been implemented yet, so using these versions will still produce a warning. Differential Revision: https://reviews.llvm.org/D77670	2020-04-08 11:19:44 -07:00
Artem Belevich	33386b20aa	[CUDA] Simplify GPU variant handling. NFC. Instead of hardcoding individual GPU mappings in multiple functions, keep them all in one table and use it to look up the mappings. We also don't care about 'virtual' architecture much, so the API is trimmed down down to a simpler GPU->Virtual arch name lookup. Differential Revision: https://reviews.llvm.org/D77665	2020-04-08 11:19:43 -07:00
Reid Kleckner	76221c734e	Remove llvm::Error include form Diagnostic.h Saves ~400 related LLVM ADT. llvm/ADT/Error.h takes 90ms to parse. $ diff -u <(sort thedeps-before.txt) <(sort thedeps-after.txt) \ \| grep '^[-+] ' \| sort \| uniq -c \| sort -nr 403 - /usr/local/google/home/rnk/llvm-project/llvm/include/llvm/Support/Error.h 403 - /usr/local/google/home/rnk/llvm-project/llvm/include/llvm-c/Error.h 397 - /usr/local/google/home/rnk/llvm-project/llvm/include/llvm/Support/Format.h 397 - /usr/local/google/home/rnk/llvm-project/llvm/include/llvm/Support/Debug.h 377 - /usr/local/google/home/rnk/llvm-project/llvm/include/llvm/ADT/StringExtras.h 158 - /usr/local/google/home/rnk/llvm-project/llvm/include/llvm-c/ExternC.h 138 - /usr/local/google/home/rnk/llvm-project/llvm/include/llvm/Support/ErrorOr.h 13 - /usr/local/google/home/rnk/llvm-project/llvm/include/llvm/Support/raw_ostream.h 13 - /usr/local/google/home/rnk/llvm-project/llvm/include/llvm/ADT/SmallString.h 5 - /usr/local/google/home/rnk/llvm-project/llvm/include/llvm/ADT/Twine.h	2020-04-06 10:42:17 -07:00
Johannes Doerfert	931c0cd713	[OpenMP][NFC] Move and simplify directive -> allowed clause mapping Move the listing of allowed clauses per OpenMP directive to the new macro file in `llvm/Frontend/OpenMP`. Also, use a single generic macro that specifies the directive and one allowed clause explicitly instead of a dedicated macro per directive. We save 800 loc and boilerplate for all new directives/clauses with no functional change. We also need to include the macro file only once and not once per directive. Depends on D77112. Reviewed By: JonChesterfield Differential Revision: https://reviews.llvm.org/D77113	2020-04-06 00:04:08 -05:00
Johannes Doerfert	419a559c5a	[OpenMP][NFCI] Move OpenMP clause information to `lib/Frontend/OpenMP` This is a cleanup and normalization patch that also enables reuse with Flang later on. A follow up will clean up and move the directive -> clauses mapping. Reviewed By: fghanim Differential Revision: https://reviews.llvm.org/D77112	2020-04-05 22:30:29 -05:00
Michael Liao	b952d799ca	[cuda][hip] Fix `RegisterVar` function prototype. Summary: - `RegisterVar` has `void` return type and `size_t` in its variable size parameter in HIP or CUDA 9.0+. Reviewers: tra, yaxunl Subscribers: cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D77398	2020-04-03 12:57:09 -04:00
Yaxun (Sam) Liu	a46e7d7a5f	[AMDGPU] Allow AGPR in inline asm Differential Revision: https://reviews.llvm.org/D77329	2020-04-03 09:08:13 -04:00
Matt Arsenault	ce2258c1cd	clang/AMDGPU: Stop setting old denormal subtarget features	2020-04-02 17:17:12 -04:00
Daniel Kiss	37ced5a571	[clang][AARCH64] Add __ARM_FEATURE_{PAC, BTI}_DEFAULT defines Summary: As defined by Arm C Language Extensions (ACLE) these macro defines should be set to specific values depending on -mbranch-protection. Reviewers: chill Reviewed By: chill Subscribers: danielkiss, cfe-commits, kristof.beyls Tags: #clang Differential Revision: https://reviews.llvm.org/D77134	2020-04-02 12:54:21 +02:00
Daniel Kiss	7314aea5a4	[clang] Move branch-protection from CodeGenOptions to LangOptions Summary: Reason: the option has an effect on preprocessing. Also see thread: http://lists.llvm.org/pipermail/cfe-dev/2020-March/065014.html Reviewers: chill, efriedma Reviewed By: efriedma Subscribers: efriedma, danielkiss, cfe-commits, kristof.beyls Tags: #clang Differential Revision: https://reviews.llvm.org/D77131	2020-04-02 10:31:52 +02:00
WangTianQing	d08fadd662	[X86] Add SERIALIZE instruction. Summary: For more details about this instruction, please refer to the latest ISE document: https://software.intel.com/en-us/download/intel-architecture-instruction-set-extensions-programming-reference Reviewers: craig.topper, RKSimon, LuoYuanke Reviewed By: craig.topper Subscribers: mgorny, hiraditya, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D77193	2020-04-02 16:19:23 +08:00
Johannes Doerfert	1858f4b50d	Revert "[OpenMP][NFCI] Move OpenMP clause information to `lib/Frontend/OpenMP`" This reverts commit `c18d55998b`. Bots have reported uses that need changing, e.g., clang-tools-extra/clang-tidy/openmp/UseDefaultNoneCheck.cp as reported by http://lab.llvm.org:8011/builders/clang-ppc64be-linux/builds/46591	2020-04-02 02:23:22 -05:00
Johannes Doerfert	c18d55998b	[OpenMP][NFCI] Move OpenMP clause information to `lib/Frontend/OpenMP` This is a cleanup and normalization patch that also enables reuse with Flang later on. A follow up will clean up and move the directive -> clauses mapping. Differential Revision: https://reviews.llvm.org/D77112	2020-04-02 01:39:07 -05:00
Adrian Prantl	f4754ea0ed	Remove const qualifier from Modules returned by ExternalASTSource. (NFC) This API is used by LLDB to attach owning module information to Declarations deserialized from DWARF. Differential Revision: https://reviews.llvm.org/D75561	2020-04-01 17:46:02 -07:00
Sid Manning	81194bfeea	[Hexagon] MaxAtomicPromoteWidth and MaxAtomicInlineWidth are not getting set. Noticed when building llvm's c++ library. Differential Revision: https://reviews.llvm.org/D76546	2020-03-30 12:33:51 -05:00
Yonghong Song	ced0d1f42b	[BPF] support 128bit int explicitly in layout spec Currently, bpf does not specify 128bit alignment in its layout spec. So for a structure like struct ipv6_key_t { unsigned pid; unsigned __int128 saddr; unsigned short lport; }; clang will generate IR type %struct.ipv6_key_t = type { i32, [12 x i8], i128, i16, [14 x i8] } Additional padding is to ensure later IR->MIR can generate correct stack layout with target layout spec. But it is common practice for a tracing program to be first compiled with target flag (e.g., x86_64 or aarch64) through clang to generate IR and then go through llc to generate bpf byte code. Tracing program often refers to kernel internal data structures which needs to be compiled with non-bpf target. But such a compilation model may cause a problem on aarch64. The bcc issue https://github.com/iovisor/bcc/issues/2827 reported such a problem. For the above structure, since aarch64 has "i128:128" in its layout string, the generated IR will have %struct.ipv6_key_t = type { i32, i128, i16 } Since bpf does not have "i128:128" in its spec string, the selectionDAG assumes alignment 8 for i128 and computes the stack storage size for the above is 32 bytes, which leads incorrect code later. The x86_64 does not have this issue as it does not have "i128:128" in its layout spec as it does permits i128 to be alignmented at 8 bytes at stack. Its IR type looks like %struct.ipv6_key_t = type { i32, [12 x i8], i128, i16, [14 x i8] } The fix here is add i128 support in layout spec, the same as aarch64. The only downside is we may have less optimal stack allocation in certain cases since we require 16byte alignment for i128 instead of 8. But this is probably fine as i128 is not used widely and in most cases users should already have proper alignment. Differential Revision: https://reviews.llvm.org/D76587	2020-03-28 11:46:29 -07:00
Yaxun (Sam) Liu	369e26ca9e	[AMDGPU] Add __builtin_amdgcn_workgroup_size_x/y/z The main purpose of introducing these builtins is to add a range metadata [1, 1025) on the work group size loaded from dispatch ptr, which cannot be done by source code. Differential Revision: https://reviews.llvm.org/D76772	2020-03-28 01:03:20 -04:00
Johannes Doerfert	095cecbe0d	[OpenMP] `omp begin/end declare variant` - part 1, parsing This is the first part extracted from D71179 and cleaned up. This patch provides parsing support for `omp begin/end declare variant`, as defined in OpenMP technical report 8 (TR8) [0]. A major purpose of this patch is to provide proper math.h/cmath support for OpenMP target offloading. See PR42061, PR42798, PR42799. The current code was developed with this feature in mind, see [1]. [0] https://www.openmp.org/wp-content/uploads/openmp-TR8.pdf [1] https://reviews.llvm.org/D61399#change-496lQkg0mhRN Reviewed By: aaron.ballman Differential Revision: https://reviews.llvm.org/D74941	2020-03-27 02:30:58 -05:00
David Blaikie	819e540208	Use llvm_unreachable after a fully covered/always-returning switch	2020-03-26 20:09:57 -07:00
Sid Manning	b0da094983	[Hexagon] Add support for Linux/Musl ABI (part 2) A continuation of https://reviews.llvm.org/D72701. This adds support needed in clang. Differential Revision: https://reviews.llvm.org/D75638	2020-03-26 17:19:46 -05:00
Ties Stuij	71ae267d1f	[PATCH] [ARM] ARMv8.6-a command-line + BFloat16 Asm Support Summary: This patch introduces command-line support for the Armv8.6-a architecture and assembly support for BFloat16. Details can be found https://community.arm.com/developer/ip-products/processors/b/processors-ip-blog/posts/arm-architecture-developments-armv8-6-a in addition to the GCC patch for the 8..6-a CLI: https://gcc.gnu.org/legacy-ml/gcc-patches/2019-11/msg02647.html In detail this patch - march options for armv8.6-a - BFloat16 assembly This is part of a patch series, starting with command-line and Bfloat16 assembly support. The subsequent patches will upstream intrinsics support for BFloat16, followed by Matrix Multiplication and the remaining Virtualization features of the armv8.6-a architecture. Based on work by: - labrinea - MarkMurrayARM - Luke Cheeseman - Javed Asbar - Mikhail Maltsev - Luke Geeson Reviewers: SjoerdMeijer, craig.topper, rjmccall, jfb, LukeGeeson Reviewed By: SjoerdMeijer Subscribers: stuij, kristof.beyls, hiraditya, dexonsmith, danielkiss, cfe-commits, llvm-commits Tags: #clang, #llvm Differential Revision: https://reviews.llvm.org/D76062	2020-03-26 09:17:20 +00:00
Michael Liao	d264f02c6f	Fix `-Wreturn-type` warning. NFC.	2020-03-26 00:53:24 -04:00
zoecarver	b915aec6b5	Add method to TargetInfo to get CPU cache line size Summary: This patch adds a virtual method `getCPUCacheLineSize()` to `TargetInfo`. Currently, I've only implemented the method in `X86TargetInfo`. It's extremely important that each CPU's cache line size correct (e.g., we can't just define it as `64` across the board) so, it has been a little slow getting to this point. I'll work on the ARM CPUs next, but that will probably come later in a different patch. Tags: #clang Differential Revision: https://reviews.llvm.org/D74918	2020-03-25 09:50:38 -07:00
Hans Wennborg	c278e8f8f9	Build fix: AttributeCommonInfo::AS_C2x	2020-03-25 15:42:21 +01:00
John Brawn	bc3f171090	Don't normalise CXX11/C2X attribute names to start with :: Currently square-bracket-style (CXX11/C2X) attribute names are normalised to start with :: if they don't have a namespace. This is a bit odd, as such names are rejected when parsing, so don't do this. Differential Revision: https://reviews.llvm.org/D76704	2020-03-25 14:33:44 +00:00
Alexey Bataev	1236eb6c31	[OPENMP50]Add 'default' modifier in reduction clauses. Added full support for 'default' modifier in the reduction clauses.	2020-03-23 18:18:08 -04:00
Alexey Bataev	63828a35da	[OPENMP50]Bassic support for exclusive clause. Added basic support (parsing/sema/serialization) for exclusive clause in scan directives.	2020-03-23 13:12:52 -04:00
Alexey Bataev	06dea73307	[OPENMP50]Initial support for inclusive clause. Added parsing/sema/serialization support for inclusive clause in scan directive.	2020-03-20 14:20:38 -04:00
Alexey Bataev	fcba7c3534	[OPENMP50]Initial support for scan directive. Addedi basic parsing/sema/serialization support for scan directive.	2020-03-20 07:58:15 -04:00
Sid Manning	430c9a80c1	[Hexagon] Enable linux #defines Enable standard linux defines when the triple is Linux and the environment is musl. Differential Revision: https://reviews.llvm.org/D76310	2020-03-19 14:33:49 -05:00
Alexey Bataev	2f8894a5b8	[OPENMP50]Add support for extended device clause in target directives. Added parsing/sema/serialization support for extended device clause in executable target directives.	2020-03-18 15:02:37 -04:00
Sander de Smalen	c5b81466c2	Reland D75470 [SVE] Auto-generate builtins and header for svld1. Reworked the patch to avoid sharing a header (SVETypeFlags.h) between include/clang/Basic and utils/TableGen/SveEmitter.cpp. Now the patch generates the enum/flags which is included in TargetBuiltins.h. Also renamed one of the SveEmitter options to be in line with MVE. Summary: This is a first patch in a series for the SveEmitter to generate the arm_sve.h header file and builtins. I've tried my best to strip down this patch as best as I could, but there are still a few changes that are not necessarily exercised by the load intrinsics in this patch, mostly around the SVEType class which has some common logic to represent types from a type and prototype string. I thought it didn't make much sense to remove that from this patch and split it up.	2020-03-18 11:16:28 +00:00
Nick Desaulniers	5d90f886bc	[clang][AArch64] readd support for 'p' inline asm constraint Summary: Was accidentally removed by commit `af64948e2a` when it overrode TargetInfo::convertConstraint. Fixes: pr/45225 Reviewers: eli.friedman, sdesmalen Reviewed By: sdesmalen Subscribers: echristo, sdesmalen, kristof.beyls, cfe-commits, kmclaughlin, srhines Tags: #clang Differential Revision: https://reviews.llvm.org/D76297	2020-03-17 10:51:25 -07:00
Alexey Bataev	0f0564bb9a	[OPENMP50]Initial support for detach clause in task directive. Added parsing/sema/serialization support for detach clause.	2020-03-17 09:19:03 -04:00
Ayke van Laethem	4add249205	[AVR] Add support for the -mdouble=x flag This flag is used by avr-gcc (starting with v10) to set the width of the double type. The double type is by default interpreted as a 32-bit floating point number in avr-gcc instead of a 64-bit floating point number as is common on other architectures. Starting with GCC 10, a new option has been added to control this behavior: https://gcc.gnu.org/wiki/avr-gcc#Deviations_from_the_Standard This commit keeps the default double at 32 bits but adds support for the -mdouble flag (-mdouble=32 and -mdouble=64) to control this behavior. Differential Revision: https://reviews.llvm.org/D76181	2020-03-17 13:21:03 +01:00

... 2 3 4 5 6 ...

3840 Commits