llvm-project

Commit Graph

Author	SHA1	Message	Date
Craig Topper	dacbddf562	[RISCV] Move isValidCPUName to RISCVTargetInfo. NFC Instead of having separate implementations for RV32 and RV64, use the triple to control the Is64Bit parameter. Do the same for isValidTuneCPUName, fillValidCPUList, and fillValidTuneCPUList.	2022-08-11 10:01:56 -07:00
David Truby	13a784f368	[clang][AArch64][SVE] Change SVE_VECTOR_OPERATORS macro for VLA vectors The __ARM_FEATURE_SVE_VECTOR_OPERATORS macro should be changed to indicate that this feature is now supported on VLA vectors as well as VLS vectors. There is a complementary PR to the ACLE spec here https://github.com/ARM-software/acle/pull/213 Reviewed By: peterwaller-arm Differential Revision: https://reviews.llvm.org/D131573	2022-08-11 13:23:52 +00:00
Freddy Ye	e4888a37d3	[X86][BF16] Enable __bf16 for x86 targets. X86 psABI has updated to support __bf16 type, the ABI of which is the same as FP16. See https://discourse.llvm.org/t/patch-add-optional-bfloat16-support/63149 Reviewed By: pengfei Differential Revision: https://reviews.llvm.org/D130964	2022-08-10 09:00:47 +08:00
Fangrui Song	3f18f7c007	[clang] LLVM_FALLTHROUGH => [[fallthrough]]. NFC With C++17 there is no Clang pedantic warning or MSVC C5051. Reviewed By: aaron.ballman Differential Revision: https://reviews.llvm.org/D131346	2022-08-08 09:12:46 -07:00
David Green	8c30f4a5ab	[AArch64] Always allow the __bf16 type We would like to make the ACLE NEON and SVE intrinsics more useable by gating them on the target, not by ifdef preprocessor macros. In order to do this the types they use need to be available. This patches makes __bf16 always available under AArch64 not just when the bf16 architecture feature is present. This bringing it in-line with GCC. In subsequent patches the NEON bfloat16x8_t and SVE svbfloat16_t types (along with bfloat16_t used in arm_sve.h) will be made unconditional too. The operations valid on the types are still very limited. They can be used as a storage type, but the intrinsics used for convertions are still behind an ifdef guard in arm_neon.h/arm_bf16.h. Differential Revision: https://reviews.llvm.org/D130973	2022-08-04 18:35:27 +01:00
Jonas Paulsson	84831bdfed	[SystemZ] Make 128 bit integers be aligned to 8 bytes. The SystemZ ABI says that 128 bit integers should be aligned to only 8 bytes. Reviewed By: Ulrich Weigand, Nikita Popov Differential Revision: https://reviews.llvm.org/D130900	2022-08-03 15:39:54 +02:00
Kai Luo	1cbaf681b0	[clang][AIX] Add option to control quadword lock free atomics ABI on AIX We are supporting quadword lock free atomics on AIX. For the situation that users on AIX are using a libatomic that is lock-based for quadword types, we can't enable quadword lock free atomics by default on AIX in case user's new code and existing code accessing the same shared atomic quadword variable, we can't guarentee atomicity. So we need an option to enable quadword lock free atomics on AIX, thus we can build a quadword lock-free libatomic(also for advanced users considering atomic performance critical) for users to make the transition smooth. Reviewed By: shchenz Differential Revision: https://reviews.llvm.org/D127189	2022-07-27 01:56:25 +00:00
Kazu Hirata	3f3930a451	Remove redundaunt virtual specifiers (NFC) Identified with tidy-modernize-use-override.	2022-07-25 23:00:59 -07:00
Kazu Hirata	a210f404da	[clang] Remove redundant virtual specifies (NFC) Identified with modernize-use-override.	2022-07-24 22:02:58 -07:00
ksyx	3198364e6e	[RISCV][Clang] Add support for Zmmul extension This patch implements recently ratified extension Zmmul, a subextension of M (Integer Multiplication and Division) consisting only multiplication part of it. Differential Revision: https://reviews.llvm.org/D103313 Reviewed By: craig.topper, jrtc27, asb	2022-07-18 20:26:08 -04:00
Stanislav Mekhanoshin	9fa5a6b7e8	[AMDGPU] Support for gfx940 fp8 conversions Differential Revision: https://reviews.llvm.org/D129902	2022-07-18 11:48:43 -07:00
Kazu Hirata	cb2c8f694d	[clang] Use value instead of getValue (NFC)	2022-07-13 23:39:33 -07:00
Jolanta Jensen	07df9e918e	[NFC] Minor cleanup of usage of FloatModeKind with bitmask enums Differential Revision: https://reviews.llvm.org/D129373	2022-07-13 20:44:06 +01:00
Kai Nacke	880eb839e6	[SystemZ] Enable `-mtune=` option in clang. https://reviews.llvm.org/D128910 enabled handling of attribute "tune-cpu" in LLVM. This PR now enables option `-mtune` in clang, which then generates the new attribute. Reviewed By: uweigand Differential Revision: https://reviews.llvm.org/D129562	2022-07-13 11:39:24 -04:00
Paul Robinson	08e4fe6c61	[X86] Add RDPRU instruction Add support for the RDPRU instruction on Zen2 processors. User-facing features: - Clang option -m[no-]rdpru to enable/disable the feature - Support is implicit for znver2/znver3 processors - Preprocessor symbol __RDPRU__ to indicate support - Header rdpruintrin.h to define intrinsics - "rdpru" mnemonic supported for assembler code Internal features: - Clang builtin __builtin_ia32_rdpru - IR intrinsic @llvm.x86.rdpru Differential Revision: https://reviews.llvm.org/D128934	2022-07-06 07:17:47 -07:00
Phoebe Wang	abeeae570e	[X86] Support `_Float16` on SSE2 and up This is split from D113107 to address #56204 and https://discourse.llvm.org/t/how-to-build-compiler-rt-for-new-x86-half-float-abi/63366 Reviewed By: zahiraam, rjmccall, bkramer, MaskRay Differential Revision: https://reviews.llvm.org/D128571	2022-06-30 17:21:37 +08:00
Jolanta Jensen	32aac7babf	[NFC] Switch FloatModeKind enum class to use bitmask enums Using bitmask enums simplifies and clarifies the code. Differential Revision: https://reviews.llvm.org/D128182	2022-06-29 11:02:02 +01:00
Ben Langmuir	eab2a06f0f	Revert "Reland "[X86] Support `_Float16` on SSE2 and up"" Broke compiler-rt on Darwin: https://green.lab.llvm.org/green/job/clang-stage1-RA/29920/ This reverts commit `527ef8ca98`.	2022-06-28 10:59:03 -07:00
Phoebe Wang	527ef8ca98	Reland "[X86] Support `_Float16` on SSE2 and up" Enable `COMPILER_RT_HAS_FLOAT16` to solve the lit fail. This is split from D113107 to address #56204 and https://discourse.llvm.org/t/how-to-build-compiler-rt-for-new-x86-half-float-abi/63366 Reviewed By: zahiraam, rjmccall, bkramer Differential Revision: https://reviews.llvm.org/D128571	2022-06-28 14:38:56 +08:00
Vitaly Buka	8f7cca90af	Revert "[X86] Support `_Float16` on SSE2 and up" Breaks buildbot https://lab.llvm.org/buildbot/#/builders/37/builds/14334 This reverts commit `f5d781d627`.	2022-06-27 12:43:29 -07:00
Phoebe Wang	f5d781d627	[X86] Support `_Float16` on SSE2 and up This is split from D113107 to address #56204 and https://discourse.llvm.org/t/how-to-build-compiler-rt-for-new-x86-half-float-abi/63366 Reviewed By: zahiraam, rjmccall, bkramer Differential Revision: https://reviews.llvm.org/D128571	2022-06-27 21:37:30 +08:00
Jolanta Jensen	5830da1f86	[AArch64] Define __FP_FAST_FMA[F] Libraries use this flag to decide whether to use the fma builtin. Author: Paul Walker Differential Revision: https://reviews.llvm.org/D127655	2022-06-27 11:37:40 +01:00
Kazu Hirata	97afce08cb	[clang] Don't use Optional::hasValue (NFC) This patch replaces Optional::hasValue with the implicit cast to bool in conditionals only.	2022-06-25 22:26:24 -07:00
Kazu Hirata	3b7c3a654c	Revert "Don't use Optional::hasValue (NFC)" This reverts commit `aa8feeefd3`.	2022-06-25 11:56:50 -07:00
Kazu Hirata	aa8feeefd3	Don't use Optional::hasValue (NFC)	2022-06-25 11:55:57 -07:00
Xiang Li	77f72ac15b	[HLSL] Enable half type for hlsl. HLSL supports half type. When enable-16bit-types is not set, half will be treated as float. When enable-16bit-types is set, half will be treated like real 16bit float type and map to llvm half type. Also change CXXABI to Microsoft to match dxc behavior. The mangle name for half is "$f16@" when half is treat as native half type and "$halff@" when treat as float. In AST, half is still half. The special thing is done at clang codeGen, when NativeHalfType is false, half will translated into float. Reviewed By: aaron.ballman Differential Revision: https://reviews.llvm.org/D124790	2022-06-23 12:56:26 -07:00
Kazu Hirata	ca4af13e48	[clang] Don't use Optional::getValue (NFC)	2022-06-20 22:59:26 -07:00
Kazu Hirata	06decd0b41	[clang] Use value_or instead of getValueOr (NFC)	2022-06-18 23:21:34 -07:00
Jolanta Jensen	c80c57674e	[Clang] Allow 'Complex float __attribute__((mode(HC)))' Adding half float to types that can be represented by __attribute__((mode(xx))). Original implementation authored by George Steed. Differential Revision: https://reviews.llvm.org/D126479	2022-06-17 12:39:52 +01:00
Yaxun (Sam) Liu	af9ee3357c	[HIP] fix long double size For amdgpu target long double type is the same as double type. The width and align of long double type was incorrectly overridden when copying aux target properties, which caused assertion in codegen when emitting global variables with long double type. This patch fix that by saving and restoring width and align of long double type. Reviewed by: Artem Belevich Differential Revision: https://reviews.llvm.org/D127771 Fixes: SWDEV-335515	2022-06-14 21:57:56 -04:00
Kazu Hirata	f5ef2c5838	[clang] Convert for_each to range-based for loops (NFC)	2022-06-10 22:39:45 -07:00
Pengxuan Zheng	e3a6784ac9	[clang-cl] Add support for /kernel MSVC defines _KERNEL_MODE when /kernel is passed. Also, /kernel disables RTTI and C++ exception handling. https://docs.microsoft.com/en-us/cpp/build/reference/kernel-create-kernel-mode-binary?view=msvc-170 Reviewed By: thakis Differential Revision: https://reviews.llvm.org/D126719	2022-06-07 06:42:35 -07:00
Kazu Hirata	d93728978b	[clang] Use llvm::is_contained (NFC)	2022-06-05 17:56:40 -07:00
Paul Robinson	8869ba3662	[PS5] Add PS5OSTargetInfo class, update affected tests	2022-06-01 13:30:29 -07:00
Paul Robinson	5d005d8256	Refactor PS4OSTargetInfo into a base class and PS4 subclass; prep for PS5	2022-06-01 13:30:29 -07:00
Zi Xuan Wu (Zeson)	b86440ecde	[CSKY] Fix the conflict of default fpu features and -mfpu option The arch or cpu has its default fpu features and versions such as fpuv2_sf/fpuv3_sf. And there is also -mfpu option to specify and override fpu version and features. For example, C860 has fpuv3_sf/fpuv3_df feature as default, when -mfpu=fpv2 is given, fpuv3_sf/fpuv3_df is replaced with fpuv2_sf/fpuv2_df.	2022-05-23 10:44:55 +08:00
Jon Chesterfield	83c431fb9e	[amdgpu] Add amdgpu_kernel calling conv attribute to clang Allows emitting define amdgpu_kernel void @func() IR from C or C++. This replaces the current workflow which is to write a stub in opencl that calls an external C function implemented in C++ combined through llvm-link. Calling the resulting function still requires a manual implementation of the ABI from the host side. The primary application is for more rapid debugging of the amdgpu backend by permuting a C or C++ test file instead of manually updating an IR file. Implementation closely follows D54425. Non-amd reviewers from there. Reviewed By: yaxunl Differential Revision: https://reviews.llvm.org/D125970	2022-05-20 08:50:37 +01:00
Amy Kwan	c35ca3a1c7	[PowerPC] Implement XL compat __fnabs and __fnabss builtins. This patch implements the following floating point negative absolute value builtins that required for compatibility with the XL compiler: ``` double __fnabs(double); float __fnabss(float); ``` These builtins will emit : - fnabs on PWR6 and below, or if VSX is disabled. - xsnabsdp on PWR7 and above, if VSX is enabled. Differential Revision: https://reviews.llvm.org/D125506	2022-05-19 11:28:40 -05:00
Yaxun (Sam) Liu	559b8fc17e	[AMDGPU] emit macro __GFX9__ etc Emit predefined macros for GPU family. e.g. for GPU gfx9xx emit __GFX9__, etc. Reviewed by: Artem Belevich Differential Revision: https://reviews.llvm.org/D125909	2022-05-19 12:06:56 -04:00
Egor Zhdan	2f04e703bf	[Clang] Add DriverKit support This is the second patch that upstreams the support for Apple's DriverKit. The first patch: https://reviews.llvm.org/D118046. Differential Revision: https://reviews.llvm.org/D121911	2022-05-13 20:34:57 +01:00
Joseph Huber	002a63f937	[OpenMP] Add `__CUDA_ARCH__` definition when offloading with OpenMP Currently we define the `__CUDA_ARCH__` macro only in CUDA mode. This patch allows us to use this macro in OpenMP-offloading mode when targeting NVPTX. Reviewed By: tra, tianshilei1992 Differential Revision: https://reviews.llvm.org/D125256	2022-05-13 14:38:35 -04:00
Matt Devereau	75bb815231	[AArch64][SVE] Add aarch64_sve_pcs attribute to Clang Enable function attribute aarch64_sve_pcs at the C level, which correspondes to aarch64_sve_vector_pcs at the LLVM IR level. This requirement was created by this addition to the ARM C Language Extension: https://github.com/ARM-software/acle/pull/194 Differential Revision: https://reviews.llvm.org/D124998	2022-05-11 13:33:56 +00:00
Ting Wang	289236d597	[PowerPC] Fix PPCISD::STBRX selection issue on A2 Enable FeatureISA2_06 on Power A2 target Reviewed By: nemanjai Differential Revision: https://reviews.llvm.org/D125203	2022-05-10 20:47:51 -04:00
Ben Shi	3902ebdd57	[compiler-rt][builtins] Fix wrong ABI of AVR __mulqi3 & __mulhi3 Reviewed By: aykevl, dylanmckay Differential Revision: https://reviews.llvm.org/D125077	2022-05-06 13:46:49 +00:00
Amy Kwan	2534dc120a	[PowerPC] Enable CR bits support for Power8 and above. This patch turns on support for CR bit accesses for Power8 and above. The reason why CR bits are turned on as the default for Power8 and above is that because later architectures make use of builtins and instructions that require CR bit accesses (such as the use of setbc in the vector string isolate predicate and bcd builtins on Power10). This patch also adds the clang portion to allow for turning on CR bits in the front end if the user so desires to. Differential Revision: https://reviews.llvm.org/D124060	2022-05-02 12:06:15 -05:00
Ben Shi	42fa5bae7a	[clang][preprocessor] Add more macros to target AVR Reviewed By: MaskRay, aykevl Differential Revision: https://reviews.llvm.org/D124157	2022-05-02 04:37:57 +00:00
Kito Cheng	41b951c929	[RISCV] Fix int16 -> __fp16 conversion code gen clang emit wrong code sequence for `int16`(`short`) to `__fp16` conversion, and that should fix the code gen directly is the right way I think, but I found there is a FIXME comment in clang/Basic/TargetInfo.h say that's should be removed in future so I think just let swich to using generic LLVM IR rather than llvm.convert.to.fp16 intrinsics code gen path is enough. ``` /// Check whether llvm intrinsics such as llvm.convert.to.fp16 should be used /// to convert to and from __fp16. /// FIXME: This function should be removed once all targets stop using the /// conversion intrinsics. virtual bool useFP16ConversionIntrinsics() const { return true; } ``` Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D124509	2022-04-30 11:10:44 +08:00
Joe Nash	8bdfc73f63	[AMDGPU][clang] Definition of gfx11 subtarget Contributors: Jay Foad <jay.foad@amd.com> Konstantin Zhuravlyov <kzhuravl_dev@outlook.com> Patch 2/N for upstreaming of AMDGPU gfx11 architecture Depends on D124536 Reviewed By: foad, kzhuravl, #amdgpu, arsenm Differential Revision: https://reviews.llvm.org/D124537	2022-04-29 13:55:56 -04:00
Ulrich Weigand	1283ccb610	Support z16 processor name The recently announced IBM z16 processor implements the architecture already supported as "arch14" in LLVM. This patch adds support for "z16" as an alternate architecture name for arch14.	2022-04-21 19:58:22 +02:00
Chen Zheng	3c776c70a7	[PowerPC] add XLC compat builtin __abs Reviewed By: jsji Differential Revision: https://reviews.llvm.org/D123372	2022-04-20 05:14:22 -04:00

1 2 3 4 5 ...

937 Commits