llvm-project

Commit Graph

Author	SHA1	Message	Date
Adhemerval Zanella	c288715e95	[compiler-rt] [builtins] Use _Float16 on extendhfsf2, truncdfhf2 __truncsfhf2 if available On AArch64 it allows use the native FP16 ABI (although libcalls are not emitted for fptrunc/fpext lowering), while on other architectures the expected current semantic is preserved (arm for instance). For testing the _Float16 usage is enabled by architecture base, currently only for arm, aarch64, and arm64. This re-enabled revert done by https://reviews.llvm.org/rGb534beabeed3ba1777cd0ff9ce552d077e496726 Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D92241	2020-12-03 16:08:55 -03:00
Martin Storsjö	d3fef7a7c2	[compiler-rt] Fix building the aarch64 out-of-line atomics assembly for non-ELF platforms Move the two different definitions of FUNC_ALIGN out of the ELF specific block. Add the missing CFI_END in END_COMPILERRT_OUTLINE_FUNCTION, to go with the corresponding CFI_START in DEFINE_COMPILERRT_OUTLINE_FUNCTION_UNMANGLED. Differential Revision: https://reviews.llvm.org/D92549	2020-12-03 15:31:06 +02:00
Pavel Iliin	a4ac434c47	[AArch64] Compiler-rt interface for out-of-line atomics. Out-of-line helper functions to support LSE deployment added. This is a port of libgcc implementation: https://gcc.gnu.org/git/?p=gcc.git;h=33befddcb849235353dc263db1c7d07dc15c9faa Differential Revision: https://reviews.llvm.org/D91156	2020-12-02 20:07:12 +00:00
Martin Storsjö	2e5aaf65a3	[compiler-rt] [emutls] Handle unused parameters in a compiler agnostic way The MSVC specific pragmas disable this warning, but the pragmas themselves (when not guarded by any _MSC_VER ifdef) cause warnings for other targets, e.g. when targeting mingw. Instead silence the MSVC warnings about unused parameters by casting the parameters to void. Differential Revision: https://reviews.llvm.org/D91851	2020-12-01 10:07:53 +02:00
Reid Kleckner	b534beabee	Revert builtins fp16 support: tests do not pass on Mac Revert "[compiler-rt] [builtins] Support conversion between fp16 and fp128" & dependency Revert "[compiler-rt] [builtins] Use _Float16 on extendhfsf2, truncdfhf2 __truncsfhf2 if available" This reverts commit `7a94829881`. This reverts commit `1fb91fcf9c`.	2020-11-25 16:12:49 -08:00
Adhemerval Zanella	7a94829881	[compiler-rt] [builtins] Use _Float16 on extendhfsf2, truncdfhf2 __truncsfhf2 if available On AArch64 it allows use the native FP16 ABI (although libcalls are not emitted for fptrunc/fpext lowering), while on other architectures the expected current semantic is preserved (arm for instance). Differential Revision: https://reviews.llvm.org/D91733	2020-11-19 15:14:50 -03:00
Adhemerval Zanella	1fb91fcf9c	[compiler-rt] [builtins] Support conversion between fp16 and fp128 This patch adds both extendhftf2 and trunctfhf2 to support conversion between half-precision and quad-precision floating-point values. They are enabled iff the compiler supports _Float16. Some notes on ARM plaforms: while __fp16 is supported on all architectures, _Float16 is supported only for 32-bit ARM, 64-bit ARM, and SPIR (as indicated by clang/docs/LanguageExtensions.rst). Also, __fp16 is a storage format and promoted to 'float' for argument passing and 64-bit ARM supports floating-point convert precision to half as base armv8-a instruction. It means that although extendhfsf2, truncdfhf2 __truncsfhf2 will be built for 64-bit ARM, they will be never used in practice (compiler won't emit libcall to them). This patch does not change the ABI for 32-bit ARM, it will continue to pass _Float16 as uint16. Differential Revision: https://reviews.llvm.org/D91732	2020-11-19 15:14:50 -03:00
Zhuojia Shen	0c0eeb78eb	[builtins] Add support for single-precision-only-FPU ARM targets. This patch enables building compiler-rt builtins for ARM targets that only support single-precision floating point instructions (e.g., those with -mfpu=fpv4-sp-d16). This fixes PR42838 Differential Revision: https://reviews.llvm.org/D90698	2020-11-12 15:10:48 +00:00
Ayshe Kuran	55ec2ba4bc	Fix PR47973: Addressing integer division edge case with INT_MIN Adjustment to integer division in int_div_impl.inc to avoid undefined behaviour that can occur as a result of having INT_MIN as one of the parameters. Reviewed By: sepavloff Differential Revision: https://reviews.llvm.org/D90218	2020-11-10 15:57:06 +00:00
Alex Lorenz	701456b523	[darwin] add support for __isPlatformVersionAtLeast check for if (@available) The __isPlatformVersionAtLeast routine is an implementation of `if (@available)` check that uses the _availability_version_check API on Darwin that's supported on macOS 10.15, iOS 13, tvOS 13 and watchOS 6. Differential Revision: https://reviews.llvm.org/D90367	2020-11-02 16:28:09 -08:00
Benjamin Kramer	39a0d6889d	[X86] Add a stub for Intel's alderlake. No scheduling, no autodetection.	2020-10-24 19:01:22 +02:00
Luís Marques	58f6b16c49	[compiler-rt][builtins][RISCV] Always include __mul[sd]i3 builtin definitions The RISC-V implementations of the `__mulsi3`, `__muldi3` builtins were conditionally compiling the actual function definitions depending on whether the M extension was present or not. This caused Compiler-RT testing failures for RISC-V targets with the M extension, as when these sources were included the `librt_has_muli3` features were still being defined. These `librt_has_` definitions are used to conditionally run the respective tests. Since the actual functions were not being compiled-in, the generic test for `__muldi3` would fail. This patch makes these implementations follow the normal Compiler-RT convention of always including the definition, and conditionally running the respective tests by using the lit conditional `REQUIRES: librt_has_*`. Since the `mulsi3_test.c` wasn't actually RISC-V-specific, this patch also moves it out of the `riscv` directory. It now only depends on `librt_has_mulsi3` to run. Differential Revision: https://reviews.llvm.org/D86457	2020-10-21 09:49:03 +01:00
Alexey Baturo	303e8cdacb	[NFC][RISCV][builtins] Remove some hard-coded values from i-cache clear routine Remove some hard-coded values from i-cache clear routine Differential Revision: https://reviews.llvm.org/D87578	2020-09-24 14:32:16 +01:00
David Tenty	c455961479	[compiler-rt][AIX] Add CMake support for 32-bit Power builds This patch enables support for building compiler-rt builtins for 32-bit Power arch on AIX. For now, we leave out the specialized ppc builtin implementations for 128-bit long double and friends since those will need some special handling for AIX. Reviewed By: hubert.reinterpretcast Differential Revision: https://reviews.llvm.org/D87383	2020-09-22 16:08:58 -04:00
David Tenty	89074bdc81	[AIX][compiler-rt] Use the AR/ranlib mode flag for 32-bit and 64-bit mode since we will be building both 32-bit and 64-bit compiler-rt builtins from a single configuration. Reviewed By: hubert.reinterpretcast Differential Revision: https://reviews.llvm.org/D87113	2020-09-22 11:10:47 -04:00
Alex Richardson	aa85c6f2a5	[compiler-rt] Fix atomic support functions on 32-bit architectures The code currently uses __c11_atomic_is_lock_free() to detect whether an atomic operation is natively supported. However, this can result in a runtime function call to determine whether the given operation is lock-free and clang generating a call to e.g. __atomic_load_8 since the branch is not a constant zero. Since we are implementing those runtime functions, we must avoid those calls. This patch replaces __c11_atomic_is_lock_free() with __atomic_always_lock_free() which always results in a compile-time constant value. This problem was found while compiling atomic.c for MIPS32 since the -Watomic-alignment warning was being triggered and objdump showed an undefined reference to _atomic_is_lock_free. In addition to fixing 32-bit platforms this also enables the 16-byte case that was disabled in r153779 (`185f2edd70`). Reviewed By: efriedma Differential Revision: https://reviews.llvm.org/D86510	2020-09-21 10:21:11 +01:00
Craig Topper	c9af34027b	Add __divmodti4 to match libgcc. gcc has used this on x86-64 since at least version 7. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D80506	2020-09-16 21:56:01 -07:00
Stephen Hines	516a01b5f3	Implement __isOSVersionAtLeast for Android Add the implementation of __isOSVersionAtLeast for Android. Currently, only the major version is checked against the API level of the platform which is an integer. The API level is retrieved by reading the system property ro.build.version.sdk (and optionally ro.build.version.codename to see if the platform is released or not). Patch by jiyong@google.com Bug: 150860940 Bug: 134795810 Test: m Reviewed By: srhines Differential Revision: https://reviews.llvm.org/D86596	2020-09-15 12:54:06 -07:00
Craig Topper	f5ad9c2e0e	[builtins] Write __divmoddi4/__divmodsi4 in terms __udivmod instead of __div and multiply. Previously we calculating the remainder by multiplying the quotient and divisor and subtracting from the dividend. __udivmod can calculate the remainder while calculating the quotient. We just need to correct the sign afterward. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D87433	2020-09-10 08:08:55 -07:00
Craig Topper	35f708a3c9	[builtins] Inline __paritysi2 into __paritydi2 and inline __paritydi2 into __parityti2. No point in making __parityti2 go through 2 calls to get to __paritysi2. Reviewed By: MaskRay, efriedma Differential Revision: https://reviews.llvm.org/D87218	2020-09-07 17:57:39 -07:00
Brad Smith	8542dab909	[compiler-rt] Implement __clear_cache() on OpenBSD/arm	2020-09-06 15:54:24 -04:00
Anatoly Trosinenko	93eed63d2f	[builtins] Make __div[sdt]f3 handle denormal results This patch introduces denormal result support to soft-float division implementation unified by D85031. Reviewed By: sepavloff Differential Revision: https://reviews.llvm.org/D85032	2020-09-01 21:52:34 +03:00
Anatoly Trosinenko	0e90d8d4fe	[builtins] Unify the softfloat division implementation This patch replaces three different pre-existing implementations of __div[sdt]f3 LibCalls with a generic one - like it is already done for many other LibCalls. Reviewed By: sepavloff Differential Revision: https://reviews.llvm.org/D85031	2020-09-01 19:05:50 +03:00
Anatoly Trosinenko	11cf6346fd	[NFC][compiler-rt] Factor out __div[sdt]i3 and __mod[dt]i3 implementations Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D86400	2020-08-30 16:14:08 +03:00
Anatoly Trosinenko	fce035eae9	[NFC][compiler-rt] Factor out __mulo[sdt]i4 implementations to .inc file The existing implementations are almost identical except for width of the integer type. Factor them out to int_mulo_impl.inc for better maintainability. This patch is almost identical to D86277. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D86289	2020-08-27 14:33:48 +03:00
Anatoly Trosinenko	182d14db07	[NFC][compiler-rt] Factor out __mulv[sdt]i3 implementations to .inc file The existing implementations are almost identical except for width of the integer type. Factor them out to int_mulv_impl.inc for better maintainability. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D86277	2020-08-27 14:33:48 +03:00
David Tenty	f8454d60b8	[AIX][compiler-rt][builtins] Don't add ppc builtin implementations that require __int128 on AIX since __int128 currently isn't supported on AIX. Reviewed By: hubert.reinterpretcast Differential Revision: https://reviews.llvm.org/D85972	2020-08-25 11:35:38 -04:00
Freddy Ye	e02d081f2b	[X86] Support -march=sapphirerapids Support -march=sapphirerapids for x86. Compare with Icelake Server, it includes 14 more new features. They are amxtile, amxint8, amxbf16, avx512bf16, avx512vp2intersect, cldemote, enqcmd, movdir64b, movdiri, ptwrite, serialize, shstk, tsxldtrk, waitpkg. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D86503	2020-08-25 14:21:21 +08:00
Shoaib Meenai	2c80e2fe51	[runtimes] Use llvm-libtool-darwin for runtimes build It's full featured now and we can use it for the runtimes build instead of relying on an external libtool, which means the CMAKE_HOST_APPLE restriction serves no purpose either now. Restrict llvm-lipo to Darwin targets while I'm here, since it's only needed there. Reviewed By: phosek Differential Revision: https://reviews.llvm.org/D86367	2020-08-24 13:48:30 -07:00
Luís Marques	57903cf093	[compiler-rt][RISCV] Use muldi3 builtin assembly implementation D80465 added an assembly implementation of muldi3 for RISC-V but it didn't add it to the cmake `*_SOURCES` list, so the C implementation was being used instead. This patch fixes that. Differential Revision: https://reviews.llvm.org/D86036	2020-08-21 13:06:35 +01:00
Craig Topper	df9a9bb7be	[X86] Correct the implementation of the testFeature macro in getIntelProcessorTypeAndSubtype to do a proper bit test. Instead of ANDing with a one hot mask representing the bit to be tested, we were ANDing with just the bit number. This tests multiple bits none of them the correct one. This caused skylake-avx512, cascadelake and cooperlake to all be misdetected. Based on experiments with the Intel SDE, it seems that all of these CPUs are being detected as being cooperlake. This is bad since its the newest CPU of the 3.	2020-08-20 23:50:45 -07:00
Louis Dionne	afa1afd410	[CMake] Bump CMake minimum version to 3.13.4 This upgrade should be friction-less because we've already been ensuring that CMake >= 3.13.4 is used. This is part of the effort discussed on llvm-dev here: http://lists.llvm.org/pipermail/llvm-dev/2020-April/140578.html Differential Revision: https://reviews.llvm.org/D78648	2020-07-22 14:25:07 -04:00
Nico Weber	669b070936	cmake list formatting fix	2020-07-16 18:29:48 -04:00
Ryan Prichard	15b37e1cfa	[builtins] Omit 80-bit builtins on Android and MSVC long double is a 64-bit double-precision type on: - MSVC (32- and 64-bit x86) - Android (32-bit x86) long double is a 128-bit quad-precision type on x86_64 Android. The assembly variants of the 80-bit builtins are correct, but some of the builtins are implemented in C and require that long double be the 80-bit type passed via an x87 register. Reviewed By: compnerd Differential Revision: https://reviews.llvm.org/D82153	2020-07-16 15:11:26 -07:00
Ryan Prichard	8cbb6ccc7f	[builtins] Cleanup generic-file filtering Split filter_builtin_sources into two functions: - filter_builtin_sources that removes generic files when an arch-specific file is selected. - darwin_filter_builtin_sources that implements the EXCLUDE/INCLUDE lists (using the files in lib/builtins/Darwin-excludes). darwin_filter_builtin_sources delegates to filter_builtin_sources. Previously, lib/builtins/CMakeLists.txt had a number of calls to filter_builtin_sources (with a confusing/broken use of the `excluded_list` parameter), as well as a redundant arch-vs-generic filtering for the non-Apple code path at the end of the file. Replace all of this with a single call to filter_builtin_sources. Remove i686_SOURCES. Previously, this list contained only the arch-specific files common to 32-bit and 64-bit x86, which is a strange set. Normally the ${ARCH}_SOURCES list contains everything needed for the arch. "i686" isn't in ALL_BUILTIN_SUPPORTED_ARCH. NFCI, but i686_SOURCES won't be defined, and the order of files in ${arch}_SOURCES lists will change. Differential Revision: https://reviews.llvm.org/D82151	2020-07-13 16:53:07 -07:00
Ryan Prichard	f398e0f3d1	[builtins][Android] Define HAS_80_BIT_LONG_DOUBLE to 0 Android 32-bit x86 uses a 64-bit long double. Android 64-bit x86 uses a 128-bit quad-precision long double. Differential Revision: https://reviews.llvm.org/D82152	2020-07-13 16:53:07 -07:00
Craig Topper	b92c2bb6a2	[X86] Add CPU name strings to getIntelProcessorTypeAndSubtype and getAMDProcessorTypeAndSubtype in compiler-rt. These aren't used in compiler-rt, but I plan to make a similar change to the equivalent code in Host.cpp where the mapping from type/subtype is an unnecessary complication. Having the CPU strings here will help keep the code somewhat synchronized.	2020-07-12 12:59:25 -07:00
Danila Kutenin	68c011aa08	[builtins] Optimize udivmodti4 for many platforms. Summary: While benchmarking uint128 division we found out that it has huge latency for small divisors https://reviews.llvm.org/D83027 ``` Benchmark Time(ns) CPU(ns) Iterations -------------------------------------------------------------------------------------------------- BM_DivideIntrinsic128UniformDivisor<unsigned __int128> 13.0 13.0 55000000 BM_DivideIntrinsic128UniformDivisor<__int128> 14.3 14.3 50000000 BM_RemainderIntrinsic128UniformDivisor<unsigned __int128> 13.5 13.5 52000000 BM_RemainderIntrinsic128UniformDivisor<__int128> 14.1 14.1 50000000 BM_DivideIntrinsic128SmallDivisor<unsigned __int128> 153 153 5000000 BM_DivideIntrinsic128SmallDivisor<__int128> 170 170 3000000 BM_RemainderIntrinsic128SmallDivisor<unsigned __int128> 153 153 5000000 BM_RemainderIntrinsic128SmallDivisor<__int128> 155 155 5000000 ``` This patch suggests a more optimized version of the division: If the divisor is 64 bit, we can proceed with the divq instruction on x86 or constant multiplication mechanisms for other platforms. Once both divisor and dividend are not less than 2**64, we use branch free subtract algorithm, it has at most 64 cycles. After that our benchmarks improved significantly ``` Benchmark Time(ns) CPU(ns) Iterations -------------------------------------------------------------------------------------------------- BM_DivideIntrinsic128UniformDivisor<unsigned __int128> 11.0 11.0 64000000 BM_DivideIntrinsic128UniformDivisor<__int128> 13.8 13.8 51000000 BM_RemainderIntrinsic128UniformDivisor<unsigned __int128> 11.6 11.6 61000000 BM_RemainderIntrinsic128UniformDivisor<__int128> 13.7 13.7 52000000 BM_DivideIntrinsic128SmallDivisor<unsigned __int128> 27.1 27.1 26000000 BM_DivideIntrinsic128SmallDivisor<__int128> 29.4 29.4 24000000 BM_RemainderIntrinsic128SmallDivisor<unsigned __int128> 27.9 27.8 26000000 BM_RemainderIntrinsic128SmallDivisor<__int128> 29.1 29.1 25000000 ``` If not using divq instrinsics, it is still much better ``` Benchmark Time(ns) CPU(ns) Iterations -------------------------------------------------------------------------------------------------- BM_DivideIntrinsic128UniformDivisor<unsigned __int128> 12.2 12.2 58000000 BM_DivideIntrinsic128UniformDivisor<__int128> 13.5 13.5 52000000 BM_RemainderIntrinsic128UniformDivisor<unsigned __int128> 12.7 12.7 56000000 BM_RemainderIntrinsic128UniformDivisor<__int128> 13.7 13.7 51000000 BM_DivideIntrinsic128SmallDivisor<unsigned __int128> 30.2 30.2 24000000 BM_DivideIntrinsic128SmallDivisor<__int128> 33.2 33.2 22000000 BM_RemainderIntrinsic128SmallDivisor<unsigned __int128> 31.4 31.4 23000000 BM_RemainderIntrinsic128SmallDivisor<__int128> 33.8 33.8 21000000 ``` PowerPC benchmarks: Was ``` BM_DivideIntrinsic128UniformDivisor<unsigned __int128> 22.3 22.3 32000000 BM_DivideIntrinsic128UniformDivisor<__int128> 23.8 23.8 30000000 BM_RemainderIntrinsic128UniformDivisor<unsigned __int128> 22.5 22.5 32000000 BM_RemainderIntrinsic128UniformDivisor<__int128> 24.9 24.9 29000000 BM_DivideIntrinsic128SmallDivisor<unsigned __int128> 394 394 2000000 BM_DivideIntrinsic128SmallDivisor<__int128> 397 397 2000000 BM_RemainderIntrinsic128SmallDivisor<unsigned __int128> 399 399 2000000 BM_RemainderIntrinsic128SmallDivisor<__int128> 397 397 2000000 ``` With this patch ``` BM_DivideIntrinsic128UniformDivisor<unsigned __int128> 21.7 21.7 33000000 BM_DivideIntrinsic128UniformDivisor<__int128> 23.0 23.0 31000000 BM_RemainderIntrinsic128UniformDivisor<unsigned __int128> 21.9 21.9 33000000 BM_RemainderIntrinsic128UniformDivisor<__int128> 23.9 23.9 30000000 BM_DivideIntrinsic128SmallDivisor<unsigned __int128> 32.7 32.6 23000000 BM_DivideIntrinsic128SmallDivisor<__int128> 33.4 33.4 21000000 BM_RemainderIntrinsic128SmallDivisor<unsigned __int128> 31.1 31.1 22000000 BM_RemainderIntrinsic128SmallDivisor<__int128> 33.2 33.2 22000000 ``` My email: danilak@google.com, I don't have commit rights Reviewers: howard.hinnant, courbet, MaskRay Reviewed By: courbet Subscribers: steven.zhang, #sanitizers Tags: #sanitizers Differential Revision: https://reviews.llvm.org/D81809	2020-07-10 09:59:16 +02:00
Sid Manning	baca8f977e	[compiler-rt][Hexagon] Remove fma/fmin/max code This code should reside in the c-library. Differential Revision: https://reviews.llvm.org/D82263	2020-07-07 19:50:04 -05:00
Anatoly Trosinenko	0ee439b705	[builtins] Change si_int to int in some helper declarations This patch changes types of some integer function arguments or return values from `si_int` to the default `int` type to make it more compatible with `libgcc`. The compiler-rt/lib/builtins/README.txt has a link to the [libgcc specification](http://gcc.gnu.org/onlinedocs/gccint/Libgcc.html#Libgcc). This specification has an explicit note on `int`, `float` and other such types being just illustrations in some cases while the actual types are expressed with machine modes. Such usage of always-32-bit-wide integer type may lead to issues on 16-bit platforms such as MSP430. Provided [libgcc2.h](https://gcc.gnu.org/git/?p=gcc.git;a=blob_plain;f=libgcc/libgcc2.h;hb=HEAD) can be used as a reference for all targets supported by the libgcc, this patch fixes some existing differences in helper declarations. This patch is expected to not change behavior at all for targets with 32-bit `int` type. Differential Revision: https://reviews.llvm.org/D81285	2020-06-30 11:07:02 +03:00
Anatoly Trosinenko	a4e8f7fe3f	[builtins] Improve compatibility with 16 bit targets Some parts of existing codebase assume the default `int` type to be (at least) 32 bit wide. On 16 bit targets such as MSP430 this may cause Undefined Behavior or results being defined but incorrect. Differential Revision: https://reviews.llvm.org/D81408	2020-06-26 15:31:11 +03:00
Anatoly Trosinenko	a931ec7ca0	[builtins] Move more float128-related helpers to GENERIC_TF_SOURCES list There are two different _generic_ lists of source files in the compiler-rt/lib/builtins/CMakeLists.txt. Now there is no simple way to not use the tf-variants of helpers at all. Since there exists a separate `GENERIC_TF_SOURCES` list, it seems quite natural to move all float128-related helpers there. If it is not possible for some reason, it would be useful to have an explanation of that reason somewhere near the `GENERIC_TF_SOURCES` definition. Differential Revision: https://reviews.llvm.org/D81282	2020-06-25 22:32:49 +03:00
Craig Topper	23654d9e7a	Recommit "[X86] Calculate the needed size of the feature arrays in _cpu_indicator_init and getHostCPUName using the size of the feature enum." Hopefully this version will fix the previously buildbot failure	2020-06-22 13:32:03 -07:00
Craig Topper	bebea4221d	Revert "[X86] Calculate the needed size of the feature arrays in _cpu_indicator_init and getHostCPUName using the size of the feature enum." Seems to breaking build. This reverts commit `5ac144fe64`.	2020-06-22 12:20:40 -07:00
Craig Topper	5ac144fe64	[X86] Calculate the needed size of the feature arrays in _cpu_indicator_init and getHostCPUName using the size of the feature enum. Move 0 initialization up to the caller so we don't need to know the size.	2020-06-22 11:46:20 -07:00
Craig Topper	90406d62e5	[X86] Add cooperlake and tigerlake to the enum in cpu_model.c I forgot to do this when I added then to _cpu_indicator_init.	2020-06-21 16:20:26 -07:00
Craig Topper	0e6c9316d4	[X86] Add cooperlake detection to _cpu_indicator_init. libgcc has this enum encoding defined for a while, but their detection code is missing. I've raised a bug with them so that should get fixed soon.	2020-06-21 13:02:33 -07:00
Craig Topper	35f7d58328	[X86] Set the cpu_vendor in __cpu_indicator_init to VENDOR_OTHER if cpuid isn't supported on the CPU. We need to set the cpu_vendor to a non-zero value to indicate that we already called __cpu_indicator_init once. This should only happen on a 386 or 486 CPU.	2020-06-20 15:36:04 -07:00
Ryan Prichard	8627190f31	[builtins] Fix typos in comments Differential Revision: https://reviews.llvm.org/D82146	2020-06-19 16:08:04 -07:00
David Tenty	8aef01eed4	[AIX][compiler-rt] Pick the right form of COMPILER_RT_ALIAS for AIX Summary: we use the alias attribute, similar to what is done for ELF. Reviewers: ZarkoCA, jasonliu, hubert.reinterpretcast, sfertile Reviewed By: jasonliu Subscribers: dberris, aheejin, mstorsjo, #sanitizers Tags: #sanitizers Differential Revision: https://reviews.llvm.org/D81120	2020-06-16 14:10:40 -04:00
Craig Topper	033bf61cc5	[X86] Remove brand_id check from cpu_indicator_init. Brand index was a feature some Pentium III and Pentium 4 CPUs. It provided an index into a software lookup table to provide a brand name for the CPU. This is separate from the family/model. It's unclear to me why this index being non-zero was used to block checking family/model. None of the CPUs that had a non-zero brand index are supported by __builtin_cpu_is or target multi-versioning so this should have no real effect.	2020-06-12 20:35:48 -07:00
Craig Topper	94ccb2acbf	[X86] Combine to two feature variables in __cpu_indicator_init into an array and pass them around as pointer we can treat as an array. This simplifies the indexing code to set and test bits.	2020-06-12 18:30:41 -07:00
Craig Topper	e424a3526a	[X86] Explicitly initialize __cpu_features2 global in compiler-rt to 0. Seems like this may be needed in order for the linker to find the symbol. At least on my Mac.	2020-06-12 18:30:34 -07:00
kamlesh kumar	e31ccee1b0	[RISCV-V] Provide muldi3 builtin assembly implementation Provides an assembly implementation of muldi3 for RISC-V, to solve bug 43388. Since the implementation is the same as for mulsi3, that code was moved to `riscv/int_mul_impl.inc` and is now reused by both `mulsi3.S` and `muldi3.S`. Differential Revision: https://reviews.llvm.org/D80465	2020-06-02 21:04:55 +01:00
Kazushi (Jam) Marukawa	dedaf3a2ac	[VE] Dynamic stack allocation Summary: This patch implements dynamic stack allocation for the VE target. Changes: * compiler-rt: `__ve_grow_stack` to request stack allocation on the VE. * VE: base pointer support, dynamic stack allocation. Differential Revision: https://reviews.llvm.org/D79084	2020-05-27 10:11:06 +02:00
Craig Topper	2bb822bc90	[X86] Add family/model for Intel Comet Lake CPUs for -march=native and function multiversioning This adds the family/model returned by CPUID for some Intel Comet Lake CPUs. Instruction set and tuning wise these are the same as "skylake". These are not in the Intel SDM yet, but these should be correct.	2020-05-24 00:29:25 -07:00
Craig Topper	95bc21f32f	[X86] Add avx512vp2intersect feature to compiler-rt's feature detection to match libgcc.	2020-05-21 21:54:54 -07:00
Kamil Rytarowski	f61f6ffe11	[compiler-rt] [builtin] Switch the return type of __atomic_compare_exchange_##n to bool Summary: Synchronize the function definition with the LLVM documentation. https://llvm.org/docs/Atomics.html#libcalls-atomic GCC also returns bool for the same atomic builtin. Reviewers: theraven Reviewed By: theraven Subscribers: theraven, dberris, jfb, #sanitizers Tags: #sanitizers Differential Revision: https://reviews.llvm.org/D79845	2020-05-13 14:09:02 +02:00
Ayke van Laethem	4d41df6482	[builtins] Support architectures with 16-bit int This is the first patch in a series to add support for the AVR target. This patch includes changes to make compiler-rt more target independent by not relying on the width of an int or long. Differential Revision: https://reviews.llvm.org/D78662	2020-04-26 01:22:10 +02:00
Fangrui Song	17772995d4	[builtins] Add missing header in D77912 and make __builtin_clzll more robust	2020-04-17 08:29:58 -07:00
Ayke van Laethem	d9e5691843	[builtins] Fix unprototypes function declaration The following declarations were missing a prototype: FE_ROUND_MODE __fe_getround(); int __fe_raise_inexact(); Discovered while fixing a bug in Clang related to unprototyped function calls (see the previous commit). Differential Revision: https://reviews.llvm.org/D78205	2020-04-15 23:44:51 +02:00
Fangrui Song	b541196eb4	[builtins] Make __umodsi3/__udivdi3/__umoddi3 standalone (shift and subtract) @kamleshbhalui reported that when the Standard Extension M (Multiplication and Division) is disabled for RISC-V, `__udivdi3` will call __udivmodti4 which will in turn calls `__udivdi3`. This patch moves __udivsi3 (shift and subtract) to int_div_impl.inc `__udivXi3`, optimize a bit, add a `__umodXi3`, and use `__udivXi3` and `__umodXi3` to define `__udivsi3` `__umodsi3` `__udivdi3` `__umoddi3`. Reviewed By: kamleshbhalui Differential Revision: https://reviews.llvm.org/D77912	2020-04-14 10:38:37 -07:00
Shoaib Meenai	f481256bfe	[builtins] Build for arm64e for Darwin https://github.com/apple/swift/pull/30112/ makes the Swift standard library for iOS build for arm64e. If you're building Swift against your own LLVM, this in turn requires having the builtins built for arm64e, otherwise you won't be able to use the builtins (which will in turn lead to an undefined symbol for `__isOSVersionAtLeast`). Make the builtins build for arm64e to fix this. Differential Revision: https://reviews.llvm.org/D76041	2020-03-11 22:01:44 -07:00
Luís Marques	99a8cc2b7d	[compiler-rt][builtins][RISCV] Port __clear_cache to RISC-V Linux Implements `__clear_cache` for RISC-V Linux. We can't just use `fence.i` on Linux, because the Linux thread might be scheduled on another hart, and the `fence.i` instruction only flushes the icache of the current hart.	2020-03-05 16:44:47 +00:00
Steven Wu	387c3f74fd	[compiler-rt] Build all alias in builtin as private external on Darwin Summary: For builtin compiler-rt, it is built with visibility hidden by default to avoid the client exporting symbols from libclang static library. The compiler option -fvisibility=hidden doesn't work on the aliases in c files because they are created with inline assembly. On Darwin platform, thoses aliases are exported by default if they are reference by the client. Fix the issue by adding ".private_extern" to all the aliases if the library is built with visibility hidden. rdar://problem/58960296 Reviewers: dexonsmith, arphaman, delcypher, kledzik Reviewed By: delcypher Subscribers: dberris, jkorous, ributzka, #sanitizers, llvm-commits Tags: #sanitizers, #llvm Differential Revision: https://reviews.llvm.org/D73577	2020-02-26 09:29:11 -08:00
Sid Manning	d37cbda5f9	[Hexagon] Define __ELF__ by default. Differential Revision: https://reviews.llvm.org/D74972	2020-02-21 16:10:31 -06:00
Sam Clegg	2f172d8d3c	[compiler-rt] Compile __powitf2 under wasm See https://github.com/emscripten-core/emscripten/issues/10374 See https://reviews.llvm.org/D74274 Differential Revision: https://reviews.llvm.org/D74275	2020-02-11 17:35:07 -08:00
Petr Hosek	c96eeebca8	[CMake] compiler-rt: Add COMPILER_RT_BUILTINS_ENABLE_PIC The configuration for -fPIC in the builtins library when built standalone is unconditional, stating that the flags would "normally be added... by the llvm cmake step" This is untrue, as the llvm cmake step checks LLVM_ENABLE_PIC, which allows a client to turn off -fPIC. I've added an option when compiler-rt builtins are configured standalone, such as when built as part of the LLVM runtimes system, to guard the application of -fPIC for users that want it. Patch By: JamesNagurne Differential Revision: https://reviews.llvm.org/D72950	2020-01-31 15:57:18 -08:00
Yi Kong	acc79aa0e7	Revert "Revert `1689ad27af` "[builtins] Implement rounding mode support for i386/x86_64"" Don't build specilised fp_mode.c on MSVC since it does not support inline ASM on x86_64. This reverts commit `a19f0eec94`.	2019-11-27 17:29:20 -08:00
Lei Huang	9e676d9c7e	[PowerPC][compiler-rt][builtins]Add __fixtfti builtin on PowerPC Implements __fixtfti builtin for PowerPC. This builtin converts a long double (IBM double-double) to a signed int128. The conversion relies on the unsigned conversion of the absolute value of the long double. Tests included for both positive and negative long doubles. Patch By: Baptiste Saleil Differential Revision: https://reviews.llvm.org/D69730	2019-11-25 14:54:03 -06:00
Florian Hahn	a70c3f9f45	[compiler-rt] Don't check XCR0 when detecting avx512 on Darwin. Darwin lazily saves the AVX512 context on first use [1]: instead of checking that it already does to figure out if the OS supports AVX512, trust that the kernel will do the right thing and always assume the context save support is available. [1] https://github.com/apple/darwin-xnu/blob/xnu-4903.221.2/osfmk/i386/fpu.c#L174 Reviewers: ab, RKSimon, craig.topper Reviewed By: craig.topper Subscribers: dberris, JDevlieghere, #sanitizers, llvm-commits Tags: #sanitizers, #llvm Differential Revision: https://reviews.llvm.org/D70454	2019-11-21 09:19:17 +00:00
Hans Wennborg	a19f0eec94	Revert `1689ad27af` "[builtins] Implement rounding mode support for i386/x86_64" It broke the build with MSVC: fp_mode.c(20): error C2065: '__asm__': undeclared identifier > Differential Revision: https://reviews.llvm.org/D69870	2019-11-19 09:37:31 +01:00
Craig Topper	ff75bf6ac9	[X86] Add AMD Matisse (znver2) model number to getHostCPUName and compiler-rt's getAMDProcessorTypeAndSubtype. This is the CPUID model used on Ryzen 3000 series (Zen 2/Matisse) CPUs. Patch by Alex James Differential Revision: https://reviews.llvm.org/D70279	2019-11-18 11:57:04 -08:00
Yi Kong	1689ad27af	[builtins] Implement rounding mode support for i386/x86_64 Differential Revision: https://reviews.llvm.org/D69870	2019-11-18 10:32:40 -08:00
Lei Huang	71f4761431	[PowerPC][compiler-rt][builtins]Fix __fixunstfti builtin on PowerPC __fixunstfti converts a long double (IBM double-double) to an unsigned 128 bit integer. This patch enables it to handle a previously unhandled case in which a negative low double may impact the result of the conversion. Collaborated with @masoud.ataei and @renenkel. Patch By: Baptiste Saleil Differential Revision: https://reviews.llvm.org/D69193	2019-11-08 11:57:09 -06:00
Dan Liew	8ea148dc0c	[Builtins] Fix bug where powerpc builtins specializations didn't remove generic implementations. Summary: Previously the CMake code looked for filepaths of the form `<arch>/<filename>` as an indication that `<arch>/<filename>` provided a specialization of a top-level file `<filename>`. For powerpc there was a bug because the powerpc specialized implementations lived in `ppc/` but the architectures were `powerpc64` and `powerpc64le` which meant that CMake was looking for files at `powerpc64/<filename>` and `powerpc64le/<filename>`. The result of this is that for powerpc the builtins library contained a duplicate symbol for `divtc3` because it had the generic implementation and the specialized version in the built static library. Although we could just add similar code to what there is for arm (i.e. compute `${_arch}`) to fix this, this is extremely error prone (until r375150 no error was raised). Instead this patch takes a different approach that removes looking for the architecture name entirely. Instead this patch uses the convention that a source file in a sub-directory might be a specialization of a generic implementation and if a source file of the same name (ignoring extension) exists at the top-level then it is the corresponding generic implementation. This approach is much simpler because it doesn't require keeping track of different architecture names. This convention already existed in repository but previously it was implicit. This change makes it explicit. This patch is motivated by wanting to revert r375162 which worked around the powerpc bug found when r375150 landed. Once it lands we should revert r375162. Reviewers: phosek, beanz, compnerd, shiva0217, amyk, rupprecht, kongyi, mstorsjo, t.p.northover, weimingz, jroelofs, joerg, sidneym Subscribers: nemanjai, mgorny, kristof.beyls, jsji, shchenz, steven.zhang, #sanitizers, llvm-commits Tags: #llvm, #sanitizers Differential Revision: https://reviews.llvm.org/D69189	2019-10-30 16:20:09 -07:00
Bryan Chan	35cb3ee4ca	[AArch64][Builtins] Avoid unnecessary cache cleaning Use new control bits CTR_EL0.DIC and CTR_EL0.IDC to discover the d-cache cleaning and i-cache invalidation requirements for instruction-to-data coherence. This matches the behavior in the latest libgcc. Author: Shaokun Zhang <zhangshaokun@hisilicon.com> Reviewed By: peter.smith Differential Revision: https://reviews.llvm.org/D69247	2019-10-28 09:56:39 -04:00
Zoran Jovanovic	78c78cb5a1	[mips] [builtins] Remove clear_mips_cache Differential Revision: https://reviews.llvm.org/D69021 llvm-svn: 375110	2019-10-17 12:21:14 +00:00
David Carlier	d80c2520d9	[builtins] Unbreak build on FreeBSD armv7 after D60351 headers include reordering. Reviewers: phosek, echristo Reviewed-By: phosek Differential Revsion: https://reviews.llvm.org/D68045 llvm-svn: 374070	2019-10-08 15:45:35 +00:00
Rumeet Dhindsa	1605eb1c1c	Fix int to bool errors exposed due to r372612. Differential Revision: https://reviews.llvm.org/D67937 M lib/builtins/fp_add_impl.inc M lib/builtins/fp_lib.h M lib/builtins/fp_trunc_impl.inc llvm-svn: 372684	2019-09-24 02:59:02 +00:00
Ed Maste	1a3dd638c4	compiler-rt: use fp_t instead of long double, for consistency Most builtins accepting or returning long double use the fp_t typedef. Change the remaining few cases to do so. Differential Revision: https://reviews.llvm.org/D35034 llvm-svn: 371400	2019-09-09 13:50:20 +00:00
Yi Kong	33b8a55329	Revert "Revert "[builtins] Rounding mode support for addxf3/subxf3"" Test failure fixed. This reverts commit `e204d244ba`. llvm-svn: 371003	2019-09-05 01:05:05 +00:00
Craig Topper	5465875e93	[X86] Add support for avx512bf16 for __builtin_cpu_supports and compiler-rt's cpu indicator. llvm-svn: 370915	2019-09-04 16:01:43 +00:00
Peter Collingbourne	f7ca57468a	Move a break into the correct place. NFCI. Should silence new C fallthrough warning. llvm-svn: 369813	2019-08-23 21:27:56 +00:00
Nico Weber	d2e493c337	Fix Wnewline-eof after r368598 llvm-svn: 368613	2019-08-12 19:57:17 +00:00
Matthew G McGovern	38a1aa117f	[builtins] MSVC warning disable for clean build - https://reviews.llvm.org/D66023 - amended for ifdef/if gcc errors in previous verison llvm-svn: 368598	2019-08-12 18:08:44 +00:00
Eric Christopher	11c1847237	Revert "[sanitizers] MSVC warning disable for clean build" and follow-up that tried to fix the build as it's still broken. This reverts commit 368476 and 368480. llvm-svn: 368481	2019-08-09 20:43:36 +00:00
Martin Storsjo	96a2b25bcb	Fix compilation after SVN r368476 That revision broke compilation with this error: lib/builtins/fixunsxfdi.c:13:2: error: unterminated conditional directive #if !_ARCH_PPC llvm-svn: 368480	2019-08-09 20:36:00 +00:00
Matthew G McGovern	8e2842cc85	[sanitizers] MSVC warning disable for clean build - https://reviews.llvm.org/D66023 llvm-svn: 368476	2019-08-09 20:09:46 +00:00
Eric Christopher	1d73e228db	BMI2 support is indicated in bit eight of EBX, not nine. See Intel SDM, Vol 2A, Table 3-8: https://www.intel.com/content/dam/www/public/us/en/documents/manuals/64-ia-32-architectures-software-developer-vol-2a-manual.pdf#page=296 Differential Revision: https://reviews.llvm.org/D65766 llvm-svn: 367929	2019-08-05 21:25:59 +00:00
Nico Weber	e4001bacc2	gn build: Fix redundant object files in builtin lib. compiler-rt's builtin library has generic implementations of many functions, and then per-arch optimized implementations of some. In the CMake build, both filter_builtin_sources() and an explicit loop at the end of the build file (see D37166) filter out the generic versions if a per-arch file is present. The GN build wasn't doing this filtering. Just do the filtering manually and explicitly, instead of being clever. While here, also remove files from the mingw/arm build that are redundantly listed after D39938 / r318139 (both from the CMake and the GN build). While here, also fix a target_os -> target_cpu typo. Differential Revision: https://reviews.llvm.org/D65512 llvm-svn: 367448	2019-07-31 17:08:34 +00:00
Rainer Orth	569f92f1e1	[compiler-rt][builtins] Provide __clear_cache for SPARC While working on https://reviews.llvm.org/D40900, two tests were failing since __clear_cache aborted. While libgcc's __clear_cache is just empty, this only happens because gcc (in gcc/config/sparc/sparc.c (sparc32_initialize_trampoline, sparc64_initialize_trampoline)) emits flush insns directly. The following patch mimics that. Tested on sparcv9-sun-solaris2.11. Differential Revision: https://reviews.llvm.org/D64496 llvm-svn: 366822	2019-07-23 16:33:54 +00:00
Nikita Popov	a205ebb09c	[builtins] Fix assembly in arm sync-ops.h This assembly is part of a macro that was reformatted in D60351. The missing space between push and { results in: Error: bad instruction `push{r4, r5,r6,lr}' llvm-svn: 365957	2019-07-12 20:52:02 +00:00
Rainer Orth	4a9a772f44	Enable compiler-rt on SPARC This patch enables compiler-rt on SPARC targets. Most of the changes are straightforward: - Add 32 and 64-bit sparc to compiler-rt - lib/builtins/fp_lib.h needed to check if the int128_t and uint128_t types exist (which they don't on sparc) There's one issue of note: many asan tests fail to compile on Solaris/SPARC: fatal error: error in backend: Function "_ZN7testing8internal16BoolFromGTestEnvEPKcb": over-aligned dynamic alloca not supported. Therefore, while asan is still built, both asan and ubsan-with-asan testing is disabled. The goal is to check if asan keeps compiling on Solaris/SPARC. This serves asan in gcc, which doesn't have the problem above and works just fine. With this patch, sparcv9-sun-solaris2.11 test results are pretty good: Failing Tests (9): Builtins-sparc-sunos :: divtc3_test.c Builtins-sparcv9-sunos :: compiler_rt_logbl_test.c Builtins-sparcv9-sunos :: divtc3_test.c [...] UBSan-Standalone-sparc :: TestCases/TypeCheck/misaligned.cpp UBSan-Standalone-sparcv9 :: TestCases/TypeCheck/misaligned.cpp The builtin failures are due to Bugs 42493 and 42496. The tree contained a few additonal patches either currently in review or about to be submitted. Tested on sparcv9-sun-solaris2.11. Differential Revision: https://reviews.llvm.org/D40943 llvm-svn: 365880	2019-07-12 08:30:17 +00:00
Petr Hosek	d2d6c17760	[builtins] Use libtool for builtins when building for Apple platform compiler-rt already uses libtool instead of ar when building for Apple platform, but that's not being used when builtins are being built separately e.g. as part of the runtimes build. This change extracts the logic setting up libtool into a separate file and uses it from both the compiler-rt and standalone builtins build. Differential Revision: https://reviews.llvm.org/D62820 llvm-svn: 362466	2019-06-04 02:38:15 +00:00
Saleem Abdulrasool	aad5d51882	builtins: correct function name for AEABI If `COMPILER_RT_ARMHF_TARGET` is set , the definition of the AEABI runtime function `__aeabi_fcmpun` is misspelt: `__aeabi_fcmpum` instead of `__aeabi_fcmpun`. Patch by Konstantin Schwarz! llvm-svn: 362424	2019-06-03 17:08:13 +00:00
Petr Hosek	529118fc87	[builtins] Move the compare2f definition outside of the macro This should hopefully address the error we're seeing in older versions of Clang. Differential Revision: https://reviews.llvm.org/D62554 llvm-svn: 361909	2019-05-29 01:51:56 +00:00
Craig Topper	6dbf4a86a7	[X86] Add more icelake model numbers to compiler-rt implementation of __builtin_cpu_is. Using model numbers found in Table 2-1 of the May 2019 version of the Intel Software Developer's Manual Volume 4. llvm-svn: 361423	2019-05-22 19:51:48 +00:00
Petr Hosek	48140db797	[builtins] Deduplicate __eqsf2 and __gtsf2 via macro The only difference between __eqsf2 and __gtsf2 is whether they return 1 or -1 on NaN. Rather than duplicating all the code, use a macro to define the function twice and use an argument to decide whether to negate the return value. Differential Revision: https://reviews.llvm.org/D61919 llvm-svn: 361207	2019-05-20 23:34:24 +00:00
Craig Topper	b93f8ae7a7	[X86] Add icelake-client and tremont model numbers to compiler-rt's implementation of __builtin_cpu_is. llvm-svn: 361175	2019-05-20 16:58:38 +00:00

1 2 3 4 5 ...

538 Commits