llvm-project

Commit Graph

Author	SHA1	Message	Date
Craig Topper	35f708a3c9	[builtins] Inline __paritysi2 into __paritydi2 and inline __paritydi2 into __parityti2. No point in making __parityti2 go through 2 calls to get to __paritysi2. Reviewed By: MaskRay, efriedma Differential Revision: https://reviews.llvm.org/D87218	2020-09-07 17:57:39 -07:00
Brad Smith	8542dab909	[compiler-rt] Implement __clear_cache() on OpenBSD/arm	2020-09-06 15:54:24 -04:00
Anatoly Trosinenko	93eed63d2f	[builtins] Make __div[sdt]f3 handle denormal results This patch introduces denormal result support to soft-float division implementation unified by D85031. Reviewed By: sepavloff Differential Revision: https://reviews.llvm.org/D85032	2020-09-01 21:52:34 +03:00
Anatoly Trosinenko	0e90d8d4fe	[builtins] Unify the softfloat division implementation This patch replaces three different pre-existing implementations of __div[sdt]f3 LibCalls with a generic one - like it is already done for many other LibCalls. Reviewed By: sepavloff Differential Revision: https://reviews.llvm.org/D85031	2020-09-01 19:05:50 +03:00
Anatoly Trosinenko	11cf6346fd	[NFC][compiler-rt] Factor out __div[sdt]i3 and __mod[dt]i3 implementations Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D86400	2020-08-30 16:14:08 +03:00
Anatoly Trosinenko	fce035eae9	[NFC][compiler-rt] Factor out __mulo[sdt]i4 implementations to .inc file The existing implementations are almost identical except for width of the integer type. Factor them out to int_mulo_impl.inc for better maintainability. This patch is almost identical to D86277. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D86289	2020-08-27 14:33:48 +03:00
Anatoly Trosinenko	182d14db07	[NFC][compiler-rt] Factor out __mulv[sdt]i3 implementations to .inc file The existing implementations are almost identical except for width of the integer type. Factor them out to int_mulv_impl.inc for better maintainability. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D86277	2020-08-27 14:33:48 +03:00
David Tenty	f8454d60b8	[AIX][compiler-rt][builtins] Don't add ppc builtin implementations that require __int128 on AIX since __int128 currently isn't supported on AIX. Reviewed By: hubert.reinterpretcast Differential Revision: https://reviews.llvm.org/D85972	2020-08-25 11:35:38 -04:00
Freddy Ye	e02d081f2b	[X86] Support -march=sapphirerapids Support -march=sapphirerapids for x86. Compare with Icelake Server, it includes 14 more new features. They are amxtile, amxint8, amxbf16, avx512bf16, avx512vp2intersect, cldemote, enqcmd, movdir64b, movdiri, ptwrite, serialize, shstk, tsxldtrk, waitpkg. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D86503	2020-08-25 14:21:21 +08:00
Shoaib Meenai	2c80e2fe51	[runtimes] Use llvm-libtool-darwin for runtimes build It's full featured now and we can use it for the runtimes build instead of relying on an external libtool, which means the CMAKE_HOST_APPLE restriction serves no purpose either now. Restrict llvm-lipo to Darwin targets while I'm here, since it's only needed there. Reviewed By: phosek Differential Revision: https://reviews.llvm.org/D86367	2020-08-24 13:48:30 -07:00
Luís Marques	57903cf093	[compiler-rt][RISCV] Use muldi3 builtin assembly implementation D80465 added an assembly implementation of muldi3 for RISC-V but it didn't add it to the cmake `*_SOURCES` list, so the C implementation was being used instead. This patch fixes that. Differential Revision: https://reviews.llvm.org/D86036	2020-08-21 13:06:35 +01:00
Craig Topper	df9a9bb7be	[X86] Correct the implementation of the testFeature macro in getIntelProcessorTypeAndSubtype to do a proper bit test. Instead of ANDing with a one hot mask representing the bit to be tested, we were ANDing with just the bit number. This tests multiple bits none of them the correct one. This caused skylake-avx512, cascadelake and cooperlake to all be misdetected. Based on experiments with the Intel SDE, it seems that all of these CPUs are being detected as being cooperlake. This is bad since its the newest CPU of the 3.	2020-08-20 23:50:45 -07:00
Louis Dionne	afa1afd410	[CMake] Bump CMake minimum version to 3.13.4 This upgrade should be friction-less because we've already been ensuring that CMake >= 3.13.4 is used. This is part of the effort discussed on llvm-dev here: http://lists.llvm.org/pipermail/llvm-dev/2020-April/140578.html Differential Revision: https://reviews.llvm.org/D78648	2020-07-22 14:25:07 -04:00
Nico Weber	669b070936	cmake list formatting fix	2020-07-16 18:29:48 -04:00
Ryan Prichard	15b37e1cfa	[builtins] Omit 80-bit builtins on Android and MSVC long double is a 64-bit double-precision type on: - MSVC (32- and 64-bit x86) - Android (32-bit x86) long double is a 128-bit quad-precision type on x86_64 Android. The assembly variants of the 80-bit builtins are correct, but some of the builtins are implemented in C and require that long double be the 80-bit type passed via an x87 register. Reviewed By: compnerd Differential Revision: https://reviews.llvm.org/D82153	2020-07-16 15:11:26 -07:00
Ryan Prichard	8cbb6ccc7f	[builtins] Cleanup generic-file filtering Split filter_builtin_sources into two functions: - filter_builtin_sources that removes generic files when an arch-specific file is selected. - darwin_filter_builtin_sources that implements the EXCLUDE/INCLUDE lists (using the files in lib/builtins/Darwin-excludes). darwin_filter_builtin_sources delegates to filter_builtin_sources. Previously, lib/builtins/CMakeLists.txt had a number of calls to filter_builtin_sources (with a confusing/broken use of the `excluded_list` parameter), as well as a redundant arch-vs-generic filtering for the non-Apple code path at the end of the file. Replace all of this with a single call to filter_builtin_sources. Remove i686_SOURCES. Previously, this list contained only the arch-specific files common to 32-bit and 64-bit x86, which is a strange set. Normally the ${ARCH}_SOURCES list contains everything needed for the arch. "i686" isn't in ALL_BUILTIN_SUPPORTED_ARCH. NFCI, but i686_SOURCES won't be defined, and the order of files in ${arch}_SOURCES lists will change. Differential Revision: https://reviews.llvm.org/D82151	2020-07-13 16:53:07 -07:00
Ryan Prichard	f398e0f3d1	[builtins][Android] Define HAS_80_BIT_LONG_DOUBLE to 0 Android 32-bit x86 uses a 64-bit long double. Android 64-bit x86 uses a 128-bit quad-precision long double. Differential Revision: https://reviews.llvm.org/D82152	2020-07-13 16:53:07 -07:00
Craig Topper	b92c2bb6a2	[X86] Add CPU name strings to getIntelProcessorTypeAndSubtype and getAMDProcessorTypeAndSubtype in compiler-rt. These aren't used in compiler-rt, but I plan to make a similar change to the equivalent code in Host.cpp where the mapping from type/subtype is an unnecessary complication. Having the CPU strings here will help keep the code somewhat synchronized.	2020-07-12 12:59:25 -07:00
Danila Kutenin	68c011aa08	[builtins] Optimize udivmodti4 for many platforms. Summary: While benchmarking uint128 division we found out that it has huge latency for small divisors https://reviews.llvm.org/D83027 ``` Benchmark Time(ns) CPU(ns) Iterations -------------------------------------------------------------------------------------------------- BM_DivideIntrinsic128UniformDivisor<unsigned __int128> 13.0 13.0 55000000 BM_DivideIntrinsic128UniformDivisor<__int128> 14.3 14.3 50000000 BM_RemainderIntrinsic128UniformDivisor<unsigned __int128> 13.5 13.5 52000000 BM_RemainderIntrinsic128UniformDivisor<__int128> 14.1 14.1 50000000 BM_DivideIntrinsic128SmallDivisor<unsigned __int128> 153 153 5000000 BM_DivideIntrinsic128SmallDivisor<__int128> 170 170 3000000 BM_RemainderIntrinsic128SmallDivisor<unsigned __int128> 153 153 5000000 BM_RemainderIntrinsic128SmallDivisor<__int128> 155 155 5000000 ``` This patch suggests a more optimized version of the division: If the divisor is 64 bit, we can proceed with the divq instruction on x86 or constant multiplication mechanisms for other platforms. Once both divisor and dividend are not less than 2**64, we use branch free subtract algorithm, it has at most 64 cycles. After that our benchmarks improved significantly ``` Benchmark Time(ns) CPU(ns) Iterations -------------------------------------------------------------------------------------------------- BM_DivideIntrinsic128UniformDivisor<unsigned __int128> 11.0 11.0 64000000 BM_DivideIntrinsic128UniformDivisor<__int128> 13.8 13.8 51000000 BM_RemainderIntrinsic128UniformDivisor<unsigned __int128> 11.6 11.6 61000000 BM_RemainderIntrinsic128UniformDivisor<__int128> 13.7 13.7 52000000 BM_DivideIntrinsic128SmallDivisor<unsigned __int128> 27.1 27.1 26000000 BM_DivideIntrinsic128SmallDivisor<__int128> 29.4 29.4 24000000 BM_RemainderIntrinsic128SmallDivisor<unsigned __int128> 27.9 27.8 26000000 BM_RemainderIntrinsic128SmallDivisor<__int128> 29.1 29.1 25000000 ``` If not using divq instrinsics, it is still much better ``` Benchmark Time(ns) CPU(ns) Iterations -------------------------------------------------------------------------------------------------- BM_DivideIntrinsic128UniformDivisor<unsigned __int128> 12.2 12.2 58000000 BM_DivideIntrinsic128UniformDivisor<__int128> 13.5 13.5 52000000 BM_RemainderIntrinsic128UniformDivisor<unsigned __int128> 12.7 12.7 56000000 BM_RemainderIntrinsic128UniformDivisor<__int128> 13.7 13.7 51000000 BM_DivideIntrinsic128SmallDivisor<unsigned __int128> 30.2 30.2 24000000 BM_DivideIntrinsic128SmallDivisor<__int128> 33.2 33.2 22000000 BM_RemainderIntrinsic128SmallDivisor<unsigned __int128> 31.4 31.4 23000000 BM_RemainderIntrinsic128SmallDivisor<__int128> 33.8 33.8 21000000 ``` PowerPC benchmarks: Was ``` BM_DivideIntrinsic128UniformDivisor<unsigned __int128> 22.3 22.3 32000000 BM_DivideIntrinsic128UniformDivisor<__int128> 23.8 23.8 30000000 BM_RemainderIntrinsic128UniformDivisor<unsigned __int128> 22.5 22.5 32000000 BM_RemainderIntrinsic128UniformDivisor<__int128> 24.9 24.9 29000000 BM_DivideIntrinsic128SmallDivisor<unsigned __int128> 394 394 2000000 BM_DivideIntrinsic128SmallDivisor<__int128> 397 397 2000000 BM_RemainderIntrinsic128SmallDivisor<unsigned __int128> 399 399 2000000 BM_RemainderIntrinsic128SmallDivisor<__int128> 397 397 2000000 ``` With this patch ``` BM_DivideIntrinsic128UniformDivisor<unsigned __int128> 21.7 21.7 33000000 BM_DivideIntrinsic128UniformDivisor<__int128> 23.0 23.0 31000000 BM_RemainderIntrinsic128UniformDivisor<unsigned __int128> 21.9 21.9 33000000 BM_RemainderIntrinsic128UniformDivisor<__int128> 23.9 23.9 30000000 BM_DivideIntrinsic128SmallDivisor<unsigned __int128> 32.7 32.6 23000000 BM_DivideIntrinsic128SmallDivisor<__int128> 33.4 33.4 21000000 BM_RemainderIntrinsic128SmallDivisor<unsigned __int128> 31.1 31.1 22000000 BM_RemainderIntrinsic128SmallDivisor<__int128> 33.2 33.2 22000000 ``` My email: danilak@google.com, I don't have commit rights Reviewers: howard.hinnant, courbet, MaskRay Reviewed By: courbet Subscribers: steven.zhang, #sanitizers Tags: #sanitizers Differential Revision: https://reviews.llvm.org/D81809	2020-07-10 09:59:16 +02:00
Sid Manning	baca8f977e	[compiler-rt][Hexagon] Remove fma/fmin/max code This code should reside in the c-library. Differential Revision: https://reviews.llvm.org/D82263	2020-07-07 19:50:04 -05:00
Anatoly Trosinenko	0ee439b705	[builtins] Change si_int to int in some helper declarations This patch changes types of some integer function arguments or return values from `si_int` to the default `int` type to make it more compatible with `libgcc`. The compiler-rt/lib/builtins/README.txt has a link to the [libgcc specification](http://gcc.gnu.org/onlinedocs/gccint/Libgcc.html#Libgcc). This specification has an explicit note on `int`, `float` and other such types being just illustrations in some cases while the actual types are expressed with machine modes. Such usage of always-32-bit-wide integer type may lead to issues on 16-bit platforms such as MSP430. Provided [libgcc2.h](https://gcc.gnu.org/git/?p=gcc.git;a=blob_plain;f=libgcc/libgcc2.h;hb=HEAD) can be used as a reference for all targets supported by the libgcc, this patch fixes some existing differences in helper declarations. This patch is expected to not change behavior at all for targets with 32-bit `int` type. Differential Revision: https://reviews.llvm.org/D81285	2020-06-30 11:07:02 +03:00
Anatoly Trosinenko	a4e8f7fe3f	[builtins] Improve compatibility with 16 bit targets Some parts of existing codebase assume the default `int` type to be (at least) 32 bit wide. On 16 bit targets such as MSP430 this may cause Undefined Behavior or results being defined but incorrect. Differential Revision: https://reviews.llvm.org/D81408	2020-06-26 15:31:11 +03:00
Anatoly Trosinenko	a931ec7ca0	[builtins] Move more float128-related helpers to GENERIC_TF_SOURCES list There are two different _generic_ lists of source files in the compiler-rt/lib/builtins/CMakeLists.txt. Now there is no simple way to not use the tf-variants of helpers at all. Since there exists a separate `GENERIC_TF_SOURCES` list, it seems quite natural to move all float128-related helpers there. If it is not possible for some reason, it would be useful to have an explanation of that reason somewhere near the `GENERIC_TF_SOURCES` definition. Differential Revision: https://reviews.llvm.org/D81282	2020-06-25 22:32:49 +03:00
Craig Topper	23654d9e7a	Recommit "[X86] Calculate the needed size of the feature arrays in _cpu_indicator_init and getHostCPUName using the size of the feature enum." Hopefully this version will fix the previously buildbot failure	2020-06-22 13:32:03 -07:00
Craig Topper	bebea4221d	Revert "[X86] Calculate the needed size of the feature arrays in _cpu_indicator_init and getHostCPUName using the size of the feature enum." Seems to breaking build. This reverts commit `5ac144fe64`.	2020-06-22 12:20:40 -07:00
Craig Topper	5ac144fe64	[X86] Calculate the needed size of the feature arrays in _cpu_indicator_init and getHostCPUName using the size of the feature enum. Move 0 initialization up to the caller so we don't need to know the size.	2020-06-22 11:46:20 -07:00
Craig Topper	90406d62e5	[X86] Add cooperlake and tigerlake to the enum in cpu_model.c I forgot to do this when I added then to _cpu_indicator_init.	2020-06-21 16:20:26 -07:00
Craig Topper	0e6c9316d4	[X86] Add cooperlake detection to _cpu_indicator_init. libgcc has this enum encoding defined for a while, but their detection code is missing. I've raised a bug with them so that should get fixed soon.	2020-06-21 13:02:33 -07:00
Craig Topper	35f7d58328	[X86] Set the cpu_vendor in __cpu_indicator_init to VENDOR_OTHER if cpuid isn't supported on the CPU. We need to set the cpu_vendor to a non-zero value to indicate that we already called __cpu_indicator_init once. This should only happen on a 386 or 486 CPU.	2020-06-20 15:36:04 -07:00
Ryan Prichard	8627190f31	[builtins] Fix typos in comments Differential Revision: https://reviews.llvm.org/D82146	2020-06-19 16:08:04 -07:00
David Tenty	8aef01eed4	[AIX][compiler-rt] Pick the right form of COMPILER_RT_ALIAS for AIX Summary: we use the alias attribute, similar to what is done for ELF. Reviewers: ZarkoCA, jasonliu, hubert.reinterpretcast, sfertile Reviewed By: jasonliu Subscribers: dberris, aheejin, mstorsjo, #sanitizers Tags: #sanitizers Differential Revision: https://reviews.llvm.org/D81120	2020-06-16 14:10:40 -04:00
Craig Topper	033bf61cc5	[X86] Remove brand_id check from cpu_indicator_init. Brand index was a feature some Pentium III and Pentium 4 CPUs. It provided an index into a software lookup table to provide a brand name for the CPU. This is separate from the family/model. It's unclear to me why this index being non-zero was used to block checking family/model. None of the CPUs that had a non-zero brand index are supported by __builtin_cpu_is or target multi-versioning so this should have no real effect.	2020-06-12 20:35:48 -07:00
Craig Topper	94ccb2acbf	[X86] Combine to two feature variables in __cpu_indicator_init into an array and pass them around as pointer we can treat as an array. This simplifies the indexing code to set and test bits.	2020-06-12 18:30:41 -07:00
Craig Topper	e424a3526a	[X86] Explicitly initialize __cpu_features2 global in compiler-rt to 0. Seems like this may be needed in order for the linker to find the symbol. At least on my Mac.	2020-06-12 18:30:34 -07:00
kamlesh kumar	e31ccee1b0	[RISCV-V] Provide muldi3 builtin assembly implementation Provides an assembly implementation of muldi3 for RISC-V, to solve bug 43388. Since the implementation is the same as for mulsi3, that code was moved to `riscv/int_mul_impl.inc` and is now reused by both `mulsi3.S` and `muldi3.S`. Differential Revision: https://reviews.llvm.org/D80465	2020-06-02 21:04:55 +01:00
Kazushi (Jam) Marukawa	dedaf3a2ac	[VE] Dynamic stack allocation Summary: This patch implements dynamic stack allocation for the VE target. Changes: * compiler-rt: `__ve_grow_stack` to request stack allocation on the VE. * VE: base pointer support, dynamic stack allocation. Differential Revision: https://reviews.llvm.org/D79084	2020-05-27 10:11:06 +02:00
Craig Topper	2bb822bc90	[X86] Add family/model for Intel Comet Lake CPUs for -march=native and function multiversioning This adds the family/model returned by CPUID for some Intel Comet Lake CPUs. Instruction set and tuning wise these are the same as "skylake". These are not in the Intel SDM yet, but these should be correct.	2020-05-24 00:29:25 -07:00
Craig Topper	95bc21f32f	[X86] Add avx512vp2intersect feature to compiler-rt's feature detection to match libgcc.	2020-05-21 21:54:54 -07:00
Kamil Rytarowski	f61f6ffe11	[compiler-rt] [builtin] Switch the return type of __atomic_compare_exchange_##n to bool Summary: Synchronize the function definition with the LLVM documentation. https://llvm.org/docs/Atomics.html#libcalls-atomic GCC also returns bool for the same atomic builtin. Reviewers: theraven Reviewed By: theraven Subscribers: theraven, dberris, jfb, #sanitizers Tags: #sanitizers Differential Revision: https://reviews.llvm.org/D79845	2020-05-13 14:09:02 +02:00
Ayke van Laethem	4d41df6482	[builtins] Support architectures with 16-bit int This is the first patch in a series to add support for the AVR target. This patch includes changes to make compiler-rt more target independent by not relying on the width of an int or long. Differential Revision: https://reviews.llvm.org/D78662	2020-04-26 01:22:10 +02:00
Fangrui Song	17772995d4	[builtins] Add missing header in D77912 and make __builtin_clzll more robust	2020-04-17 08:29:58 -07:00
Ayke van Laethem	d9e5691843	[builtins] Fix unprototypes function declaration The following declarations were missing a prototype: FE_ROUND_MODE __fe_getround(); int __fe_raise_inexact(); Discovered while fixing a bug in Clang related to unprototyped function calls (see the previous commit). Differential Revision: https://reviews.llvm.org/D78205	2020-04-15 23:44:51 +02:00
Fangrui Song	b541196eb4	[builtins] Make __umodsi3/__udivdi3/__umoddi3 standalone (shift and subtract) @kamleshbhalui reported that when the Standard Extension M (Multiplication and Division) is disabled for RISC-V, `__udivdi3` will call __udivmodti4 which will in turn calls `__udivdi3`. This patch moves __udivsi3 (shift and subtract) to int_div_impl.inc `__udivXi3`, optimize a bit, add a `__umodXi3`, and use `__udivXi3` and `__umodXi3` to define `__udivsi3` `__umodsi3` `__udivdi3` `__umoddi3`. Reviewed By: kamleshbhalui Differential Revision: https://reviews.llvm.org/D77912	2020-04-14 10:38:37 -07:00
Shoaib Meenai	f481256bfe	[builtins] Build for arm64e for Darwin https://github.com/apple/swift/pull/30112/ makes the Swift standard library for iOS build for arm64e. If you're building Swift against your own LLVM, this in turn requires having the builtins built for arm64e, otherwise you won't be able to use the builtins (which will in turn lead to an undefined symbol for `__isOSVersionAtLeast`). Make the builtins build for arm64e to fix this. Differential Revision: https://reviews.llvm.org/D76041	2020-03-11 22:01:44 -07:00
Luís Marques	99a8cc2b7d	[compiler-rt][builtins][RISCV] Port __clear_cache to RISC-V Linux Implements `__clear_cache` for RISC-V Linux. We can't just use `fence.i` on Linux, because the Linux thread might be scheduled on another hart, and the `fence.i` instruction only flushes the icache of the current hart.	2020-03-05 16:44:47 +00:00
Steven Wu	387c3f74fd	[compiler-rt] Build all alias in builtin as private external on Darwin Summary: For builtin compiler-rt, it is built with visibility hidden by default to avoid the client exporting symbols from libclang static library. The compiler option -fvisibility=hidden doesn't work on the aliases in c files because they are created with inline assembly. On Darwin platform, thoses aliases are exported by default if they are reference by the client. Fix the issue by adding ".private_extern" to all the aliases if the library is built with visibility hidden. rdar://problem/58960296 Reviewers: dexonsmith, arphaman, delcypher, kledzik Reviewed By: delcypher Subscribers: dberris, jkorous, ributzka, #sanitizers, llvm-commits Tags: #sanitizers, #llvm Differential Revision: https://reviews.llvm.org/D73577	2020-02-26 09:29:11 -08:00
Sid Manning	d37cbda5f9	[Hexagon] Define __ELF__ by default. Differential Revision: https://reviews.llvm.org/D74972	2020-02-21 16:10:31 -06:00
Sam Clegg	2f172d8d3c	[compiler-rt] Compile __powitf2 under wasm See https://github.com/emscripten-core/emscripten/issues/10374 See https://reviews.llvm.org/D74274 Differential Revision: https://reviews.llvm.org/D74275	2020-02-11 17:35:07 -08:00
Petr Hosek	c96eeebca8	[CMake] compiler-rt: Add COMPILER_RT_BUILTINS_ENABLE_PIC The configuration for -fPIC in the builtins library when built standalone is unconditional, stating that the flags would "normally be added... by the llvm cmake step" This is untrue, as the llvm cmake step checks LLVM_ENABLE_PIC, which allows a client to turn off -fPIC. I've added an option when compiler-rt builtins are configured standalone, such as when built as part of the LLVM runtimes system, to guard the application of -fPIC for users that want it. Patch By: JamesNagurne Differential Revision: https://reviews.llvm.org/D72950	2020-01-31 15:57:18 -08:00
Yi Kong	acc79aa0e7	Revert "Revert `1689ad27af` "[builtins] Implement rounding mode support for i386/x86_64"" Don't build specilised fp_mode.c on MSVC since it does not support inline ASM on x86_64. This reverts commit `a19f0eec94`.	2019-11-27 17:29:20 -08:00
Lei Huang	9e676d9c7e	[PowerPC][compiler-rt][builtins]Add __fixtfti builtin on PowerPC Implements __fixtfti builtin for PowerPC. This builtin converts a long double (IBM double-double) to a signed int128. The conversion relies on the unsigned conversion of the absolute value of the long double. Tests included for both positive and negative long doubles. Patch By: Baptiste Saleil Differential Revision: https://reviews.llvm.org/D69730	2019-11-25 14:54:03 -06:00
Florian Hahn	a70c3f9f45	[compiler-rt] Don't check XCR0 when detecting avx512 on Darwin. Darwin lazily saves the AVX512 context on first use [1]: instead of checking that it already does to figure out if the OS supports AVX512, trust that the kernel will do the right thing and always assume the context save support is available. [1] https://github.com/apple/darwin-xnu/blob/xnu-4903.221.2/osfmk/i386/fpu.c#L174 Reviewers: ab, RKSimon, craig.topper Reviewed By: craig.topper Subscribers: dberris, JDevlieghere, #sanitizers, llvm-commits Tags: #sanitizers, #llvm Differential Revision: https://reviews.llvm.org/D70454	2019-11-21 09:19:17 +00:00
Hans Wennborg	a19f0eec94	Revert `1689ad27af` "[builtins] Implement rounding mode support for i386/x86_64" It broke the build with MSVC: fp_mode.c(20): error C2065: '__asm__': undeclared identifier > Differential Revision: https://reviews.llvm.org/D69870	2019-11-19 09:37:31 +01:00
Craig Topper	ff75bf6ac9	[X86] Add AMD Matisse (znver2) model number to getHostCPUName and compiler-rt's getAMDProcessorTypeAndSubtype. This is the CPUID model used on Ryzen 3000 series (Zen 2/Matisse) CPUs. Patch by Alex James Differential Revision: https://reviews.llvm.org/D70279	2019-11-18 11:57:04 -08:00
Yi Kong	1689ad27af	[builtins] Implement rounding mode support for i386/x86_64 Differential Revision: https://reviews.llvm.org/D69870	2019-11-18 10:32:40 -08:00
Lei Huang	71f4761431	[PowerPC][compiler-rt][builtins]Fix __fixunstfti builtin on PowerPC __fixunstfti converts a long double (IBM double-double) to an unsigned 128 bit integer. This patch enables it to handle a previously unhandled case in which a negative low double may impact the result of the conversion. Collaborated with @masoud.ataei and @renenkel. Patch By: Baptiste Saleil Differential Revision: https://reviews.llvm.org/D69193	2019-11-08 11:57:09 -06:00
Dan Liew	8ea148dc0c	[Builtins] Fix bug where powerpc builtins specializations didn't remove generic implementations. Summary: Previously the CMake code looked for filepaths of the form `<arch>/<filename>` as an indication that `<arch>/<filename>` provided a specialization of a top-level file `<filename>`. For powerpc there was a bug because the powerpc specialized implementations lived in `ppc/` but the architectures were `powerpc64` and `powerpc64le` which meant that CMake was looking for files at `powerpc64/<filename>` and `powerpc64le/<filename>`. The result of this is that for powerpc the builtins library contained a duplicate symbol for `divtc3` because it had the generic implementation and the specialized version in the built static library. Although we could just add similar code to what there is for arm (i.e. compute `${_arch}`) to fix this, this is extremely error prone (until r375150 no error was raised). Instead this patch takes a different approach that removes looking for the architecture name entirely. Instead this patch uses the convention that a source file in a sub-directory might be a specialization of a generic implementation and if a source file of the same name (ignoring extension) exists at the top-level then it is the corresponding generic implementation. This approach is much simpler because it doesn't require keeping track of different architecture names. This convention already existed in repository but previously it was implicit. This change makes it explicit. This patch is motivated by wanting to revert r375162 which worked around the powerpc bug found when r375150 landed. Once it lands we should revert r375162. Reviewers: phosek, beanz, compnerd, shiva0217, amyk, rupprecht, kongyi, mstorsjo, t.p.northover, weimingz, jroelofs, joerg, sidneym Subscribers: nemanjai, mgorny, kristof.beyls, jsji, shchenz, steven.zhang, #sanitizers, llvm-commits Tags: #llvm, #sanitizers Differential Revision: https://reviews.llvm.org/D69189	2019-10-30 16:20:09 -07:00
Bryan Chan	35cb3ee4ca	[AArch64][Builtins] Avoid unnecessary cache cleaning Use new control bits CTR_EL0.DIC and CTR_EL0.IDC to discover the d-cache cleaning and i-cache invalidation requirements for instruction-to-data coherence. This matches the behavior in the latest libgcc. Author: Shaokun Zhang <zhangshaokun@hisilicon.com> Reviewed By: peter.smith Differential Revision: https://reviews.llvm.org/D69247	2019-10-28 09:56:39 -04:00
Zoran Jovanovic	78c78cb5a1	[mips] [builtins] Remove clear_mips_cache Differential Revision: https://reviews.llvm.org/D69021 llvm-svn: 375110	2019-10-17 12:21:14 +00:00
David Carlier	d80c2520d9	[builtins] Unbreak build on FreeBSD armv7 after D60351 headers include reordering. Reviewers: phosek, echristo Reviewed-By: phosek Differential Revsion: https://reviews.llvm.org/D68045 llvm-svn: 374070	2019-10-08 15:45:35 +00:00
Rumeet Dhindsa	1605eb1c1c	Fix int to bool errors exposed due to r372612. Differential Revision: https://reviews.llvm.org/D67937 M lib/builtins/fp_add_impl.inc M lib/builtins/fp_lib.h M lib/builtins/fp_trunc_impl.inc llvm-svn: 372684	2019-09-24 02:59:02 +00:00
Ed Maste	1a3dd638c4	compiler-rt: use fp_t instead of long double, for consistency Most builtins accepting or returning long double use the fp_t typedef. Change the remaining few cases to do so. Differential Revision: https://reviews.llvm.org/D35034 llvm-svn: 371400	2019-09-09 13:50:20 +00:00
Yi Kong	33b8a55329	Revert "Revert "[builtins] Rounding mode support for addxf3/subxf3"" Test failure fixed. This reverts commit `e204d244ba`. llvm-svn: 371003	2019-09-05 01:05:05 +00:00
Craig Topper	5465875e93	[X86] Add support for avx512bf16 for __builtin_cpu_supports and compiler-rt's cpu indicator. llvm-svn: 370915	2019-09-04 16:01:43 +00:00
Peter Collingbourne	f7ca57468a	Move a break into the correct place. NFCI. Should silence new C fallthrough warning. llvm-svn: 369813	2019-08-23 21:27:56 +00:00
Nico Weber	d2e493c337	Fix Wnewline-eof after r368598 llvm-svn: 368613	2019-08-12 19:57:17 +00:00
Matthew G McGovern	38a1aa117f	[builtins] MSVC warning disable for clean build - https://reviews.llvm.org/D66023 - amended for ifdef/if gcc errors in previous verison llvm-svn: 368598	2019-08-12 18:08:44 +00:00
Eric Christopher	11c1847237	Revert "[sanitizers] MSVC warning disable for clean build" and follow-up that tried to fix the build as it's still broken. This reverts commit 368476 and 368480. llvm-svn: 368481	2019-08-09 20:43:36 +00:00
Martin Storsjo	96a2b25bcb	Fix compilation after SVN r368476 That revision broke compilation with this error: lib/builtins/fixunsxfdi.c:13:2: error: unterminated conditional directive #if !_ARCH_PPC llvm-svn: 368480	2019-08-09 20:36:00 +00:00
Matthew G McGovern	8e2842cc85	[sanitizers] MSVC warning disable for clean build - https://reviews.llvm.org/D66023 llvm-svn: 368476	2019-08-09 20:09:46 +00:00
Eric Christopher	1d73e228db	BMI2 support is indicated in bit eight of EBX, not nine. See Intel SDM, Vol 2A, Table 3-8: https://www.intel.com/content/dam/www/public/us/en/documents/manuals/64-ia-32-architectures-software-developer-vol-2a-manual.pdf#page=296 Differential Revision: https://reviews.llvm.org/D65766 llvm-svn: 367929	2019-08-05 21:25:59 +00:00
Nico Weber	e4001bacc2	gn build: Fix redundant object files in builtin lib. compiler-rt's builtin library has generic implementations of many functions, and then per-arch optimized implementations of some. In the CMake build, both filter_builtin_sources() and an explicit loop at the end of the build file (see D37166) filter out the generic versions if a per-arch file is present. The GN build wasn't doing this filtering. Just do the filtering manually and explicitly, instead of being clever. While here, also remove files from the mingw/arm build that are redundantly listed after D39938 / r318139 (both from the CMake and the GN build). While here, also fix a target_os -> target_cpu typo. Differential Revision: https://reviews.llvm.org/D65512 llvm-svn: 367448	2019-07-31 17:08:34 +00:00
Rainer Orth	569f92f1e1	[compiler-rt][builtins] Provide __clear_cache for SPARC While working on https://reviews.llvm.org/D40900, two tests were failing since __clear_cache aborted. While libgcc's __clear_cache is just empty, this only happens because gcc (in gcc/config/sparc/sparc.c (sparc32_initialize_trampoline, sparc64_initialize_trampoline)) emits flush insns directly. The following patch mimics that. Tested on sparcv9-sun-solaris2.11. Differential Revision: https://reviews.llvm.org/D64496 llvm-svn: 366822	2019-07-23 16:33:54 +00:00
Nikita Popov	a205ebb09c	[builtins] Fix assembly in arm sync-ops.h This assembly is part of a macro that was reformatted in D60351. The missing space between push and { results in: Error: bad instruction `push{r4, r5,r6,lr}' llvm-svn: 365957	2019-07-12 20:52:02 +00:00
Rainer Orth	4a9a772f44	Enable compiler-rt on SPARC This patch enables compiler-rt on SPARC targets. Most of the changes are straightforward: - Add 32 and 64-bit sparc to compiler-rt - lib/builtins/fp_lib.h needed to check if the int128_t and uint128_t types exist (which they don't on sparc) There's one issue of note: many asan tests fail to compile on Solaris/SPARC: fatal error: error in backend: Function "_ZN7testing8internal16BoolFromGTestEnvEPKcb": over-aligned dynamic alloca not supported. Therefore, while asan is still built, both asan and ubsan-with-asan testing is disabled. The goal is to check if asan keeps compiling on Solaris/SPARC. This serves asan in gcc, which doesn't have the problem above and works just fine. With this patch, sparcv9-sun-solaris2.11 test results are pretty good: Failing Tests (9): Builtins-sparc-sunos :: divtc3_test.c Builtins-sparcv9-sunos :: compiler_rt_logbl_test.c Builtins-sparcv9-sunos :: divtc3_test.c [...] UBSan-Standalone-sparc :: TestCases/TypeCheck/misaligned.cpp UBSan-Standalone-sparcv9 :: TestCases/TypeCheck/misaligned.cpp The builtin failures are due to Bugs 42493 and 42496. The tree contained a few additonal patches either currently in review or about to be submitted. Tested on sparcv9-sun-solaris2.11. Differential Revision: https://reviews.llvm.org/D40943 llvm-svn: 365880	2019-07-12 08:30:17 +00:00
Petr Hosek	d2d6c17760	[builtins] Use libtool for builtins when building for Apple platform compiler-rt already uses libtool instead of ar when building for Apple platform, but that's not being used when builtins are being built separately e.g. as part of the runtimes build. This change extracts the logic setting up libtool into a separate file and uses it from both the compiler-rt and standalone builtins build. Differential Revision: https://reviews.llvm.org/D62820 llvm-svn: 362466	2019-06-04 02:38:15 +00:00
Saleem Abdulrasool	aad5d51882	builtins: correct function name for AEABI If `COMPILER_RT_ARMHF_TARGET` is set , the definition of the AEABI runtime function `__aeabi_fcmpun` is misspelt: `__aeabi_fcmpum` instead of `__aeabi_fcmpun`. Patch by Konstantin Schwarz! llvm-svn: 362424	2019-06-03 17:08:13 +00:00
Petr Hosek	529118fc87	[builtins] Move the compare2f definition outside of the macro This should hopefully address the error we're seeing in older versions of Clang. Differential Revision: https://reviews.llvm.org/D62554 llvm-svn: 361909	2019-05-29 01:51:56 +00:00
Craig Topper	6dbf4a86a7	[X86] Add more icelake model numbers to compiler-rt implementation of __builtin_cpu_is. Using model numbers found in Table 2-1 of the May 2019 version of the Intel Software Developer's Manual Volume 4. llvm-svn: 361423	2019-05-22 19:51:48 +00:00
Petr Hosek	48140db797	[builtins] Deduplicate __eqsf2 and __gtsf2 via macro The only difference between __eqsf2 and __gtsf2 is whether they return 1 or -1 on NaN. Rather than duplicating all the code, use a macro to define the function twice and use an argument to decide whether to negate the return value. Differential Revision: https://reviews.llvm.org/D61919 llvm-svn: 361207	2019-05-20 23:34:24 +00:00
Craig Topper	b93f8ae7a7	[X86] Add icelake-client and tremont model numbers to compiler-rt's implementation of __builtin_cpu_is. llvm-svn: 361175	2019-05-20 16:58:38 +00:00
Leonard Chan	992021335c	[NFC][compiler-rt][builtins] Tidy and match comments for floating point operations Differential Revision: https://reviews.llvm.org/D61762 llvm-svn: 360389	2019-05-09 22:48:30 +00:00
Martin Storsjo	b1f3910283	Avoid duplicate function aliases on MinGW after SVN r359835 On MinGW, the same alias mechanism as for ELF, using __attribute__((__alias__())), is used. llvm-svn: 359865	2019-05-03 07:43:23 +00:00
Reid Kleckner	3961507ba1	Fix check-builtins on Windows after alias changes llvm-svn: 359835	2019-05-02 22:11:55 +00:00
Petr Hosek	e62915bcc1	[builtins] Use __APPLE__ instead of __MACH__ in check The latter doesn't seem to be working for all targets. This addresses the issue introduced in r359413. llvm-svn: 359423	2019-04-29 08:38:43 +00:00
Petr Hosek	cb929dcebe	[builtins] Fix the missing assembly on Darwin This was introduced in r359413. llvm-svn: 359421	2019-04-29 07:45:15 +00:00
Petr Hosek	ba45daab14	[builtins] Fix the typo in the preprocessor check This was introduced in r359413. llvm-svn: 359419	2019-04-29 06:30:50 +00:00
Petr Hosek	84da0e1bb7	[builtins] Use aliases for function redirects Symbol aliases are supported by all platforms that compiler-rt builtins target, and we can use these instead of function redirects to avoid the extra indirection. This is part of the cleanup proposed in "[RFC] compiler-rt builtins cleanup and refactoring". Differential Revision: https://reviews.llvm.org/D60931 llvm-svn: 359413	2019-04-29 00:46:23 +00:00
Petr Hosek	0ba22f51d1	[builtins] Use single line C++/C99 comment style Use the uniform single line C++/99 style for code comments. This is part of the cleanup proposed in "[RFC] compiler-rt builtins cleanup and refactoring". Differential Revision: https://reviews.llvm.org/D60352 llvm-svn: 359411	2019-04-28 22:47:49 +00:00
Petr Hosek	082b89b25f	[builtins] Reformat builtins with clang-format Update formatting to use the LLVM style. This is part of the cleanup proposed in "[RFC] compiler-rt builtins cleanup and refactoring". Differential Revision: https://reviews.llvm.org/D60351 llvm-svn: 359410	2019-04-28 21:53:32 +00:00
Yi Kong	815a4c902d	[builtins] Build x86_64 with GENERIC_TF_SOURCES llvm-svn: 358706	2019-04-18 19:29:03 +00:00
Yi Kong	64c32362f0	[builtins] Add __cmpsf2 for ARM version of comparesf2 The generic version of comparesf2 defines __cmpsf2 alias for libgcc compatibility, but the ARM overlay is missing the alias. Differential Revision: https://reviews.llvm.org/D60805 llvm-svn: 358542	2019-04-17 01:30:33 +00:00
Petr Hosek	40442658db	[gn] Support for building compiler-rt builtins This is support for building compiler-rt builtins, The library build should be complete for a subset of supported platforms, but not all CMake options have been replicated in GN. We always use the just built compiler to build all the runtimes, which is equivalent to the CMake runtimes build. This simplifies the build configuration because we don't need to support arbitrary host compiler and can always assume the latest Clang. With GN's toolchain support, this is significantly more efficient than the CMake runtimes build. Differential Revision: https://reviews.llvm.org/D60331 llvm-svn: 357821	2019-04-05 21:30:40 +00:00
Reid Kleckner	6af8e1e64c	Remove unneeded ymath.h include from int_math.h This avoids a conflict between stdbool.h, which defines bool to _Bool in xkeycheck.h. From what I can tell, ymath.h is an internal header, and the intention is that users should include math.h directly instead. It doesn't appear to provide declarations of anything required for our builtins. This include was added back in r249513 from 2015, and it's possible that ymath.h provided something this code needed at the time, but today it does not. llvm-svn: 357728	2019-04-04 21:47:15 +00:00
Yi Kong	e204d244ba	Revert "[builtins] Rounding mode support for addxf3/subxf3" This reverts commit `2cabea054e`. Test failure on buildbots. llvm-svn: 357048	2019-03-27 04:18:37 +00:00
Yi Kong	2cabea054e	[builtins] Rounding mode support for addxf3/subxf3 Implement rounding mode support for addxf3/subxf3. On architectures that implemented the support, this will access the corresponding floating point environment register to apply the correct rounding. For other architectures, it will keep the current behaviour and use IEEE-754 default rounding mode (to nearest, ties to even). ARM32/AArch64 support implemented in this change. i386 and AMD64 will be added in a follow up change. Differential Revision: https://reviews.llvm.org/D57143 llvm-svn: 357035	2019-03-26 22:01:22 +00:00
Hubert Tong	4e7a218abf	Fix typos in compiler-rt/lib/builtins/atomic.c Summary: This patch fixes typos in file compiler-rt/lib/builtins/atomic.c. Reviewers: jasonliu, hubert.reinterpretcast, jfb Reviewed By: jfb Subscribers: t.p.northover, theraven, dberris, jfb, jdoerfert, #sanitizers, llvm-commits Tags: #llvm, #sanitizers Differential Revision: https://reviews.llvm.org/D59228 Patch by Xing Xue. llvm-svn: 356844	2019-03-23 18:39:54 +00:00
Sterling Augustine	86724e40bf	Make __cpu_model a hidden symbol, to match libgcc. Also hide __cpu_inicator_init and __cpu_features2 for similar reasons. Summary: Make __cpu_model a hidden symbol, to match libgcc. Subscribers: llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D59561 llvm-svn: 356581	2019-03-20 17:37:23 +00:00
Eli Friedman	d674d96bc5	[builtins] Divide shouldn't underflow if rounded result would be normal. We were treating certain edge cases that are actually normal as denormal results, and flushing them to zero; we shouldn't do that. Not sure this is the cleanest way to implement this edge case, but I wanted to avoid adding any code on the common path. Differential Revision: https://reviews.llvm.org/D59070 llvm-svn: 356529	2019-03-19 21:55:58 +00:00
Craig Topper	938d3f461b	[X86] Add 'znver2' and 'cascadelake' support to __cpu_indicator_init. For 'cascadelake' this is adding a 'avx512vnni' feature check to the 0x55 skylake-avx512 model check. These CPUs use the same model number and only differ in the stepping number. But the feature flag is simpler than collecting all the stepping numbers. For 'znver2' this is just syncing with LLVM's Host.cpp. llvm-svn: 354927	2019-02-26 21:51:05 +00:00

1 2 3 4 5 ...

519 Commits