llvm-project

Commit Graph

Author	SHA1	Message	Date
Nemanja Ivanovic	9019b55b60	[PowerPC] Fix byte ordering of ld/st with length on BE The builtins vec_xl_len_r and vec_xst_len_r actually use the wrong side of the vector on big endian Power9 systems. We never spotted this before because there was no such thing as a big endian distro that supported Power9. Now we have AIX and the elements are in the wrong part of the vector. This just fixes it so the elements are loaded to and stored from the right side of the vector.	2021-07-30 14:37:24 -05:00
Nemanja Ivanovic	1c50a5da36	[PowerPC] Implement partial vector ld/st builtins for XL compatibility XL provides functions __vec_ldrmb/__vec_strmb for loading/storing a sequence of 1 to 16 bytes in big endian order, right justified in the vector register (regardless of target endianness). This is equivalent to vec_xl_len_r/vec_xst_len_r which are only available on Power9. This patch simply uses the Power9 functions when compiled for Power9, but provides a more general implementation for Power8. Differential revision: https://reviews.llvm.org/D106757	2021-07-26 13:19:52 -05:00
Qiu Chaofan	240dde9482	[PowerPC] Change altivec indexed load/store builtins argument type This patch changes the index argument of lvxl?/lve[bhw]x and stvxl?/stve[bhw]x builtins from int to long. Because on 64-bit subtargets, an extra extsw will always been generated, which is incorrect. Reviewed By: nemanjai Differential Revision: https://reviews.llvm.org/D106530	2021-07-27 00:26:50 +08:00
Bardia Mahjour	2071ce9d45	[Altivec] Use signed comparison for vec_all_* and vec_any_* interfaces We are currently being inconsistent in using signed vs unsigned comparisons for vec_all_* and vec_any_* interfaces that use vector bool types. For example we use signed comparison for vec_all_ge(vector signed char, vector bool char) but unsigned comparison for when the arguments are swapped. GCC and XL use signed comparison instead. This patch makes clang consistent with itself and with XL and GCC. Reviewed By: nemanjai Differential Revision: https://reviews.llvm.org/D105666	2021-07-12 11:41:16 -04:00
Nemanja Ivanovic	84e429693f	[PowerPC] Fix rounding mode for vec_round in altivec.h The function is supposed to be the equivalent of rint() (as in round to nearest, ties to even) rather than round() (round to nearest, ties away from zero). In fact, the instruction we emit without VSX is vrfin which is correct. However, with VSX we emit xvrspi which is the equivalent of round() and therefore incorrect. Since there is no equivalent VSX instruction, simply use vrfin regardless of availability of VSX.	2021-07-12 06:11:27 -05:00
Nemanja Ivanovic	41ce5ec5f6	[PowerPC] Remove unnecessary 64-bit guards from altivec.h A number of functions in the header have guards for 64-bit only that were presumably added as some of the functions in the blocks use vector __int128 which is only available in 64-bit mode. A more appropriate guard (__SIZEOF_INT128__) has been added for those functions since, making the 64-bit guards redundant. This patch removes those guards as they inadvertently guard code that uses vector long long which does not actually require 64-bit mode.	2021-07-12 04:59:00 -05:00
Nemanja Ivanovic	ef906573a1	[PowerPC] Fix vec_add for 64-bit on pre-Power7 subtargets The shift of the carry was actually incorrect.	2021-06-24 18:42:44 -05:00
Nemanja Ivanovic	7cd2833311	[PowerPC] Add vec_vupkhpx and vec_vupklpx for XL compatibility These are old names for these functions that XL still supports.	2021-05-14 08:02:00 -05:00
Nemanja Ivanovic	39e4676ca7	[PowerPC] Provide doubleword vector predicate form comparisons on Power7 There are two reasons this shouldn't be restricted to Power8 and up: 1. For XL compatibility 2. Because clang will expand comparison operators to these intrinsics* *Without this patch, the following causes a selection error: int test(vector signed long a, vector signed long b) { return a < b; } This patch provides the handling for the intrinsics in the back end and removes the Power8 guards from the predicate functions (vec_{all\|any}_{eq\|ne\|gt\|ge\|lt\|le}).	2021-05-13 04:56:56 -05:00
Nemanja Ivanovic	1faf3b195e	[PowerPC] Re-commit `ed87f512bb` This was reverted in `3761b9a234` just as I was about to commit the fix. This patch inlcudes the necessary fix.	2021-05-06 09:50:12 -05:00
Nico Weber	3761b9a234	Revert "[PowerPC] Provide some P8-specific altivec overloads for P7" This reverts commit `ed87f512bb`. Breaks check-clang, see e.g. https://lab.llvm.org/buildbot/#/builders/139/builds/3818	2021-05-06 10:01:16 -04:00
Nemanja Ivanovic	ed87f512bb	[PowerPC] Provide some P8-specific altivec overloads for P7 This adds additional support for XL compatibility. There are a number of functions in altivec.h that produce a single instruction (or a very short sequence) for Power8 but can be done on Power7 without scalarization. XL provides these implementations. This patch adds the following overloads for doubleword vectors: vec_add vec_cmpeq vec_cmpgt vec_cmpge vec_cmplt vec_cmple vec_sl vec_sr vec_sra	2021-05-06 08:37:36 -05:00
Nemanja Ivanovic	bfd60b36f8	[PowerPC] Add floating point overloads for vec_sldw These are added for compatibility with XLC.	2021-04-30 20:29:03 -05:00
Nemanja Ivanovic	c3da07d216	[PowerPC] Provide fastmath sqrt and div functions in altivec.h This adds the long overdue implementations of these functions that have been part of the ABI document and are now part of the "Power Vector Intrinsic Programming Reference" (PVIPR). The approach is to add new builtins and to emit code with the fast flag regardless of whether fastmath was specified on the command line. Differential revision: https://reviews.llvm.org/D101209	2021-04-30 19:17:48 -05:00
Nemanja Ivanovic	19b29b1ed1	[PowerPC] Provide XL-compatible builtins in altivec.h There are some interfaces in altivec.h that are not compatible between Clang and XL (although Clang is compatible with GCC). Currently, we have found 3 but there may be others. Clang/GCC signatures: vector double vec_ctf(vector signed long long) vector double vec_ctf(vector unsigned long long) vector signed long long vec_cts(vector double) vector unsigned long long vec_ctu(vector double) XL signatures: vector float vec_ctf(vector signed long long) vector float vec_ctf(vector unsigned long long) vector signed int vec_cts(vector double) vector unsigned int vec_ctu(vector double) This patch provides the XL behaviour under the __XL_COMPAT_ALTIVEC__ macro for users that rely on XL behaviour. Differential revision: https://reviews.llvm.org/D101130	2021-04-23 15:13:46 -05:00
Nemanja Ivanovic	6725b90a02	[PowerPC] Add vec_ctsl and vec_ctul to altivec.h These are added for compatibility with XLC. They are similar to vec_cts and vec_ctu except that the result is a doubleword vector regardless of the parameter type.	2021-04-23 11:03:38 -05:00
Nemanja Ivanovic	7a5641d651	[PowerPC] Add missing casts for vec_xlds and vec_load_splats The previous commits just missed some pointer casts and ended up producing warnings.	2021-04-22 10:31:00 -05:00
Nemanja Ivanovic	1cc1d9db28	[PowerPC] Add vec_vclz as an alias for vec_cntlz in altivec.h Another addition for compatibility with XLC. The functions have the same overloads so just add it as a preprocessor define.	2021-04-22 10:31:00 -05:00
Nemanja Ivanovic	e43963db24	[PowerPC] Add vec_load_splats to altivec.h Add these overloads for compatibility with XLC. This is a word load-and-splat.	2021-04-22 10:31:00 -05:00
Nemanja Ivanovic	a0e6189712	[PowerPC] Add vec_xlds to altivec.h Add these overloads for compatibility with XLC. This is a doubleword load-and-splat.	2021-04-22 10:31:00 -05:00
Nemanja Ivanovic	a1d325af67	[PowerPC] Add vec_roundz as alias for vec_trunc in altivec.h Add the overloads for compatibility with XLC.	2021-04-22 10:31:00 -05:00
Nemanja Ivanovic	1550c47c18	[PowerPC] Add vec_roundp as alias for vec_ceil Add the overloads for compatibility with XLC.	2021-04-22 10:30:59 -05:00
Nemanja Ivanovic	51692c6c63	[PowerPC] Add missing VSX guard for vec_roundm with vector double The guard was missed in the previous commit.	2021-04-22 10:30:59 -05:00
Nemanja Ivanovic	3a46667059	[PowerPC] Add vec_roundm as alias for vec_floor in altivec.h Add the overloads for compatibility with XLC.	2021-04-22 10:30:59 -05:00
Nemanja Ivanovic	3bcd0ece43	[PowerPC] Add vec_roundc as alias for vec_rint in altivec.h For compatibility with XLC, add these overloads.	2021-04-22 05:31:38 -05:00
Nemanja Ivanovic	06411edb9f	[PowerPC][NFC] Provide legacy names for VSX loads and stores Before we unified the names of the builtins across all the compilers, there were a number of synonyms between them. There is code out there that uses XL naming for some of these loads and stores. This just adds those names.	2021-03-25 06:32:40 -05:00
Nemanja Ivanovic	4020932706	[PowerPC] Make altivec.h work with AIX which has no __int128 There are a number of functions in altivec.h that use vector __int128 which isn't supported on AIX. Those functions need to be guarded for targets that don't support the type. Furthermore, the functions that produce quadword instructions without using the type need a builtin. This patch adds the macro guards to altivec.h using the __SIZEOF_INT128__ which is only defined on targets that support the __int128 type.	2021-03-24 00:35:51 -05:00
Nemanja Ivanovic	4146864735	[PowerPC][NFC] Use valid type for offset in altivec.h We currently use signed long long instead of ptrdiff_t for offsets in altivec.h. This has never really presented a problem because all platforms where we use these are 64-bit. However, now that we have 32-bit targets, we need to use a meaningful type.	2021-03-23 08:45:37 -05:00
Nemanja Ivanovic	2f782a796a	[PowerPC] Add more missing overloads to altivec.h Add overloads that perform subtraction on v1i128 that take and produce vector unsigned char to avoid needing to use __int128. The overloads are suffixed with _u128 and are needed for targets where __int128 isn't supported (AIX).	2021-03-23 05:52:36 -05:00
Nemanja Ivanovic	54e4654f04	[PowerPC] Add more missing overloads to altivec.h Add overloads that perform addition on v1i128 that take and produce vector unsigned char to avoid needing to use __int128. The overloads are suffixed with _u128 and are needed for targets where __int128 isn't supported (AIX).	2021-03-23 05:09:19 -05:00
Nemanja Ivanovic	10cc5bcd86	[PowerPC] Add more missing overloads to altivec.h Add vec_permi as a synonym for vec_xxpermdi (but only for doubleword vectors).	2021-03-22 23:09:41 -05:00
Nemanja Ivanovic	b5e96e0ad6	[PowerPC] Add more missing overloads to altivec.h Add vec_gbb as a synonym for vec_vgbbd but for doubleword vectors.	2021-03-22 22:25:28 -05:00
Nemanja Ivanovic	d8e574c8e6	[PowerPC] Add more missing overloads to altivec.h Add vec_cvf as a synonym for vec_doublee/vec_floate.	2021-03-22 22:08:43 -05:00
Nemanja Ivanovic	bef2cb9062	[PowerPC] Add more missing overloads to altivec.h Add vec_ctd which is similar to vec_ctf except the return type is vector double rather than vector float.	2021-03-22 20:23:07 -05:00
Nemanja Ivanovic	b5fae4b9b2	[PowerPC] Add more missing overloads to altivec.h We are missing more predicate forms for 'vector double' and some tests. This adds the missing overloads and completes the set of test cases for them.	2021-03-12 10:51:57 -06:00
Nemanja Ivanovic	f4ad7a1a15	[PowerPC] Add missing double precision vec_all overloads to altivec.h We somehow missed vec_all_nlt, vec_all_nle and vec_all_numeric overloads for double precision vectors when VSX is enabled.	2021-03-05 18:42:12 -06:00
Nemanja Ivanovic	1ff93618e5	[PowerPC] Add missing overloads of vec_promote to altivec.h The VSX-only overloads (for 8-byte element vectors) are missing. Add the missing overloads and convert element numbering to modulo arithmetic to match GCC and XLC.	2021-03-01 21:40:30 -06:00
Nemanja Ivanovic	38a34e207f	[PowerPC] Use modulo arithmetic for vec_extract in altivec.h These interfaces are not covered in the ELFv2 ABI but are rather implemented to emulate those available in GCC/XLC. However, the ones in the other compilers are documented to perform modulo arithmetic on the element number. This patch just brings clang inline with the other compilers at -O0 (with optimization, clang already does the right thing).	2021-03-01 19:49:26 -06:00
Esme-Yi	ffa67873a3	[PowerPC] Add variants of 64-bit vector types for vec_sel. Summary: This patch added variants of vec_sel and fixed bugzilla 46770. Reviewed By: nemanjai Differential Revision: https://reviews.llvm.org/D94162	2021-01-11 03:52:16 +00:00
Masoud Ataei	fc750f609d	[PPC] Fixing a typo in altivec.h. Commenting out an unnecessary macro	2020-12-08 19:21:02 +00:00
Albion Fung	1af037f643	[PowerPC] Correct cpsgn's behaviour on PowerPC to match that of the ABI This patch fixes the reversed behaviour exhibited by cpsgn on PPC. It now matches the ABI. Differential Revision: https://reviews.llvm.org/D84962	2020-11-05 15:35:14 -05:00
Albion Fung	d30155feaa	[PowerPC] Implementation of 128-bit Binary Vector Rotate builtins This patch implements 128-bit Binary Vector Rotate builtins for PowerPC10. Differential Revision: https://reviews.llvm.org/D86819	2020-10-16 18:03:22 -04:00
Esme-Yi	e3475f5b91	[PowerPC] Add builtins for xvtdiv(dp\|sp) and xvtsqrt(dp\|sp). Summary: This patch implements the builtins for xvtdivdp, xvtdivsp, xvtsqrtdp, xvtsqrtsp. The instructions correspond to the following builtins: int vec_test_swdiv(vector double v1, vector double v2); int vec_test_swdivs(vector float v1, vector float v2); int vec_test_swsqrt(vector double v1); int vec_test_swsqrts(vector float v1); This patch depends on D88274, which fixes the bug in copying from CRRC to GPRC/G8RC. Reviewed By: steven.zhang, amyk Differential Revision: https://reviews.llvm.org/D88278	2020-10-04 16:24:20 +00:00
Amy Kwan	6b136b19cb	[Power10] Implement custom codegen for the vec_replace_elt and vec_replace_unaligned builtins. This patch implements custom codegen for the vec_replace_elt and vec_replace_unaligned builtins. These builtins map to the @llvm.ppc.altivec.vinsw and @llvm.ppc.altivec.vinsd intrinsics depending on the arguments. The main motivation for doing custom codegen for these intrinsics is because there are float and double versions of the builtin. Normally, the converting the float to an integer would be done via fptoui in the IR. This is incorrect as fptoui truncates the value and we must ensure the value is not truncated. Therefore, we provide custom codegen to utilize bitcast instead as bitcasts do not truncate. Differential Revision: https://reviews.llvm.org/D83500	2020-09-23 22:55:25 -05:00
Amy Kwan	2e7117f847	[PowerPC] Implement the 128-bit vec_[all\|any]_[eq \| ne \| lt \| gt \| le \| ge] builtins in Clang/LLVM This patch implements the vec_[all\|any]_[eq \| ne \| lt \| gt \| le \| ge] builtins for vector signed/unsigned __int128. Differential Revision: https://reviews.llvm.org/D87910	2020-09-23 16:49:40 -04:00
Albion Fung	88cdbeab41	[PowerPC] Implement Vector signed/unsigned __int128 overloads for the comparison builtins This patch implements Vector signed/unsigned __int128 overloads for the comparison builtins. Differential Revision: https://reviews.llvm.org/D87804	2020-09-23 16:49:40 -04:00
Albion Fung	d7eb917a7c	[PowerPC] Implementation of 128-bit Binary Vector Mod and Sign Extend builtins This patch implements 128-bit Binary Vector Mod and Sign Extend builtins for PowerPC10. Differential: https://reviews.llvm.org/D87394#inline-815858	2020-09-23 01:18:14 -05:00
Amy Kwan	079757b551	[PowerPC] Implement Vector String Isolate Builtins in Clang/LLVM This patch implements the vector string isolate (predicate and non-predicate versions) builtins. The predicate builtins are custom selected within PPCISelDAGToDAG. Differential Revision: https://reviews.llvm.org/D87671	2020-09-22 11:31:44 -05:00
Amy Kwan	b3147058de	[PowerPC] Implement the 128-bit Vector Divide Extended Builtins in Clang/LLVM This patch implements the 128-bit vector divide extended builtins in Clang/LLVM. These builtins map to the vdivesq and vdiveuq instructions respectively. Differential Revision: https://reviews.llvm.org/D87729	2020-09-22 11:31:44 -05:00
Amy Kwan	37e7673c21	[PowerPC] Implement Move to VSR Mask builtins in LLVM/Clang This patch implements the vec_gen[b\|h\|w\|d\|q]m function prototypes in altivec.h in order to utilize the move to VSR with mask instructions introduced in Power10. Differential Revision: https://reviews.llvm.org/D82725	2020-09-18 18:16:14 -05:00

1 2 3 4

168 Commits