llvm-project

Commit Graph

Author	SHA1	Message	Date
Craig Topper	5ece4cfe1e	[X86] Implement broadcastf32x2 and broadcasti32x2 intrinsics using __builtin_shufflevector instead builtins This patch implements the broadcastf32x2/broadcasti32x2 intrinsics using __builtin_shufflevector. Differential Revision: https://reviews.llvm.org/D37287 llvm-svn: 312135	2017-08-30 16:15:12 +00:00
Saleem Abdulrasool	65101adb16	Headers: explicitly specify double-word alignment GCC will interpret `__attribute__((__aligned__))` as 8-byte alignment on ARM, but clang will not. Explicitly specify the alignment. This mirrors the declaration in libunwind. llvm-svn: 311576	2017-08-23 16:57:55 +00:00
Saleem Abdulrasool	75cfabef35	Headers: give _Unwind_Control_Block double-word alignment The C++ ABI requires that the exception object (which under AEABI is the `_Unwind_Control_Block`) is double-word aligned. The attribute was applied to the `_Unwind_Exception` type, but not the `_Unwind_Control_Block`. This should fix the libunwind test for the alignment of the exception type. llvm-svn: 311563	2017-08-23 15:35:33 +00:00
Yaxun Liu	a3c3d7b442	[OpenCL] Remove extra select functions from opencl-c.h OpenCL spec v2.0 s6.13.6: gentype select (gentype a, gentype b, igentype c) gentype select (gentype a, gentype b, ugentype c) igentype and ugentype must have the same number of elements and bits as gentype. Differential Revision: https://reviews.llvm.org/D36259 llvm-svn: 310160	2017-08-05 02:23:47 +00:00
Yaxun Liu	39195062c2	Add OpenCL 2.0 atomic builtin functions as Clang builtin OpenCL 2.0 atomic builtin functions have a scope argument which is ideally represented as synchronization scope argument in LLVM atomic instructions. Clang supports translating Clang atomic builtin functions to LLVM atomic instructions. However it currently does not support synchronization scope of LLVM atomic instructions. Without this, users have to use LLVM assembly code to implement OpenCL atomic builtin functions. This patch adds OpenCL 2.0 atomic builtin functions as Clang builtin functions, which supports generating LLVM atomic instructions with synchronization scope operand. Currently only constant memory scope argument is supported. Support of non-constant memory scope argument will be added later. Differential Revision: https://reviews.llvm.org/D28691 llvm-svn: 310082	2017-08-04 18:16:31 +00:00
Bruno Cardoso Lopes	d89a1eb4fb	[Headers][Darwin] Allow #include_next<float.h> to work on Darwin prior to 10.7 This fixes PR31504 and it's a follow up from adding #include_next<float.h> for Darwin in r289018. rdar://problem/29856682 llvm-svn: 309752	2017-08-01 22:10:36 +00:00
Simon Pilgrim	c14865c0c5	[X86][AVX] Ensure vector non-temporal load/store intrinsics force pointer alignment (PR33830) Clang specifies a max type alignment of 16 bytes on darwin targets (annoyingly in the driver not via cc1), meaning that the builtin nontemporal stores don't correctly align the loads/stores to 32 or 64 bytes when required, resulting in lowering to temporal unaligned loads/stores. This patch casts the vectors to explicitly aligned types prior to the load/store to ensure that the require alignment is respected. Differential Revision: https://reviews.llvm.org/D35996 llvm-svn: 309488	2017-07-29 15:33:34 +00:00
Simon Pilgrim	0b37ffbbf9	Strip trailing whitespace. NFCI. llvm-svn: 309383	2017-07-28 14:01:51 +00:00
Saleem Abdulrasool	b5eca2f9a2	Headers: fix _Unwind_{G,S}etGR for non-EHABI targets The EHABI definition was being inlined into the users even when EHABI was not in use. Adjust the condition to ensure that the right version is defined. llvm-svn: 309327	2017-07-27 21:56:25 +00:00
Saleem Abdulrasool	9c13bbe953	Headers: improve ARM EHABI coverage of unwind.h Ensure that we define the `_Unwind_Control_Block` structure used on ARM EHABI targets. This is needed for building libc++abi with the unwind.h from the resource dir. A minor fallout of this is that we needed to create a typedef for _Unwind_Exception to work across ARM EHABI and non-EHABI targets. The structure definitions here are based originally on the documentation from ARM under the "Exception Handling ABI for the ARM® Architecture" Section 7.2. They are then adjusted to more closely reflect the definition in libunwind from LLVM. Those changes are compatible in layout but permit easier use in libc++abi and help maintain compatibility between libunwind and the compiler provided definition. llvm-svn: 309226	2017-07-26 22:55:23 +00:00
Mandeep Singh Grang	79249e1be7	[clang] Add ARM64 support to armintr.h for MSVC compatibility Summary: This fixes compiling with headers from the Windows SDK for ARM64. Reviewers: compnerd, ruiu, mstorsjo Reviewed By: compnerd, mstorsjo Subscribers: mgorny, aemerson, javed.absar, kristof.beyls, llvm-commits, cfe-commits Differential Revision: https://reviews.llvm.org/D35862 llvm-svn: 309081	2017-07-26 05:29:40 +00:00
Ulrich Weigand	6af2559562	[SystemZ] Add support for IBM z14 processor (3/3) This patch updates the vecintrin.h header file to provide the new set of high-level vector built-in functions. This matches the updated definition implemented by other compilers for the platform, indicated by the pre-defined macro __VEC__ == 10302. Note that some of the new functions (notably those involving the vector float data type) are only available with -march=z14 (indicated by __ARCH__ == 12). llvm-svn: 308199	2017-07-17 17:47:35 +00:00
Ekaterina Romanova	03ecd774ba	[DOXYGEN] Corrected typos and incorrect parameters description. Corrected several typos and incorrect parameters description that Sony 's techinical writer found during review. I got an OK from Eric Christopher to commit doxygen comments without prior code review upstream. llvm-svn: 307838	2017-07-12 20:18:55 +00:00
Zvi Rackover	064f00061b	X86 Intrinsics: _bit_scan_forward should not be under #ifdef __RDRND__ Summary: The _bit_scan_forward and _bit_scan_reverse intrinsics were accidentally masked under the preprocessor checks that prune intrinsics definitions for the benefit of faster compile-time on Windows. This patch moves the definitons out of that region. Fixes pr33722 Reviewers: craig.topper, aaboud, thakis Reviewed By: craig.topper Subscribers: cfe-commits Differential Revision: https://reviews.llvm.org/D35184 llvm-svn: 307524	2017-07-10 07:13:56 +00:00
Craig Topper	b2f8b311d1	[X86] Add more feature flag bit defines to cpuid.h for gcc compatibility. llvm-svn: 307507	2017-07-09 17:43:11 +00:00
Craig Topper	f6e8408a11	[X86] Add __get_cpuid_count to cpuid.h. Update __get_cpuid to check the maximum level support before accessing the leaf. Rename level to leaf everywhere. This matches gcc behavior. llvm-svn: 307506	2017-07-09 17:43:10 +00:00
Ekaterina Romanova	cb3603a4eb	[DOXYGEN] Corrected several typos and incorrect parameters description that Sony's techinical writer found during review. I got an OK from Eric Christopher to commit doxygen comments without prior code review upstream. llvm-svn: 304840	2017-06-06 22:58:01 +00:00
Benjamin Kramer	c796245431	[PPC] Make altivec conversion function macros. The second argument must be a constant, otherwise instruction selection will fail. always_inline is not enough for isel to always fold everything away at -O0. Sadly the overloading turned this into a big macro mess. Fixes PR33212. llvm-svn: 304205	2017-05-30 11:37:29 +00:00
Oren Ben Simhon	140c1fb9ec	[X86] Adding avx512_vpopcntdq feature set and its intrinsics AVX512_VPOPCNTDQ is a new feature set that was published by Intel. The patch represents the Clang side of the addition of six intrinsics for two new machine instructions (vpopcntd and vpopcntq). It also includes the addition of the new feature set. Differential Revision: https://reviews.llvm.org/D33170 llvm-svn: 303857	2017-05-25 13:44:11 +00:00
Tony Jiang	9aa2c0383d	[PowerPC] Implement vec_xxsldwi builtin. The vec_xxsldwi builtin is missing from altivec.h. This has been requested by developers working on libvpx for VP9 support for Google. The patch fixes PR: https://bugs.llvm.org/show_bug.cgi?id=32653 Differential Revision: https://reviews.llvm.org/D33236 llvm-svn: 303766	2017-05-24 15:54:13 +00:00
Tony Jiang	bbc48e9164	[PowerPC] Implement vec_xxpermdi builtin. The vec_xxpermdi builtin is missing from altivec.h. This has been requested by developers working on libvpx for VP9 support for Google. The patch fixes PR: https://bugs.llvm.org/show_bug.cgi?id=32653 Differential Revision: https://reviews.llvm.org/D33053 llvm-svn: 303760	2017-05-24 15:13:32 +00:00
Ekaterina Romanova	bfc1e3a84e	(1) Fixed mismatch in intrinsics names in declarations and in doxygen comments. (2) Removed uncessary anymore \c commands, since the same effect will be achived by <c> ... </c> sequence. I got an OK from Eric Christopher to commit doxygen comments without prior code review upstream. llvm-svn: 303228	2017-05-17 01:46:11 +00:00
Ekaterina Romanova	1d4a0f270c	[DOXYGEN] Minor improvements in doxygen comments. Separated very long brief sections into two sections. I got an OK from Eric Christopher to commit doxygen comments without prior code review upstream. llvm-svn: 303031	2017-05-15 03:25:04 +00:00
Egor Churaev	44800c5aba	[OpenCL] Added checking OpenCL version for cl_khr_mipmap_image built-ins Reviewers: Anastasia, cfe-commits Reviewed By: Anastasia Subscribers: bader, yaxunl Differential Revision: https://reviews.llvm.org/D32897 llvm-svn: 302630	2017-05-10 08:23:01 +00:00
Simon Pilgrim	073c4e66b0	[X86][LWP] Remove MSVC LWP intrinsics stubs. Now provided in lwpintrin.h llvm-svn: 302559	2017-05-09 17:50:16 +00:00
Simon Pilgrim	7855510ae3	[X86][LWP] Removing LWP todo comment. NFCI. LWP / lwpintrin.h is now supported llvm-svn: 302557	2017-05-09 17:43:16 +00:00
Simon Pilgrim	3511348dbb	[X86][LWP] Add clang support for LWP instructions. This patch adds support for the the LightWeight Profiling (LWP) instructions which are available on all AMD Bulldozer class CPUs (bdver1 to bdver4). Differential Revision: https://reviews.llvm.org/D32770 llvm-svn: 302418	2017-05-08 12:09:45 +00:00
Sam Parker	b9ea36f9c1	[ARM] ACLE Chapter 9 intrinsics Implemented the remaining integer data processing intrinsics from the ARM ACLE v2.1 spec, such as parallel arithemtic and DSP style multiplications. Differential Revision: https://reviews.llvm.org/D32282 llvm-svn: 302131	2017-05-04 08:37:59 +00:00
Simon Pilgrim	96d02f5503	[X86][AVX] Added support for _mm256_zext* helper intrinsics (PR32839) llvm-svn: 301749	2017-04-29 17:17:06 +00:00
Ekaterina Romanova	ea8702d393	[DOXYGEN] Minor improvements in doxygen comments. - I removed doxygen comments for the intrinsics that "alias" the other existing documented intrinsics and that only sligtly differ in spelling (single underscores vs. double underscores). #define _tzcnt_u16(a) (__tzcnt_u16((a))) It will be very hard to keep the documentation for these "aliases" in sync with the documentation for the intrinsics they alias to. Out of sync documentation will be more confusing than no documentation. I got an OK from Eric Christopher to commit doxygen comments without prior code review upstream. llvm-svn: 301652	2017-04-28 16:45:39 +00:00
Simon Pilgrim	99ed27053d	[X86][SSE] Add _mm_set_pd1 (PR32827) Matches _mm_set_ps1 implementation llvm-svn: 301637	2017-04-28 10:28:32 +00:00
Duncan P. N. Exon Smith	e77a3aff6f	Headers: Make the type of SIZE_MAX the same as size_t size_t is usually defined as unsigned long, but on 64-bit platforms, stdint.h currently defines SIZE_MAX using "ull" (unsigned long long). Although this is the same width, it doesn't necessarily have the same alignment or calling convention. It also triggers printf warnings when using the format flag "%zu" to print SIZE_MAX. This changes SIZE_MAX to reuse the compiler-provided __SIZE_MAX__, and provides similar fixes for the other integers: - INTPTR_MIN - INTPTR_MAX - UINTPTR_MAX - PTRDIFF_MIN - PTRDIFF_MAX - INTMAX_MIN - INTMAX_MAX - UINTMAX_MAX - INTMAX_C() - UINTMAX_C() ... and fixes the typedefs for intptr_t and uintptr_t to use __INTPTR_TYPE__ and __UINTPTR_TYPE__ instead of int32_t, effectively reverting r89224, r89226, and r89237 (r89221 already having been effectively reverted). We can probably also kill __INTPTR_WIDTH__, __INTMAX_WIDTH__, and __UINTMAX_WIDTH__ in a follow-up, but I was hesitant to delete all the per-target CHECK lines in this commit since those might serve their own purpose. rdar://problem/11811377 llvm-svn: 301593	2017-04-27 21:49:45 +00:00
Eric Fiselier	56be04284f	Use __CLANG_ATOMIC_TYPE_LOCK_FREE macros in `stdatomic.h` Summary: This patch makes the header `stdatomic.h` work when `-fms-compatibility` is specified. Reviewers: rsmith Reviewed By: rsmith Subscribers: cfe-commits Differential Revision: https://reviews.llvm.org/D32322 llvm-svn: 300919	2017-04-20 23:07:38 +00:00
Ekaterina Romanova	0a40d67b20	[DOXYGEN] Minor improvements in doxygen comments. - To be consistent with the rest of the intrinsics headers, I removed the tags <i> .. </i> for marking instruction names in italics in in smmintrin.h. - Formatting changes to fit into 80 characters. I got an OK from Eric Christopher to commit doxygen comments without prior code review upstream. llvm-svn: 300578	2017-04-18 19:44:07 +00:00
Simon Pilgrim	9f6e79c5e4	[X86][SSE] Update MOVNTDQA non-temporal loads to generic implementation (clang) MOVNTDQA non-temporal aligned vector loads can be correctly represented using generic builtin loads, allowing us to remove the existing x86 intrinsics. LLVM companion patch: D31767. Differential Revision: https://reviews.llvm.org/D31766 llvm-svn: 300326	2017-04-14 15:05:57 +00:00
Sanjay Patel	bd0d0068ef	[x86] fix AVX FP cmp intrinsic documentation (PR28110) This copies the text used in the #define statements to the code comments. The conflicting text comes from AMD manuals, but those are wrong. Sadly, that FP cmp text has not been updated even after some docs were updated for Zen: http://support.amd.com/en-us/search/tech-docs ( AMD64 Architecture Programmer's Manual Volume 4 ) See PR28110 for more discussion: https://bugs.llvm.org/show_bug.cgi?id=28110 Differential Revision: https://reviews.llvm.org/D31428 llvm-svn: 300068	2017-04-12 15:19:08 +00:00
Hans Wennborg	5c3c51fe05	Implement _interlockedbittestandset as a builtin It's used by MS headers in VS 2017 without including intrin.h, so we can't implement it in the header anymore. Differential Revision: https://reviews.llvm.org/D31736 llvm-svn: 299782	2017-04-07 16:41:47 +00:00
Craig Topper	01bba17819	Recommit r299321 '[X86] Add __extension__ to f16c macro intrinsics to suppress warnings about compound literals when compiled for with earlier language standards enabled.' The bot didn't recover after the revert. So it looks like this wasn't the issue. llvm-svn: 299397	2017-04-03 22:59:30 +00:00
Craig Topper	27b71e5b1b	Revert r299321 '[X86] Add __extension__ to f16c macro intrinsics to suppress warnings about compound literals when compiled for with earlier language standards enabled.' to see if recovers a fuzzer bot. llvm-svn: 299382	2017-04-03 19:43:47 +00:00
Craig Topper	bf82498301	[AVX-512] Fix a couple more intrinsic macros I missed in r299346. llvm-svn: 299347	2017-04-03 03:51:57 +00:00
Craig Topper	ac9959eb53	[AVX-512] Fix some intrinsic macros that use the wrong macro parameter names and don't have parentheses around them. Thanks to Matthew Barr for reporting this issue. llvm-svn: 299346	2017-04-03 03:41:29 +00:00
Craig Topper	ce272ae2c5	[X86] Add __extension__ to f16c macro intrinsics to suppress warnings about compound literals when compiled for with earlier language standards enabled. Fixes PR32491. llvm-svn: 299321	2017-04-02 03:02:53 +00:00
Hans Wennborg	043f402586	[X86] Implement __readgsqword (and the rest) as builtins (PR32373) It seems MS headers have started using __readgsqword, and since it's used in a header that doesn't include intrin.h, we can't implement it as an inline function anymore. That was already the case for __readfsdword, which Saleem added support for in r220859. This patch reuses that codegen to implement all of __read[fg]s{byte,word,dword,qword}. Differential Revision: https://reviews.llvm.org/D31248 llvm-svn: 298538	2017-03-22 19:13:13 +00:00
Ekaterina Romanova	6a5702a093	[DOXYGEN] Improvements to smmintrin.h and emmintrin.h intrinsics. I made some small changes in smmintrin.h and emmintrin.h intrinsics. - changed some regular comments '//' into doxygen-style comments '///' where necessary - removed some trailing spaces in doxygen comments. I got an OK from Eric Christopher to commit doxygen comments without prior code review upstream. llvm-svn: 298371	2017-03-21 13:34:06 +00:00
Simon Pilgrim	60e924985c	[X86][AVX512] Add _mm512_cvtsd_f64 and _mm512_cvtss_f32 intrinsics (PR32305) Differential Revision: https://reviews.llvm.org/D31155 llvm-svn: 298364	2017-03-21 12:46:13 +00:00
Eric Christopher	5ba576ffe6	Fix parsing of htmxlintrin.h in C++ mode - Fix a variable naming mismatch - Fix gcc extension pointer arithmetic on void to cast to char *. - Test that the header (and htmintrin.h) parse. llvm-svn: 298318	2017-03-20 22:31:33 +00:00
Anastasia Stulova	bb27dfe049	[OpenCL] Fix extension guards for atomic functions Review: D30830 Patch by James Price! llvm-svn: 298256	2017-03-20 15:02:54 +00:00
Igor Breger	f050b797ac	[X86][AVX512][Clang][Intrinsics] Adding missing intrinsics to Clang . Summary: Adding missing intrinsics : _mm512_set_epi16, _mm512_set_epi8, _mm512_permutevar_epi32 _mm512_mask_permutevar_epi32 Reviewers: zvi, guyblank, eladcohen, craig.topper Reviewed By: craig.topper Subscribers: craig.topper, cfe-commits Differential Revision: https://reviews.llvm.org/D31034 llvm-svn: 298208	2017-03-19 08:27:16 +00:00
Craig Topper	6afc436a78	[AVX-512] Change the input type for some load intrinsics to take void type like the spec (and the test cases say). llvm-svn: 298042	2017-03-17 05:59:25 +00:00
Craig Topper	2e5058c403	[AVX-512] Add missing typecasts and parentheses to _mm512_mask_i64gather_ps. My macro cleanup script I used on the others last year must have missed it. llvm-svn: 298040	2017-03-17 05:14:37 +00:00
Bruno Cardoso Lopes	ae1249e4f2	[Headers] Reapply: Add #include_next for tgmath.h on Darwin Reapply r289181 but rename the include guard to avoid conflict with the one from Darwin. Allow darwin to provide additional definitions and implementation specifc values for tgmath.h on Apple platforms. rdar://problem/19019845 llvm-svn: 298013	2017-03-16 23:19:00 +00:00
Egor Churaev	60c30ae1f1	[OpenCL] Implement as_type operator as alias of __builtin_astype. Reviewers: Anastasia Reviewed By: Anastasia Subscribers: cfe-commits, yaxunl, bader Differential Revision: https://reviews.llvm.org/D28136 llvm-svn: 297947	2017-03-16 12:15:10 +00:00
Reid Kleckner	b04cb9ab7a	[MS] Add support for __ud2 and __int2c MSVC intrinsics This was requested in PR31958 and elsewhere. llvm-svn: 297057	2017-03-06 19:43:16 +00:00
Oren Ben Simhon	259b091669	[X86] DAZ Macros Relocation The DAZ feature introduces the denormal zero support for x86. Currently the definitions are located under SSE3 header, however there are some SSE2 targets that support the feature as well. Differential Revision: https://reviews.llvm.org/D30194 llvm-svn: 296296	2017-02-26 11:58:15 +00:00
Simon Pilgrim	a81d45a1ba	[X86][XOP] Fix type conversion warning in vpcmov generic implementations. llvm-svn: 295584	2017-02-18 23:47:34 +00:00
Craig Topper	117892098a	[X86] Replace XOP vpcmov builtins with native vector logical operations. llvm-svn: 295570	2017-02-18 21:15:30 +00:00
Ekaterina Romanova	ff266f5236	Added doxygen comments to smmintrin.h's intrinsics. Note: The doxygen comments are automatically generated based on Sony's intrinsic s document. I got an OK from Eric Christopher to commit doxygen comments without prior code review upstream. llvm-svn: 295404	2017-02-17 02:49:50 +00:00
Anastasia Stulova	58984e7087	[OpenCL] Correct ndrange_t implementation Removed ndrange_t as Clang builtin type and added as a struct type in the OpenCL header. Use type name to do the Sema checking in enqueue_kernel and modify IR generation accordingly. Review: D28058 Patch by Dmitry Borisenkov! llvm-svn: 295311	2017-02-16 12:27:47 +00:00
Craig Topper	f0d1147fae	[AVX-512] Replace 512-bit masked packss/packus builtins and replace with new unmasked builtins. These new unmasked builtins will enable us to easily support optimizing these builtins in InstCombine in the backend. llvm-svn: 295291	2017-02-16 06:32:07 +00:00
Reid Kleckner	2a02c2e331	Fix some warnings in intrin.h llvm-svn: 295082	2017-02-14 18:38:19 +00:00
Reid Kleckner	04f9f91da6	[MS] Implement the __fastfail intrinsic as a builtin __fastfail terminates the process immediately with a special system call. It does not run any process shutdown code or exception recovery logic. Fixes PR31854 llvm-svn: 294606	2017-02-09 18:31:06 +00:00
Craig Topper	4574226c3f	[X86] Clzero flag addition and inclusion under znver1 1. Adds the command line flag for clzero. 2. Includes the clzero flag under znver1. 3. Defines the macro for clzero. 4. Adds a new file which has the intrinsic definition for clzero instruction. Patch by Ganesh Gopalasubramanian with some additional tests from me. Differential revision: https://reviews.llvm.org/D29386 llvm-svn: 294559	2017-02-09 06:10:14 +00:00
Ekaterina Romanova	ae7b82eaf8	Doxygen comments for prfchwintrin.h Added doxygen comments to prfchwintrin.h's intrinsics. Note: The doxygen comments are automatically generated based on Sony's intrinsic s document. I got an OK from Eric Christopher to commit doxygen comments without prior code review upstream. llvm-svn: 293745	2017-02-01 07:37:40 +00:00
Anastasia Stulova	d1f390ef99	[OpenCL] Diagnose write_only image3d when extension is disabled Prior to OpenCL 2.0, image3d_t can only be used with the write_only access qualifier when the cl_khr_3d_image_writes extension is enabled, see e.g. OpenCL 1.1 s6.8b. Require the extension for write_only image3d_t types and guard uses of write_only image3d_t in the OpenCL header. Patch by Sven van Haastregt! Review: https://reviews.llvm.org/D28860 llvm-svn: 293050	2017-01-25 12:18:50 +00:00
Paul Robinson	a363d14538	Guard __gnuc_va_list typedef. Differential Revision: http://reviews.llvm.org/D28620 llvm-svn: 292819	2017-01-23 19:09:21 +00:00
Tim Shen	867be0d14c	[Altivec] Change vec_sl to a << (b % (sizeof(a) * 8)) For a << b (as original vec_sl does), if b >= sizeof(a) * 8, the behavior is undefined. However, Power instructions do define the behavior, which is equivalent to a << (b % (sizeof(a) * 8)). This patch changes altivec.h to use a << (b % (sizeof(a) * 8)), to ensure the consistent semantic of the instructions. Then it combines the generated multiple instructions back to a single shift. This patch handles left shift only. Right shift, on the other hand, is more complicated, considering arithematic/logical right shift. Differential Revision: https://reviews.llvm.org/D28037 llvm-svn: 292659	2017-01-20 22:05:33 +00:00
Craig Topper	367c86ddbe	[AVX-512] Replace subvector broadcast builtins with shufflevectors and selects. Verified that the backend codegens this equally well. llvm-svn: 292329	2017-01-18 02:17:10 +00:00
Ekaterina Romanova	2e041c9c20	[DOXYGEN] Documentation for the newly added x86 intrinsics. Added doxygen comments for the newly added intrinsics in avxintrin.h, namely _mm256_cvtsd_f64, _mm256_cvtsi256_si32 and _mm256_cvtss_f32 Added doxygen comments for the new intrinsics in emmintrin.h, namely _mm_loadu_si64 and _mm_load_sd. Explicit parameter names were added for _mm_clflush and _mm_setcsr The rest of the changes are editorial, removing trailing spaces at the end of the lines. Differential Revision: https://reviews.llvm.org/D28503 llvm-svn: 291876	2017-01-13 01:14:08 +00:00
Tony Jiang	974e4c7899	[PowerPC] Fix the wrong implementation of builtin vec_rlnm. llvm-svn: 291702	2017-01-11 20:59:42 +00:00
Sean Fertile	96d9e0ec05	Add vec_insert4b and vec_extract4b functions to altivec.h Add builtins for the functions and custom codegen mapping the builtins to their corresponding intrinsics and handling the endian related swapping. https://reviews.llvm.org/D26546 llvm-svn: 291179	2017-01-05 21:43:30 +00:00
Justin Lebar	b8f7a3b8b1	[CUDA] Rename keywords used in macro so they don't conflict with MSVC. Summary: MSVC seems to use "__in" and "__out" for its own purposes, so we have to pick different names in this macro. Reviewers: tra Subscribers: cfe-commits Differential Revision: https://reviews.llvm.org/D28325 llvm-svn: 291138	2017-01-05 16:54:11 +00:00
Justin Lebar	11d5116904	[CUDA] Don't define functions that the CUDA headers themselves define on Windows. Reviewers: tra Subscribers: cfe-commits Differential Revision: https://reviews.llvm.org/D28324 llvm-svn: 291137	2017-01-05 16:53:55 +00:00
Justin Lebar	1863d611f8	[Windows] Remove functions in intrin.h that are defined in Builtin.def. Summary: These duplicate declarations cause a problem for CUDA compiles on Windows. All implicitly-defined functions are host+device, and this applies to the declarations in Builtin.def. But then when we see the declarations in intrin.h, they have no attributes, so are host-only functions. This is an error. (A better fix might be to make these builtins host-only, but that is a much bigger change.) Reviewers: rnk Subscribers: cfe-commits, echristo Differential Revision: https://reviews.llvm.org/D28317 llvm-svn: 291128	2017-01-05 16:51:37 +00:00
Artem Belevich	60f25f70c8	[CUDA] Pre-include sm_60 and sm_61 headers. CUDA-8.0 comes with new headers which nvcc pre-includes via cuda_runtime.h Clang now makes them available as well. Differential Revision: https://reviews.llvm.org/D28301 llvm-svn: 290982	2017-01-04 18:39:29 +00:00
Ekaterina Romanova	c9ed514632	[DOXYGEN] Improved doxygen comments for xmmintrin.h intrinsics. Added \n commands to insert a line breaks where necessary, since one long line of documentation is nearly unreadable. Formatted comments to fit into 80 chars. In some cases added \a command in front of the parameter names to display them in italics. llvm-svn: 290619	2016-12-27 18:53:29 +00:00
Craig Topper	70536f4e47	[AVX-512] Replace masked 512-bit pmuldq and pmuludq builtins with the newly added unmasked versions and selects. llvm-svn: 290580	2016-12-27 04:04:57 +00:00
Craig Topper	32866ab800	Revert r290574 "foo" This was supposed to be merged with another commit with a real commit message. Sorry. llvm-svn: 290579	2016-12-27 04:03:29 +00:00
Craig Topper	c5ab78d4c3	Revert r290575 "[AVX-512] Replace masked 512-bit pmuldq and pmuludq builtins with the newly added unmasked versions and selects." I failed to merge this with r290574. llvm-svn: 290578	2016-12-27 04:03:25 +00:00
Craig Topper	6ad5bcc8ac	[AVX-512] Replace masked 512-bit pmuldq and pmuludq builtins with the newly added unmasked versions and selects. llvm-svn: 290575	2016-12-27 03:46:16 +00:00
Craig Topper	39b9e32493	foo llvm-svn: 290574	2016-12-27 03:46:13 +00:00
Ekaterina Romanova	dffe45b3e6	[DOXYGEN] Improved doxygen comments for x86 intrinsics. Improved doxygen comments for the following intrinsics headers: __wmmintrin_pclmul.h, bmiintrin.h, emmintrin.h, f16cintrin.h, immintrin.h, mmintrin.h, pmmintrin.h, tmmintrin.h Added \n commands to insert a line breaks where necessary, since one long line of documentation is nearly unreadable. Formatted comments to fit into 80 chars. In some cases added \a command in front of the parameter names to display them in italics. llvm-svn: 290561	2016-12-27 00:49:38 +00:00
Marina Yatsina	c42fd03bf8	[inline-asm]No error for conflict between inputs\outputs and clobber list According to extended asm syntax, a case where the clobber list includes a variable from the inputs or outputs should be an error - conflict. for example: const long double a = 0.0; int main() { char b; double t1 = a; __asm__ ("fucompp": "=a" (b) : "u" (t1), "t" (t1) : "cc", "st", "st(1)"); return 0; } This should conflict with the output - t1 which is st, and st which is st aswell. The patch fixes it. Commit on behald of Ziv Izhar. Differential Revision: https://reviews.llvm.org/D15075 llvm-svn: 290539	2016-12-26 12:23:42 +00:00
Ekaterina Romanova	16166a4d71	[DOXYGEN] Improved doxygen comments for tmmintrin.h intrinsics. Added \n commands to insert a line breaks where necessary to make the documentation more readable. Formatted comments to fit into 80 chars. llvm-svn: 290458	2016-12-23 23:36:26 +00:00
Ekaterina Romanova	6de0cd870b	[DOXYGEN] Improved doxygen comments for tmmintrin.h intrinsics. Tagged parameter names with \a doxygen command to display parameters in italics. Added \n commands to insert a line break to make the documentation more readable. Formatted comments to fit into 80 chars. llvm-svn: 290455	2016-12-23 22:47:16 +00:00
Yaxun Liu	5b74665a41	Recommit r289979 [OpenCL] Allow disabling types and declarations associated with extensions Fixed undefined behavior due to cast integer to bool in initializer list. llvm-svn: 290056	2016-12-18 05:18:55 +00:00
Yaxun Liu	35f6d66b0d	Revert r289979 due to regressions llvm-svn: 289991	2016-12-16 21:23:55 +00:00
Yaxun Liu	2e8331cab6	[OpenCL] Allow disabling types and declarations associated with extensions Added a map to associate types and declarations with extensions. Refactored existing diagnostic for disabled types associated with extensions and extended it to declarations for generic situation. Fixed some bugs for types associated with extensions. Allow users to use pragma to declare types and functions for supported extensions, e.g. #pragma OPENCL EXTENSION the_new_extension_name : begin // declare types and functions associated with the extension here #pragma OPENCL EXTENSION the_new_extension_name : end Differential Revision: https://reviews.llvm.org/D21698 llvm-svn: 289979	2016-12-16 19:22:08 +00:00
Bruno Cardoso Lopes	88458c31e7	Revert "[Headers] Add #include_next for tgmath.h on Darwin" Reverts r289181: it's currently breaking modules using simd.h in 10.12 SDK. This reverts commit 6e73e3464e96a4e00492c24aa790d36e1adb5702. llvm-svn: 289487	2016-12-12 23:06:58 +00:00
Craig Topper	678b07fe3c	[AVX-512] Remove masking from 512-bit vpermil builtins. The backend now has versions without masking so wrap it with select. This will allow the backend to constant fold these to generic shuffle vectors like 128-bit and 256-bit without having to working about handling masking. llvm-svn: 289351	2016-12-11 01:26:52 +00:00
Craig Topper	cdd3603c04	[AVX-512] Remove masking from 512-bit pshufb builtin. The backend now has a version without masking so wrap it with select. This will allow the backend to constant fold these to generic shuffle vectors like 128-bit and 256-bit without having to working about handling masking. llvm-svn: 289345	2016-12-10 23:09:52 +00:00
Craig Topper	5391c98341	[AVX-512] Remove 128/256-bit masked vpermilvar builtins and replace with select and the avx unmasked builtins. llvm-svn: 289338	2016-12-10 20:27:39 +00:00
Ekaterina Romanova	0c1c3bbc78	[DOXYGEN] Improved doxygen comments for x86 intrinsics headers. Tagged instruction names with <c> INSTR_NAME </c> to display them in typewriter font. In the past, \c command was used, unfortunately it applied to only one word. <c> .. </c> has the same meaning, but applies to all words in between the tags. llvm-svn: 289249	2016-12-09 18:35:50 +00:00
Bruno Cardoso Lopes	052e6ddf27	[Headers] Add #include_next for tgmath.h on Darwin Allow darwin to provide additional definitions and implementation specifc values for tgmath.h on Apple platforms. rdar://problem/19019845 llvm-svn: 289181	2016-12-09 03:30:46 +00:00
Ekaterina Romanova	08da283295	[DOXYGEN] Improved doxygen comments for xmmintrin.h intrinsics. Tagged parameter names with \a doxygen command to display parameters in italics. Formatted comments to fit into 80 chars. llvm-svn: 289159	2016-12-08 23:58:39 +00:00
Ekaterina Romanova	3494a597e9	[DOXYGEN] Improved doxygen comments. Improved doxygen comments for fxsrintrin.h and mmintrin.h intrinsics by taagging parameter names with \a doxygen command to display parameters in italics. Formatted comments to fit into 80 chars. llvm-svn: 289154	2016-12-08 23:32:07 +00:00
Ekaterina Romanova	797b0ebf2d	[DOXYGEN] Improved doxygen comments for emmintrin.h intrinsics. Tagged parameter names with \a doxygen command to display parameters in italics. Formatted comments to fit into 80 chars. llvm-svn: 289116	2016-12-08 22:10:51 +00:00
Ekaterina Romanova	a8fde7ce8b	[DOXYGEN] Improved doxygen comments. Improved doxygen comments for __wmmintrin_pclmul.h and ammintrin.h intrinsics by taagging parameter names with \a doxygen command to display parameters in italics. Formatted comments to fit into 80 chars. llvm-svn: 289083	2016-12-08 17:57:23 +00:00
Ekaterina Romanova	d6042197db	[DOXYGEN] Improved doxygen comments for avxintrin.h intrinsics. Tagged parameter names with \a doxygen command to display them in italics. Formatted comments to fit into 80 chars. llvm-svn: 289022	2016-12-08 04:09:17 +00:00
Bruno Cardoso Lopes	d93779da15	[Headers] Enable #include_next<float.h> on Darwin Allows darwin targets to provide additional definitions and implementation specifc values for float.h rdar://problem/21961491 llvm-svn: 289018	2016-12-08 02:13:56 +00:00
Ekaterina Romanova	4c77e8940e	[DOXYGEN] Updated instruction names corresponding to avxintrin.h intrinsics. Documentation for some of the avxintrin.h's intrinsics errorneously said that non VEX-prefixed instructions could be generated. This was fixed. I tried several different solutions to achieve pretty printing of unordered lists (nested and non-nested) in param sections in doxygen. llvm-svn: 287990	2016-11-26 19:38:19 +00:00

1 2 3 4 5 ...

1274 Commits