llvm-project

Commit Graph

Author	SHA1	Message	Date
Stephen Lin	73de7bf5de	AArch64/PowerPC/SystemZ/X86: This patch fixes the interface, usage, and all in-tree implementations of TargetLoweringBase::isFMAFasterThanMulAndAdd in order to resolve the following issues with fmuladd (i.e. optional FMA) intrinsics: 1. On X86(-64) targets, ISD::FMA nodes are formed when lowering fmuladd intrinsics even if the subtarget does not support FMA instructions, leading to laughably bad code generation in some situations. 2. On AArch64 targets, ISD::FMA nodes are formed for operations on fp128, resulting in a call to a software fp128 FMA implementation. 3. On PowerPC targets, FMAs are not generated from fmuladd intrinsics on types like v2f32, v8f32, v4f64, etc., even though they promote, split, scalarize, etc. to types that support hardware FMAs. The function has also been slightly renamed for consistency and to force a merge/build conflict for any out-of-tree target implementing it. To resolve, see comments and fixed in-tree examples. llvm-svn: 185956	2013-07-09 18:16:56 +00:00
Nico Rieck	51969be724	Reuse %rax after calling __chkstk on win64 Reapply this as I reverted the wrong commit. llvm-svn: 185807	2013-07-08 11:20:11 +00:00
Nico Rieck	4801303ce1	Revert "Proper va_arg/va_copy lowering on win64" This reverts commit 2b52880592a525cfe04d8f9008a35da8c2ea94c3. Needs review. llvm-svn: 185806	2013-07-08 11:19:44 +00:00
Nico Rieck	43b51056d6	Revert "Reuse %rax after calling __chkstk on win64" This reverts commit 01f8d579f7672872324208ac5bc4ac311e81b22e. llvm-svn: 185781	2013-07-08 01:30:57 +00:00
Nico Rieck	7adf6111a8	Reuse %rax after calling __chkstk on win64 llvm-svn: 185778	2013-07-07 16:48:39 +00:00
Nico Rieck	99ef2890c0	Proper va_arg/va_copy lowering on win64 llvm-svn: 185763	2013-07-06 18:08:19 +00:00
Jakob Stoklund Olesen	db429d9483	Remove the EXCEPTIONADDR, EHSELECTION, and LSDAADDR ISD opcodes. These exception-related opcodes are not used any longer. llvm-svn: 185625	2013-07-04 13:54:20 +00:00
Jakob Stoklund Olesen	a1f5b901a5	Revert r185595-185596 which broke buildbots. Revert "Simplify landing pad lowering." Revert "Remove the EXCEPTIONADDR, EHSELECTION, and LSDAADDR ISD opcodes." llvm-svn: 185600	2013-07-04 00:26:30 +00:00
Jakob Stoklund Olesen	f33ec531fa	Remove the EXCEPTIONADDR, EHSELECTION, and LSDAADDR ISD opcodes. These exception-related opcodes are not used any longer. llvm-svn: 185596	2013-07-03 23:56:31 +00:00
Craig Topper	31ee5866de	Use SmallVectorImpl::iterator/const_iterator instead of SmallVector to avoid specifying the vector size. llvm-svn: 185540	2013-07-03 15:07:05 +00:00
Elena Demikhovsky	6769c50d9e	Optimized integer vector multiplication operation by replacing it with shift/xor/sub when it is possible. Fixed a bug in SDIV, where the const operand is not a splat constant vector. llvm-svn: 184931	2013-06-26 10:55:03 +00:00
Chad Rosier	295bd43adb	The getRegForInlineAsmConstraint function should only accept MVT value types. llvm-svn: 184642	2013-06-22 18:37:38 +00:00
Bill Wendling	8f26840c5a	Don't cache the instruction and register info from the TargetMachine, because the internals of TargetMachine could change. No functionality change intended. llvm-svn: 183571	2013-06-07 21:00:34 +00:00
Andrew Trick	ad6d08ac6f	Order CALLSEQ_START and CALLSEQ_END nodes. Fixes PR16146: gdb.base__call-ar-st.exp fails after pre-RA-sched=source fixes. Patch by Xiaoyi Guo! This also fixes an unsupported dbg.value test case. Codegen was previously incorrect but the test was passing by luck. llvm-svn: 182885	2013-05-29 22:03:55 +00:00
Andrew Trick	ef9de2a739	Track IR ordering of SelectionDAG nodes 2/4. Change SelectionDAG::getXXXNode() interfaces as well as call sites of these functions to pass in SDLoc instead of DebugLoc. llvm-svn: 182703	2013-05-25 02:42:55 +00:00
Michael J. Spencer	df1ecbd734	Replace Count{Leading,Trailing}Zeros_{32,64} with count{Leading,Trailing}Zeros. llvm-svn: 182680	2013-05-24 22:23:49 +00:00
Nadav Rotem	7b66c47051	X86: Fix a bug in EltsFromConsecutiveLoads. We can't generate new loads without chains. llvm-svn: 182507	2013-05-22 19:28:41 +00:00
Benjamin Kramer	d76cc186fc	X86: When expanding PCMPGTQ to PCMPGTD we always want to compare the lower halves as unsigned. Take #2 on fixing PR15977. llvm-svn: 182486	2013-05-22 17:01:12 +00:00
Benjamin Kramer	18ef6b22b9	X86: When emulating unsigned PCMPGTQ with PCMPGTD, fix the sign bit for the smaller type. Otherwise we'll get a mix of signed and unsigned compares. Fixes PR15977. llvm-svn: 182364	2013-05-21 09:58:54 +00:00
Matt Arsenault	75865923c9	Add LLVMContext argument to getSetCCResultType llvm-svn: 182180	2013-05-18 00:21:46 +00:00
Benjamin Kramer	fc33e1d99b	X86: Make shuffle -> shift conversion more aggressive about undefs. Shuffles that only move an element into position 0 of the vector are common in the output of the loop vectorizer and often generate suboptimal code when SSSE3 is not available. Lower them to vector shifts if possible. We still prefer palignr over psrldq because it has higher throughput on sandybridge. llvm-svn: 182102	2013-05-17 14:48:34 +00:00
David Majnemer	66fb70de38	Remove a recently redundant transform from X86ISelLowering. X86ISelLowering has support to treat: (icmp ne (and (xor %flags, -1), (shl 1, flag)), 0) as if it were actually: (icmp eq (and %flags, (shl 1, flag)), 0) However, r179386 has code at the InstCombine level to handle this. llvm-svn: 181145	2013-05-05 02:00:10 +00:00
Nadav Rotem	42932bdcd0	Fix an odd comment. llvm-svn: 181136	2013-05-04 23:24:56 +00:00
Michael Liao	06badde1ac	80-col fixup. llvm-svn: 180915	2013-05-02 09:22:04 +00:00
Michael Liao	afafa98fa8	Avoid duplicating logic on frame register selecting when lowering eh_return No functionality change llvm-svn: 180914	2013-05-02 09:18:38 +00:00
Michael Liao	31d39a4a47	Avoid duplicating logic on frame register selecting when lowering frameaddr No functionality change llvm-svn: 180912	2013-05-02 08:21:56 +00:00
Tim Northover	16aba17024	Remove unused ShouldFoldAtomicFences flag. I think it's almost impossible to fold atomic fences profitably under LLVM/C++11 semantics. As a result, this is now unused and just cluttering up the target interface. llvm-svn: 179940	2013-04-20 12:32:43 +00:00
Tim Northover	a2b533906a	Remove unused MEMBARRIER DAG node; it's been replaced by ATOMIC_FENCE. llvm-svn: 179939	2013-04-20 12:32:17 +00:00
Michael Liao	b53d8963ce	ArrayRefize getMachineNode(). No functionality change. llvm-svn: 179901	2013-04-19 22:22:57 +00:00
Michael Liao	e28fab22c4	Use 'array_lengthof' as possible to avoid magic numbers llvm-svn: 179833	2013-04-19 04:03:37 +00:00
Benjamin Kramer	c557828805	X86: Add an SSE2 lowering for 64 bit compares when pcmpgtq (SSE4.2) isn't available. This pattern started popping up in vectorized min/max reductions. llvm-svn: 179797	2013-04-18 21:37:45 +00:00
Michael Liao	55658d4222	Optimize vector select from all 0s or all 1s As packed comparisons in AVX/SSE produce all 0s or all 1s in each SIMD lane, vector select could be simplified to AND/OR or removed if one or both values being selected is all 0s or all 1s. llvm-svn: 179267	2013-04-11 05:15:54 +00:00
Michael Liao	f7bf87051a	Enhance bool simplifcation in X86 to handle more cases This patch is revised based on patch from Victor Umansky <victor.umansky@intel.com>. More cases are handled in X86's bool simplification, i.e. - SETCC_CARRY - value is truncated to i1 with AND As a by-product, PR5443 is also fixed. llvm-svn: 179265	2013-04-11 04:43:09 +00:00
Evan Cheng	ac0469c5d0	__sincosf_stret returns sinf / cosf in bits 0:31 and 32:63 of xmm0, not in xmm0 / xmm1. rdar://13599493 llvm-svn: 179141	2013-04-10 01:26:07 +00:00
Bill Wendling	eb108bad50	Use the target options specified on a function to reset the back-end. During LTO, the target options on functions within the same Module may change. This would necessitate resetting some of the back-end. Do this for X86, because it's a Friday afternoon. llvm-svn: 178917	2013-04-05 21:52:40 +00:00
Benjamin Kramer	b60633fb87	X86: Promote sitofp <8 x i16> to <8 x i32> when AVX is available. A vector sext + sitofp is a lot cheaper than 8 scalar conversions. llvm-svn: 178448	2013-03-31 12:49:15 +00:00
Benjamin Kramer	70671b9937	Remove the old CodePlacementOpt pass. It was superseded by MachineBlockPlacement and disabled by default since LLVM 3.1. llvm-svn: 178349	2013-03-29 17:14:24 +00:00
Michael Liao	a486a11dcf	Add support of RDSEED defined in AVX2 extension llvm-svn: 178314	2013-03-28 23:41:26 +00:00
Michael Liao	5fff5c7b26	Enhance boolean simplification to handle 16-/64-bit RDRAND - RDRAND always clears the destination value when a random value is not available (i.e. CF == 0). This value is truncated or zero-extended as the false boolean value to be returned. Boolean simplification needs to skip this 'zext' or 'trunc' node. llvm-svn: 178312	2013-03-28 23:38:52 +00:00
Michael Liao	96b42608ab	Skip moving call address loading into callseq when targets prefer register indirect call. To enable a load of a call address to be folded with that call, this load is moved from outside of callseq into callseq. Such a moving adds a non-glued node (that load) into a glued sequence. This non-glue load is only removed when DAG selection folds them into a memory form call instruction. When such instruction selection is disabled, it breaks DAG schedule. To prevent that, such moving is disabled when target favors register indirect call. Previous workaround disabling CALL32m/CALL64m insn selection is removed. llvm-svn: 178308	2013-03-28 23:13:21 +00:00
Timur Iskhodzhanov	a2fd5fdd7a	Make Win32 put the SRet address into EAX, fixes PR15556 llvm-svn: 178291	2013-03-28 21:30:04 +00:00
Preston Gurd	663e6f9558	For the current Atom processor, the fastest way to handle a call indirect through a memory address is to load the memory address into a register and then call indirect through the register. This patch implements this improvement by modifying SelectionDAG to force a function address which is a memory reference to be loaded into a virtual register. Patch by Sriram Murali. llvm-svn: 178171	2013-03-27 19:14:02 +00:00
Hal Finkel	1996f3d87f	Fix typo (common to both X86 and PPC) Thanks to Bill Schmidt for pointing this out during code review! llvm-svn: 178170	2013-03-27 19:10:42 +00:00
Michael Liao	03f9ad0e67	Add XTEST codegen support llvm-svn: 178083	2013-03-26 22:47:01 +00:00
Michael Liao	5fbcd81793	Revise alignment checking/calculation on 256-bit unaligned memory access - It's still considered aligned when the specified alignment is larger than the natural alignment; - The new alignment for the high 128-bit vector should be min(16, alignment) as the pointer is advanced by 16, a power-of-2 offset. llvm-svn: 177947	2013-03-25 23:50:10 +00:00
Michael Liao	0f4ea0c4a9	Fix PR15296 - Move SRA/SRL/SHL lowering support from DAG combination to DAG lowering to support extended 256-bit integer in AVX but not AVX2. llvm-svn: 177478	2013-03-20 02:33:21 +00:00
Michael Liao	5a4e81d2e8	Mark all variable shifts needing customizing - Prepare moving logic from DAG combining into DAG lowering. There's no functionality change. llvm-svn: 177477	2013-03-20 02:28:20 +00:00
Michael Liao	48e8a3727c	Move scalar immediate shift lowering into a dedicated func - no functionality change llvm-svn: 177476	2013-03-20 02:20:36 +00:00
Nadav Rotem	0f1bc60d51	Optimize sext <4 x i8> and <4 x i16> to <4 x i64>. Patch by Ahmad, Muhammad T <muhammad.t.ahmad@intel.com> llvm-svn: 177421	2013-03-19 18:38:27 +00:00
Anton Korobeynikov	3e7005f1c1	TLS support for MinGW targets. MinGW is almost completely compatible to MSVC, with the exception of the _tls_array global not being available. Patch by David Nadlinger! llvm-svn: 177257	2013-03-18 08:12:28 +00:00

1 2 3 4 5 ...

2344 Commits