llvm-project

Commit Graph

Author	SHA1	Message	Date
Nick Lewycky	b82c0ec5a6	Add missing file for test. llvm-svn: 163272	2012-09-05 23:52:20 +00:00
Nick Lewycky	cfc2fe9163	Teach libObject about some more ELF relocations. llvm-objdump -r now knows every relocation in C++ hello world built with debug info. llvm-svn: 163271	2012-09-05 23:48:54 +00:00
Manman Ren	f3fedb6935	JumpThreading: when default destination is the destination of some cases in a switch, make sure we include the value for the cases when calculating edge value from switch to the default destination. rdar://12241132 llvm-svn: 163270	2012-09-05 23:45:58 +00:00
Jack Carter	b4dbc17acd	Mips specific llvm assembler support for ALU instructions. This includes register support. Test case included. Contributer: Vladimir Medic llvm-svn: 163268	2012-09-05 23:34:03 +00:00
Tim Northover	c8d867d42d	Strip old MachineInstrs after we know we can put them back. Previous patch accidentally decided it couldn't convert a VFP to a NEON instruction after it had already destroyed the old one. Not a good move. llvm-svn: 163230	2012-09-05 18:37:53 +00:00
Pranav Bhandarkar	823f9ebaa3	LLVM Bug Fix 13709: Remove needless lsr(Rp, #32 ) instruction access the subreg_hireg of register pair Rp. * lib/Target/Hexagon/HexagonPeephole.cpp(PeepholeDoubleRegsMap): New DenseMap similar to PeepholeMap that additionally records subreg info too. (runOnMachineFunction): Record information in PeepholeDoubleRegsMap and copy propagate the high sub-reg of Rp0 in Rp1 = lsr(Rp0, #32) to the instruction Rx = COPY Rp1:logreg_subreg. * test/CodeGen/Hexagon/remove_lsr.ll: New test. llvm-svn: 163214	2012-09-05 16:01:40 +00:00
Silviu Baranga	3f40d87207	Fixed the DAG combiner to better handle the folding of AND nodes for vector types. The previous code was making the assumption that the length of the bitmask returned by isConstantSplat was equal to the size of the vector type. Now we first make sure that the splat value has at least the length of the vector lane type, then we only use as many fields as we have available in the splat value. llvm-svn: 163203	2012-09-05 08:57:21 +00:00
Logan Chien	eeaaf65cb6	Fix UseInitArray option for MIPS target. llvm-svn: 163193	2012-09-05 06:17:17 +00:00
Dan Gohman	df476e5e93	Make provenance checking conservative in cases when pointers-to-strong-pointers may be in play. These can lead to retains and releases happening in unstructured ways, foiling the optimizer. This fixes rdar://12150909. llvm-svn: 163180	2012-09-04 23:16:20 +00:00
Jakob Stoklund Olesen	c7579cdded	Move tie checks into MachineVerifier::visitMachineOperand. llvm-svn: 163152	2012-09-04 18:38:28 +00:00
Preston Gurd	cdf540d5d6	Generic Bypass Slow Div - CodeGenPrepare pass for identifying div/rem ops - Backend specifies the type mapping using addBypassSlowDivType - Enabled only for Intel Atom with O2 32-bit -> 8-bit - Replace IDIV with instructions which test its value and use DIVB if the value is positive and less than 256. - In the case when the quotient and remainder of a divide are used a DIV and a REM instruction will be present in the IR. In the non-Atom case they are both lowered to IDIVs and CSE removes the redundant IDIV instruction, using the quotient and remainder from the first IDIV. However, due to this optimization CSE is not able to eliminate redundant IDIV instructions because they are located in different basic blocks. This is overcome by calculating both the quotient (DIV) and remainder (REM) in each basic block that is inserted by the optimization and reusing the result values when a subsequent DIV or REM instruction uses the same operands. - Test cases check for the presents of the optimization when calculating either the quotient, remainder, or both. Patch by Tyler Nowicki! llvm-svn: 163150	2012-09-04 18:22:17 +00:00
Sergei Larin	4d8986af12	Porting Hexagon MI Scheduler to the new API. Change current Hexagon MI scheduler to use new converging scheduler. Integrates DFA resource model into it. llvm-svn: 163137	2012-09-04 14:49:56 +00:00
Arnold Schwaighofer	f00fb1c581	Patch to implement UMLAL/SMLAL instructions for the ARM architecture This patch corrects the definition of umlal/smlal instructions and adds support for matching them to the ARM dag combiner. Bug 12213 Patch by Yin Ma! llvm-svn: 163136	2012-09-04 14:37:49 +00:00
Elena Demikhovsky	cbe99bbb36	This patch optimizes shuffle instruction - generates 2 instructions instead of 4. Since this specific shuffle is widely used in many workloads we have ~10% performance on them. shufflevector <8 x float> %A, <8 x float> %B, <8 x i32> <i32 0, i32 8, i32 2, i32 10, i32 4, i32 12, i32 6, i32 14> vmovaps (%rdx), %ymm0 vshufps $8, %ymm0, %ymm0, %ymm0 vmovaps (%rcx), %ymm1 vshufps $8, %ymm0, %ymm1, %ymm1 vunpcklps %ymm0, %ymm1, %ymm0 vmovaps (%rcx), %ymm0 vmovsldup (%rdx), %ymm1 vblendps $85, %ymm0, %ymm1, %ymm0 llvm-svn: 163134	2012-09-04 12:49:02 +00:00
Nadav Rotem	03dcd85b56	LICM may hoist an instruction with undefined behavior above a trap. Scan the body of the loop and find instructions that may trap. Use this information when deciding if it is safe to hoist or sink instructions. Notice that we can optimize the search of instructions that may throw in the case of nested loops. rdar://11518836 llvm-svn: 163132	2012-09-04 10:25:04 +00:00
Alexey Samsonov	c942e6b781	Add support for fetching inlining context (stack of source code locations) by instruction address from DWARF. Add --inlining flag to llvm-dwarfdump to demonstrate and test this functionality, so that "llvm-dwarfdump --inlining --address=0x..." now works much like "addr2line -i 0x...", provided that the binary has debug info (Clang's -gline-tables-only is enough). llvm-svn: 163128	2012-09-04 08:12:33 +00:00
Bob Wilson	dcc54decd5	Fix more fallout from r158919, similar to PR13547. This code used to only handle malloc-like calls, which do not read memory. r158919 changed it to check isNoAliasFn(), which includes strdup-like and realloc-like calls, but it was not checking for dependencies on the memory read by those calls. llvm-svn: 163106	2012-09-03 05:15:15 +00:00
Nuno Lopes	750f83a752	escape special char when handling CXX_FOR_OCAMLOPT llvm-svn: 163098	2012-09-02 15:16:51 +00:00
Nuno Lopes	19e8933063	fix test's RUN lines llvm-svn: 163097	2012-09-02 15:07:25 +00:00
Nadav Rotem	9d83202620	Not all targets have efficient ISel code generation for select instructions. For example, the ARM target does not have efficient ISel handling for vector selects with scalar conditions. This patch adds a TLI hook which allows the different targets to report which selects are supported well and which selects should be converted to CF duting codegen prepare. llvm-svn: 163093	2012-09-02 12:10:19 +00:00
Benjamin Kramer	599a4bb6ea	LoopRotation: Make the brute force DomTree update more brute force. We update until we hit a fixpoint. This is probably slow but also slightly simplifies the code. It should also fix the occasional invalid domtrees observed when building with expensive checking. I couldn't find a case where this had a measurable slowdown, but if someone finds a pathological case where it does we may have to find a cleverer way of updating dominators here. Thanks to Duncan for the test case. llvm-svn: 163091	2012-09-02 11:57:22 +00:00
Nadav Rotem	500d691d4a	Generate better select code by allowing the target to use scalar select, and not sign-extend. llvm-svn: 163086	2012-09-02 08:20:07 +00:00
Pete Cooper	2117ac40c9	Revert "Take account of boolean vector contents when promoting a build vector from i1 to some other type. rdar://problem/12210060" This reverts commit 5dd9e214fb92847e947f9edab170f9b4e52b908f. Thanks to Duncan for explaining how this should have been done. Conflicts: test/CodeGen/X86/vec_select.ll llvm-svn: 163064	2012-09-01 17:37:55 +00:00
Logan Chien	cea0354c1b	Fix Thumb2 fixup kind in the integrated-as. llvm-svn: 163063	2012-09-01 15:06:36 +00:00
Owen Anderson	90e0eaffa8	Teach DAG combine a number of tricks to simplify FMA expressions in fast-math mode. llvm-svn: 163051	2012-09-01 06:04:27 +00:00
NAKAMURA Takumi	d35a4ff88b	llvm/test/CodeGen/X86/fp-fast.ll: Suppress FMA4 on AMD Bulldozer host, corresponding to r162999. llvm-svn: 163041	2012-09-01 00:26:28 +00:00
Manman Ren	3590361bf0	Fix Atom bots for r163036. llvm-svn: 163040	2012-09-01 00:17:06 +00:00
Manman Ren	26c5d0f607	SelectionDAG: when constructing VZEXT_LOAD from other loads, make sure its output chain is correctly setup. As an example, if the original load must happen before later stores, we need to make sure the constructed VZEXT_LOAD is constrained to be before the stores. rdar://11457792 llvm-svn: 163036	2012-08-31 23:16:57 +00:00
Craig Topper	908e685102	Mark FMA4 instructions as commutable and add them to the folding tables. llvm-svn: 163035	2012-08-31 23:10:34 +00:00
Michael Liao	3224543bf9	Fix PR12359 - In addition to undefined, if V2 is zero vector, skip 2nd PSHUFB and POR as well as PSHUFB will zero elements with negative indices. Patch by Sriram Murali <sriram.murali@intel.com> llvm-svn: 163018	2012-08-31 20:12:31 +00:00
Jack Carter	b3f3b17e16	The instruction DINS may be transformed into DINSU or DEXTM depending on the size of the extraction and its position in the 64 bit word. This patch allows support of the dext transformations with mips64 direct object output. 0 <= msb < 32 0 <= lsb < 32 0 <= pos < 32 1 <= size <= 32 DINS The field is entirely contained in the right-most word of the doubleword 32 <= msb < 64 0 <= lsb < 32 0 <= pos < 32 2 <= size <= 64 DINSM The field straddles the words of the doubleword 32 <= msb < 64 32 <= lsb < 64 32 <= pos < 64 1 <= size <= 32 DINSU The field is entirely contained in the left-most word of the doubleword llvm-svn: 163010	2012-08-31 18:06:48 +00:00
Craig Topper	c0387f6b23	Mark FMA3 instructions as commutable so that the operands to the multiply part can be commuted. llvm-svn: 163001	2012-08-31 16:31:13 +00:00
Craig Topper	c30fdbc46c	Add support for converting llvm.fma to fma4 instructions. llvm-svn: 162999	2012-08-31 15:40:30 +00:00
Jakob Stoklund Olesen	96f87069c4	Don't enforce ordered inline asm operands. I was too optimistic, inline asm can have tied operands that don't follow the def order. Fixes PR13742. llvm-svn: 162998	2012-08-31 15:34:59 +00:00
NAKAMURA Takumi	2762dadf2c	llvm/test/CodeGen/X86/vec_select.ll: Fix failure on xmm-less hosts, to add -mattr=+sse2. FIXME: Should this be tested with both +avx and -avx,+sse2? llvm-svn: 162983	2012-08-31 10:02:22 +00:00
Jakob Stoklund Olesen	d3bda3c5b9	Fix a couple of typos in EmitAtomic. Thumb2 instructions are mostly constrained to rGPR, not tGPR which is for Thumb1. rdar://problem/12203728 llvm-svn: 162968	2012-08-31 02:08:34 +00:00
Jim Grosbach	e423e865fe	X86: Fix encoding of 'movd %xmm0, %rax' The assembly string for the VMOVPQIto64rr instruction incorrectly lacked the 'v' prefix, resulting in mis-assembly of the vanilla movd instruction. llvm-svn: 162963	2012-08-31 00:30:30 +00:00
Pete Cooper	e969340fea	Take account of boolean vector contents when promoting a build vector from i1 to some other type. rdar://problem/12210060 llvm-svn: 162960	2012-08-30 23:58:52 +00:00
Owen Anderson	d1545e3715	Try to make this test more generic to unbreak buildbots. llvm-svn: 162958	2012-08-30 23:51:20 +00:00
Owen Anderson	cc61f87cf7	Teach the DAG combiner to turn chains of FADDs (x+x+x+x+...) into FMULs by constants. This is only enabled in unsafe FP math mode, since it does not preserve rounding effects for all such constants. llvm-svn: 162956	2012-08-30 23:35:16 +00:00
Michael Gottesman	2dc1120f15	[llvm] Updated the test fold-vector-select so that we test the vector selects exhaustively. llvm-svn: 162953	2012-08-30 23:11:49 +00:00
Nadav Rotem	ea973bda26	Currently targets that do not support selects with scalar conditions and vector operands - scalarize the code. ARM is such a target because it does not support CMOV of vectors. To implement this efficientlyi, we broadcast the condition bit and use a sequence of NAND-OR to select between the two operands. This is the same sequence we use for targets that don't have vector BLENDs (like SSE2). rdar://12201387 llvm-svn: 162926	2012-08-30 19:17:29 +00:00
Michael Liao	bbd10792c2	Introduce 'UseSSEx' to force SSE legacy encoding - Add 'UseSSEx' to force SSE legacy insn not being selected when AVX is enabled. As the penalty of inter-mixing SSE and AVX instructions, we need prevent SSE legacy insn from being generated except explicitly specified through some intrinsics. For patterns supported by both SSE and AVX, so far, we force AVX insn will be tried first relying on AddedComplexity or position in td file. It's error-prone and introduces bugs accidentally. 'UseSSEx' is disabled when AVX is turned on. For SSE insns inherited by AVX, we need this predicate to force VEX encoding or SSE legacy encoding only. For insns not inherited by AVX, we still use the previous predicates, i.e. 'HasSSEx'. So far, these insns fall into the following categories: * SSE insns with MMX operands * SSE insns with GPR/MEM operands only (xFENCE, PREFETCH, CLFLUSH, CRC, and etc.) * SSE4A insns. * MMX insns. * x87 insns added by SSE. 2 test cases are modified: - test/CodeGen/X86/fast-isel-x86-64.ll AVX code generation is different from SSE one. 'vcvtsi2sdq' cannot be selected by fast-isel due to complicated pattern and fast-isel fallback to materialize it from constant pool. - test/CodeGen/X86/widen_load-1.ll AVX code generation is different from SSE one after fixing SSE/AVX inter-mixing. Exec-domain fixing prefers 'vmovapd' instead of 'vmovaps'. llvm-svn: 162919	2012-08-30 16:54:46 +00:00
Benjamin Kramer	17ce7117d0	Fix test case. llvm-svn: 162913	2012-08-30 15:42:45 +00:00
Benjamin Kramer	afdfdb5cff	LoopRotate: Also rotate loops with multiple exits. The old PHI updating code in loop-rotate was replaced with SSAUpdater a while ago, it has no problems with comples PHIs. What had to be fixed is detecting whether a loop was already rotated and updating dominators when multiple exits were present. This change increases overall code size a bit, mostly due to additional loop unrolling opportunities. Passes test-suite and selfhost with -verify-dom-info. Fixes PR7447. Thanks to Andy for the input on the domtree updating code. llvm-svn: 162912	2012-08-30 15:39:42 +00:00
Nadav Rotem	d5f5777b77	It is illegal to transform (sdiv (ashr X c1) c2) -> (sdiv x (2^c1 * c2)), because C always rounds towards zero. Thanks Dirk and Ben. llvm-svn: 162899	2012-08-30 11:23:20 +00:00
Tim Northover	ca9f384ff8	Add support for moving pure S-register to NEON pipeline if desired llvm-svn: 162898	2012-08-30 10:17:45 +00:00
Michael Liao	271f11b571	Should put test case under test/ExecutionEngine/MCJIT/ llvm-svn: 162885	2012-08-30 00:43:57 +00:00
Michael Liao	3c8980646b	Fix PR13727 - The root cause is that target constant materialization in X86 fast-isel creates a PC-rel addressing which may overflow 32-bit range in non-Small code model if .rodata section is allocated too far away from code segment in MCJIT, which uses Large code model so far. - Follow the similar logic to fix non-Small code model in fast-isel by skipping non-Small code model. llvm-svn: 162881	2012-08-30 00:30:16 +00:00
Hal Finkel	1859d26528	Reserve space for the mandatory traceback fields on PPC64. We need to reserve space for the mandatory traceback fields, though leaving them as zero is appropriate for now. Although the ABI calls for these fields to be filled in fully, no compiler on Linux currently does this, and GDB does not read these fields. GDB uses the first word of zeroes during exception handling to find the end of the function and the size field, allowing it to compute the beginning of the function. DWARF information is used for everything else. We need the extra 8 bytes of pad so the size field is found in the right place. As a comparison, GCC fills in a few of the fields -- language, number of saved registers -- but ignores the rest. IBM's proprietary OSes do make use of the full traceback table facility. Patch by Bill Schmidt. llvm-svn: 162854	2012-08-29 20:22:24 +00:00
Benjamin Kramer	8bcc971174	Make MemoryBuiltins aware of TargetLibraryInfo. This disables malloc-specific optimization when -fno-builtin (or -ffreestanding) is specified. This has been a problem for a long time but became more severe with the recent memory builtin improvements. Since the memory builtin functions are used everywhere, this required passing TLI in many places. This means that functions that now have an optional TLI argument, like RecursivelyDeleteTriviallyDeadFunctions, won't remove dead mallocs anymore if the TLI argument is missing. I've updated most passes to do the right thing. Fixes PR13694 and probably others. llvm-svn: 162841	2012-08-29 15:32:21 +00:00
Jush Lu	e87e559e62	[arm-fast-isel] Add support for ARM PIC. llvm-svn: 162823	2012-08-29 02:41:21 +00:00
NAKAMURA Takumi	e9205613f6	Create llvm/test/Object/Mips/lit.local.cfg to check Mips in targets_to_build. llvm-svn: 162819	2012-08-29 01:37:57 +00:00
NAKAMURA Takumi	e1f9fc11d4	llvm/test: [CMake] Add profile_rt-shared to deps. llvm-svn: 162813	2012-08-29 00:37:56 +00:00
NAKAMURA Takumi	87abb0ee34	llvm/test/Analysis/Profiling: Mark 3 of them as REQUIRES: loadable_module. FIXME: profile_rt.dll could be built on win32. llvm-svn: 162811	2012-08-29 00:37:46 +00:00
Jack Carter	fc93516642	Moved input for objdump test from Mips to Inputs. llvm-svn: 162808	2012-08-29 00:10:48 +00:00
Manman Ren	abbb01abea	Profile: set branch weight metadata with data generated from profiling. This patch implements ProfileDataLoader which loads profile data generated by -insert-edge-profiling and updates branch weight metadata accordingly. Patch by Alastair Murray. llvm-svn: 162799	2012-08-28 22:21:25 +00:00
Jack Carter	cd6b0e1368	The instruction DEXT may be transformed into DEXTU or DEXTM depending on the size of the extraction and its position in the 64 bit word. This patch allows support of the dext transformations with mips64 direct object output. 0 <= msb < 32 0 <= lsb < 32 0 <= pos < 32 1 <= size <= 32 DINS The field is entirely contained in the right-most word of the doubleword 32 <= msb < 64 0 <= lsb < 32 0 <= pos < 32 2 <= size <= 64 DINSM The field straddles the words of the doubleword 32 <= msb < 64 32 <= lsb < 64 32 <= pos < 64 1 <= size <= 32 DINSU The field is entirely contained in the left-most word of the doubleword llvm-svn: 162782	2012-08-28 20:07:41 +00:00
Jack Carter	551efd7fd9	Some of the instructions in the Mips instruction set are revision delimited. llvm-mc -disassemble access these through the -mattr option. llvm-objdump -disassemble had no such way to set the attribute so some instructions were just not recognized for disassembly. This patch accepts llvm-mc mechanism for specifying the attributes. llvm-svn: 162781	2012-08-28 19:24:49 +00:00
Jack Carter	c20a21b855	Some instructions are passed to the assembler to be transformed to the final instruction variant. An example would be dsrll which is transformed into dsll32 if the shift value is greater than 32. For direct object output we need to do this transformation in the codegen. If the instruction was inside branch delay slot, it was being missed. This patch corrects this oversight. llvm-svn: 162779	2012-08-28 19:07:39 +00:00
Roman Divacky	8c4b6a307e	Emit word of zeroes after the last instruction as a start of the mandatory traceback table on PowerPC64. This helps gdb handle exceptions. The other mandatory fields are ignored by gdb and harder to implement so just add there a FIXME. Patch by Bill Schmidt. PR13641. llvm-svn: 162778	2012-08-28 19:06:55 +00:00
Hal Finkel	742b535e40	Add PPC Freescale e500mc and e5500 subtargets. Add subtargets for Freescale e500mc (32-bit) and e5500 (64-bit) to the PowerPC backend. Patch by Tobias von Koch. llvm-svn: 162764	2012-08-28 16:12:39 +00:00
Benjamin Kramer	9c0a807c27	InstCombine: Guard the transform introduced in r162743 against large ints and non-const shifts. llvm-svn: 162751	2012-08-28 13:08:13 +00:00
Nadav Rotem	d457787fed	Make sure that we don't call getZExtValue on values > 64 bits. Thanks Benjamin for noticing this. llvm-svn: 162749	2012-08-28 12:23:22 +00:00
Nadav Rotem	11935b29f3	Teach InstCombine to canonicalize [SU]div+[AL]shl patterns. For example: %1 = lshr i32 %x, 2 %2 = udiv i32 %1, 100 rdar://12182093 llvm-svn: 162743	2012-08-28 10:01:43 +00:00
Bill Wendling	cc56718038	The commutative flag is already correctly set within the multiclass. If we set it here, then a 'register-memory' version would wrongly get the commutative flag. <rdar://problem/12180135> llvm-svn: 162741	2012-08-28 07:36:46 +00:00
Craig Topper	bd509eea4a	Merge AVX_SET0PSY/AVX_SET0PDY/AVX2_SET0 into a single post-RA pseudo. llvm-svn: 162738	2012-08-28 07:05:28 +00:00
NAKAMURA Takumi	cdfe1d1cdb	llvm/test/CodeGen/X86/pr12312.ll: Add -mtriple=x86_64-unknown-unknown. llvm-svn: 162736	2012-08-28 04:04:29 +00:00
Michael Liao	b7d85b6328	Fix PR12312 - Add a target-specific DAG optimization to recognize a pattern PTEST-able. Such a pattern is a OR'd tree with X86ISD::OR as the root node. When X86ISD::OR node has only its flag result being used as a boolean value and all its leaves are extracted from the same vector, it could be folded into an X86ISD::PTEST node. llvm-svn: 162735	2012-08-28 03:34:40 +00:00
Jakob Stoklund Olesen	87cb471e52	Remove extra MayLoad/MayStore flags from atomic_load/store. These extra flags are not required to properly order the atomic load/store instructions. SelectionDAGBuilder chains atomics as if they were volatile, and SelectionDAG::getAtomic() sets the isVolatile bit on the memory operands of all atomic operations. The volatile bit is enough to order atomic loads and stores during and after SelectionDAG. This means we set mayLoad on atomic_load, mayStore on atomic_store, and mayLoad+mayStore on the remaining atomic read-modify-write operations. llvm-svn: 162733	2012-08-28 03:11:32 +00:00
Akira Hatanaka	b5af7121b1	Fix mips' long branch pass. Instructions emitted to compute branch offsets now use immediate operands instead of symbolic labels. This change was needed because there were problems when R_MIPS_HI16/LO16 relocations were used to make shared objects. llvm-svn: 162731	2012-08-28 03:03:05 +00:00
Akira Hatanaka	adb14f56c7	Fix bug 13532. In SelectionDAGLegalize::ExpandLegalINT_TO_FP, expand INT_TO_FP nodes without using any f64 operations if f64 is not a legal type. Patch by Stefan Kristiansson. llvm-svn: 162728	2012-08-28 02:12:42 +00:00
Hal Finkel	686f2ee226	Allow remat of LI on PPC. Allow load-immediates to be rematerialised in the register coalescer for PPC. This makes test/CodeGen/PowerPC/big-endian-formal-args.ll fail, because it relies on a register move getting emitted. The immediate load is equivalent, so change this test case. Patch by Tobias von Koch. llvm-svn: 162727	2012-08-28 02:10:33 +00:00
Hal Finkel	5ab378037f	Eliminate redundant CR moves on PPC32. The 32-bit ABI requires CR bit 6 to be set if the call has fp arguments and unset if it doesn't. The solution up to now was to insert a MachineNode to set/unset the CR bit, which produces a CR vreg. This vreg was then copied into CR bit 6. When the register allocator saw a bunch of these in the same function, it allocated the set/unset CR bit in some random CR register (1 extra instruction) and then emitted CR moves before every vararg function call, rather than just setting and unsetting CR bit 6 directly before every vararg function call. This patch instead inserts a PPCcrset/PPCcrunset instruction which are then matched by a dedicated instruction pattern. Patch by Tobias von Koch. llvm-svn: 162725	2012-08-28 02:10:27 +00:00
Hal Finkel	e39526a789	Optimize zext on PPC64. The zeroextend IR instruction is lowered to an 'and' node with an immediate mask operand, which in turn gets legalised to a sequence of ori's & ands. This can be done more efficiently using the rldicl instruction. Patch by Tobias von Koch. llvm-svn: 162724	2012-08-28 02:10:15 +00:00
Bill Wendling	988a47d7e5	Make sure we add the predicate after all of the registers are added. <rdar://problem/12183003> llvm-svn: 162703	2012-08-27 22:12:44 +00:00
NAKAMURA Takumi	fee50c8cf1	llvm/test/CodeGen/X86/fma.ll: Add -march=x86, or two tests would fail on non-x86 hosts. llvm-svn: 162667	2012-08-27 11:50:26 +00:00
NAKAMURA Takumi	10eb4cfc3e	llvm/test/CodeGen/X86/fma_patterns.ll: Add -mtriple=x86_64. It was incompatible on i686 and Windows x64. llvm-svn: 162664	2012-08-27 09:37:54 +00:00
Craig Topper	bfc1d0ed48	Commit test change for r162658. llvm-svn: 162660	2012-08-27 07:55:50 +00:00
Alexey Samsonov	034e57a297	Add basic support for .debug_ranges section to LLVM's DebugInfo library. This section (introduced in DWARF-3) is used to define instruction address ranges for functions that are not contiguous and can't be described by low_pc/high_pc attributes (this is the usual case for inlined subroutines). The patch is the first step to support fetching complete inlining info from DWARF. Reviewed by Benjamin Kramer. llvm-svn: 162657	2012-08-27 07:17:47 +00:00
Anitha Boyapati	0dd589c5f1	FMA3 tests on bdver2 target for changes made in rev 162012. Also made corresponding changes to existing tests for darwin triple to ensure that same pattern is tested for bdver2 target. llvm-svn: 162655	2012-08-27 06:59:01 +00:00
Craig Topper	9e4f0aae17	Make sure that FMA3 is favored even when FMA4 is also enabled. Test case for r162454. llvm-svn: 162653	2012-08-27 03:38:15 +00:00
Jakob Stoklund Olesen	c2272df1be	Infer instruction properties from single-instruction patterns. Previously, instructions without a primary patterns wouldn't get their properties inferred. Now, we use all single-instruction patterns for inference, including 'def : Pat<>' instances. This causes a lot of instruction flags to change. - Many instructions no longer have the UnmodeledSideEffects flag because their flags are now inferred from a pattern. - Instructions with intrinsics will get a mayStore flag if they already have UnmodeledSideEffects and a mayLoad flag if they already have mayStore. This is because intrinsics properties are linear. - Instructions with atomic_load patterns get a mayStore flag because atomic loads can't be reordered. The correct workaround is to create pseudo-instructions instead of using normal loads. PR13693. llvm-svn: 162614	2012-08-24 22:46:53 +00:00
Akira Hatanaka	4a08a4a8b6	Disable Mips' delay slot filler when optimization level is O0. llvm-svn: 162589	2012-08-24 20:40:15 +00:00
Akira Hatanaka	e8e4ef102d	In MipsDAGToDAGISel::SelectAddr, fold add node into address operand, if its second operand is MipsISD::GPRel. llvm-svn: 162584	2012-08-24 20:21:49 +00:00
Manman Ren	cf10446ffa	BranchProb: modify the definition of an edge in BranchProbabilityInfo to handle the case of multiple edges from one block to another. A simple example is a switch statement with multiple values to the same destination. The definition of an edge is modified from a pair of blocks to a pair of PredBlock and an index into the successors. Also set the weight correctly when building SelectionDAG from LLVM IR, especially when converting a Switch. IntegersSubsetMapping is updated to calculate the weight for each cluster. llvm-svn: 162572	2012-08-24 18:14:27 +00:00
Roman Divacky	ace4707ea6	Lower constant pools and jump tables via TOC on PPC64/SVR4. In collaboration with Adhemerval Zanella. llvm-svn: 162562	2012-08-24 16:26:02 +00:00
Eric Christopher	bb69a27dbc	Use DW_FORM_flag_present to save space in debug information if we're not in darwin gdb compat mode. Fixes rdar://10975088 llvm-svn: 162526	2012-08-24 01:14:27 +00:00
Eric Christopher	acb7115bde	Remove the DW_AT_MIPS_linkage name attribute when we don't need it output (we're emitting a specification already and the information isn't changing) and we're not in old gdb compat mode. Saves 1% on the debug information for a build of llvm. Fixes rdar://11043421 llvm-svn: 162493	2012-08-23 22:52:55 +00:00
Eric Christopher	20b76a77c3	Turn these two options in to trinary state so that they can be turned on and off separate from the platform if you're on darwin. llvm-svn: 162487	2012-08-23 22:36:40 +00:00
Eric Christopher	35ceacbcc2	Make this darwin specific to try to silence the bots. llvm-svn: 162435	2012-08-23 07:18:46 +00:00
Eric Christopher	7782618271	Emit pubtypes only when going for darwin gdb compatibility. rdar://10393214 llvm-svn: 162434	2012-08-23 07:10:56 +00:00
Eric Christopher	6cb5594979	Filecheck-ize. llvm-svn: 162433	2012-08-23 07:10:51 +00:00
Benjamin Kramer	e07728b936	SimplifyLibCalls: Give all safely-shrinkable libcalls the same treatment. llvm-svn: 162383	2012-08-22 19:39:15 +00:00
Chad Rosier	1df1fb511f	Whitespace. llvm-svn: 162370	2012-08-22 17:34:11 +00:00
Chad Rosier	671dc096ae	Add test case for r162368. llvm-svn: 162369	2012-08-22 17:31:04 +00:00
Stepan Dyatkovskiy	99120e04be	Rejected 169195. As Duncan commented, bitcasting to proper type is wrong approach. We need to insert some valid TRANCATE node here. llvm-svn: 162354	2012-08-22 09:33:55 +00:00
Akira Hatanaka	ad4950258b	Add register Mips::GP to the list of reserved registers if target is bare-metal to prevent it from being clobbered. mips uses $gp to access small data section. This bug was originally reported by Carl Norum. llvm-svn: 162340	2012-08-22 03:18:13 +00:00
Akira Hatanaka	9d957842e1	Add option disable-mips-delay-filler. Turn on mips' delay slot filler by default. Patch by Carl Norum. llvm-svn: 162339	2012-08-22 02:51:28 +00:00
Jack Carter	77064c0590	For mips64 switch statements in subroutines could generate within the codegen EK_GPRel64BlockAddress. This was not supported for direct object output and resulted in an assertion. This change adds support for EK_GPRel64BlockAddress for direct object. One fallout from this is to turn on rela relocations for mips64 to match gas. llvm-svn: 162334	2012-08-22 00:49:30 +00:00

1 2 3 4 5 ...

16968 Commits