llvm-project

Commit Graph

Author	SHA1	Message	Date
Evan Cheng	b3ed09703c	Backing out 55521. Not safe. llvm-svn: 55548	2008-08-29 22:13:21 +00:00
Evan Cheng	960b17a3c2	Swap fp comparison operands and change predicate to allow load folding. llvm-svn: 55521	2008-08-28 23:48:31 +00:00
Owen Anderson	42ccd39689	These assertions should be return false's instead, allowing the client to detect the failure. llvm-svn: 55377	2008-08-26 18:50:40 +00:00
Owen Anderson	27fb3dcbc7	Make TargetInstrInfo::copyRegToReg return a bool indicating whether the copy requested was inserted or not. This allows bitcast in fast isel to properly handle the case where an appropriate reg-to-reg copy is not available. llvm-svn: 55375	2008-08-26 18:03:31 +00:00
Owen Anderson	4f6bf04616	Convert uses of std::vector in TargetInstrInfo to SmallVector. This change had to be propoagated down into all the targets and up into all clients of this API. llvm-svn: 54802	2008-08-14 22:49:33 +00:00
Dan Gohman	4e2f3ace2c	Add an EXTRACTPSmr pattern to match the pattern that X86ISelLowering creates. llvm-svn: 54544	2008-08-08 18:30:21 +00:00
Dan Gohman	527ca7e253	Re-enable elimination of unnecessary SUBREG_TO_REG instructions in LowerSubregs, and fix an x86-64 isel bug that this exposed. SUBREG_TO_REG for x86-64 implicit zero extension is only safe for isel to generate when the source is known to always have zeros in the high 32 bits. The EXTRACT_SUBREG instruction does not clear the high 32 bits. llvm-svn: 54444	2008-08-07 02:54:50 +00:00
Dan Gohman	2ce6f2ad5e	Rename SDOperand to SDValue. llvm-svn: 54128	2008-07-27 21:46:04 +00:00
Evan Cheng	e001643358	Use movaps instead of movups to spill 16-byte vector values when default alignment is >= 16. This fixes some massive performance regressions. llvm-svn: 53844	2008-07-21 06:34:17 +00:00
Anton Korobeynikov	b7a49925a1	Use aligned stack spills, where possible. This fixes PR2549. llvm-svn: 53784	2008-07-19 06:30:51 +00:00
Dan Gohman	1705968102	Add a new function, ReplaceAllUsesOfValuesWith, which handles bulk replacement of multiple values. This is slightly more efficient than doing multiple ReplaceAllUsesOfValueWith calls, and theoretically could be optimized even further. However, an important property of this new function is that it handles the case where the source value set and destination value set overlap. This makes it feasible for isel to use SelectNodeTo in many very common cases, which is advantageous because SelectNodeTo avoids a temporary node and it doesn't require CSEMap updates for users of values that don't change position. Revamp MorphNodeTo, which is what does all the work of SelectNodeTo, to handle operand lists more efficiently, and to correctly handle a number of corner cases to which its new wider use exposes it. This commit also includes a change to the encoding of post-isel opcodes in SDNodes; now instead of being sandwiched between the target-independent pre-isel opcodes and the target-dependent pre-isel opcodes, post-isel opcodes are now represented as negative values. This makes it possible to test if an opcode is pre-isel or post-isel without having to know the size of the current target's post-isel instruction set. These changes speed up llc overall by 3% and reduce memory usage by 10% on the InstructionCombining.cpp testcase with -fast and -regalloc=local. llvm-svn: 53728	2008-07-17 19:10:17 +00:00
Dan Gohman	9a542a4d5f	Add a utility function to MachineInstr for testing whether an instruction has exactly one MachineMemOperand, and change some X86 lowering code to make use of it. llvm-svn: 53498	2008-07-12 00:10:52 +00:00
Dan Gohman	3b46030375	Pool-allocation for MachineInstrs, MachineBasicBlocks, and MachineMemOperands. The pools are owned by MachineFunctions. This drastically reduces the number of calls to malloc/free made during the "Emit" phase of scheduling, as well as later phases in CodeGen. Combined with other changes, this speeds up the "instruction selection" phase of CodeGen by 10% in some cases. llvm-svn: 53212	2008-07-07 23:14:23 +00:00
Dan Gohman	38740a98b2	Make DenseMap's insert return a pair, to more closely resemble std::map. llvm-svn: 53177	2008-07-07 17:46:23 +00:00
Evan Cheng	7d98a48f15	- Remove calls to copyKillDeadInfo which is an N^2 function. Instead, propagate kill / dead markers as new instructions are constructed in foldMemoryOperand, convertToThressAddress, etc. - Also remove LiveVariables::instructionChanged, etc. Replace all calls with cheaper calls which update VarInfo kill list. llvm-svn: 53097	2008-07-03 09:09:37 +00:00
Evan Cheng	c939f45fe7	commuteInstruction should preserve dead markers. llvm-svn: 53060	2008-07-03 00:04:51 +00:00
Owen Anderson	30cc028e4a	Make LiveVariables even more optional, by making it optional in the call to TargetInstrInfo::convertToThreeAddressInstruction Also, if LV isn't around, then TwoAddr doesn't need to be updating flags, since they won't have been set in the first place. llvm-svn: 53058	2008-07-02 23:41:07 +00:00
Dan Gohman	fb19f9402b	Split ISD::LABEL into ISD::DBG_LABEL and ISD::EH_LABEL, eliminating the need for a flavor operand, and add a new SDNode subclass, LabelSDNode, for use with them to eliminate the need for a label id operand. Change instruction selection to let these label nodes through unmodified instead of creating copies of them. Teach the MachineInstr emitter how to emit a MachineInstr directly from an ISD label node. This avoids the need for allocating SDNodes for the label id and flavor value, as well as SDNodes for each of the post-isel label, label id, and label flavor. llvm-svn: 52943	2008-07-01 00:05:16 +00:00
Evan Cheng	3f2ceac565	If it's determined safe, remat MOV32r0 (i.e. xor r, r) and others as it is instead of using the longer MOV32ri instruction. llvm-svn: 52670	2008-06-24 07:10:51 +00:00
Evan Cheng	03553bb59a	Add option to commuteInstruction() which forces it to create a new (commuted) instruction. llvm-svn: 52308	2008-06-16 07:33:11 +00:00
Duncan Sands	13237ac3b9	Wrap MVT::ValueType in a struct to get type safety and better control the abstraction. Rename the type to MVT. To update out-of-tree patches, the main thing to do is to rename MVT::ValueType to MVT, and rewrite expressions like MVT::getSizeInBits(VT) in the form VT.getSizeInBits(). Use VT.getSimpleVT() to extract a MVT::SimpleValueType for use in switch statements (you will get an assert failure if VT is an extended value type - these shouldn't exist after type legalization). This results in a small speedup of codegen and no new testsuite failures (x86-64 linux). llvm-svn: 52044	2008-06-06 12:08:01 +00:00
Dan Gohman	3388d022ac	Use PMULDQ for v2i64 multiplies when SSE4.1 is available. And add load-folding table entries for PMULDQ and PMULLD. llvm-svn: 51489	2008-05-23 17:49:40 +00:00
Dan Gohman	eabd647cd5	Change target-specific classes to use more precise static types. This eliminates the need for several awkward casts, including the last dynamic_cast under lib/Target. llvm-svn: 51091	2008-05-14 01:58:56 +00:00
Bill Wendling	1e11768a4f	Constify the machine instruction passed into the "is{Trivially,Really}ReMaterializable" methods. llvm-svn: 51001	2008-05-12 20:54:26 +00:00
Evan Cheng	fa8f9f937a	Undo r50574. We are already ensuring the folded load address is 16-byte aligned. llvm-svn: 50578	2008-05-02 17:01:01 +00:00
Evan Cheng	50f82f2c8e	Not safe folding a load + FsXORPSrr into FsXORPSrm. It's loading a FR64 value but the load folding variant expects a 16-byte aligned address. llvm-svn: 50574	2008-05-02 07:50:58 +00:00
Nicolas Geoffray	984e7199cc	Don't forget to update the current operand when getting the size of an instruction. llvm-svn: 50007	2008-04-20 23:36:47 +00:00
Evan Cheng	147cb764b5	Don't forget about sub-register indices when rematting instructions. llvm-svn: 49830	2008-04-16 23:44:44 +00:00
Nicolas Geoffray	ae84bbdbed	Infrastructure for getting the machine code size of a function and an instruction. X86, PowerPC and ARM are implemented llvm-svn: 49809	2008-04-16 20:10:13 +00:00
Dan Gohman	3bc3ddd638	Rename MemOperand to MachineMemOperand. This was suggested by review feedback from Chris quite a while ago. No functionality change. llvm-svn: 49348	2008-04-07 19:35:22 +00:00
Evan Cheng	b86595fb0a	ReMat of load from stub in pic mode extends the life of pic base. Currently spiller doesn't do a good job of estimating the impact. Disable for now. llvm-svn: 49059	2008-04-01 23:26:12 +00:00
Evan Cheng	19a6dd9f2a	Remove unnecessary and non-deterministic checking code. Re-enable remat of load from gv stub. llvm-svn: 49054	2008-04-01 21:38:20 +00:00
Evan Cheng	306e3dcff4	Disabling remat of load from gv stub (temporarily) again to fix llvmgcc bootstrap miscompare. llvm-svn: 49037	2008-04-01 07:33:13 +00:00
Evan Cheng	e4f77c69ac	It's not safe to fold a load from GV stub or constantpool into a two-address use. llvm-svn: 49002	2008-03-31 23:19:51 +00:00
Evan Cheng	ed6e34fe41	Move reMaterialize() from TargetRegisterInfo to TargetInstrInfo. llvm-svn: 48995	2008-03-31 20:40:39 +00:00
Evan Cheng	1973a46cd3	Re-apply 48911. llvm-svn: 48977	2008-03-31 07:54:19 +00:00
Evan Cheng	b8654202dd	Backing out 48911 for now. It's breaking stuff. llvm-svn: 48922	2008-03-28 17:49:06 +00:00
Evan Cheng	9ae4d7b719	Load from stub is already re-materializable. llvm-svn: 48911	2008-03-28 06:49:25 +00:00
Evan Cheng	308e564693	Code clean up. llvm-svn: 48856	2008-03-27 01:45:11 +00:00
Evan Cheng	29e62a59f3	Allow certain lea instructions to be rematerialized. llvm-svn: 48855	2008-03-27 01:41:09 +00:00
Evan Cheng	4fb07c6500	Remove an unused command line option. llvm-svn: 48854	2008-03-27 01:30:24 +00:00
Dan Gohman	883cbfd0ba	Add CMP32mr and friends to the load-unfolding table. Among other things, this allows the scheduler to unfold a load operand in the 2008-01-08-SchedulerCrash.ll testcase, so it now successfully clones the comparison to avoid a pushf+popf. llvm-svn: 48777	2008-03-25 16:53:19 +00:00
Chris Lattner	5abbe6cef5	Add support for calls that return two FP values in ST(0)/ST(1). llvm-svn: 48634	2008-03-21 06:38:26 +00:00
Christopher Lamb	d3d0ad3f58	Make insert_subreg a two-address instruction, vastly simplifying LowerSubregs pass. Add a new TII, subreg_to_reg, which is like insert_subreg except that it takes an immediate implicit value to insert into rather than a register. llvm-svn: 48412	2008-03-16 03:12:01 +00:00
Christopher Lamb	dd55d3f1b2	Get rid of a pseudo instruction and replace it with subreg based operation on real instructions, ridding the asm printers of the hack used to do this previously. In the process, update LowerSubregs to be careful about eliminating copies that have side affects. Note: the coalescer will have to be careful about this too, when it starts coalescing insert_subreg nodes. llvm-svn: 48329	2008-03-13 05:47:01 +00:00
Chris Lattner	7b27ccfd5e	coalesce away 80-bit floating point copies. llvm-svn: 48241	2008-03-11 19:30:09 +00:00
Chris Lattner	7930d8e775	convert a massive if statement to a switch. llvm-svn: 48240	2008-03-11 19:28:17 +00:00
Christopher Lamb	342e4104d3	Missed part of recommit. llvm-svn: 48224	2008-03-11 10:27:36 +00:00
Chris Lattner	a4fa0ad30d	abort with an assert instead of a cerr to get line# llvm-svn: 48199	2008-03-10 23:56:08 +00:00
Evan Cheng	d4e1d9eeb2	Revert 48125, 48126, and 48130 for now to unbreak some x86-64 tests. llvm-svn: 48167	2008-03-10 19:31:26 +00:00
Christopher Lamb	4ba3f0430b	Allow insert_subreg into implicit, target-specific values. Change insert/extract subreg instructions to be able to be used in TableGen patterns. Use the above features to reimplement an x86-64 pseudo instruction as a pattern. llvm-svn: 48130	2008-03-10 06:12:08 +00:00
Chris Lattner	86829f0ff7	teach X86InstrInfo::copyRegToReg how to copy into ST(0) from an RFP register class. Teach ScheduleDAG how to handle CopyToReg with different src/dst reg classes. This allows us to compile trivial inline asms that expect stuff on the top of x87-fp stack. llvm-svn: 48107	2008-03-09 09:15:31 +00:00
Chris Lattner	b79bafcec8	add some code to support cross-register class copying from RST -> RFP{32/64/80}. We only handle ST(0) for now. llvm-svn: 48104	2008-03-09 08:46:19 +00:00
Chris Lattner	c4c9dde04c	rearrange some code, no functionality change. llvm-svn: 48101	2008-03-09 07:58:04 +00:00
Evan Cheng	42cb72e52c	Turning on remat of pic loads. llvm-svn: 47524	2008-02-23 02:07:42 +00:00
Evan Cheng	4d17671997	No need recognize load from a fixed argument slot as re-materializable. LiveIntervalAnalysis already handles it as a special case. llvm-svn: 47522	2008-02-23 01:47:44 +00:00
Evan Cheng	94ba37f8e3	Allow re-materialization of pic load (controlled by -remat-pic-load for now). llvm-svn: 47476	2008-02-22 09:25:47 +00:00
Evan Cheng	244183ef0d	commuteInstr() can now commute non-ssa machine instrs. llvm-svn: 47043	2008-02-13 02:46:49 +00:00
Evan Cheng	3b3286d4bc	It's not always safe to fold movsd into xorpd, etc. Check the alignment of the load address first to make sure it's 16 byte aligned. llvm-svn: 46893	2008-02-08 21:20:40 +00:00
Evan Cheng	8d59dd119b	Added missing entries in X86 load / store folding tables. llvm-svn: 46866	2008-02-08 00:12:56 +00:00
Evan Cheng	1bc1cae318	In some cases, e.g. ADD32ri, no transformation is made. Guide against it. llvm-svn: 46849	2008-02-07 08:29:53 +00:00
Chris Lattner	18df33d0c8	fix a wordo that gordon noticed :) llvm-svn: 45896	2008-01-12 00:53:16 +00:00
Chris Lattner	6da61c2515	Any x86 instruction that reads from an invariant location is invariant. This allows us to sink things like: cvtsi2sd 32(%esp), %xmm1 when reading from the argument area, for example. llvm-svn: 45895	2008-01-12 00:35:08 +00:00
Chris Lattner	596875118c	rename MachineInstr::setInstrDescriptor -> setDesc llvm-svn: 45871	2008-01-11 18:10:50 +00:00
Chris Lattner	806dd0e2ac	remove xchg and shift-reg-by-1 instructions, which are dead. llvm-svn: 45870	2008-01-11 18:00:50 +00:00
Chris Lattner	c8226f32e9	Simplify the side effect stuff a bit more and make licm/sinking both work right according to the new flags. This removes the TII::isReallySideEffectFree predicate, and adds TII::isInvariantLoad. It removes NeverHasSideEffects+MayHaveSideEffects and adds UnmodeledSideEffects as machine instr flags. Now the clients can decide everything they need. I think isRematerializable can be implemented in terms of the flags we have now, though I will let others tackle that. llvm-svn: 45843	2008-01-10 23:08:24 +00:00
Chris Lattner	f171482a66	verify that the frame index is immutable before remat'ing (still disabled) or being side-effect free. llvm-svn: 45816	2008-01-10 04:16:31 +00:00
Chris Lattner	9129f51f9b	add a testcase llvm-svn: 45768	2008-01-09 00:37:18 +00:00
Bill Wendling	a3bdad153f	Operand 1 should be a register. We don't care if it's a preg, vreg, or 0. llvm-svn: 45699	2008-01-07 08:05:29 +00:00
Chris Lattner	03ad885039	rename TargetInstrDescriptor -> TargetInstrDesc. Make MachineInstr::getDesc return a reference instead of a pointer, since it can never be null. llvm-svn: 45695	2008-01-07 07:27:27 +00:00
Chris Lattner	b0d06b4381	Move a bunch more accessors from TargetInstrInfo to TargetInstrDescriptor llvm-svn: 45680	2008-01-07 03:13:06 +00:00
Chris Lattner	f0f438a517	remove MachineOpCode typedef. llvm-svn: 45679	2008-01-07 02:48:55 +00:00
Chris Lattner	e55e115616	Add predicates methods to TargetOperandInfo, and switch all clients over to using them, instead of diddling Flags directly. Change the various flags from const variables to enums. llvm-svn: 45677	2008-01-07 02:39:19 +00:00
Chris Lattner	a98c679de0	Rename MachineInstr::getInstrDescriptor -> getDesc(), which reflects that it is cheap and efficient to get. Move a variety of predicates from TargetInstrInfo into TargetInstrDescriptor, which makes it much easier to query a predicate when you don't have TII around. Now you can use MI->getDesc()->isBranch() instead of going through TII, and this is much more efficient anyway. Not all of the predicates have been moved over yet. Update old code that used MI->getInstrDescriptor()->Flags to use the new predicates in many places. llvm-svn: 45674	2008-01-07 01:56:04 +00:00
Owen Anderson	2a3be7bb6c	Move even more functionality from MRegisterInfo into TargetInstrInfo. Some day I'll get it all moved over... llvm-svn: 45672	2008-01-07 01:35:02 +00:00
Bill Wendling	5fa2c64b78	Fix comment. llvm-svn: 45638	2008-01-05 23:30:51 +00:00
Bill Wendling	be984cf10b	Chris and Evan noticed that this check was compleatly fubared. I was checking that there was a from a global instead of a load from the stub for a global, which is the one that's safe to hoist. Consider this program: volatile char G[100]; int B(char *F, int N) { for (; N > 0; --N) F[N] = G[N]; } In static mode, we shouldn't be hoisting the load from G: $ llc -relocation-model=static -o - a.bc -march=x86 -machine-licm LBB1_1: # bb.preheader leal -1(%eax), %edx testl %edx, %edx movl $1, %edx cmovns %eax, %edx xorl %esi, %esi LBB1_2: # bb movb _G(%eax), %bl movb %bl, (%ecx,%eax) llvm-svn: 45626	2008-01-05 09:18:04 +00:00
Chris Lattner	e0f0c4aa02	enable sinking and licm of loads from the argument area. I'd like to enable this for remat, but can't due to an RA bug. llvm-svn: 45623	2008-01-05 06:10:42 +00:00
Chris Lattner	d4738ee8e1	simplify some code by using shorter accessors. llvm-svn: 45622	2008-01-05 05:28:30 +00:00
Chris Lattner	86f4a2e4f2	revert my previous patch. llvm-svn: 45621	2008-01-05 05:26:26 +00:00
Chris Lattner	69d7902cf1	factor some code better to avoid redundancy between isReallySideEffectFree and isReallyTriviallyReMaterializable. Why is a load from a global considered side-effect-free but not rematable? llvm-svn: 45620	2008-01-05 05:19:56 +00:00
Owen Anderson	6bb0c52628	Move some more functionality from MRegisterInfo to TargetInstrInfo. llvm-svn: 45603	2008-01-04 23:57:37 +00:00
Bill Wendling	e1f28e7871	Machine LICM will check that operands are defined outside of the loop. Also check that register isn't 0 before going further. llvm-svn: 45498	2008-01-02 21:10:40 +00:00
Owen Anderson	eee14601b1	Move some more instruction creation methods from RegisterInfo into InstrInfo. llvm-svn: 45484	2008-01-01 21:11:32 +00:00
Chris Lattner	b0fb17fd34	Fix a bug in my previous patch: refer to the impl not the pure virtual version. It's unclear why gcc would ever compile this... llvm-svn: 45476	2008-01-01 01:05:34 +00:00
Chris Lattner	25568e4cef	Fix a problem where lib/Target/TargetInstrInfo.h would include and use a header file from libcodegen. This violates a layering order: codegen depends on target, not the other way around. The fix to this is to split TII into two classes, TII and TargetInstrInfoImpl, which defines stuff that depends on libcodegen. It is defined in libcodegen, where the base is not. llvm-svn: 45475	2008-01-01 01:03:04 +00:00
Owen Anderson	7a73ae9a86	Move copyRegToReg from MRegisterInfo to TargetInstrInfo. This is part of the Machine-level API cleanup instigated by Chris. llvm-svn: 45470	2007-12-31 06:32:00 +00:00
Chris Lattner	a10fff51d9	Rename SSARegMap -> MachineRegisterInfo in keeping with the idea that "machine" classes are used to represent the current state of the code being compiled. Given this expanded name, we can start moving other stuff into it. For now, move the UsedPhysRegs and LiveIn/LoveOuts vectors from MachineFunction into it. Update all the clients to match. This also reduces some needless #includes, such as MachineModuleInfo from MachineFunction. llvm-svn: 45467	2007-12-31 04:13:23 +00:00
Chris Lattner	a5bb370aa4	Add new shorter predicates for testing machine operands for various types: e.g. MO.isMBB() instead of MO.isMachineBasicBlock(). I don't plan on switching everything over, so new clients should just start using the shorter names. Remove old long accessors, switching everything over to use the short accessor: getMachineBasicBlock() -> getMBB(), getConstantPoolIndex() -> getIndex(), setMachineBasicBlock -> setMBB(), etc. llvm-svn: 45464	2007-12-30 23:10:15 +00:00
Chris Lattner	5c4637816e	Use MachineOperand::getImm instead of MachineOperand::getImmedValue. Likewise setImmedValue -> setImm llvm-svn: 45453	2007-12-30 20:49:49 +00:00
Bill Wendling	7749a9014b	If we have a load of a global address that's not modified during the function, then go ahead and hoist it out of the loop. This is the result: $ cat a.c volatile int G; int A(int N) { for (; N > 0; --N) G++; } $ llc -o - -relocation-model=pic _A: ... LBB1_2: # bb movl L_G$non_lazy_ptr-"L1$pb"(%eax), %esi incl (%esi) incl %edx cmpl %ecx, %edx jne LBB1_2 # bb ... $ llc -o - -relocation-model=pic -machine-licm _A: ... movl L_G$non_lazy_ptr-"L1$pb"(%eax), %eax LBB1_2: # bb incl (%eax) incl %edx cmpl %ecx, %edx jne LBB1_2 # bb ... I'm limiting this to the MOV32rm x86 instruction for now. llvm-svn: 45444	2007-12-30 03:18:58 +00:00
Chris Lattner	f3ebc3f3d2	Remove attribution from file headers, per discussion on llvmdev. llvm-svn: 45418	2007-12-29 20:36:04 +00:00
Bill Wendling	b3d85a5d4b	Add "mayHaveSideEffects" and "neverHasSideEffects" flags to some instructions. I based what flag to set on whether it was already marked as "isRematerializable". If there was a further check to determine if it's "really" rematerializable, then I marked it as "mayHaveSideEffects" and created a check in the X86 back-end similar to the remat one. llvm-svn: 45132	2007-12-17 23:07:56 +00:00
Bill Wendling	3f19dfe794	Reverting 44702. It wasn't correct to rename them. llvm-svn: 44727	2007-12-08 23:58:46 +00:00
Bill Wendling	2b07d8c5a0	Renaming: isTriviallyReMaterializable -> hasNoSideEffects isReallyTriviallyReMaterializable -> isTriviallyReMaterializable llvm-svn: 44702	2007-12-08 07:17:56 +00:00
Arnold Schwaighofer	9ccea99165	Added tail call optimization to the x86 back end. It can be enabled by passing -tailcallopt to llc. The optimization is performed if the following conditions are satisfied: * caller/callee are fastcc * elf/pic is disabled OR elf/pic enabled + callee is in module + callee has visibility protected or hidden llvm-svn: 42870	2007-10-11 19:40:01 +00:00
Evan Cheng	82bc90ac60	Under 64-bit mode use LEA64_32r instead of LEA64r to save a byte. llvm-svn: 42783	2007-10-09 07:14:53 +00:00
Evan Cheng	18109c88c3	Allow x86 compare to be commutable by default. llvm-svn: 42761	2007-10-08 18:27:46 +00:00
Evan Cheng	1151ffde70	Commute x86 cmove instructions by swapping the operands and change the condition to its inverse. Testing this as llcbeta llvm-svn: 42661	2007-10-05 23:13:21 +00:00
Evan Cheng	484cab7a2f	Enable convertToThreeAddress for X86 by default. llvm-svn: 42655	2007-10-05 22:31:10 +00:00
Evan Cheng	d3ccf00870	INC64_32r -> LEA64_32r is better than INC64_32r -> LEA32r, but it still can cause performance degradation. llvm-svn: 42653	2007-10-05 21:55:32 +00:00
Evan Cheng	fa2c828687	In 64-bit mode, avoid using leal with 32-bit 32-bit address size, e.g. leal 1(%ecx), %edi, which requires 67H prefix. llvm-svn: 42647	2007-10-05 20:34:26 +00:00
Evan Cheng	aac0f8e351	Add support to convert more 64-bit instructions to 3-address instructions. llvm-svn: 42642	2007-10-05 18:20:36 +00:00
Evan Cheng	a8a9c15e30	Testing convertToThreeeAddress as X86 llcbeta. llvm-svn: 42630	2007-10-05 08:04:01 +00:00
Evan Cheng	5fb5a1f389	Enabling new condition code modeling scheme. llvm-svn: 42459	2007-09-29 00:00:36 +00:00
Evan Cheng	e95f391ef1	Added support for new condition code modeling scheme (i.e. physical register dependency). These are a bunch of instructions that are duplicated so the x86 backend can support both the old and new schemes at the same time. They will be deleted after all the kinks are worked out. llvm-svn: 42285	2007-09-25 01:57:46 +00:00
Dan Gohman	3243e10ef0	Add 64-bit jmp instructions to the list of instructions that can terminate a block with no fall-through. llvm-svn: 42029	2007-09-17 15:19:08 +00:00
Dan Gohman	48ea03d169	Add patterns for SHLD64* and SHRD64*. llvm-svn: 41975	2007-09-14 23:17:45 +00:00
Evan Cheng	483e1ce16e	Add implicit def of EFLAGS on those instructions that may modify flags. llvm-svn: 41962	2007-09-14 21:48:26 +00:00
Owen Anderson	e2f23a3abf	Add lengthof and endof templates that hide a lot of sizeof computations. Patch by Sterling Stein! llvm-svn: 41758	2007-09-07 04:06:50 +00:00
Evan Cheng	189df733ed	Fix a bug in X86InstrInfo::convertToThreeAddress that caused it to codegen: leal (,%rcx,8), %rcx It should be leal (,%rcx,8), %ecx llvm-svn: 41735	2007-09-06 00:14:41 +00:00
Christopher Lamb	d36d30b53c	Add 2-addr to 3-addr promotion code that allows 32-bit LEA to be used via subregisters when 16-bit LEA is disabled. llvm-svn: 41007	2007-08-10 21:18:25 +00:00
Evan Cheng	936d17aa1b	Don't pollute the meaning of isUnpredicatedTerminator. llvm-svn: 40537	2007-07-26 17:32:14 +00:00
Evan Cheng	d771e05121	isUnpredicatedTerminator should treat conditional branches as unpredicated terminator. llvm-svn: 37960	2007-07-06 23:22:03 +00:00
Dale Johannesen	3d7008cd49	Refactor X87 instructions. As a side effect, all their names are changed. llvm-svn: 37876	2007-07-04 21:07:47 +00:00
Dale Johannesen	a2b3c175db	Fix for PR 1505 (and 1489). Rewrite X87 register model to include f32 variants. Some factoring improvments forthcoming. llvm-svn: 37847	2007-07-03 00:53:03 +00:00
Dan Gohman	e8c1e428f2	Revert the earlier change that removed the M_REMATERIALIZABLE machine instruction flag, and use the flag along with a virtual member function hook for targets to override if there are instructions that are only trivially rematerializable with specific operands (i.e. constant pool loads). llvm-svn: 37728	2007-06-26 00:48:07 +00:00
Dan Gohman	9e82064924	Replace M_REMATERIALIZIBLE and the newly-added isOtherReMaterializableLoad with a general target hook to identify rematerializable instructions. Some instructions are only rematerializable with specific operands, such as loads from constant pools, while others are always rematerializable. This hook allows both to be identified as being rematerializable with the same mechanism. llvm-svn: 37644	2007-06-19 01:48:05 +00:00
Dale Johannesen	616627b002	Do not treat FP_REG_KILL as terminator in branch analysis (X86). llvm-svn: 37578	2007-06-14 22:03:45 +00:00
Dan Gohman	4a4a8eb00e	Add a target hook to allow loads from constant pools to be rematerialized, and an implementation for x86. llvm-svn: 37576	2007-06-14 20:50:44 +00:00
Dale Johannesen	c68554683d	Handle blocks with 2 unconditional branches in AnalyzeBranch. llvm-svn: 37571	2007-06-13 17:59:52 +00:00
Evan Cheng	5514bbef46	Add a utility routine to check for unpredicated terminator instruction. llvm-svn: 37528	2007-06-08 21:59:56 +00:00
Evan Cheng	fc94eb66d2	BlockHasNoFallThrough() now returns true if block ends with a return instruction. llvm-svn: 37266	2007-05-21 18:44:17 +00:00
Evan Cheng	e20dd92792	RemoveBranch() and InsertBranch() now returns number of instructions deleted / inserted. llvm-svn: 37193	2007-05-18 00:18:17 +00:00
Evan Cheng	8cd224e81c	Relex assertions to account for additional implicit def / use operands. llvm-svn: 36430	2007-04-25 07:12:14 +00:00
Bill Wendling	e2324ca17d	Remove some invalid instructions from this check. llvm-svn: 36404	2007-04-24 21:17:46 +00:00
Bill Wendling	ac5b650a54	Adding more MMX instructions. llvm-svn: 35638	2007-04-03 23:48:32 +00:00
Bill Wendling	e7b2a864f2	Add FEMMS and ADDQ. Renamed MMX recipes to prepend the MMX_ to them. llvm-svn: 35616	2007-04-03 06:00:37 +00:00
Chris Lattner	bcd38852f2	Compile CodeGen/X86/lea-3.ll:test2 to: _test3: leaq (,%rdi,4), %rax orq %rdi, %rax ret instead of: _test2: movq %rdi, %rax shlq $2, %rax orq %rdi, %rax ret llvm-svn: 35434	2007-03-28 18:12:31 +00:00
Chris Lattner	c56e4920d2	Fix a problem building llvm-gcc on amd64-unknown-freebsd6.2, due to the system assembler not groking legal instructions like "leal (,%esi,8), %ecx". llvm-svn: 35393	2007-03-28 00:58:40 +00:00
Chris Lattner	3e1d917e80	Two changes: 1) codegen a shift of a register as a shift, not an LEA. 2) teach the RA to convert a shift to an LEA instruction if it wants something in three-address form. This gives us asm diffs like: - leal (,%eax,4), %eax + shll $2, %eax which is faster on some processors and smaller on all of them. and, more interestingly: - movl 24(%esi), %eax - leal (,%eax,4), %edi + movl 24(%esi), %edi + shll $2, %edi Without #2, #1 was a significant pessimization in some cases. This implements CodeGen/X86/shift-codegen.ll llvm-svn: 35204	2007-03-20 06:08:29 +00:00
Bill Wendling	6092ce25cf	Added "padd*" support for MMX. Added MMX move stuff to X86InstrInfo so that moves, loads, etc. are recognized. llvm-svn: 35031	2007-03-08 22:09:11 +00:00
Jim Laskey	f9e5445ed4	Make LABEL a builtin opcode. llvm-svn: 33537	2007-01-26 14:34:52 +00:00
Evan Cheng	07fc107e90	convertToThreeAddress() is now responsible for updating live info as well as inserting the new MI's. llvm-svn: 32097	2006-12-01 21:52:41 +00:00
Evan Cheng	20350c4025	Change MachineInstr ctor's to take a TargetInstrDescriptor reference instead of opcode and number of operands. llvm-svn: 31947	2006-11-27 23:37:22 +00:00
Evan Cheng	7ae482c52a	Fix a potential bug: MOVPDI2DI, etc. are not copy instructions. llvm-svn: 31794	2006-11-16 23:22:26 +00:00
Evan Cheng	dc2c8748a7	Properly transfer kill / dead info. llvm-svn: 31765	2006-11-15 20:58:11 +00:00
Evan Cheng	dbd3d294e6	Matches MachineInstr changes. llvm-svn: 31712	2006-11-13 23:36:35 +00:00
Chris Lattner	7443600ef8	fix wonky indentation llvm-svn: 31298	2006-10-30 22:27:23 +00:00
Chris Lattner	53ebf20c26	add another target hook for branch folding. llvm-svn: 31262	2006-10-28 17:29:57 +00:00
Chris Lattner	3a897f31fa	Implement support for branch condition reversal. llvm-svn: 31099	2006-10-21 05:52:40 +00:00
Chris Lattner	d881660366	Simplify code, no functionality change llvm-svn: 31097	2006-10-21 05:42:09 +00:00
Chris Lattner	6fca75ec05	allow insertion of a conditional branch with fall-through llvm-svn: 31095	2006-10-21 05:34:23 +00:00
Chris Lattner	20e75d4635	update assert message llvm-svn: 31093	2006-10-21 04:42:29 +00:00
Chris Lattner	8366b874a9	bugfix llvm-svn: 31074	2006-10-20 20:44:34 +00:00
Chris Lattner	c0fb567e23	Implement branch analysis/xform hooks required by the branch folding pass. llvm-svn: 31065	2006-10-20 17:42:20 +00:00
Chris Lattner	0408e5b7ce	expose DWARF_LABEL opcode# so the branch folder can update debug info properly. llvm-svn: 31024	2006-10-17 22:41:45 +00:00
Chris Lattner	63007919dc	remove some dead code llvm-svn: 30938	2006-10-13 20:40:42 +00:00
Chris Lattner	d9e4bf5285	update comments llvm-svn: 30663	2006-09-28 23:33:12 +00:00
Evan Cheng	11b0a5dbd4	Committing X86-64 support. llvm-svn: 30177	2006-09-08 06:48:29 +00:00
Chris Lattner	13a5dcddce	Fix a long-standing wart in the code generator: two-address instruction lowering actually removes one of the operands, instead of just assigning both operands the same register. This make reasoning about instructions unnecessarily complex, because you need to know if you are before or after register allocation to match up operand #'s with the target description file. Changing this also gets rid of a bunch of hacky code in various places. This patch also includes changes to fold loads into cmp/test instructions in the X86 backend, along with a significant simplification to the X86 spill folding code. llvm-svn: 30108	2006-09-05 02:12:02 +00:00
Evan Cheng	66ed41cac1	Can't commute shufps. The high / low parts elements come from different vectors. llvm-svn: 29275	2006-07-25 20:25:40 +00:00
Evan Cheng	c0f90bef47	Commute shufps / shufpd. llvm-svn: 28577	2006-05-30 23:34:30 +00:00
Evan Cheng	f21045a5cd	Somehow I lost a condition when I was shuffling some code around. Anyway, only transform a shufps to pshufd when the first two operands are the same. llvm-svn: 28575	2006-05-30 22:13:36 +00:00
Evan Cheng	c8c172eaae	Fix a build breaker. llvm-svn: 28574	2006-05-30 21:45:53 +00:00
Evan Cheng	a4fc5b8699	Oops. PSHUFD is only available with SSE2. llvm-svn: 28573	2006-05-30 21:30:59 +00:00
Evan Cheng	66f849bd7b	Allow shufps x, x, mask to be converted to pshufd x, mask to save a move. llvm-svn: 28565	2006-05-30 20:26:50 +00:00
Evan Cheng	feca91a516	These can be transformed into lea as well. Not that we use this feature currently... llvm-svn: 28393	2006-05-19 18:43:41 +00:00
Evan Cheng	dd7230c9e0	Add MOV16_rm / MOV32_rm and MOV16_mr / MOV32_mr to isLoadFromStackSlot and isStoreToStackSlot llvm-svn: 28223	2006-05-11 07:33:49 +00:00
Evan Cheng	9733bde74c	Fixing truncate. Previously we were emitting truncate from r16 to r8 as movw. That is we promote the destination operand to r16. So %CH = TRUNC_R16_R8 %BP is emitted as movw %bp, %cx. This is incorrect. If %cl is live, it would be clobbered. Ideally we want to do the opposite, that is emitted it as movb ??, %ch But this is not possible since %bp does not have a r8 sub-register. We are now defining a new register class R16_ which is a subclass of R16 containing only those 16-bit registers that have r8 sub-registers (i.e. AX - DX). We isel the truncate to two instructions, a MOV16to16_ to copy the value to the R16_ class, followed by a TRUNC_R16_R8. Due to bug 770, the register colaescer is not going to coalesce between R16 and R16_. That will be fixed later so we can eliminate the MOV16to16_. Right now, it can only be eliminated if we are lucky that source and destination registers are the same. llvm-svn: 28164	2006-05-08 08:01:26 +00:00
Chris Lattner	bfc2c68386	Teach the codegen about instructions used for SSE spill code, allowing it to optimize cases where it has to spill a lot llvm-svn: 27801	2006-04-18 16:44:51 +00:00
Evan Cheng	ebf1006d16	- More efficient extract_vector_elt with shuffle and movss, movsd, movd, etc. - Some bug fixes and naming inconsistency fixes. llvm-svn: 27377	2006-04-03 20:53:28 +00:00
Evan Cheng	e7ee6a5e32	Support for scalar to vector with zero extension. llvm-svn: 27091	2006-03-24 23:15:12 +00:00
Evan Cheng	1208d9179a	- Remove scalar to vector pseudo ops. They are just wrong. - Handle FR32 to VR128:v4f32 and FR64 to VR128:v2f64 with aliases of MOVAPS and MOVAPD. Mark them as move instructions and hope they will be deleted. llvm-svn: 26919	2006-03-21 07:09:35 +00:00
Evan Cheng	24c461b51e	1. Use pxor instead of xoraps / xorapd to clear FR32 / FR64 registers. This proves to be worth 20% on Ptrdist/ks. Might be related to dependency breaking support. 2. Added FsMOVAPSrr and FsMOVAPDrr as aliases to MOVAPSrr and MOVAPDrr. These are used for FR32 / FR64 reg-to-reg copies. 3. Tell reg-allocator to generate MOVSSrm / MOVSDrm and MOVSSmr / MOVSDmr to spill / restore FsMOVAPSrr and FsMOVAPDrr. llvm-svn: 26241	2006-02-16 22:45:17 +00:00
Chris Lattner	5123346708	fix operand numbers llvm-svn: 25915	2006-02-02 20:38:12 +00:00
Chris Lattner	bb53acd03c	Move isLoadFrom/StoreToStackSlot from MRegisterInfo to TargetInstrInfo,a far more logical place. Other methods should also be moved if anyoneis interested. :) llvm-svn: 25913	2006-02-02 20:12:32 +00:00
Evan Cheng	b3ea2677a4	Tell codegen MOVAPSrr and MOVAPDrr are copies. llvm-svn: 25889	2006-02-01 23:03:16 +00:00
Nate Begeman	9d7008b08d	Properly split f32 and f64 into separate register classes for scalar sse fp fixing a bunch of nasty hackery llvm-svn: 23735	2005-10-14 22:06:00 +00:00
Nate Begeman	8293d0e232	Teach the register allocator that movaps is also a move instruction llvm-svn: 22451	2005-07-16 02:00:20 +00:00
Nate Begeman	8a0933608a	First round of support for doing scalar FP using the SSE2 ISA extension and XMM registers. There are many known deficiencies and fixmes, which will be addressed ASAP. The major benefit of this work is that it will allow the LLVM register allocator to allocate FP registers across basic blocks. The x86 backend will still default to x87 style FP. To enable this work, you must pass -enable-sse-scalar-fp and either -sse2 or -sse3 to llc. An example before and after would be for: double foo(double *P) { double Sum = 0; int i; for (i = 0; i < 1000; ++i) Sum += P[i]; return Sum; } The inner loop looks like the following: x87: .LBB_foo_1: # no_exit fldl (%esp) faddl (%eax,%ecx,8) fstpl (%esp) incl %ecx cmpl $1000, %ecx #FP_REG_KILL jne .LBB_foo_1 # no_exit SSE2: addsd (%eax,%ecx,8), %xmm0 incl %ecx cmpl $1000, %ecx #FP_REG_KILL jne .LBB_foo_1 # no_exit llvm-svn: 22340	2005-07-06 18:59:04 +00:00
Misha Brukman	c88330ad13	* Remove trailing whitespace * Convert tabs to spaces llvm-svn: 21426	2005-04-21 23:38:14 +00:00
Chris Lattner	25be208e02	When commuting these instructions, make sure to actually swap the operands too. llvm-svn: 19694	2005-01-19 16:55:52 +00:00
Chris Lattner	d54845f530	Improve coverage of the X86 instruction set by adding 16-bit shift doubles. llvm-svn: 19687	2005-01-19 07:31:24 +00:00
Chris Lattner	2947801735	Teach the code generator that shrd/shld is commutable if it has an immediate. This allows us to generate this: foo: mov %EAX, DWORD PTR [%ESP + 4] mov %EDX, DWORD PTR [%ESP + 8] shld %EDX, %EDX, 2 shl %EAX, 2 ret instead of this: foo: mov %EAX, DWORD PTR [%ESP + 4] mov %ECX, DWORD PTR [%ESP + 8] mov %EDX, %EAX shrd %EDX, %ECX, 30 shl %EAX, 2 ret Note the magically transmogrifying immediate. llvm-svn: 19686	2005-01-19 07:11:01 +00:00
Chris Lattner	733aac1270	Disable 2->3 address promotion of add and inc instructions to LEA's. In addition to being three address, LEA's don't set the flags. This fixes 186.crafty. llvm-svn: 19251	2005-01-02 04:18:17 +00:00
Chris Lattner	b7782d77c1	Implement the convertToThreeAddress method, add support for inverting JP/JNP branches. llvm-svn: 19247	2005-01-02 02:37:07 +00:00
Chris Lattner	4d7af1c680	Fix a warning llvm-svn: 15409	2004-08-01 19:31:30 +00:00
Alkis Evlogimenos	bb635a27a4	Align breaks. llvm-svn: 15371	2004-07-31 10:05:44 +00:00
Chris Lattner	fcef7655fa	Add breaks llvm-svn: 15365	2004-07-31 09:53:31 +00:00
Alkis Evlogimenos	ce15f8f4c9	Simplify code a bit. llvm-svn: 15364	2004-07-31 09:44:32 +00:00
Alkis Evlogimenos	aaf642103a	Correctly spell 'unconditional'. llvm-svn: 15363	2004-07-31 09:41:44 +00:00
Alkis Evlogimenos	f57d78a87d	Implement insertGoto and reverseBranchCondition for the X86. llvm-svn: 15362	2004-07-31 09:38:47 +00:00
Alkis Evlogimenos	ea81b79a97	A big X86 instruction rename. The instructions are renamed to make their names more decriptive. A name consists of the base name, a default operand size followed by a character per operand with an optional special size. For example: ADD8rr -> add, 8-bit register, 8-bit register IMUL16rmi -> imul, 16-bit register, 16-bit memory, 16-bit immediate IMUL16rmi8 -> imul, 16-bit register, 16-bit memory, 8-bit immediate MOVSX32rm16 -> movsx, 32-bit register, 16-bit memory llvm-svn: 11995	2004-02-29 08:50:03 +00:00
Chris Lattner	ed01da8f0b	Adjust to change in TII ctor arguments llvm-svn: 11987	2004-02-29 06:31:44 +00:00
Chris Lattner	ca89812db7	These two virtual methods are never called. llvm-svn: 11984	2004-02-29 05:59:33 +00:00
Alkis Evlogimenos	8358cc573d	Move MOTy::UseType enum into MachineOperand. This eliminates the switch statements in the constructors and simplifies the implementation of the getUseType() member function. You will have to specify defs using MachineOperand::Def instead of MOTy::Def though (similarly for Use and UseAndDef). llvm-svn: 11715	2004-02-22 19:23:26 +00:00
Alkis Evlogimenos	8cdd0215bf	Remove getAllocatedRegNum(). Use getReg() instead. llvm-svn: 11393	2004-02-13 21:01:20 +00:00
Chris Lattner	92252f42c3	Don't use MachineOperator::is(Phys\|Virt)Register llvm-svn: 11276	2004-02-10 20:31:28 +00:00
Chris Lattner	259e98ed27	Tighten up checks llvm-svn: 11274	2004-02-10 20:25:13 +00:00
Alkis Evlogimenos	aeb8a80c71	FpMOV is also a move instruction. llvm-svn: 11055	2004-02-01 08:22:16 +00:00
Alkis Evlogimenos	52564b2071	Add TargetInstrInfo::isMoveInstr() to support coalescing in register allocation. llvm-svn: 10633	2003-12-28 17:35:08 +00:00
Brian Gaeke	960707c335	Put all LLVM code into the llvm namespace, as per bug 109. llvm-svn: 9903	2003-11-11 22:41:34 +00:00
John Criswell	482202a601	Added LLVM project notice to the top of every C++ source file. Header files will be on the way. llvm-svn: 9298	2003-10-20 19:43:21 +00:00
Chris Lattner	05e2f385a7	* Start using tablegen'd instruction descriptions * Fix bug in the createNOP method, which was not marking the operands of the generated XCHG as useanddef. I don't think this method is actually used, so it wasn't breaking anything, but it should be fixed anyway... llvm-svn: 7539	2003-08-03 21:55:55 +00:00
Misha Brukman	e2402c65d0	Reword to remove reference to how things worked in the past. llvm-svn: 6323	2003-05-24 01:08:43 +00:00
Misha Brukman	d21a02ad58	Implement the TargetInstrInfo's createNOPinstr() and isNOPinstr() interface. llvm-svn: 6320	2003-05-24 00:09:50 +00:00
Chris Lattner	b4d58d7f9e	Rename MachineInstrInfo -> TargetInstrInfo llvm-svn: 5272	2003-01-14 22:00:31 +00:00
Chris Lattner	b54343a64f	Add comments, switch uses/defs to match InstrInfo.def file llvm-svn: 5102	2002-12-18 01:05:54 +00:00
Chris Lattner	0d80874f6c	* Move information about Implicit Defs/Uses into X86InstrInfo.def. * Expose information about implicit defs/uses of register through the MachineInstrInfo.h file. llvm-svn: 4877	2002-12-03 05:42:53 +00:00
Misha Brukman	1a72c637fb	Added -- C++ -- mode to the comments. llvm-svn: 4826	2002-11-22 22:42:50 +00:00
Chris Lattner	cf72e52df3	Expose base opcode llvm-svn: 4742	2002-11-18 06:56:24 +00:00
Chris Lattner	0018e8d5fc	Start to add more information to instr.def llvm-svn: 4741	2002-11-18 05:37:11 +00:00
Chris Lattner	9289d7d693	Reorganize printing interface a bit llvm-svn: 4728	2002-11-17 22:53:13 +00:00
Chris Lattner	87b84a6913	Set the destination register field based on the target specific flags llvm-svn: 4442	2002-10-30 01:15:31 +00:00
Chris Lattner	27d247978b	Rename X86InstructionInfo to X86InstrInfo llvm-svn: 4413	2002-10-29 21:05:24 +00:00
Chris Lattner	f57420ee17	Minor renaming llvm-svn: 4410	2002-10-29 20:48:56 +00:00
Chris Lattner	16cbd41c21	Implement MachineInstrInfo interface llvm-svn: 4394	2002-10-29 17:43:19 +00:00
Chris Lattner	d92fb0058b	Initial checkin of X86 backend. We can instruction select exactly one instruction 'ret void'. Wow. llvm-svn: 4284	2002-10-25 22:55:53 +00:00

... 15 16 17 18 19 ...

1009 Commits