llvm-project

Commit Graph

Author	SHA1	Message	Date
Eli Friedman	5286833f4a	Revert earlier unnecessary hack. Make sure we correctly force on 64bit and cmov for 64-bit targets. llvm-svn: 134768	2011-07-08 23:07:42 +00:00
Evan Cheng	45543ba4e8	Fix indentation. llvm-svn: 134764	2011-07-08 22:49:55 +00:00
Evan Cheng	284b467d9f	Add support for ARM / Thumb mode switching with .code 16 and .code 32. llvm-svn: 134760	2011-07-08 22:36:29 +00:00
Jim Grosbach	39c67b5e08	Mark tBRIND as predicable. llvm-svn: 134758	2011-07-08 22:33:49 +00:00
Evan Cheng	60fc0fca5c	Restore old behavior. Always auto-detect features unless cpu or features are specified. llvm-svn: 134757	2011-07-08 22:30:25 +00:00
Jim Grosbach	59a3ab6e46	Pseudo-ize tBRIND. llvm-svn: 134755	2011-07-08 22:25:23 +00:00
Eli Friedman	e2f76c4ade	Default 64-bit target features and SSE2 on when a triple specifies x86-64. Clean up all the other hacks which are now unnecessary. llvm-svn: 134753	2011-07-08 22:16:47 +00:00
Jim Grosbach	7471937ad7	Make tBX_RET and tBX_RET_vararg predicable. The normal tBX instruction is predicable, so there's no reason the pseudos for using it as a return shouldn't be. Gives us some nice code-gen improvements as can be seen by the test changes. In particular, several tests now have to disable if-conversion because it works too well and defeats the test. llvm-svn: 134746	2011-07-08 21:50:04 +00:00
Julien Lerouge	112fcc164a	Add _allrem, _aullrem and _allmul to the runtime for MSVC. http://llvm.org/bugs/show_bug.cgi?id=10305 llvm-svn: 134744	2011-07-08 21:40:25 +00:00
Cameron Zwarich	f03fa189ca	Add an intrinsic and codegen support for fused multiply-accumulate. The intent is to use this for architectures that have a native FMA instruction. llvm-svn: 134742	2011-07-08 21:39:21 +00:00
Evan Cheng	964cb5feb0	For non-x86 host, used generic as CPU name. llvm-svn: 134741	2011-07-08 21:14:14 +00:00
Jim Grosbach	d61ae786bd	Pseudo-ize tBX_RET and tBX_RET_vararg. llvm-svn: 134739	2011-07-08 21:10:35 +00:00
Benjamin Kramer	debe69fb37	Plug a leak by giving the AsmParser ownership of the MCSubtargetInfo. Found by valgrind. llvm-svn: 134738	2011-07-08 21:06:23 +00:00
Jim Grosbach	cb1b0b7130	Shuffle productions around a bit. No functional change. llvm-svn: 134737	2011-07-08 21:04:05 +00:00
Jim Grosbach	204c128f66	Use tPseudoExpand for tTAILJMPrND and tTAILJMPr. llvm-svn: 134734	2011-07-08 20:39:19 +00:00
Jim Grosbach	4af8647e17	Use tPseudoExpand for tTAILJMPd and tTAILJMPdND. llvm-svn: 134732	2011-07-08 20:32:21 +00:00
Benjamin Kramer	dbdff47cb3	Silence compiler warning. llvm-svn: 134730	2011-07-08 20:18:13 +00:00
Jim Grosbach	3840c90f73	Add more info to FIXME. llvm-svn: 134729	2011-07-08 20:18:11 +00:00
Jim Grosbach	166cd88645	Move Thumb tail call pseudos to Thumb.td file. Fix a FIXME. llvm-svn: 134727	2011-07-08 20:13:35 +00:00
Evan Cheng	22e9d8f40e	TargetAsmParser doesn't need reference to Target. llvm-svn: 134721	2011-07-08 19:33:14 +00:00
Jim Grosbach	dbfb29d6c0	Use ARMPseudoExpand for ARM tail calls. llvm-svn: 134719	2011-07-08 18:50:22 +00:00
Jim Grosbach	7ddc1d709f	Shuffle productions around a bit. No functional change. llvm-svn: 134714	2011-07-08 18:26:27 +00:00
Jim Grosbach	2dfe8e3ccd	Use ARMPseudoExpand for BLr9, BLr9_pred, BXr9, and BXr9_pred. TableGen'erated MC lowering pseudo-expansion. llvm-svn: 134712	2011-07-08 18:15:12 +00:00
Chandler Carruth	f21cebf6dd	Add CMake support for the new TableGen file introduced in r134705. llvm-svn: 134707	2011-07-08 17:54:08 +00:00
Jim Grosbach	95dee40343	Use TableGen'erated pseudo lowering for ARM. Hook up the TableGen lowering for simple pseudo instructions for ARM and use it for a subset of the many pseudos the backend has as proof of concept. More conversions to come. llvm-svn: 134705	2011-07-08 17:40:42 +00:00
Evan Cheng	4d1ca96bfc	Eliminate asm parser's dependency on TargetMachine: - Each target asm parser now creates its own MCSubtatgetInfo (if needed). - Changed AssemblerPredicate to take subtarget features which tablegen uses to generate asm matcher subtarget feature queries. e.g. "ModeThumb,FeatureThumb2" is translated to "(Bits & ModeThumb) != 0 && (Bits & FeatureThumb2) != 0". llvm-svn: 134678	2011-07-08 01:53:10 +00:00
Akira Hatanaka	f9a85356bc	Raise assertion when MachineOperand has unexpected target flag. llvm-svn: 134671	2011-07-08 00:42:35 +00:00
Akira Hatanaka	8ccd65842d	Make sure variable Kind is assigned a value to suppress warning. llvm-svn: 134668	2011-07-08 00:26:25 +00:00
Nick Lewycky	9badf60203	Let the inline asm 'q' constraint match float, and on 64-bit double too. Fixes PR9602! llvm-svn: 134665	2011-07-08 00:19:27 +00:00
Eric Christopher	7a2a0f80de	Go ahead and emit the barrier on x86-64 even without sse2. The processor supports it just fine. Fixes PR9675 and rdar://9740801 llvm-svn: 134664	2011-07-08 00:04:56 +00:00
Akira Hatanaka	9c6028f98e	Lower MachineInstr to MC Inst and print to .s files. llvm-svn: 134661	2011-07-07 23:56:50 +00:00
Eric Christopher	719c29702f	Handle fpcr register. Part of PR10299 and rdar://9740322 llvm-svn: 134653	2011-07-07 22:54:12 +00:00
Eric Christopher	9721396dab	Add support for the X86 'l' constraint. Fixes PR10149 and rdar://9738585 llvm-svn: 134648	2011-07-07 22:29:07 +00:00
Akira Hatanaka	28d6677a53	Remove unnecessary newline. llvm-svn: 134645	2011-07-07 22:06:18 +00:00
Evan Cheng	13bcc6c1c7	Add Mode64Bit feature and sink it down to MC layer. llvm-svn: 134641	2011-07-07 21:06:52 +00:00
Bill Wendling	1a423d8b35	Move a function out-of-line. llvm-svn: 134640	2011-07-07 21:05:13 +00:00
Akira Hatanaka	9f6f6f6ecc	Rather than having printMemOperand change the way memory operands are printed based on a modifier, split it into two functions. llvm-svn: 134637	2011-07-07 20:54:20 +00:00
Akira Hatanaka	77a9e6e7df	Define class MipsMCInstLower. llvm-svn: 134633	2011-07-07 20:24:54 +00:00
Akira Hatanaka	ddd1265316	Change visibility of MipsAsmPrinter. llvm-svn: 134630	2011-07-07 20:10:52 +00:00
Akira Hatanaka	04da3658c6	Define class MipsMCSymbolRefExpr. llvm-svn: 134629	2011-07-07 19:27:22 +00:00
Akira Hatanaka	9d1936a270	Simplify MipsRegisterInfo::eliminateFrameIndex. llvm-svn: 134628	2011-07-07 19:13:09 +00:00
Evan Cheng	6dbe713a49	Rewrite comment in English. llvm-svn: 134627	2011-07-07 19:09:06 +00:00
Evan Cheng	1834f5dcb6	Rename attribute 'thumb' to a more descriptive 'thumb-mode'. llvm-svn: 134626	2011-07-07 19:05:12 +00:00
Akira Hatanaka	2e766ed2f8	Reverse order of operands of address operand mem so that the base operand comes before the offset. This change will enable simplification of function MipsRegisterInfo::eliminateFrameIndex. llvm-svn: 134625	2011-07-07 18:57:00 +00:00
Akira Hatanaka	ac4db9251b	Add missing return statement. llvm-svn: 134622	2011-07-07 18:27:36 +00:00
Oscar Fuentes	32a45e5aeb	Update CMake library dependencies llvm-svn: 134616	2011-07-07 16:33:00 +00:00
Douglas Gregor	cc4a55f6f2	Fix CMake build llvm-svn: 134614	2011-07-07 15:59:22 +00:00
Cameron Zwarich	148220306f	The VMLA instruction and its friends are not actually fused; they're plain old multiply-accumulate instructions with separate rounding steps. llvm-svn: 134609	2011-07-07 08:28:52 +00:00
Evan Cheng	f2c2616e72	Sink feature IsThumb into MC layer. llvm-svn: 134608	2011-07-07 08:26:46 +00:00
Evan Cheng	1a72add615	Compute feature bits at time of MCSubtargetInfo initialization. llvm-svn: 134606	2011-07-07 07:07:08 +00:00
Bill Wendling	667be58220	Use ArrayRef instead of a std::vector&. llvm-svn: 134595	2011-07-07 04:42:01 +00:00
Evan Cheng	8b2bda09a5	Change some ARM subtarget features to be single bit yes/no in order to sink them down to MC layer. Also fix tests. llvm-svn: 134590	2011-07-07 03:55:05 +00:00
Bill Wendling	b6adf46f62	Add a target hook to encode the compact unwind information. llvm-svn: 134577	2011-07-07 00:54:13 +00:00
Evan Cheng	2bd65363a8	Factor ARM triple parsing out of ARMSubtarget. Another step towards making ARM subtarget info available to MC. llvm-svn: 134569	2011-07-07 00:08:19 +00:00
Evan Cheng	928ce72bcd	Add ARM MC registry routines. llvm-svn: 134547	2011-07-06 22:02:34 +00:00
Evan Cheng	3ddfbd325d	Rename files for consistency. llvm-svn: 134546	2011-07-06 22:01:53 +00:00
Jim Grosbach	7c301ea093	Mark ARM pseudo-instructions as isPseudo. This allows us to remove the (bogus and unneeded) encoding information from the pseudo-instruction class definitions. All of the pseudos that haven't been converted yet and still need encoding information instance from the normal instruction classes and explicitly set isCodeGenOnly, and so are distinct from this change. llvm-svn: 134540	2011-07-06 21:35:46 +00:00
Jim Grosbach	4db363af7c	Remove un-used encoding info from Pseudo MLAv5. Pseudo-instructions don't have encoding information, as they're lowered to real instructions by the time we're doing binary encoding. llvm-svn: 134533	2011-07-06 20:57:35 +00:00
Bill Wendling	5ace8edfd6	Constify getCompactUnwindRegNum. llvm-svn: 134527	2011-07-06 20:33:48 +00:00
Evan Cheng	ab37af9af3	createMCInstPrinter doesn't need TargetMachine anymore. llvm-svn: 134525	2011-07-06 19:45:42 +00:00
Kevin Enderby	6ee1d2bd78	Changed the X86 PUSH64i8 record to use the i64i8imm ParserMatchClass so that a push with a small constant produces a 2-byte push. llvm-svn: 134501	2011-07-06 17:23:46 +00:00
Evan Cheng	4d806e2830	Remove the AsmWriterEmitter (unused) feature that rely on TargetSubtargetInfo. llvm-svn: 134457	2011-07-06 02:02:33 +00:00
Eli Friedman	415412e82f	Add assembler/disassembler support for non-AVX pclmulqdq. While I'm here, use proper aliases for the pclmullqlqdq and friends. PR10269. llvm-svn: 134424	2011-07-05 18:21:20 +00:00
Jim Grosbach	ea53901dc9	ARM estimateStackSize() needs to account for simplified call frames. If the function allocates reserved stack space for callee argument frames, estimateStackSize() needs to account for that, as it doesn't show up as ordinary frame objects. Otherwise, a callee with a large argument list will throw off the calculations for whether to allocate an emergency spill slot and we get assert() failures in the register scavenger. rdar://9715469 llvm-svn: 134415	2011-07-05 16:05:50 +00:00
Roman Divacky	cc5e53383e	Remove accidentaly left node from previous iteration of the patch. Noticed by Benjamin Kramer! llvm-svn: 134376	2011-07-04 15:42:45 +00:00
Roman Divacky	075491f2cd	Make the i64 and f64 be 64bit ABI aligned in the target description. This is what both the ABI and clang says. llvm-svn: 134367	2011-07-03 16:24:07 +00:00
Duncan Sands	4bea037504	Remove unused array. llvm-svn: 134323	2011-07-02 16:36:24 +00:00
Jakob Stoklund Olesen	e925f22b40	Consistent diagnostic capitalization and redundant context elimination. llvm-svn: 134311	2011-07-02 07:23:40 +00:00
Jakob Stoklund Olesen	25a404eb81	Include a source location when complaining about bad inline assembly. Add a MI->emitError() method that the backend can use to report errors related to inline assembly. Call it from X86FloatingPoint.cpp when the constraints are wrong. This enables proper clang diagnostics from the backend: $ clang -c pr30848.c pr30848.c:5:12: error: Inline asm output regs must be last on the x87 stack __asm__ ("" : "=u" (d)); /* { dg-error "output regs" } */ ^ 1 error generated. llvm-svn: 134307	2011-07-02 03:53:34 +00:00
Eric Christopher	a8a56f7e5c	TargetConstant immediates won't be placed into registers so tighten up the valid constant check earlier. rdar://9692967 llvm-svn: 134286	2011-07-01 23:04:38 +00:00
Evan Cheng	c9c090d7a5	Rename XXXGenSubtarget.inc to XXXGenSubtargetInfo.inc for consistency. llvm-svn: 134281	2011-07-01 22:36:09 +00:00
Evan Cheng	0711c4d489	Add MCSubtargetInfo target registry stuff. llvm-svn: 134279	2011-07-01 22:25:04 +00:00
Eli Friedman	d24a7da658	Calling-convention specifications for illegal types are no-ops. Simplify based on this. llvm-svn: 134264	2011-07-01 21:33:28 +00:00
Jim Grosbach	cf1464d943	ARMv7M vs. ARMv7E-M support. The DSP instructions in the Thumb2 instruction set are an optional extension in the Cortex-M* archtitecture. When present, the implementation is considered an "ARMv7E-M implementation," and when not, an "ARMv7-M implementation." Add a subtarget feature hook for the v7e-m instructions and hook it up. The cortex-m3 cpu is an example of a v7m implementation, while the cortex-m4 is a v7e-m implementation. rdar://9572992 llvm-svn: 134261	2011-07-01 21:12:19 +00:00
Evan Cheng	0d639a28aa	Rename TargetSubtarget to TargetSubtargetInfo for consistency. llvm-svn: 134259	2011-07-01 21:01:15 +00:00
Evan Cheng	54b68e3432	- Added MCSubtargetInfo to capture subtarget features and scheduling itineraries. - Refactor TargetSubtarget to be based on MCSubtargetInfo. - Change tablegen generated subtarget info to initialize MCSubtargetInfo and hide more details from targets. llvm-svn: 134257	2011-07-01 20:45:01 +00:00
Jim Grosbach	68b0e8456e	Fix off-by-one error. (low two bits always zero, so off by one bit of encoded value). llvm-svn: 134247	2011-07-01 19:07:09 +00:00
Evan Cheng	703a0fbf39	Hide the call to InitMCInstrInfo into tblgen generated ctor. llvm-svn: 134244	2011-07-01 17:57:27 +00:00
Jim Grosbach	4def704a21	Pseudo-ize t2MOVCC[ri]. t2MOVCC[ri] are just t2MOV[ri] instructions, so properly pseudo-ize them. The Thumb1 versions, tMOVCC[ri] were only present for use by the size- reduction pass, so they're no longer necessary at all and can be deleted. llvm-svn: 134242	2011-07-01 17:14:11 +00:00
Akira Hatanaka	f2bcad972d	Improve Mips back-end's handling of DBG_VALUE. llvm-svn: 134224	2011-07-01 01:04:43 +00:00
Eric Christopher	29f1db85dd	Add support for the 'j' immediate constraint. This is conditionalized on supporting the instruction that the constraint is for 'movw'. Part of rdar://9119939 llvm-svn: 134222	2011-07-01 01:00:07 +00:00
Eric Christopher	c011d31543	Add support for the ARM 't' register constraint. And another testcase for the 'x' register constraint. Part of rdar://9119939 llvm-svn: 134220	2011-07-01 00:30:46 +00:00
Eric Christopher	f09b0f1043	We'll return a null RC by default if we can't match. Part of rdar://9119939 llvm-svn: 134217	2011-07-01 00:19:27 +00:00
Eric Christopher	f1c74595aa	Add support for the 'x' constraint. Part of rdar://9307836 and rdar://9119939 llvm-svn: 134215	2011-07-01 00:14:47 +00:00
Eric Christopher	1f054f27af	Capitalize the unsigned part of the initializer. llvm-svn: 134211	2011-06-30 23:59:16 +00:00
Eric Christopher	cf2007ca78	Rename Pair to RCPair lacking any better naming ideas. llvm-svn: 134210	2011-06-30 23:50:52 +00:00
Bill Wendling	3f049b8b7e	Use the correct registers on X86_64. llvm-svn: 134208	2011-06-30 23:47:14 +00:00
Jakob Stoklund Olesen	d0e2352b65	Fix a problem with fast-isel return values introduced in r134018. We would put the return value from long double functions in the wrong register. This fixes gcc.c-torture/execute/conversion.c llvm-svn: 134205	2011-06-30 23:42:18 +00:00
Jim Grosbach	e9cc901814	Refact ARM Thumb1 tMOVr instruction family. Merge the tMOVr, tMOVgpr2tgpr, tMOVtgpr2gpr, and tMOVgpr2gpr instructions into tMOVr. There's no need to keep them separate. Giving the tMOVr instruction the proper GPR register class for its operands is sufficient to give the register allocator enough information to do the right thing directly. llvm-svn: 134204	2011-06-30 23:38:17 +00:00
Eric Christopher	f45daac30f	Add support for the 'h' constraint. Part of rdar://9119939 llvm-svn: 134203	2011-06-30 23:23:01 +00:00
Bill Wendling	b403f0c4ed	Add target a target hook to get the register number used by the compact unwind encoding for the registers it knows about. Return -1 if it can't handle that register. llvm-svn: 134202	2011-06-30 23:20:32 +00:00
Eric Christopher	c486b47b15	Add a convenience typedef for std::pair<unsigned, const TargetRegisterClass*>. No functional change. Part of rdar://9119939 llvm-svn: 134198	2011-06-30 22:17:01 +00:00
Jim Grosbach	b98ab91e39	Thumb1 register to register MOV instruction is predicable. Fix a FIXME and allow predication (in Thumb2) for the T1 register to register MOV instructions. This allows some better codegen with if-conversion (as seen in the test updates), plus it lays the groundwork for pseudo-izing the tMOVCC instructions. llvm-svn: 134197	2011-06-30 22:10:46 +00:00
Jakob Stoklund Olesen	2034261972	Tweak error messages to match GCC. Should fix gcc.target/i386/pr30848.c llvm-svn: 134193	2011-06-30 21:30:30 +00:00
Jim Grosbach	e4750ef6ec	Pseudo-ize the Thumb tTPsoft instruction. It's just a call to a special helper function. Get rid of the T2 variant entirely, as it's identical to the Thumb1 version. llvm-svn: 134178	2011-06-30 19:38:01 +00:00
Jim Grosbach	353da73186	Pseudo-ize the t2LDMIA_RET instruction. It's just a t2LDMIA_UPD instruction with extra codegen properties, so it doesn't need the encoding information. As a side-benefit, we now correctly recognize for instruction printing as a 'pop' instruction. llvm-svn: 134173	2011-06-30 18:25:42 +00:00
Jim Grosbach	417671a7b1	Pseudo-ize the Thumb tPOP_RET instruction. It's just a tPOP instruction with additional code-gen properties, so it doesn't need encoding information. llvm-svn: 134172	2011-06-30 17:34:04 +00:00
Jim Grosbach	cfe3b14d77	Kill dead code. llvm-svn: 134131	2011-06-30 02:23:05 +00:00
Jim Grosbach	ed5134a921	Size reducing SP adjusting t2ADDri needs to check predication. tADDrSPi is not predicable, so we can't size-reduce a t2ADDri to it if the predicate is anything other than "always." llvm-svn: 134130	2011-06-30 02:22:49 +00:00
Evan Cheng	0b33a323ac	Fix ARMSubtarget feature parsing. llvm-svn: 134129	2011-06-30 02:12:44 +00:00
Evan Cheng	fe6e405e8c	Fix the ridiculous SubtargetFeatures API where it implicitly expects CPU name to be the first encoded as the first feature. It then uses the CPU name to look up features / scheduling itineray even though clients know full well the CPU name being used to query these properties. The fix is to just have the clients explictly pass the CPU name! llvm-svn: 134127	2011-06-30 01:53:36 +00:00
Joerg Sonnenberger	91e5662075	Recognize the xstorerng alias for VIA PadLock's xstore instruction. llvm-svn: 134126	2011-06-30 01:38:03 +00:00
Eric Christopher	16cde8ad36	Make sure we use the correct register class here since we'll need to care about spill values. llvm-svn: 134122	2011-06-30 01:05:46 +00:00
Eric Christopher	c932173773	Fix a small thinko for constant i64 lock/orq optimization where we we didn't have an opcode for 64-bit constant or expressions. Fixes rdar://9692967 llvm-svn: 134121	2011-06-30 00:48:30 +00:00
Jim Grosbach	a8a8067dec	Remove redundant Thumb2 ADD/SUB SP instruction definitions. Unlike Thumb1, Thumb2 does not have dedicated encodings for adjusting the stack pointer. It can just use the normal add-register-immediate encoding since it can use all registers as a source, not just R0-R7. The extra instruction definitions are just duplicates of the normal instructions with the (not well enforced) constraint that the source register was SP. llvm-svn: 134114	2011-06-29 23:25:04 +00:00
Jakob Stoklund Olesen	9f4cc4645b	Always adjust the stack pointer immediately after the call. Some x86-32 calls pop values off the stack, and we need to readjust the stack pointer after the call. This happens when ADJCALLSTACKUP is eliminated. It could happen that spill code was inserted between the CALL and ADJCALLSTACKUP instructions, and we would compute wrong stack pointer offsets for those frame index references. Fix this by inserting the stack pointer adjustment immediately after the call instead of where the ADJCALLSTACKUP instruction was erased. I don't have a test case since we don't currently insert code in that position. We will soon, though. I am testing a regalloc patch that didn't work on Linux because of this. llvm-svn: 134113	2011-06-29 23:11:39 +00:00
Cameron Zwarich	34c8f51d65	In the ARM global merging pass, allow extraneous alignment specifiers. This pass already makes the assumption, which is correct on ARM, that a type's alignment is less than its alloc size. This improves codegen with Clang (which inserts a lot of extraneous alignment specifiers) and fixes <rdar://problem/9695089>. llvm-svn: 134106	2011-06-29 22:24:25 +00:00
Eric Christopher	1b8b9419ba	Remove getRegClassForInlineAsmConstraint from the ARM port. Part of rdar://9643582 llvm-svn: 134095	2011-06-29 21:10:36 +00:00
Eric Christopher	03e756b93b	Remove todo. llvm-svn: 134094	2011-06-29 21:05:54 +00:00
Jim Grosbach	d86f34d631	Refactor away tSpill and tRestore pseudos in ARM backend. The tSpill and tRestore instructions are just copies of the tSTRspi and tLDRspi instructions, respectively. Just use those directly instead. llvm-svn: 134092	2011-06-29 20:26:39 +00:00
Eric Christopher	e449141788	Add a TODO for the Alpha port inline asm constraints. llvm-svn: 134089	2011-06-29 19:41:27 +00:00
Eric Christopher	372a5c2a98	Move Alpha from getRegClassForInlineAsmConstraint to getRegForInlineAsmConstraint. Part of rdar://9643582 llvm-svn: 134088	2011-06-29 19:40:01 +00:00
Eric Christopher	eaf77dc2bd	Update comment for getRegForInlineAsmConstraint for Mips. llvm-svn: 134087	2011-06-29 19:33:04 +00:00
Eric Christopher	dabd8a7bef	Move the Blackfin port away from getRegClassForInlineAsmConstraint by creating a few specific register classes. Part of rdar://9643582 llvm-svn: 134086	2011-06-29 19:30:29 +00:00
Eric Christopher	d0e48c84f0	Remove getRegClassForInlineAsmConstraint from MBlaze. Add a TODO comment for the port. Part of rdar://9643582 llvm-svn: 134085	2011-06-29 19:12:24 +00:00
Eric Christopher	9519c08a43	Remove getRegClassForInlineAsmConstraint for Mips. Part of rdar://9643582 llvm-svn: 134084	2011-06-29 19:04:31 +00:00
Eric Christopher	ff740621f1	Remove getRegClassForInlineAsmConstraint from sparc. Part of rdar://9643582 llvm-svn: 134083	2011-06-29 18:53:10 +00:00
Eric Christopher	790d882caa	Move XCore from getRegClassForInlineAsmConstraint to getRegForInlineAsmConstraint. Part of rdar://9643582 llvm-svn: 134080	2011-06-29 17:53:29 +00:00
Eric Christopher	7e5f2350d3	Use getRegForInlineAsmConstraint instead of custom defining regclasses via vectors. Part of rdar://9643582 llvm-svn: 134079	2011-06-29 17:23:50 +00:00
NAKAMURA Takumi	7e26682c71	Fix CMake build. llvm-svn: 134055	2011-06-29 03:26:17 +00:00
Evan Cheng	8264e272a9	Sink SubtargetFeature and TargetInstrItineraries (renamed MCInstrItineraries) into MC. llvm-svn: 134049	2011-06-29 01:14:12 +00:00
Evan Cheng	194c3dc01f	Move CallFrameSetupOpcode and CallFrameDestroyOpcode to TargetInstrInfo. llvm-svn: 134030	2011-06-28 21:14:33 +00:00
Evan Cheng	0beca53a29	Hide more details in tablegen generated MCRegisterInfo ctor function. llvm-svn: 134027	2011-06-28 20:44:22 +00:00
Evan Cheng	df8974ef2f	Add MCInstrInfo registeration machinery. llvm-svn: 134026	2011-06-28 20:29:03 +00:00
Evan Cheng	1e210d08d8	Merge XXXGenRegisterNames.inc into XXXGenRegisterInfo.inc llvm-svn: 134024	2011-06-28 20:07:07 +00:00
Evan Cheng	6cc775f905	- Rename TargetInstrDesc, TargetOperandInfo to MCInstrDesc and MCOperandInfo and sink them into MC layer. - Added MCInstrInfo, which captures the tablegen generated static data. Chang TargetInstrInfo so it's based off MCInstrInfo. llvm-svn: 134021	2011-06-28 19:10:37 +00:00
Jakob Stoklund Olesen	7297e7e223	Clean up the handling of the x87 fp stack to make it more robust. Drop the FpMov instructions, use plain COPY instead. Drop the FpSET/GET instruction for accessing fixed stack positions. Instead use normal COPY to/from ST registers around inline assembly, and provide a single new FpPOP_RETVAL instruction that can access the return value(s) from a call. This is still necessary since you cannot tell from the CALL instruction alone if it returns anything on the FP stack. Teach fast isel to use this. This provides a much more robust way of handling fixed stack registers - we can tolerate arbitrary FP stack instructions inserted around calls and inline assembly. Live range splitting could sometimes break x87 code by inserting spill code in unfortunate places. As a bonus we handle floating point inline assembly correctly now. llvm-svn: 134018	2011-06-28 18:32:28 +00:00
Chad Rosier	6b610b387d	Remove warning: 'c0' may be used uninitialized in this function. llvm-svn: 134014	2011-06-28 17:26:57 +00:00
Roman Divacky	4394e68c24	Implement ISD::VAARG lowering on PPC32. llvm-svn: 134005	2011-06-28 15:30:42 +00:00
Rafael Espindola	09932bd5b6	Fix cmake build. llvm-svn: 133989	2011-06-28 03:17:03 +00:00
Jim Grosbach	16896325a6	ARM Thumb2 asm syntax optional destination operand for binary operators. When the destination operand is the same as the first source register operand for arithmetic instructions, the destination operand may be omitted. For example, the following two instructions are equivalent: and r1, #ff and r1, r1, #ff rdar://9672867 llvm-svn: 133973	2011-06-28 00:19:13 +00:00
Jim Grosbach	a6f7a1efcc	ARM Assembly support for Thumb mov-immediate. Correctly parse the forms of the Thumb mov-immediate instruction: 1. 8-bit immediate 0-255. 2. 12-bit shifted-immediate. The 16-bit immediate "movw" form is also legal with just a "mov" mnemonic, but is not yet supported. More parser logic necessary there due to fixups. llvm-svn: 133966	2011-06-27 23:54:06 +00:00
Jim Grosbach	1a13cd77f1	ARM Asm parsing of Thumb2 move immediate. Thumb2 MOV mnemonic can accept both cc_out and predication. We don't (yet) encode the instruction properly, but this gets the parsing part. llvm-svn: 133945	2011-06-27 21:38:03 +00:00
Evan Cheng	8d71a75777	More refactoring. Move getRegClass from TargetOperandInfo to TargetInstrInfo. llvm-svn: 133944	2011-06-27 21:26:13 +00:00
Jim Grosbach	0ceb5473a3	Add exception necessitated by 133938. llvm-svn: 133939	2011-06-27 20:59:10 +00:00
Jim Grosbach	07d9027088	ARM assembly carry set/clear condition code aliases for 'hi/lo' llvm-svn: 133938	2011-06-27 20:40:29 +00:00
Jim Grosbach	89bbfc434a	ARM assembler support for ldmfd/stmfd mnemonics. llvm-svn: 133936	2011-06-27 20:32:18 +00:00
Jim Grosbach	29882a75eb	ARM assembler support for vpush/vpop. Add aliases for the vpush/vpop mnemonics to the VFP load/store multiple writeback instructions w/ SP as the base pointer. rdar://9683231 llvm-svn: 133932	2011-06-27 20:00:07 +00:00
Jim Grosbach	b5ee311602	ARM Assembly syntax support for arithmetic implied dest operand. When the destination operand is the same as the first source register operand for arithmetic instructions, the destination operand may be omitted. For example, the following two instructions are equivalent: sub r2, r2, #6 sub r2, #6 rdar://9682597 llvm-svn: 133925	2011-06-27 19:09:15 +00:00
Evan Cheng	d9997acd14	Merge XXXGenRegisterDesc.inc XXXGenRegisterNames.inc XXXGenRegisterInfo.h.inc into XXXGenRegisterInfo.inc. llvm-svn: 133922	2011-06-27 18:32:37 +00:00
Jakob Stoklund Olesen	ff653a2eed	Grow the X86FloatingPoint register map to hold 16 registers. This allows for more live scratch registers which is needed to handle live ST registers before return and inline asm instructions. llvm-svn: 133903	2011-06-27 04:08:36 +00:00
Chad Rosier	15db390f8f	Replace dyn_cast<> with cast<> since the cast is already guarded by the necessary check. llvm-svn: 133874	2011-06-25 18:51:28 +00:00
Dan Bailey	b7ee561399	PTX: Reverting implementation of i8. The .b8 operations in PTX are far more limiting than I first thought. The mov operation isn't even supported, so there's no way of converting a .pred value into a .b8 without going via .b16, which is not sensible. An improved implementation needs to use the fact that loads and stores automatically extend and truncate to implement support for EXTLOAD and TRUNCSTORE in order to correctly support boolean values. llvm-svn: 133873	2011-06-25 18:16:28 +00:00
Chad Rosier	bde13d3f76	Enable tail call optimization in the presence of a byval (x86-32 and x86-64). <rdar://problem/9483883> llvm-svn: 133858	2011-06-25 02:04:56 +00:00
Douglas Gregor	03bf47c0f0	Unbreak CMake build llvm-svn: 133853	2011-06-25 00:51:50 +00:00
Evan Cheng	b2681bef4f	Add include guard. llvm-svn: 133847	2011-06-24 23:59:54 +00:00
Evan Cheng	3b960aca17	Rename TargetDesc to MCTargetDesc llvm-svn: 133846	2011-06-24 23:53:19 +00:00
Jim Grosbach	28fcafb502	Refactor MachO relocation generaration into the Target directories. Move the target-specific RecordRelocation logic out of the generic MC MachObjectWriter and into the target-specific object writers. This allows nuking quite a bit of target knowledge from the supposedly target-independent bits in lib/MC. llvm-svn: 133844	2011-06-24 23:44:37 +00:00
Rafael Espindola	38c3c7f386	Fix cmake build. llvm-svn: 133830	2011-06-24 22:01:28 +00:00
Chad Rosier	e553e75b15	Hoist simple check above more complex checking to avoid unnecessary overheads. No functional change intended. llvm-svn: 133824	2011-06-24 21:15:36 +00:00
Evan Cheng	e862d59eee	- Add MCRegisterInfo registration machinery. Also added x86 registration routines. - Rename TargetRegisterDesc to MCRegisterDesc. llvm-svn: 133820	2011-06-24 20:42:09 +00:00
Jim Grosbach	6629b574b3	ARM movw/movt fixups need to mask the high bits. The fixup value comes in as the whole 32-bit value, so for the lo16 fixup, the upper bits need to be masked off. Previously we assumed the masking had already been done and asserted. rdar://9635991 llvm-svn: 133818	2011-06-24 20:06:59 +00:00
Dan Bailey	dd01c4ac7a	PTX: Add support for i8 type and introduce associated .b8 registers The i8 type is required for boolean values, but can only use ld, st and mov instructions. The i1 type continues to be used for predicates. llvm-svn: 133814	2011-06-24 19:27:10 +00:00
Chad Rosier	fa8d89327f	The Neon VCVT (between floating-point and fixed-point, Advanced SIMD) instructions can be used to match combinations of multiply/divide and VCVT (between floating-point and integer, Advanced SIMD). Basically the VCVT immediate operand that specifies the number of fraction bits corresponds to a floating-point multiply or divide by the corresponding power of 2. For example, VCVT (floating-point to fixed-point, Advanced SIMD) can replace a combination of VMUL and VCVT (floating-point to integer) as follows: Example (assume d17 = <float 8.000000e+00, float 8.000000e+00>): vmul.f32 d16, d17, d16 vcvt.s32.f32 d16, d16 becomes: vcvt.s32.f32 d16, d16, #3 Similarly, VCVT (fixed-point to floating-point, Advanced SIMD) can replace a combinations of VCVT (integer to floating-point) and VDIV as follows: Example (assume d17 = <float 8.000000e+00, float 8.000000e+00>): vcvt.f32.s32 d16, d16 vdiv.f32 d16, d17, d16 becomes: vcvt.f32.s32 d16, d16, #3 llvm-svn: 133813	2011-06-24 19:23:04 +00:00
Justin Holewinski	1be944a466	PTX: Add preliminary support for outputting debug information in the form of .file and .loc directives. Ideally, we would utilize the existing support in AsmPrinter for this, but I cannot find a way to get .file and .loc directives to print without the rest of the associated DWARF sections, which ptxas cannot handle. llvm-svn: 133812	2011-06-24 19:19:18 +00:00
Akira Hatanaka	35792089e7	Change the chain input of nodes that load the address of a function. This change enables SelectionDAG::getLoad at MipsISelLowering.cpp:1914 to return a pre-existing node instead of redundantly create a new node every time it is called. llvm-svn: 133811	2011-06-24 19:01:25 +00:00
Akira Hatanaka	ca88b4abec	Prevent generation of redundant addiu instructions that compute address of static variables or functions. llvm-svn: 133803	2011-06-24 17:55:19 +00:00
Justin Holewinski	7d65756aba	PTX: Re-work target sm/compute selection and add some basic GPU targets: g80, gt200, gf100(fermi) llvm-svn: 133799	2011-06-24 16:27:49 +00:00
Rafael Espindola	cff9c5e9a0	Fix CellSPU CMakeList.txt. llvm-svn: 133792	2011-06-24 13:58:45 +00:00
Evan Cheng	8e731166d0	Fix CellSPU CMakeLists.txt llvm-svn: 133787	2011-06-24 05:04:48 +00:00
Evan Cheng	247533179a	Starting to refactor Target to separate out code that's needed to fully describe target machine from those that are only needed by codegen. The goal is to sink the essential target description into MC layer so we can start building MC based tools without needing to link in the entire codegen. First step is to refactor TargetRegisterInfo. This patch added a base class MCRegisterInfo which TargetRegisterInfo is derived from. Changed TableGen to separate register description from the rest of the stuff. llvm-svn: 133782	2011-06-24 01:44:41 +00:00
Eli Friedman	5c958bb528	Add support for movntil/movntiq mnemonics. Reported on llvmdev. llvm-svn: 133759	2011-06-23 21:07:47 +00:00
Evan Cheng	8b2a2a1158	Rename TargetOptions::StackAlignment to StackAlignmentOverride. llvm-svn: 133739	2011-06-23 18:15:47 +00:00
Evan Cheng	1b049f5851	Remove TargetOptions.h dependency from ARMSubtarget. llvm-svn: 133738	2011-06-23 18:15:17 +00:00
Justin Holewinski	6cdd72a9ca	PTX: Always use registers for return values, but use .param space for device parameters if SM >= 2.0 - Update test cases to be more robust against register allocation changes - Bump up the number of registers to 128 per type - Include Python script to re-generate register file with any number of registers llvm-svn: 133736	2011-06-23 18:10:13 +00:00
Justin Holewinski	90b336ba0c	PTX: Whitespace fixes and remove commented out code llvm-svn: 133734	2011-06-23 18:10:07 +00:00
Justin Holewinski	9b459d9d7b	PTX: Prevent DCE from eliminating st.param calls, and unify the handling of st.param and ld.param FIXME: Test cases still need to be updated llvm-svn: 133733	2011-06-23 18:10:05 +00:00
Justin Holewinski	9472541ba1	PTX: Use .param space for parameters in device functions for SM >= 2.0 FIXME: DCE is eliminating the final st.param.x calls, figure out why llvm-svn: 133732	2011-06-23 18:10:03 +00:00
Evan Cheng	3a0c5e52ff	Remove TargetOptions.h dependency from X86Subtarget. llvm-svn: 133726	2011-06-23 17:54:54 +00:00
Dylan Noblesmith	f490726633	CppBackend: fixup for api change This broke after r133364. llvm-svn: 133709	2011-06-23 12:11:37 +00:00
Jay Foad	61ea0e4692	Reinstate r133513 (reverted in r133700) with an additional fix for a -Wshorten-64-to-32 warning in Instructions.h. llvm-svn: 133708	2011-06-23 09:09:15 +00:00
Eric Christopher	96513120b7	Revert r133513: "Reinstate r133435 and r133449 (reverted in r133499) now that the clang self-hosted build failure has been fixed (r133512)." Due to some additional warnings. llvm-svn: 133700	2011-06-23 06:24:52 +00:00
Bill Wendling	9af2fa9d1b	Use the presence of the __compact_unwind section to indicate that a target supports compact unwind info instead of having a separate flag indicating this. llvm-svn: 133685	2011-06-23 05:13:28 +00:00
Evan Cheng	ee9b90a727	Get rid of one getStackAlignment(). RegisterInfo shouldn't need to know about stack alignment. llvm-svn: 133679	2011-06-23 01:53:43 +00:00
Bill Wendling	f942585dae	Add a flag that indicates whether a target supports compact unwind info or not. llvm-svn: 133662	2011-06-22 23:16:51 +00:00
Bill Wendling	d346304373	Add a __LD,__compact_unwind section. If the linker supports it, this will hold the CIE and FDE information in a compact format. The implementation of the compact unwinding emission is coming soon. llvm-svn: 133658	2011-06-22 22:22:24 +00:00
Jim Grosbach	51897047da	Add missing header. llvm-svn: 133640	2011-06-22 20:40:30 +00:00
Jim Grosbach	2354f87a9d	Move ARMMachObjectWriter to its own file. Just tidy up a bit. No functional change. llvm-svn: 133638	2011-06-22 20:14:52 +00:00
Justin Holewinski	08d0f3550a	PTX: Fix FrameIndex mapping bug llvm-svn: 133619	2011-06-22 16:07:03 +00:00
Dan Bailey	55ec2a8929	Test Commit. llvm-svn: 133613	2011-06-22 09:04:30 +00:00
Justin Holewinski	6fafebfb6a	PTX: Add signed integer comparisons llvm-svn: 133599	2011-06-22 02:09:50 +00:00
Justin Holewinski	54e3c0f5d9	PTX: Add .address_size directive if PTX version >= 2.3 Patch by Wei-Ren Chen llvm-svn: 133589	2011-06-22 00:43:56 +00:00
Nick Lewycky	ef9c497e4c	Add support for assembling "movq" when it's correct to do so, while continuing to emit "movd" across the board to continue supporting a Darwin assembler bug. This is the reincarnation of r133452. llvm-svn: 133565	2011-06-21 22:45:41 +00:00
Eric Christopher	e256cd0565	Handle the memory-ness of all U+ ARM constraints. Noticed on inspection. llvm-svn: 133553	2011-06-21 22:10:57 +00:00
Evan Cheng	8d971ad5b7	Reorg. No functionality change. llvm-svn: 133533	2011-06-21 19:00:54 +00:00
Bob Wilson	646dd0f4d1	Revert r133452: "Emit movq for 64-bit register to XMM register moves..." This is breaking compiler-rt and llvm-gcc builds on MacOSX when not using the integrated assembler. llvm-svn: 133524	2011-06-21 17:35:13 +00:00
Anna Zaks	083f0b5a7e	Add support for sadd.with.overflow and uadd.with.overflow intrinsics to the CBackend by emitting definitions for each intrinsic that occurs in the module. llvm-svn: 133522	2011-06-21 17:18:15 +00:00
Jay Foad	a97a2c998e	Reinstate r133435 and r133449 (reverted in r133499) now that the clang self-hosted build failure has been fixed (r133512). llvm-svn: 133513	2011-06-21 10:33:19 +00:00
Evan Cheng	4c0bd9629d	Teach dag combine to match halfword byteswap patterns. 1. (((x) & 0xFF00) >> 8) \| (((x) & 0x00FF) << 8) => (bswap x) >> 16 2. ((x&0xff)<<8)\|((x&0xff00)>>8)\|((x&0xff000000)>>8)\|((x&0x00ff0000)<<8)) => (rotl (bswap x) 16) This allows us to eliminate most of the def : Pat patterns for ARM rev16 revsh instructions. It catches many more cases for ARM and x86. rdar://9609108 llvm-svn: 133503	2011-06-21 06:01:08 +00:00
Chad Rosier	184f3b37e2	Revert r133435 and r133449 to appease buildbots. llvm-svn: 133499	2011-06-21 02:09:03 +00:00
Akira Hatanaka	27029885f0	Add A0 and A1 to the list of registers used for returning a value in order to handle functions with return type Complex long long. llvm-svn: 133497	2011-06-21 01:28:11 +00:00
Akira Hatanaka	5b350be79d	Coding style fixes. llvm-svn: 133496	2011-06-21 01:02:03 +00:00
Akira Hatanaka	4c406e7457	Re-apply 132758 and 132768 which were speculatively reverted in 132777. llvm-svn: 133494	2011-06-21 00:40:49 +00:00
Justin Holewinski	cd4484d25d	PTX: Fix conversion between predicates and value types llvm-svn: 133454	2011-06-20 18:42:48 +00:00
Nick Lewycky	c7df192279	Emit movq for 64-bit register to XMM register moves, but continue to accept movd when assembling. llvm-svn: 133452	2011-06-20 18:33:26 +00:00
Justin Holewinski	e8ae1db4d8	PTX: Fix if-then-else formatting and add missing asserts llvm-svn: 133447	2011-06-20 17:08:56 +00:00
Justin Holewinski	c7b073da0f	PTX: Add basic register spilling code The current implementation generates stack loads/stores, which are really just mov instructions from/to "special" registers. This may not be the most efficient implementation, compared to an approach where the stack registers are directly folded into instructions, but this is easier to implement and I have yet to see a case where ptxas is unable to see through this kind of register usage and know what is really going on. llvm-svn: 133443	2011-06-20 15:56:20 +00:00
Roman Divacky	254f82112d	Don't apply on PPC64 the 32bit ADDIC optimizations as there's no overflow with 32bit values. llvm-svn: 133439	2011-06-20 15:28:39 +00:00
Jay Foad	e03c05c35a	Change how PHINodes store their operands. Change PHINodes to store simple pointers to their incoming basic blocks, instead of full-blown Uses. Note that this loses an optimization in SplitCriticalEdge(), because we can no longer walk the use list of a BasicBlock to find phi nodes. See the comment I removed starting "However, the foreach loop is slow for blocks with lots of predecessors". Extend replaceAllUsesWith() on a BasicBlock to also update any phi nodes in the block's successors. This mimics what would have happened when PHINodes were proper Users of their incoming blocks. (Note that this only works if OldBB->replaceAllUsesWith(NewBB) is called when OldBB still has a terminator instruction, so it still has some successors.) llvm-svn: 133435	2011-06-20 14:38:01 +00:00
Jay Foad	372ad64b4d	Make better use of the PHINode API. Change various bits of code to make better use of the existing PHINode API, to insulate them from forthcoming changes in how PHINodes store their operands. llvm-svn: 133434	2011-06-20 14:18:48 +00:00
Jay Foad	6002068c13	Fix a FIXME by making GlobalVariable::getInitializer() return a const Constant *. llvm-svn: 133400	2011-06-19 18:37:11 +00:00
Benjamin Kramer	25e17b0f89	Remove unused but set variables. llvm-svn: 133347	2011-06-18 11:09:41 +00:00
Jakob Stoklund Olesen	b68fee1e82	Delete unneeded allocation order override. llvm-svn: 133331	2011-06-18 02:30:02 +00:00
Jakob Stoklund Olesen	831ae0105a	Switch ARM to using AltOrders instead of MethodBodies. This slightly changes the GPR allocation order on Darwin where R9 is not a callee-saved register: Before: %R0 %R1 %R2 %R3 %R12 %R9 %LR %R4 %R5 %R6 %R8 %R10 %R11 After: %R0 %R1 %R2 %R3 %R9 %R12 %LR %R4 %R5 %R6 %R8 %R10 %R11 llvm-svn: 133326	2011-06-18 01:14:46 +00:00
Jakob Stoklund Olesen	3337f7d50a	Switch x86 to using AltOrders instead of MethodBodies. llvm-svn: 133325	2011-06-18 01:14:43 +00:00
Jakob Stoklund Olesen	19d968e62f	Reserve D16-D13 on subtargets that don't support them. llvm-svn: 133321	2011-06-18 00:53:27 +00:00
Jakob Stoklund Olesen	d3fec5edc1	Zap the last reference to allocation_order_begin(). llvm-svn: 133310	2011-06-17 23:17:13 +00:00
Jakob Stoklund Olesen	157e6a79a1	SI, DI, BP, and SP don't have 8-bit sub-registers in x86 mode. llvm-svn: 133308	2011-06-17 23:15:00 +00:00
Bill Wendling	b74b9de151	Use the verbose asm flag instead of a new flag for decoding the LSDA. llvm-svn: 133292	2011-06-17 20:55:01 +00:00
Evan Cheng	7552a62af5	Add an alternative rev16 pattern. We should figure out a better way to handle these complex rev patterns. rdar://9609108 llvm-svn: 133289	2011-06-17 20:47:21 +00:00
Bill Wendling	e303114b3c	Add an option that allows one to "decode" the LSDA. The LSDA is a bit difficult for the non-initiated to read. Even with comments, it's not always clear what's going on. This wraps the ASM streamer in a class that retains the LSDA and then emits a human-readable description of what's going on in it. So instead of having to make sense of: Lexception1: .byte 255 .byte 155 .byte 168 .space 1 .byte 3 .byte 26 Lset0 = Ltmp7-Leh_func_begin1 .long Lset0 Lset1 = Ltmp812-Ltmp7 .long Lset1 Lset2 = Ltmp913-Leh_func_begin1 .long Lset2 .byte 3 Lset3 = Ltmp812-Leh_func_begin1 .long Lset3 Lset4 = Leh_func_end1-Ltmp812 .long Lset4 .long 0 .byte 0 .byte 1 .byte 0 .byte 2 .byte 125 .long __ZTIi@GOTPCREL+4 .long __ZTIPKc@GOTPCREL+4 you can read this instead: ## Exception Handling Table: Lexception1 ## @LPStart Encoding: omit ## @TType Encoding: indirect pcrel sdata4 ## @TType Base: 40 bytes ## @CallSite Encoding: udata4 ## @Action Table Size: 26 bytes ## Action 1: ## A throw between Ltmp7 and Ltmp812 jumps to Ltmp913 on an exception. ## For type(s): __ZTIi@GOTPCREL+4 __ZTIPKc@GOTPCREL+4 ## Action 2: ## A throw between Ltmp812 and Leh_func_end1 does not have a landing pad. llvm-svn: 133286	2011-06-17 20:35:21 +00:00
Roman Divacky	d041962c20	Fix a few places where 32bit instructions/registerset were used on PPC64. llvm-svn: 133260	2011-06-17 15:21:10 +00:00
Justin Holewinski	3604d9a421	PTX: Adjust rounding modes * rounding modes for fp add, mul, sub now use .rn * float -> int rounding correctly uses .rzi not .rni * 32bit fdiv for sm13 uses div.rn (instead of div.approx) * 32bit fdiv for sm10 now uses div (instead of div.approx) Approx is not IEEE 754 compatible (and should be optionally set by a flag to the backend instead). The .rn rounding modifier is the PTX default anyway, but it's better to be explicit. All these modifiers should be available by using __fmul_rz functions for example, but support will need to be added for this in the backend. Patch by Dan Bailey llvm-svn: 133253	2011-06-17 12:12:42 +00:00
Jakob Stoklund Olesen	3982029f60	Allocate SystemZ callee-saved registers backwards: R13-R6 The reserved R14-R15 are always saved in the prolog, and using CSRs starting from R13 allows them to be saved in one instruction. Thanks to Anton for explaining this. llvm-svn: 133233	2011-06-17 03:47:30 +00:00
Cameron Zwarich	033026ffc0	Update an insertion point iterator after replacing a return instruction with a tail call pseudoinstruction. This fixes <rdar://problem/9624333>. llvm-svn: 133227	2011-06-17 02:16:43 +00:00
Jakob Stoklund Olesen	66773c3398	Explicitly invoke ArrayRef constructor to keep gcc happy. Patch by Richard Smith! llvm-svn: 133220	2011-06-17 00:18:25 +00:00
Jakob Stoklund Olesen	801f7ab321	Rename TRI::getAllocationOrder() to getRawAllocationOrder(). Also switch the return type to ArrayRef<unsigned> which works out nicely for ARM's implementation of this function because of the clever ArrayRef constructors. The name change indicates that the returned allocation order may contain reserved registers as has been the case for a while. llvm-svn: 133216	2011-06-16 23:31:16 +00:00
Owen Anderson	5fc8b77f83	Change the REG_SEQUENCE SDNode to take an explict register class ID as its first operand. This operand is lowered away by the time we reach MachineInstrs, so the actual register-allocation handling of them doesn't need to change. This is intended to support using REG_SEQUENCE SDNode's with type MVT::untyped, and is part of the long road to eliminating some of the hacks we currently use to support register pairs and other strange constraints, particularly on ARM NEON. llvm-svn: 133178	2011-06-16 18:17:13 +00:00
Bruno Cardoso Lopes	d66ab9ead1	Mark ldrexd/strexd w/ volatile memory by default llvm-svn: 133175	2011-06-16 18:11:32 +00:00
Justin Holewinski	7f191b2a3b	PTX: Finish new calling convention implementation llvm-svn: 133172	2011-06-16 17:50:00 +00:00
Justin Holewinski	6b356c1f3f	PTX: Rename register classes for readability and combine int and fp registers llvm-svn: 133171	2011-06-16 17:49:58 +00:00
Dan Gohman	8eb36ef497	Add a comment describing why transforming (shl x, 1) to (add x, x) is to be considered safe enough in this context. llvm-svn: 133159	2011-06-16 15:55:48 +00:00
Justin Holewinski	5ccf812b1d	PTX: Fix whitespace errors llvm-svn: 133158	2011-06-16 15:17:11 +00:00
Bruno Cardoso Lopes	bbf2ab990f	Add AVX suport for fpextend. Original patch by Syoyo Fujita with more comments by me. llvm-svn: 133153	2011-06-16 07:03:21 +00:00
Chad Rosier	2730162bee	Revision r128665 added an optimization to make use of NEON multiplier accumulator forwarding. Specifically (from SVN log entry): Distribute (A + B) * C to (A * C) + (B * C) to make use of NEON multiplier accumulator forwarding: vadd d3, d0, d1 vmul d3, d3, d2 => vmul d3, d0, d2 vmla d3, d1, d2 Make sure it catches cases where operand 1 is add/fadd/sub/fsub, which was intended in the original revision. llvm-svn: 133127	2011-06-16 01:21:54 +00:00
Bruno Cardoso Lopes	5444a7b4cd	Silence warnings in non assert builds. Patch by David Blaikie llvm-svn: 133118	2011-06-16 00:40:02 +00:00
Jakob Stoklund Olesen	99f35eab45	Use set operations instead of plain lists to enumerate register classes. This simplifies many of the target description files since it is common for register classes to be related or contain sequences of numbered registers. I have verified that this doesn't change the files generated by TableGen for ARM and X86. It alters the allocation order of MBlaze GPR and Mips FGR32 registers, but I believe the change is benign. llvm-svn: 133105	2011-06-15 23:28:14 +00:00
John McCall	4b7a8d68ae	Add a new function attribute, nonlazybind, which inhibits lazy-loading optimizations when emitting calls to the function; instead those calls may use faster relocations which require the function to be immediately resolved upon loading the dynamic object featuring the call. This is useful when it is known that the function will be called frequently and pervasively and therefore there is no merit in delaying binding of the function. Currently only implemented for x86-64, where it turns into a call through the global offset table. Patch by Dan Gohman, who assures me that he's going to add LangRef documentation for this once it's committed. llvm-svn: 133080	2011-06-15 20:36:13 +00:00
Jakob Stoklund Olesen	5977109f14	Remove custom allocation orders in SystemZ. Note that this actually changes code generation, and someone who understands this target better should check the changes. - R12Q is now allocatable. I think it was omitted from the allocation order by mistake since it isn't reserved. It as apparently used as a GOT pointer sometimes, and it should probably be reserved if that is the case. - The GR64 registers are allocated in a different order now. The register allocator will automatically put the CSRs last. There were other changes to the order that may have been significant. The test fix is because r0 and r1 swapped places in the allocation order. llvm-svn: 133067	2011-06-15 18:02:56 +00:00
Evan Cheng	678b691aa3	Another revsh pattern. rdar://9609059 llvm-svn: 133064	2011-06-15 17:17:48 +00:00
Roman Divacky	6874b26d0f	Make PPC64CompilationCallback compilable no non-darwin platforms. Patch by Nathan Whitehorn! llvm-svn: 133059	2011-06-15 15:29:47 +00:00
Owen Anderson	86fd3c0058	Replace the statically generated hashtables for checking register relationships with just scanning the (typically tiny) static lists. At the time I wrote this code (circa 2007), TargetRegisterInfo was using a std::set to perform these queries. Switching to the static hashtables was an obvious improvement, but in reality there's no reason to do anything other than scan. With this change, total LLC time on a whole-program 403.gcc is reduced by approximately 1.5%, almost all of which comes from a 15% reduction in LiveVariables time. It also reduces the binary size of LLC by 86KB, thanks to eliminating a bunch of very large static tables. llvm-svn: 133051	2011-06-15 06:53:50 +00:00
Bob Wilson	4b12a11f30	A minor simplification: no functional change. llvm-svn: 133047	2011-06-15 06:04:34 +00:00
Evan Cheng	6d02d9044b	PerformBFICombine - (bfi A, (and B, Mask1), Mask2) -> (bfi A, B, Mask2) iff the bits being cleared by the AND are not demanded by the BFI. The previous BFI dag combine rule was actually incorrect (or used to be correct until BFI representation changed). rdar://9609030 llvm-svn: 133034	2011-06-15 01:12:31 +00:00
Tanya Lattner	e9e6705cf9	Add an optimization that looks for a specific pair-wise add pattern and generates a vpaddl instruction instead of scalarizing the add. Includes a test case. llvm-svn: 133027	2011-06-14 23:48:48 +00:00
Anna Zaks	cd7f70e8b5	Anna's test commit (#2 ). llvm-svn: 133023	2011-06-14 22:40:29 +00:00
Eli Friedman	164b1d753a	PR10136: fix PPCTargetLowering::LowerCall_SVR4 so that a necessary CopyToReg doesn't appear to be dead. Roman, since you're writing tests for other PPC-SVR4 vararg-related stuff, would you mind writing a test for this? llvm-svn: 133018	2011-06-14 22:16:20 +00:00
Anna Zaks	d7f7fcd3cb	Anna's test commit. llvm-svn: 133017	2011-06-14 22:10:12 +00:00
Evan Cheng	965ed2e790	Also recognize ARM v4t and v5e variants. llvm-svn: 133002	2011-06-14 18:08:33 +00:00
Bruno Cardoso Lopes	dc9ff3a4b1	Add one more argument to the prefetch intrinsic to indicate whether it's a data or instruction cache access. Update the targets to match it and also teach autoupgrade. llvm-svn: 132976	2011-06-14 04:58:37 +00:00
Nick Lewycky	34a425b075	Fit banner in 80-col and adjust whitespace. No functionality changes. llvm-svn: 132964	2011-06-14 03:23:52 +00:00
Jim Grosbach	7ef7ddd2df	Clean up a few 80 column violations. llvm-svn: 132946	2011-06-13 22:54:22 +00:00
Jim Grosbach	dca8531821	Fix coordination for using R4 in Thumb1 as a scratch for SP restore. The logic for reserving R4 for use as a scratch needs to match that for actually using it. Also, it's not necessary for immediate <=508, so adjust the value checked. llvm-svn: 132934	2011-06-13 21:18:25 +00:00
Benjamin Kramer	c970849ea0	InstCombine: Fold A-b == C --> b == A-C if A and C are constants. The backend already knew this trick. llvm-svn: 132915	2011-06-13 15:24:24 +00:00
Rafael Espindola	defd4b0875	AnalyzeBranch doesn't change which successors a bb has, just the order we try to branch to them. Before we were creating successor lists with duplicated entries. Fixing that found a bug in isBlockOnlyReachableByFallthrough that would causes it to return the wrong answer for ----------- ... jne foo jmp bar foo: ---------- llvm-svn: 132882	2011-06-12 03:20:32 +00:00
Charles Davis	7ed40cbded	Put FrameSetup flag on x86 instructions that set up the call frame. No functionality change. Later on, we'll use the flag to emit SEH pseudo-ops that describe how the call frame was built. llvm-svn: 132880	2011-06-12 01:45:54 +00:00
Eli Friedman	1735b29196	Make sure to pass OpFlags into MachineInstrBuilder::addExternalSymbol; the memcpy/memset symbol doesn't get marked up correctly in PIC modes otherwise. Should fix llvm-x86_64-linux-checks buildbot. Followup to r132864. llvm-svn: 132869	2011-06-11 01:55:07 +00:00
Eli Friedman	cd2124a3f0	Add full x86 fast-isel support for memcpy and memset. rdar://9431466 llvm-svn: 132864	2011-06-10 23:39:36 +00:00
Cameron Zwarich	890197859b	Provide an ARMCCState subclass of CCState so that ARM clients will always set CallOrPrologue correctly and eliminate the existing setter. llvm-svn: 132856	2011-06-10 20:59:24 +00:00
Eli Friedman	87ef38784e	PR10092 (second try): Don't crash on a load without a momoperand; fast-isel creates loads like this. llvm-svn: 132826	2011-06-10 01:13:01 +00:00
Eli Friedman	5abfd79900	Chris fixed this README a while back by changing how clang generates code for structs like the given struct. llvm-svn: 132815	2011-06-09 23:02:19 +00:00
Cameron Zwarich	361548d4b4	A CCState was being created without setting whether it is in the Call or Prologue state, causing an assertion failure downstream. This fixes <rdar://problem/9562908>. This really seems like it should always be set at CCState creation time, so mistakes like this can never happen. I'll take a look at doing that. llvm-svn: 132811	2011-06-09 22:30:07 +00:00
Roman Divacky	4b5665a1f7	Fix emission of PPC64 assembler on non-darwin platforms by splitting VK_PPC_{HA,LO}16 into darwin and gas variants. Darwin wants {ha,lo}16(symbol) while gnu as wants symbol@{ha,l}. llvm-svn: 132802	2011-06-09 20:25:38 +00:00
Eli Friedman	9008377c2d	Revert 132789; it breaks tests. My mistake. llvm-svn: 132795	2011-06-09 19:33:30 +00:00
Eli Friedman	c095116710	Add a check to make sure we don't crash with strange configurations where we do fast-isel, then try to fold instructions. PR10092. llvm-svn: 132789	2011-06-09 18:55:00 +00:00
Jakob Stoklund Olesen	5750ca7089	Remove custom allocation order boilerplate that is no longer needed. The register allocators automatically filter out reserved registers and place the callee saved registers last in the allocation order, so custom methods are no longer necessary just for that. Some targets still use custom allocation orders: ARM/Thumb: The high registers are removed from GPR in thumb mode. The NEON allocation orders prefer to use non-VFP2 registers first. X86: The GR8 classes omit AH-DH in x86-64 mode to avoid REX trouble. SystemZ: Some of the allocation orders are omitting R12 aliases without explanation. I don't understand this target well enough to fix that. It looks like all the boilerplate could be removed by reserving the right registers. llvm-svn: 132781	2011-06-09 16:56:59 +00:00
Eric Christopher	f15601f19a	Speculatively revert 132758 and 132768 to try to fix the Windows buildbots. llvm-svn: 132777	2011-06-09 16:03:19 +00:00
Duncan Sands	eeb50c8fd2	Enable printf() to iprintf() optimization for the TCE target. Patch by Pekka Jaaskelainen. llvm-svn: 132774	2011-06-09 11:11:45 +00:00
Akira Hatanaka	0683a7212e	Initial support for inline asm memory operand constraints. llvm-svn: 132768	2011-06-09 03:31:05 +00:00
Eric Christopher	0713a9d8fc	Add a parameter to CCState so that it can access the MachineFunction. No functional change. Part of PR6965 llvm-svn: 132763	2011-06-08 23:55:35 +00:00
Akira Hatanaka	4e9af454f7	Fix bug in lowering of DYNAMIC_STACKALLOC nodes. The correct offset of the dynamically allocated stack area was not set. llvm-svn: 132758	2011-06-08 21:28:09 +00:00
Akira Hatanaka	195a1e2184	Reorganize code in MipsTargetLowering::LowerCall to improve readability. llvm-svn: 132756	2011-06-08 17:39:33 +00:00
Akira Hatanaka	41956cf6e3	Refactor MipsTargetLowering::EmitInstrWithCustomInserter. llvm-svn: 132726	2011-06-07 19:28:39 +00:00
Akira Hatanaka	e99b08d6c3	Put back removed line. llvm-svn: 132725	2011-06-07 19:03:14 +00:00
Akira Hatanaka	1550678765	Coding style fixes. - Fix indentation. - Move comments. - Fit lines in 80 columns. - Remove dead code. llvm-svn: 132724	2011-06-07 18:58:42 +00:00
Akira Hatanaka	dde4aac02b	Use tabs to separate opcode and operand strings. llvm-svn: 132718	2011-06-07 18:16:51 +00:00
Akira Hatanaka	d8373a4680	Add comments for wrapper node patterns in MipsInstrInfo.td. llvm-svn: 132717	2011-06-07 18:00:14 +00:00
Akira Hatanaka	08b7a779ef	Add test case for C++ exception handling and fix the following mistakes in MipsFrameLowering::emitPrologue: - cfi directives are not inserted at the right location or in the right order. - The source MachineLocation for the cfi directive that changes the cfa register to $fp should be MachineLocation::VirtualFP. - A PROLOG_LABEL that marks the beginning of cfi_offset directives for callee-saved register is emitted even when no callee-saved registers are saved. - When a callee-saved double precision register is saved, two cfi_offset directives, one for each of the paired single precision registers, should be emitted. llvm-svn: 132703	2011-06-07 02:17:21 +00:00
Andrew Trick	410172bf5e	Fix for setjmp/longjmp exception handling on ARM. setjmp clobbers CPSR. rdar://problem/9556069 llvm-svn: 132699	2011-06-07 00:08:49 +00:00
Stuart Hastings	e0d3426e1a	Followup to 132458, omit unnecessary stack copy when x87 input is a load. rdar://problem/6373334 llvm-svn: 132696	2011-06-06 23:15:58 +00:00
Stuart Hastings	be605494ac	Reapply 132424 with fixes. This fixes PR10068. rdar://problem/5993888 llvm-svn: 132606	2011-06-03 23:53:54 +00:00
Jakob Stoklund Olesen	e1c6d3acb4	Blackfin always uses a reserved call frame. Materializing the stack pointer update before a call requires a scratch register that may not be available. llvm-svn: 132601	2011-06-03 22:45:18 +00:00
Eric Christopher	354b2a25f3	Make the Uv constraint a memory operand. This doesn't solve the addressing mode problem mentioned in r132559. Backend part of rdar://9037836 and part of rdar://9119939 llvm-svn: 132561	2011-06-03 17:24:37 +00:00
Roman Divacky	a4a59aebd9	Fix wrong usages of CTR/MCTR where CTR8/MCTR8 was meant. - Check for MTCTR8 in addition to MTCTR when looking up a hazard. - When lowering an indirect call use CTR8 when targeting 64bit. - Introduce BCTR8 that uses CTR8 and use it on 64bit when expanding ISD::BRIND. The last change fixes PR8487. With those changes, we are able to compile a running "ls" and "sh" on FreeBSD/PowerPC64. llvm-svn: 132552	2011-06-03 15:47:49 +00:00
Eli Friedman	86585798af	Add ARM fast-isel support for materializing the address of a global in cases where the global uses an indirect symbol. rdar://9431157 llvm-svn: 132522	2011-06-03 01:13:19 +00:00
Eric Christopher	de9399bf76	Have LowerOperandForConstraint handle multiple character constraints. Part of rdar://9119939 llvm-svn: 132510	2011-06-02 23:16:42 +00:00
Jakob Stoklund Olesen	60cdf8e727	Flag unallocatable register classes instead of giving them empty allocation orders. llvm-svn: 132509	2011-06-02 23:07:24 +00:00
Jakob Stoklund Olesen	75703ca76f	Make it possible to have unallocatable register classes. Some register classes are only used for instruction operand constraints. They should never be used for virtual registers. Previously, those register classes were given an empty allocation order, but now you can say 'let isAllocatable=0' in the register class definition. TableGen calculates if a register is part of any allocatable register class, and makes that information available in TargetRegisterDesc::inAllocatableClass. The goal here is to eliminate use cases for overriding allocation_order_* methods. llvm-svn: 132508	2011-06-02 23:07:20 +00:00
Tanya Lattner	f0759ef271	Fix encoding for VEXTdf. llvm-svn: 132486	2011-06-02 21:25:24 +00:00
Rafael Espindola	aa318ae495	Revert 132424 to fix PR10068. llvm-svn: 132479	2011-06-02 19:57:47 +00:00
Stuart Hastings	8d530ad22a	Omit unnecessary stack copy when x87 input is a load. rdar://problem/6373334 llvm-svn: 132458	2011-06-02 15:57:11 +00:00
Jakob Stoklund Olesen	aff1060207	Use TRI::has{Sub,Super}ClassEq() where possible. No functional change. llvm-svn: 132455	2011-06-02 05:43:46 +00:00
Rafael Espindola	d6860522b2	Don't hardcode the %reg format in the streamer. llvm-svn: 132451	2011-06-02 02:34:55 +00:00
Akira Hatanaka	2446869410	Detect FI\|cst pattern in MipsDAGToDAGISel::SelectAddr. Patch by Sasa Stankovic. llvm-svn: 132448	2011-06-02 01:03:14 +00:00
Akira Hatanaka	6627752050	Custom-lower FRAMEADDR. Patch by Sasa Stankovic. llvm-svn: 132444	2011-06-02 00:24:44 +00:00
Stuart Hastings	7adc95f69e	Recommit 132404 with fixes. rdar://problem/5993888 llvm-svn: 132424	2011-06-01 21:33:14 +00:00
Stuart Hastings	aab130d995	Revert 132404 to appease a buildbot. rdar://problem/5993888 llvm-svn: 132419	2011-06-01 19:52:20 +00:00
Stuart Hastings	7b7c102f2c	Add support for x86 CMPEQSS and friends. These instructions do a floating-point comparison, generate a mask of 0s or 1s, and generally DTRT with NaNs. Only profitable when the user wants a materialized 0 or 1 at runtime. rdar://problem/5993888 llvm-svn: 132404	2011-06-01 17:17:45 +00:00
Jakob Stoklund Olesen	56ce3a0f01	Fix PR10059 and future variations by handling all register subclasses. Add TargetRegisterInfo::hasSubClassEq and use it to check for compatible register classes instead of trying to list all register classes in X86's getLoadStoreRegOpcode. llvm-svn: 132398	2011-06-01 15:32:10 +00:00
Stuart Hastings	9f20804216	FGETSIGN support for x86, using movmskps/pd. Will be enabled with a patch to TargetLowering.cpp. rdar://problem/5660695 llvm-svn: 132388	2011-06-01 04:39:42 +00:00
Bruno Cardoso Lopes	f771a0f490	Fix uninitialized variables and silence warnings llvm-svn: 132355	2011-05-31 20:25:26 +00:00
Richard Osborne	4dae7379ef	Fix 80 column violations. llvm-svn: 132341	2011-05-31 16:30:33 +00:00
Richard Osborne	2f14b0bb1d	Add XCore intrinsic for crc8. llvm-svn: 132340	2011-05-31 16:24:49 +00:00
Richard Osborne	542f9a2bcf	Add XCore intrinsic for crc32. llvm-svn: 132336	2011-05-31 14:47:36 +00:00
Bruno Cardoso Lopes	394f516d16	Fix ssat and ssat16 encodings for ARM and Thumb. The bit position value must be encoded decremented by one. Only add encoding tests for ssat16 because ssat can't be parsed yet. llvm-svn: 132324	2011-05-31 03:33:27 +00:00
Bruno Cardoso Lopes	98fc4c8bbc	This patch implements atomic intrinsics atomic.load.add (sub,and,or,xor, nand), atomic.swap and atomic.cmp.swap, all in i8, i16 and i32 versions. The intrinsics are implemented by creating pseudo-instructions, which are then expanded in the method MipsTargetLowering::EmitInstrWithCustomInserter. Patch by Sasa Stankovic. llvm-svn: 132323	2011-05-31 02:54:07 +00:00
Bruno Cardoso Lopes	bf3c1251e0	This patch implements the thread local storage. Implemented are General Dynamic, Initial Exec and Local Exec TLS models. Patch by Sasa Stankovic llvm-svn: 132322	2011-05-31 02:53:58 +00:00
Rafael Espindola	08600bcf65	Use the dwarf->llvm mapping to print register names in the cfi directives. Fixes PR9826. llvm-svn: 132317	2011-05-30 20:20:15 +00:00
Rafael Espindola	cd0d2fd21f	Split ppc dwarf regnums into ppc64 and ppc32 flavours. llvm-svn: 132315	2011-05-30 18:24:44 +00:00
Rafael Espindola	ddffa0e160	Introduce the DwarfRegAlias class for declaring that two registers have the same dwarf number. This will be used for creating a dwarf number to register mapping. The only case that needs this so far is the XMM/YMM registers that unfortunately do have the same numbers. llvm-svn: 132314	2011-05-30 17:49:59 +00:00

... 4 5 6 7 8 ...

18433 Commits