llvm-project

Commit Graph

Author	SHA1	Message	Date
Akira Hatanaka	0f693a8a77	[mips] Custom-legalize BR_JT. In N64-static, GOT address is needed to compute the branch address. llvm-svn: 176580	2013-03-06 21:32:03 +00:00
Michael Liao	da22b30be5	Fix PR15355 - Clear 'mayStore' flag when loading from the atomic variable before the spin loop - Clear kill flag from one use to multiple use in registers forming the address to that atomic variable - don't use a physical register as live-in register in BB (neither entry nor landing pad.) by copying it into virtual register (patch by Cameron Zwarich) llvm-svn: 176538	2013-03-06 00:17:04 +00:00
Akira Hatanaka	1454ed8ad3	[mips] Remove android calling convention. This calling convention was added just to handle functions which return vector of floats. The fix committed in r165585 solves the problem. llvm-svn: 176530	2013-03-05 23:22:30 +00:00
Akira Hatanaka	e092f72956	[mips] Fix MipsCC::analyzeReturn so that, in soft-float mode, fp128 gets returned in registers $2 and $4. llvm-svn: 176527	2013-03-05 22:54:59 +00:00
Akira Hatanaka	5f3ba9e595	[mips] Fix MipsTargetLowering::LowerCallResult and LowerReturn to correctly handle fp128 returns. llvm-svn: 176523	2013-03-05 22:41:55 +00:00
Akira Hatanaka	3b7391d140	[mips] Fix MipsTargetLowering::LowerCall to pass fp128 arguments in floating point registers. llvm-svn: 176521	2013-03-05 22:20:28 +00:00
Akira Hatanaka	4b634fa3b3	[mips] Correct handling of fp128 (long double) formals and read long double parameters from floating point registers if target is mips64 hard float. llvm-svn: 176520	2013-03-05 22:13:04 +00:00
Meador Inge	b904e6e467	Add more functions to the TLI. This patch adds many more functions to the target library information. All of the functions being added were discovered while doing the migration of the simplify-libcalls attribute annotation functionality to the functionattrs pass. As a part of that work the attribute annotation logic will query TLI to determine if a function should be annotated or not. Signed-off-by: Meador Inge <meadori@codesourcery.com> llvm-svn: 176514	2013-03-05 21:47:40 +00:00
Jyotsna Verma	457801f7ab	reverting patch 176508. llvm-svn: 176513	2013-03-05 20:29:23 +00:00
Jyotsna Verma	7179e712dd	Hexagon: Add support for lowering block address. llvm-svn: 176508	2013-03-05 19:37:46 +00:00
Vincent Lejeune	fe32bd87c2	R600: Do not predicate vector op llvm-svn: 176507	2013-03-05 19:12:06 +00:00
Jyotsna Verma	0eeea14e3e	Hexagon: Expand addc, adde, subc and sube. llvm-svn: 176505	2013-03-05 19:04:47 +00:00
Benjamin Kramer	5dc831801a	Update cmake build. llvm-svn: 176501	2013-03-05 18:54:05 +00:00
Jyotsna Verma	f1214a8ab7	Hexagon: Use MO operand flags to mark constant extended instructions. llvm-svn: 176500	2013-03-05 18:51:42 +00:00
Jyotsna Verma	f4e324f4fb	Hexagon: Add encoding bits to the TFR64 instructions. Set imMoveImm, isAsCheapAsAMove flags for TFRI instructions. llvm-svn: 176499	2013-03-05 18:42:28 +00:00
Vincent Lejeune	68b6b6ddfb	R600: initial scheduler code This is a skeleton for a pre-RA MachineInstr scheduler strategy. Currently it only tries to expose more parallelism for ALU instructions (this also makes the distribution of GPR channels more uniform and increases the chances of ALU instructions to be packed together in a single VLIW group). Also it tries to reduce clause switching by grouping instruction of the same kind (ALU/FETCH/CF) together. Vincent Lejeune: - Support for VLIW4 Slot assignement - Recomputation of ScheduleDAG to get more parallelism opportunities Tom Stellard: - Fix assertion failure when trying to determine an instruction's slot based on its destination register's class - Fix some compiler warnings Vincent Lejeune: [v2] - Remove recomputation of ScheduleDAG (will be provided in a later patch) - Improve estimation of an ALU clause size so that heuristic does not emit cf instructions at the wrong position. - Make schedule heuristic smarter using SUnit Depth - Take constant read limitations into account Vincent Lejeune: [v3] - Fix some uninitialized values in ConstPair - Add asserts to ensure an ALU slot is always populated llvm-svn: 176498	2013-03-05 18:41:32 +00:00
Vincent Lejeune	0b72f1021d	R600: Remove LowerConstCopyPass and lower CONST_COPY right after ISel. Maintaining CONST_COPY Instructions until Pre Emit may prevent some ifcvt case and taking them in account for scheduling is difficult for no real benefit. llvm-svn: 176488	2013-03-05 15:04:55 +00:00
Vincent Lejeune	3b6f20e944	R600: Turn BUILD_VECTOR into Reg_Sequence Reviewed-by: Tom Stellard <thomas.stellard at amd.com> llvm-svn: 176487	2013-03-05 15:04:49 +00:00
Vincent Lejeune	10a5e4773e	R600: CONST_ADDRESS node is not marked as mayLoad anymore Reviewed-by: Tom Stellard <thomas.stellard at amd.com> mayLoad complexify scheduling and does not bring any usefull info as the location is not writeable at all. llvm-svn: 176486	2013-03-05 15:04:42 +00:00
Vincent Lejeune	a199d01e4d	R600: Use MUL_IEEE for trig/fdiv intrinsic Reviewed-by: Tom Stellard <thomas.stellard at amd.com> llvm-svn: 176485	2013-03-05 15:04:37 +00:00
Vincent Lejeune	743dca0446	R600: Add support for indirect addressing of non default const buffer NOTE: This is a candidate for the Mesa stable branch. llvm-svn: 176484	2013-03-05 15:04:29 +00:00
David Sehr	4c8979cd4d	The current X86 NOP padding uses one long NOP followed by the remainder in one-byte NOPs. If the processor actually executes those NOPs, as it sometimes does with aligned bundling, this can have a performance impact. From my micro-benchmarks run on my one machine, a 15-byte NOP followed by twelve one-byte NOPs is about 20% worse than a 15 followed by a 12. This patch changes NOP emission to emit as many 15-byte (the maximum) as possible followed by at most one shorter NOP. llvm-svn: 176464	2013-03-05 00:02:23 +00:00
Akira Hatanaka	c7828356aa	[mips] Print move instructions. "move $4, $5" is printed instead of "or $4, $5, $zero". llvm-svn: 176455	2013-03-04 22:25:01 +00:00
Jack Carter	0e149b04f6	Mips specific inline assembler constraint 'R' 'R' An address that can be sued in a non-macro load or store. This patch includes a positive test case. llvm-svn: 176452	2013-03-04 21:33:15 +00:00
Preston Gurd	485296d1e8	Bypass Slow Divides * Only apply divide bypass optimization when not optimizing for size. * Fixed bug caused by constant for 0 value of type Int32, used dividend type to generate the constant instead. * For atom x86-64 apply the divide bypass to use 16-bit divides instead of 64-bit divides when operand values are small enough. * Added lit tests for 64-bit divide bypass. Patch by Tyler Nowicki! llvm-svn: 176442	2013-03-04 18:13:57 +00:00
Tom Stellard	b2f2f960ce	R600: Clean up datalayout strings so they better match hardware capabilities llvm-svn: 176439	2013-03-04 17:40:28 +00:00
Jia Liu	434874db6f	Mips ISD typo llvm-svn: 176426	2013-03-04 01:06:54 +00:00
Jim Grosbach	a3c5c769d6	ARM: Creating a vector from a lane of another. The VDUP instruction source register doesn't allow a non-constant lane index, so make sure we don't construct a ARM::VDUPLANE node asking it to do so. rdar://13328063 http://llvm.org/bugs/show_bug.cgi?id=13963 llvm-svn: 176413	2013-03-02 20:16:24 +00:00
Jim Grosbach	c6f1914ef0	Clean up code format a bit. llvm-svn: 176412	2013-03-02 20:16:19 +00:00
Jim Grosbach	54efea0a7a	Tidy up. Trailing whitespace. llvm-svn: 176411	2013-03-02 20:16:15 +00:00
Arnold Schwaighofer	99cba9697a	ARM NEON: Fix v2f32 float intrinsics Mark them as expand, they are not legal as our backend does not match them. llvm-svn: 176410	2013-03-02 19:38:33 +00:00
Arnold Schwaighofer	20ef54f4c1	X86 cost model: Adjust cost for custom lowered vector multiplies This matters for example in following matrix multiply: int mmult(int rows, int cols, int m1, int m2, int m3) { int i, j, k, val; for (i=0; i<rows; i++) { for (j=0; j<cols; j++) { val = 0; for (k=0; k<cols; k++) { val += m1[i][k] * m2[k][j]; } m3[i][j] = val; } } return(m3); } Taken from the test-suite benchmark Shootout. We estimate the cost of the multiply to be 2 while we generate 9 instructions for it and end up being quite a bit slower than the scalar version (48% on my machine). Also, properly differentiate between avx1 and avx2. On avx-1 we still split the vector into 2 128bits and handle the subvector muls like above with 9 instructions. Only on avx-2 will we have a cost of 9 for v4i64. I changed the test case in test/Transforms/LoopVectorize/X86/avx1.ll to use an add instead of a mul because with a mul we now no longer vectorize. I did verify that the mul would be indeed more expensive when vectorized with 3 kernels: for (i ...) r += a[i] * 3; for (i ...) m1[i] = m1[i] * 3; // This matches the test case in avx1.ll and a matrix multiply. In each case the vectorized version was considerably slower. radar://13304919 llvm-svn: 176403	2013-03-02 04:02:52 +00:00
Andrew Trick	63474629e8	Added FIXME for future Hexagon cleanup. llvm-svn: 176400	2013-03-02 01:43:08 +00:00
Akira Hatanaka	ece459bb66	[mips] Fix inefficient code generation. This patch eliminates the need to emit a constant move instruction when this pattern is matched: (select (setgt a, Constant), T, F) The pattern above effectively turns into this: (conditional-move (setlt a, Constant + 1), F, T) llvm-svn: 176384	2013-03-01 21:52:08 +00:00
Akira Hatanaka	a4c0341514	Fix indentation. llvm-svn: 176380	2013-03-01 21:22:21 +00:00
Michael Liao	6af16fc3b7	Fix PR10475 - ISD::SHL/SRL/SRA must have either both scalar or both vector operands but TLI.getShiftAmountTy() so far only return scalar type. As a result, backend logic assuming that breaks. - Rename the original TLI.getShiftAmountTy() to TLI.getScalarShiftAmountTy() and re-define TLI.getShiftAmountTy() to return target-specificed scalar type or the same vector type as the 1st operand. - Fix most TICG logic assuming TLI.getShiftAmountTy() a simple scalar type. llvm-svn: 176364	2013-03-01 18:40:30 +00:00
Chad Rosier	9660343b42	Add support for using non-pic code for arm and thumb1 when emitting the sjlj dispatch code. As far as I can tell the thumb2 code is behaving as expected. I was able to compile and run the associated test case for both arm and thumb1. rdar://13066352 llvm-svn: 176363	2013-03-01 18:30:38 +00:00
Jyotsna Verma	8425643728	Hexagon: Add constant extender support framework. llvm-svn: 176358	2013-03-01 17:37:13 +00:00
Christian Konig	d0e3da1818	R600/SI: handle all registers in copyPhysReg v2 v2: based on Michels patch, but now allows copying of all registers sizes. Signed-off-by: Michel Dänzer <michel.daenzer@amd.com> Signed-off-by: Christian König <christian.koenig@amd.com> llvm-svn: 176346	2013-03-01 09:46:27 +00:00
Christian Konig	1f344cda53	R600/SI: remove S_MOV immediate patterns They won't match anyway. Signed-off-by: Christian König <christian.koenig@amd.com> llvm-svn: 176345	2013-03-01 09:46:22 +00:00
Christian Konig	8465296420	R600/SI: remove GPR*AlignEncode It's much easier to specify the encoding with tablegen directly. Signed-off-by: Christian König <christian.koenig@amd.com> llvm-svn: 176344	2013-03-01 09:46:17 +00:00
Christian Konig	01fd1f6b36	R600/SI: fix warning about overloaded virtual Signed-off-by: Christian König <christian.koenig@amd.com> llvm-svn: 176343	2013-03-01 09:46:11 +00:00
Christian Konig	862fd9fa2c	R600/SI: fix inserting waits for unordered defines Signed-off-by: Christian König <christian.koenig@amd.com> llvm-svn: 176342	2013-03-01 09:46:04 +00:00
Duncan Sands	2cb41d372c	GCC thinks that this variable might be used uninitialized (it isn't). llvm-svn: 176341	2013-03-01 09:46:03 +00:00
Akira Hatanaka	e9e588dd72	[mips] Remove unused option. Fix 80-column violations. llvm-svn: 176330	2013-03-01 02:17:02 +00:00
Akira Hatanaka	8f7bfb39be	[mips] Add the capability to search delay slot filling instructions in successor basic blocks. Currently this is off by default. llvm-svn: 176329	2013-03-01 02:03:51 +00:00
Akira Hatanaka	28dc83ceb3	[mips] Do not add SecondLastInst to list BranchInstrs if there is only one terminator. No functionality change. llvm-svn: 176326	2013-03-01 01:22:26 +00:00
Akira Hatanaka	7320b2364d	[mips] Define an overloaded version of function MipsInstrInfo::AnalyzeBranchAdd. This function will be used later when the capability to search delay slot filling instructions in successor blocks is added. No intended functionality changes. llvm-svn: 176325	2013-03-01 01:10:17 +00:00
Akira Hatanaka	e44e30ca5a	[mips] Add options to disable searching backward and in successor blocks. llvm-svn: 176321	2013-03-01 01:02:36 +00:00
Akira Hatanaka	e01ff9dc60	[mips] Add capability to search in the forward direction for instructions that can fill the delay slot. Currently, this is off by default. llvm-svn: 176320	2013-03-01 00:50:52 +00:00
Akira Hatanaka	f815db5bcb	[mips] Define helper function searchRange No functionality change. llvm-svn: 176318	2013-03-01 00:26:14 +00:00
Akira Hatanaka	50e174d95d	[mips] Rename function findDelayInstr to searchBackward. llvm-svn: 176317	2013-03-01 00:20:16 +00:00
Akira Hatanaka	eb33ced08f	[mips] Define class MemDefsUses. This class tracks dependence between memory instructions using underlying objects of memory operands. llvm-svn: 176313	2013-03-01 00:16:31 +00:00
Chad Rosier	537ff50b5d	Tidy up; no functional change. llvm-svn: 176288	2013-02-28 19:16:42 +00:00
Chad Rosier	11a9828745	Style; no functional change. llvm-svn: 176285	2013-02-28 18:54:27 +00:00
Yiannis Tsiouris	d4842e5ee9	Re-format comments (and check commit access) llvm-svn: 176270	2013-02-28 16:59:10 +00:00
Tim Northover	ce17020c97	AArch64: remove post-encoder method from FCMP (immediate) instructions. The work done by the post-encoder (setting architecturally unused bits to 0 as required) can be done by the existing operand that covers the "#0.0". This removes at least one use of the discouraged PostEncoderMethod uses. llvm-svn: 176261	2013-02-28 14:46:14 +00:00
Tim Northover	c3c5c0971d	AArch64: be more careful resorting to inefficient addressing for weak vars. If an otherwise weak var is actually defined in this unit, it can't be undefined at runtime so we can use normal global variable sequences (ADRP/ADD) to access it. llvm-svn: 176259	2013-02-28 14:36:31 +00:00
Tim Northover	b9d4fd210b	AArch64: don't drop GlobalAddress offset when handling extern_weak decls. llvm-svn: 176258	2013-02-28 14:36:24 +00:00
Tim Northover	9fafdf6d5a	AArch64: Use cbnz instead of cmp/b.ne pair for atomic operations. llvm-svn: 176253	2013-02-28 13:52:07 +00:00
Jim Grosbach	5f21587648	ARM: FMA is legal only if VFP4 is available. rdar://13306723 llvm-svn: 176212	2013-02-27 21:31:12 +00:00
Chad Rosier	d3e47ca423	Remove this instance of dl as it's defined in a previous scope. llvm-svn: 176208	2013-02-27 20:34:14 +00:00
Tim Northover	29931ab21d	ARM: permit full range of valid ADR immediates. This fixes an issue where trying to assemlbe valid ADR instructions would cause LLVM to hit a failed assertion. Patch by Keith Walker. llvm-svn: 176189	2013-02-27 16:43:09 +00:00
Nadav Rotem	08ab877cc7	Revert r176166 because it broke one of the lit tests. llvm-svn: 176171	2013-02-27 05:56:20 +00:00
Nadav Rotem	85e1211fbf	std::string to StringRef. llvm-svn: 176166	2013-02-27 05:23:56 +00:00
Reed Kotler	5bf8020d83	Fix cut/paste error in a comment. llvm-svn: 176165	2013-02-27 04:20:14 +00:00
Reed Kotler	bb3094aa1e	Add the skeleton for the Mips constant island pass. It will only be used for Mips 16 at this time. llvm-svn: 176161	2013-02-27 03:33:58 +00:00
Bill Schmidt	8ea7af8e44	Fix PR15332 (patch by Florian Zeitz). There's no need to generate a stack frame for PPC32 SVR4 when there are no local variables assigned to the stack, i.e., when no red zone is needed. (PPC64 supports a red zone, but PPC32 does not.) llvm-svn: 176124	2013-02-26 21:28:57 +00:00
Christian Konig	e500e445c5	R600/SI: Add promotion of e32 to e64 in operand folding Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 176105	2013-02-26 17:52:47 +00:00
Christian Konig	f741fbfb1b	R600/SI: add VOP mapping functions Make it possible to map between e32 and e64 encoding opcodes. Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 176104	2013-02-26 17:52:42 +00:00
Christian Konig	6612ac39c9	R600/SI: swap operands if it helps folding Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 176103	2013-02-26 17:52:36 +00:00
Christian Konig	76edd4f2bc	R600/SI: add some more instruction flags Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 176102	2013-02-26 17:52:29 +00:00
Christian Konig	f82901af2a	R600/SI: add post ISel folding for SI v2 Include immediate folding and SGPR limit handling for VOP3 instructions. v2: remove leftover hasExtraSrcRegAllocReq Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 176101	2013-02-26 17:52:23 +00:00
Christian Konig	d910b7d534	R600/SI: add folding helper Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 176100	2013-02-26 17:52:16 +00:00
Christian Konig	d303996918	R600/SI: fix VOP3b encoding v2 v2: document why we hardcode VCC for now. This is a candidate for the mesa-stable branch. Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 176099	2013-02-26 17:52:09 +00:00
Christian Konig	0f0a8fe2dd	R600/SI: fix and cleanup SI register definition v2 Prevent producing real strange tablegen code by using proper register sizes, alignments and hierarchy. Also cleanup the unused definitions and add some comments. v2: add SGPR 512 bit registers, stop registers from wrapping around, fix SGPR alignment This is a candidate for the mesa-stable branch. Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 176098	2013-02-26 17:52:03 +00:00
Christian Konig	d76ed54b60	R600/SI: fix stupid typo This is a candidate for the mesa-stable branch. Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 176097	2013-02-26 17:51:57 +00:00
Akira Hatanaka	979899e5cc	[mips] Use class RegDefsUses to track register defs and uses. No functionality change. llvm-svn: 176070	2013-02-26 01:30:05 +00:00
Chad Rosier	1b33e8d63e	[fast-isel] Make sure the FastLowerArguments function checks to make sure the arguments type is a simple type. rdar://13290455 llvm-svn: 176066	2013-02-26 01:05:31 +00:00
Michael Liao	609a527286	Refine fix to PR10499, no functionality change - Put expensive checking after simple one llvm-svn: 176060	2013-02-25 23:16:36 +00:00
Michael Liao	ab97668061	Fix PR10499 - Check whether SSE is available before lowering all 1s vector building with PCMPEQD, which is only available from SSE2 llvm-svn: 176058	2013-02-25 23:01:03 +00:00
Chad Rosier	a92ef4ba5b	[fast-isel] Add X86FastIsel::FastLowerArguments to handle functions with 6 or fewer scalar integer (i32 or i64) arguments. It completely eliminates the need for SDISel for trivial functions. Also, add the new llc -fast-isel-abort-args option, which is similar to -fast-isel-abort option, but for formal argument lowering. llvm-svn: 176052	2013-02-25 21:59:35 +00:00
Chad Rosier	669bb3ee77	[ms-inline asm] Add support for the pushad/popad mnemonics. rdar://13254235 llvm-svn: 176036	2013-02-25 19:06:27 +00:00
Bill Schmidt	b454829981	Fix missing relocation for TLS addressing peephole optimization. Report and fix due to Kai Nacke. Testcase update by me. llvm-svn: 176029	2013-02-25 16:44:35 +00:00
Reed Kotler	bd1058a877	Make pseudos FEXT_CCRX16_ins and FEXT_CCRXI16_ins into custom emitters. llvm-svn: 176007	2013-02-25 02:25:47 +00:00
Reed Kotler	7a86b3dc2b	Make psuedo FEXT_T8I816_ins into a custom emitter. llvm-svn: 176002	2013-02-24 23:17:51 +00:00
Bill Schmidt	c68c6df884	Fix PR14364. This removes a const_cast hack from PPCRegisterInfo::hasReservedSpillSlot(). The proper place to save the frame index for the CR spill slot is in the PPCFunctionInfo object, not the PPCRegisterInfo object. No new test cases, as this just reimplements existing function. Existing tests such as test/CodeGen/PowerPC/crsave.ll are sufficient. llvm-svn: 175998	2013-02-24 17:34:50 +00:00
Francois Pichet	d1cef3ecd3	Typo llvm-svn: 175991	2013-02-24 12:34:13 +00:00
Nadav Rotem	b532fca92c	Revert r169638 because it broke Mesa llvmpipe tests. Fix PR15239. llvm-svn: 175985	2013-02-24 07:09:35 +00:00
Reed Kotler	e2bead7a2d	Make psuedo FEXT_T8I816_ins a custom inserter. It should be expanded as early as possible; which means during instruction selection. llvm-svn: 175984	2013-02-24 06:16:39 +00:00
Reed Kotler	80070bd439	Add new base instruction def for cmpi, cmp, slt and sltu so that def/uses proper. Fixed this already a few days ago for slti. llvm-svn: 175975	2013-02-23 23:37:03 +00:00
Benjamin Kramer	ee23dcb461	X86: Disable cmov-memory patterns on subtargets without cmov. Fixes PR15115. llvm-svn: 175962	2013-02-23 10:40:58 +00:00
Reed Kotler	dacee2bb44	Expand pseudos/macros for Selt. This is the last of the complex macros.The rest is some small misc. stuff. llvm-svn: 175950	2013-02-23 03:09:56 +00:00
Jim Grosbach	9be2d71512	ARM: Convenience aliases for 'srs*' instructions. Handle an implied 'sp' operand. rdar://11466783 llvm-svn: 175940	2013-02-23 00:52:09 +00:00
Akira Hatanaka	02b0e48f6a	[mips] Emit call16 operator instead of got_disp. The former allows lazy binding. llvm-svn: 175920	2013-02-22 21:10:03 +00:00
Peter Collingbourne	7b57621fb3	x86_64: designate most general purpose and SSE registers as callee save under coldcc llvm-svn: 175911	2013-02-22 19:19:44 +00:00
Michel Danzer	0cc991e17b	R600/SI: Add pattern for sign extension of i1 to i32. 16 more little piglits with radeonsi. NOTE: This is a candidate for the Mesa stable branch. Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 175887	2013-02-22 11:22:58 +00:00
Michel Danzer	00fb283560	R600/SI: Add pattern for logical or of i1 values. 24 more little piglits with radeonsi. NOTE: This is a candidate for the Mesa stable branch. Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 175886	2013-02-22 11:22:54 +00:00
Michel Danzer	c3ea4041b9	R600/SI: Add pattern for fceil. 9 more little piglits with radeonsi. NOTE: This is a candidate for the Mesa stable branch. Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 175885	2013-02-22 11:22:49 +00:00
Kristof Beyls	0ba797e8f7	Make ARMAsmPrinter generate the correct alignment specifier syntax in instructions. The Printer will now print instructions with the correct alignment specifier syntax, like vld1.8 {d16}, [r0:64] llvm-svn: 175884	2013-02-22 10:01:33 +00:00
Reed Kotler	fbe4e863db	Fix a nomenclature mistake. Slt->Slti in the functions. The "i" refers to the immediate operand of sli or cmp function. llvm-svn: 175865	2013-02-22 05:59:39 +00:00
Reed Kotler	4416cdadd5	Expand mips16 SelT form pseudso/macros. llvm-svn: 175862	2013-02-22 05:10:51 +00:00
Andrew Trick	57ecf603c4	Remove code copied from GenRegisterInfo.inc. There's no apparent reason this code was copied from generated source into a .cpp. It sets a bad example for those working on other targets and trying to understand the register info API. llvm-svn: 175849	2013-02-22 01:15:08 +00:00
Eli Bendersky	8da87163ca	Move the eliminateCallFramePseudoInstr method from TargetRegisterInfo to TargetFrameLowering, where it belongs. Incidentally, this allows us to delete some duplicated (and slightly different!) code in TRI. There are potentially other layering problems that can be cleaned up as a result, or in a similar manner. The refactoring was OK'd by Anton Korobeynikov on llvmdev. Note: this touches the target interfaces, so out-of-tree targets may be affected. llvm-svn: 175788	2013-02-21 20:05:00 +00:00
Anshuman Dasgupta	d062c70444	Hexagon: Expand cttz, ctlz, and ctpop for now. llvm-svn: 175783	2013-02-21 19:39:40 +00:00
Evan Cheng	ab28b9ae73	Radar numbers don't belong in source code. llvm-svn: 175775	2013-02-21 18:37:54 +00:00
Bill Schmidt	836c45badf	Trivial cleanup llvm-svn: 175771	2013-02-21 17:26:05 +00:00
Bill Schmidt	27917785ae	Large code model support for PowerPC. Large code model is identical to medium code model except that the addis/addi sequence for "local" accesses is never used. All accesses use the addis/ld sequence. The coding changes are straightforward; most of the patch is taken up with creating variants of the medium model tests for large model. llvm-svn: 175767	2013-02-21 17:12:27 +00:00
Eli Bendersky	e93249befa	getX86SubSuperRegister has a special mode with High=true for i64 which exists solely to enable it to call itself for i8 with some registers. The proposed patch simplifies the function somewhat to make the High bit only meaningful for the i8 mode, which makes sense. No functional difference (getX86SubSuperRegister is not getting called from anywhere outside with i64 and High=true). llvm-svn: 175762	2013-02-21 16:40:18 +00:00
Christian Konig	71088e68e8	R600/SI: inline V_ADD\|SUB_F32 patterns Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> Reviewed-by: Michel Dänzer <michel.daenzer@amd.com> llvm-svn: 175758	2013-02-21 15:17:41 +00:00
Christian Konig	7c9de8e6e8	R600/SI: replace IMPLICIT_DEF with SIOperand.ZERO Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> Reviewed-by: Michel Dänzer <michel.daenzer@amd.com> llvm-svn: 175757	2013-02-21 15:17:36 +00:00
Christian Konig	2aca043312	R600/SI: replace SI_V_CNDLT with a pattern It actually fixes quite a bunch of piglit tests. This is a candidate for the mesa-stable branch. Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> Reviewed-by: Michel Dänzer <michel.daenzer@amd.com> llvm-svn: 175756	2013-02-21 15:17:32 +00:00
Christian Konig	8dbe6f617c	R600/SI: use patterns for clamp, fabs, fneg Instead of using custom inserters, it's simpler and should make DAG folding easier. Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> Reviewed-by: Michel Dänzer <michel.daenzer@amd.com> llvm-svn: 175755	2013-02-21 15:17:27 +00:00
Christian Konig	bf114b42a8	R600/SI: add all the other missing asm operands v2 v2: put implicit parameters in [] Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> Reviewed-by: Michel Dänzer <michel.daenzer@amd.com> llvm-svn: 175754	2013-02-21 15:17:22 +00:00
Christian Konig	08e768b4cf	R600/SI: add the missing M*BUF\|IMG asm operands Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> Reviewed-by: Michel Dänzer <michel.daenzer@amd.com> llvm-svn: 175753	2013-02-21 15:17:17 +00:00
Christian Konig	e0130a2f25	R600/SI: add the missing S_* asm operands Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> Reviewed-by: Michel Dänzer <michel.daenzer@amd.com> llvm-svn: 175752	2013-02-21 15:17:13 +00:00
Christian Konig	f5754a011d	R600/SI: rework VOP3 classes Order the classes and add asm operands. Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> Reviewed-by: Michel Dänzer <michel.daenzer@amd.com> llvm-svn: 175751	2013-02-21 15:17:09 +00:00
Christian Konig	b19849a682	R600/SI: simplify VOPC_* pattern v2 Fixing asm operation names. v2: fix name of the e64 encoding, also add asm operands Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> Reviewed-by: Michel Dänzer <michel.daenzer@amd.com> llvm-svn: 175750	2013-02-21 15:17:04 +00:00
Christian Konig	ae034e63f1	R600/SI: rework VOP2_* pattern v2 Fixing asm operation names. v2: use ZERO constant, also add asm operands Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> Reviewed-by: Michel Dänzer <michel.daenzer@amd.com> llvm-svn: 175749	2013-02-21 15:16:58 +00:00
Christian Konig	3da7017e81	R600/SI: rework VOP1_* patterns v2 Fixing asm operation names. v2: use ZERO constant, also add asm operands Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> Reviewed-by: Michel Dänzer <michel.daenzer@amd.com> llvm-svn: 175748	2013-02-21 15:16:53 +00:00
Christian Konig	eabf8333d6	R600/SI: add constant for inline zero operand Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> Reviewed-by: Michel Dänzer <michel.daenzer@amd.com> llvm-svn: 175747	2013-02-21 15:16:49 +00:00
Christian Konig	72d5d5c754	R600/SI: cleanup SIInstrInfo.td and SIInstrFormat.td Those two files got mixed up. Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> Reviewed-by: Michel Dänzer <michel.daenzer@amd.com> llvm-svn: 175746	2013-02-21 15:16:44 +00:00
Tom Stellard	0d171c8877	R600: Fix for Unigine when MachineSched is enabled Fixes for-loop.cl piglit test Patch By: Vincent Lejeune Reviewed-by: Tom Stellard <thomas.stellard@amd.com> NOTE: This is a candidate for the Mesa stable branch. llvm-svn: 175742	2013-02-21 15:06:59 +00:00
Bill Schmidt	49498dac9d	Code review cleanup for r175697 llvm-svn: 175739	2013-02-21 14:35:42 +00:00
Michel Danzer	7f02a8c7a7	R600/SI: Make sure M0 is loaded for V_INTERP_MOV_F32 NOTE: This is a candidate for the Mesa stable branch. Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 175733	2013-02-21 08:57:10 +00:00
Reed Kotler	97ba5f2772	Expand the sel pseudo/macro. This generates basic blocks where previously there were inline br .+4 instructions. Soon everything can enjoy the full instruction scheduling experience. llvm-svn: 175718	2013-02-21 04:22:38 +00:00
Jack Carter	dc46338e2d	Mips specific standalone assembler addressing mode %hi and %lo. The constructs %hi() and %lo() represent the high and low 16 bits of the address. Because the 16 bit offset field of an LW instruction is interpreted as signed, if bit 15 of the low part is 1 then the low part will act as a negative and 1 needs to be added to the high part. Contributer: Vladimir Medic llvm-svn: 175707	2013-02-21 02:09:31 +00:00
Bill Schmidt	f5b474c6c6	PPCDAGToDAGISel::PostprocessISelDAG() This patch implements the PPCDAGToDAGISel::PostprocessISelDAG virtual method to perform post-selection peephole optimizations on the DAG representation. One optimization is implemented here: folds to clean up complex addressing expressions for thread-local storage and medium code model. It will also be useful for large code model sequences when those are added later. I originally thought about doing this on the MI representation prior to register assignment, but it's difficult to do effective global dead code elimination at that point. DCE is trivial on the DAG representation. A typical example of a candidate code sequence in assembly: addis 3, 2, globalvar@toc@ha addi 3, 3, globalvar@toc@l lwz 5, 0(3) When the final instruction is a load or store with an immediate offset of zero, the offset from the add-immediate can replace the zero, provided the relocation information is carried along: addis 3, 2, globalvar@toc@ha lwz 5, globalvar@toc@l(3) Since the addi can in general have multiple uses, we need to only delete the instruction when the last use is removed. llvm-svn: 175697	2013-02-21 00:38:25 +00:00
Bill Schmidt	3822ef2c0c	Relocation enablement for PPC DAG postprocessing pass llvm-svn: 175693	2013-02-21 00:05:29 +00:00
Jack Carter	1ac5322e61	ELF symbol table field st_other support, excluding visibility bits. Mips specific standalone assembler directive "set at". This directive changes the general purpose register that the assembler will use when given the symbolic register name $at. This does not include negative testing. That will come in a future patch. A side affect of this patch recognizes the different GPR register names for temporaries between old abi and new abi so a test case for that is included. Contributer: Vladimir Medic llvm-svn: 175686	2013-02-20 23:11:17 +00:00
Jim Grosbach	d2037eb1ee	MCParser: Update method names per coding guidelines. s/AddDirectiveHandler/addDirectiveHandler/ s/ParseMSInlineAsm/parseMSInlineAsm/ s/ParseIdentifier/parseIdentifier/ s/ParseStringToEndOfStatement/parseStringToEndOfStatement/ s/ParseEscapedString/parseEscapedString/ s/EatToEndOfStatement/eatToEndOfStatement/ s/ParseExpression/parseExpression/ s/ParseParenExpression/parseParenExpression/ s/ParseAbsoluteExpression/parseAbsoluteExpression/ s/CheckForValidSection/checkForValidSection/ http://llvm.org/docs/CodingStandards.html#name-types-functions-variables-and-enumerators-properly No functional change intended. llvm-svn: 175675	2013-02-20 22:21:35 +00:00
Jim Grosbach	d15cd2a11c	R600: Update for name changes from r175667. llvm-svn: 175668	2013-02-20 21:31:28 +00:00
Jim Grosbach	341ad3e72a	Update TargetLowering ivars for name policy. http://llvm.org/docs/CodingStandards.html#name-types-functions-variables-and-enumerators-properly ivars should be camel-case and start with an upper-case letter. A few in TargetLowering were starting with a lower-case letter. No functional change intended. llvm-svn: 175667	2013-02-20 21:13:59 +00:00
Bill Schmidt	c6cbecc2c7	Additional fixes for bug 15155. This handles the cases where the 6-bit splat element is odd, converting to a three-instruction sequence to add or subtract two splats. With this fix, the XFAIL in test/CodeGen/PowerPC/vec_constants.ll is removed. llvm-svn: 175663	2013-02-20 20:41:42 +00:00
Chad Rosier	a018cfd10c	[ms-inline asm] Make the comment a bit more verbose. llvm-svn: 175641	2013-02-20 18:03:44 +00:00
Bill Schmidt	6631e94838	Fix bug 14779 for passing anonymous aggregates [patch by Kai Nacke]. The PPC backend doesn't handle these correctly. This patch uses logic similar to that in the X86 and ARM backends to track these arguments properly. llvm-svn: 175635	2013-02-20 17:31:41 +00:00
Jyotsna Verma	7503a62bce	Hexagon: Move HexagonMCInst.h to MCTargetDesc/HexagonMCInst.h. Add HexagonMCInst class which adds various Hexagon VLIW annotations. In addition, this class also includes some APIs related to the constant extenders. llvm-svn: 175634	2013-02-20 16:13:27 +00:00
Bill Schmidt	51e7951e24	Fix PR15155: lost vadd/vsplat optimization. During lowering of a BUILD_VECTOR, we look for opportunities to use a vector splat. When the splatted value fits in 5 signed bits, a single splat does the job. When it doesn't fit in 5 bits but does fit in 6, and is an even value, we can splat on half the value and add the result to itself. This last optimization hasn't been working recently because of improved constant folding. To circumvent this, create a pseudo VADD_SPLAT that can be expanded during instruction selection. llvm-svn: 175632	2013-02-20 15:50:31 +00:00
Elena Demikhovsky	0ccdd1315b	I optimized the following patterns: sext <4 x i1> to <4 x i64> sext <4 x i8> to <4 x i64> sext <4 x i16> to <4 x i64> I'm running Combine on SIGN_EXTEND_IN_REG and revert SEXT patterns: (sext_in_reg (v4i64 anyext (v4i32 x )), ExtraVT) -> (v4i64 sext (v4i32 sext_in_reg (v4i32 x , ExtraVT))) The sext_in_reg (v4i32 x) may be lowered to shl+sar operations. The "sar" does not exist on 64-bit operation, so lowering sext_in_reg (v4i64 x) has no vector solution. I also added a cost of this operations to the AVX costs table. llvm-svn: 175619	2013-02-20 12:42:54 +00:00
Logan Chien	53c18d8ac7	Fix thumbv5e frame lowering assertion failure. It is possible that frame pointer is not found in the callee saved info, thus FramePtrSpillFI may be incorrect if we don't check the result of hasFP(MF). Besides, if we enable the stack coloring algorithm, there will be an assertion to ensure the slot is live. But in the test case, %var1 is not live in the prologue of the function, and we will get the assertion failure. Note: There is similar code in ARMFrameLowering.cpp. llvm-svn: 175616	2013-02-20 12:21:33 +00:00
David Blaikie	725fda1213	Fix the (clang -Werror) build by removing an unused member variable. llvm-svn: 175607	2013-02-20 07:39:18 +00:00
Reed Kotler	7b503c2b03	Expand pseudos/macros: SltCCRxRy16, SltiCCRxImmX16, SltiuCCRxImmX16, SltuCCRxRy16 $T8 shows up as register $24 when emitted from C++ code so we had to change some tests that were already there for this functionality. llvm-svn: 175593	2013-02-20 05:45:15 +00:00
Jakub Staszak	2be3832d50	Add missing #include. llvm-svn: 175583	2013-02-20 00:31:54 +00:00
Chad Rosier	45a52fa097	[ms-inline asm] Force the use of a base pointer if the MachineFunction includes MS-style inline assembly. This is a follow-on to r175334. Forcing a FP to be emitted doesn't ensure it will be used. Therefore, force the base pointer as well. We now treat MS inline assembly in the same way we treat functions with dynamic stack realignment and VLAs. This guarantees the BP will be used to reference parameters and locals. rdar://13218191 llvm-svn: 175576	2013-02-19 23:50:45 +00:00
Jack Carter	10c97e5ca0	ELF symbol table field st_other support, excluding visibility bits. Mips (o32 abi) specific e_header setting. EF_MIPS_ABI_O32 needs to be set in the ELF header flags for o32 abi output. Contributer: Reed Kotler llvm-svn: 175569	2013-02-19 22:29:00 +00:00
Jack Carter	1ba1f3cec8	ELF symbol table field st_other support, excluding visibility bits. Mips (Mips16) specific e_header setting. EF_MIPS_ARCH_ASE_M16 needs to be set in the ELF header flags for Mips16. Contributer: Reed Kotler llvm-svn: 175566	2013-02-19 22:14:34 +00:00
Jack Carter	ab3cb425aa	ELF symbol table field st_other support, excluding visibility bits. Mips (MicroMips) specific STO handling . The st_other field settig for STO_MIPS_MICROMIPS Contributer: Zoran Jovanovic llvm-svn: 175564	2013-02-19 22:04:37 +00:00
Jakub Staszak	e167cf5c4d	Add obvious constantness. llvm-svn: 175560	2013-02-19 21:54:59 +00:00
Arnold Schwaighofer	e4df5eb34a	ARM NEON: Don't need COPY_TO_REGCLASS in pattern In my previous commit: "Merge a f32 bitcast of a v2i32 extractelt A vectorized sitfp on doubles will get scalarized to a sequence of an extract_element of <2 x i32>, a bitcast to f32 and a sitofp. Due to the the extract_element, and the bitcast we will uneccessarily generate moves between scalar and vector registers." I added a pattern containing a copy_to_regclass. The copy_to_regclass is actually not needed. radar://13191881 llvm-svn: 175555	2013-02-19 20:16:45 +00:00
Jim Grosbach	3fa275e6f7	ARM: Allocation hints must make sure to be in the alloc order. When creating an allocation hint for a register pair, make sure the hint for the physical register reference is still in the allocation order. rdar://13240556 llvm-svn: 175541	2013-02-19 18:55:36 +00:00
Jyotsna Verma	e758da2080	Hexagon: Sync TSFlags in MCTargetDesc/HexagonBaseInfo.h with HexagonInstrFormats.td. llvm-svn: 175537	2013-02-19 18:18:36 +00:00
Benjamin Kramer	1cb826b0ad	Clean up HiPE prologue emission a bit and avoid signed arithmetic tricks. No intended functionality change. llvm-svn: 175536	2013-02-19 17:32:57 +00:00
Rafael Espindola	1c040b5788	Move LLVM_LIBRARY_VISIBILITY for consistency with what was done to PPCJITInfo.cpp in r175394. llvm-svn: 175531	2013-02-19 17:14:33 +00:00
Eli Bendersky	6aa4fc389e	Make ARMAsmPrinter pass name more precise and fix comment. llvm-svn: 175527	2013-02-19 16:47:59 +00:00
Eli Bendersky	b0b13b22a3	Make pass name more precise and fix comment. llvm-svn: 175525	2013-02-19 16:38:32 +00:00
Arnold Schwaighofer	e5083442b2	ARM NEON: Merge a f32 bitcast of a v2i32 extractelt A vectorized sitfp on doubles will get scalarized to a sequence of an extract_element of <2 x i32>, a bitcast to f32 and a sitofp. Due to the the extract_element, and the bitcast we will uneccessarily generate moves between scalar and vector registers. The patch fixes this by using a COPY_TO_REGCLASS and a EXTRACT_SUBREG to extract the element from the vector instead. radar://13191881 llvm-svn: 175520	2013-02-19 15:27:05 +00:00
Tom Stellard	d4409e2cec	R600: Add AR_X to the R600_TReg_X register class. NOTE: This is a candidate for the Mesa stable branch. llvm-svn: 175519	2013-02-19 15:22:47 +00:00
Tom Stellard	a24a516737	R600: Mark all members of the TRegMem register class as reserved This stops the Machine Verifier from complaining about uses of undefined physical registers. NOTE: This is a candidate for the Mesa stable branch. llvm-svn: 175518	2013-02-19 15:22:45 +00:00
Tom Stellard	8d469edbe3	R600: Fix scheduler crash caused by invalid MachinePointerInfo Kernel function arguments are lowered to loads from the PARAM_I address space. When creating these load instructions, we were initializing their MachinePointerInfo with an Arguement object that was not attached to any function. This was causing the MachineScheduler to crash when it tried to access the parent of the Arguement. This has been fixed by initializing the MachinePointerInfo with a UndefValue instead. NOTE: This is a candidate for the Mesa stable branch. llvm-svn: 175517	2013-02-19 15:22:44 +00:00
Tom Stellard	0f965aaf9b	R600: Fix tracking of implicit defs in the IndirectAddressing pass In some cases, we were losing track of live implicit registers which was creating dead defs and causing the scheduler to produce invalid code. NOTE: This is a candidate for the Mesa stable branch. llvm-svn: 175516	2013-02-19 15:22:42 +00:00
Craig Topper	f371e89264	Fix capitalization in comment to match function name. llvm-svn: 175497	2013-02-19 07:43:59 +00:00
Reed Kotler	3e457f505e	Expand pseudos/macros BteqzT8SltiX16, BteqzT8SltiuX16, BtnezT8SltiX16, BtnezT8SltiuX16 . llvm-svn: 175486	2013-02-19 03:56:57 +00:00
Chandler Carruth	c2fd56a21d	Remove some unused private fields from the AArch64MCCodeEmitter. These fields were only ever set in the constructor. The create method retains its consistent interface so that these bits can be re-threaded through the emitter if they're ever needed. This was found by the -Wunused-private-field Clang warning. llvm-svn: 175482	2013-02-19 02:08:14 +00:00
Reed Kotler	d82171990f	Expand pseudos BteqzT8CmpiX16 and BtnezT8CmpiX16. llvm-svn: 175474	2013-02-19 00:20:58 +00:00
Jakub Staszak	1f199a0ef2	Use array_pod_sort instead of std::sort. llvm-svn: 175472	2013-02-18 23:18:22 +00:00
NAKAMURA Takumi	3a8002f61d	X86FrameLowering.cpp: Fixup. Sorry for the breakage. llvm-svn: 175467	2013-02-18 23:15:21 +00:00
David Blaikie	772d4f75f6	Use LLVM_DELETED_FUNCTION rather than '// do not implement' comments. Also removes some redundant DNI comments on function declarations already using the macro. llvm-svn: 175466	2013-02-18 23:11:17 +00:00
NAKAMURA Takumi	a614ec7e6f	X86FrameLowering.cpp: Fix a warning in -Asserts. [-Wunused-variable] llvm-svn: 175464	2013-02-18 23:08:49 +00:00
Chad Rosier	441e81287f	Remove a useless assert. llvm-svn: 175463	2013-02-18 22:20:16 +00:00
Chad Rosier	f3f8f443e1	[fast-isel] Remove an invalid assert. If the memcpy has an odd length with an alignment of 2, this would incorrectly assert on the last 1 byte copy. rdar://13202135 llvm-svn: 175459	2013-02-18 21:46:28 +00:00
Benjamin Kramer	5c6e653b72	Fix a 32/64 bit incompatibility in the HiPE prologue generation. llvm-svn: 175458	2013-02-18 21:45:01 +00:00
Benjamin Kramer	53bc37ca2a	Support for HiPE-compatible code emission, patch by Yiannis Tsiouris. llvm-svn: 175457	2013-02-18 20:55:12 +00:00
Vincent Lejeune	1ce13f553e	R600/SI: Use MULADD_IEEE/V_MAD_F32 instruction for mad pattern llvm-svn: 175446	2013-02-18 14:11:28 +00:00
Vincent Lejeune	685018009b	R600: Support for TBO NOTE: This is a candidate for the Mesa stable branch. Reviewed-by: Tom Stellard <thomas.stellard at amd.com> llvm-svn: 175445	2013-02-18 14:11:19 +00:00
Vincent Lejeune	4c1602b5c9	R600: Increase number of ArrayBase Reg to 32 Reviewed-by: Tom Stellard <thomas.stellard at amd.com> llvm-svn: 175443	2013-02-18 13:48:09 +00:00
Reed Kotler	1460738710	Expand macro/pseudo instructions BtnezT8SltX16 and BtnezT8SltuX16. llvm-svn: 175420	2013-02-18 05:43:03 +00:00
Reed Kotler	6879e56dc7	Expand pseudo/macro BteqzT8SltuX16 . There is no test case because at this time, llvm is generating a different but equivalent pattern that would lead to this instruction. I am trying to think of a way to get it to generate this. If I can't, I may just remove the pseudo. llvm-svn: 175419	2013-02-18 04:55:38 +00:00
Reed Kotler	c40f4e5899	Expand pseudo/macro BteqzT8SltX16. llvm-svn: 175417	2013-02-18 04:04:26 +00:00
Reed Kotler	7e4bc6067b	Expand macro/pseudo BteqzT8CmpX16. llvm-svn: 175416	2013-02-18 03:06:29 +00:00
Reed Kotler	cb37409b92	Beginning of expanding all current mips16 macro/pseudo instruction sequences. This expansion will be moved to expandISelPseudos as soon as I can figure out how to do that. There are other instructions which use this ExpandFEXT_T8I816_ins and as soon as I have finished expanding them all, I will delete the macro asm string text so it has no way to be used in the future. llvm-svn: 175413	2013-02-18 00:59:04 +00:00
Benjamin Kramer	189fc5819a	X86: Add a note. llvm-svn: 175408	2013-02-17 23:34:14 +00:00
Richard Osborne	53fff94527	[XCore] Add missing 2r instructions. These instructions are not targeted by the compiler but it is needed for the MC layer. llvm-svn: 175407	2013-02-17 22:38:05 +00:00
Richard Osborne	f5a3ffcba9	[XCore] Add TSETR instruction. This instruction is not targeted by the compiler but it is needed for the MC layer. llvm-svn: 175406	2013-02-17 22:32:41 +00:00
Richard Osborne	2192615d9f	[XCore] Add missing u10 / lu10 instructions. These instructions are not targeted by the compiler but they are needed for the MC layer. llvm-svn: 175404	2013-02-17 20:44:48 +00:00
Richard Osborne	3814491fb1	[XCore] Add missing u6 / lu6 instructions. These instructions are not targeted by the compiler but they are needed for the MC layer. llvm-svn: 175403	2013-02-17 20:43:17 +00:00
Jakub Staszak	74010cd9c2	Return false instead of 0. llvm-svn: 175402	2013-02-17 18:35:25 +00:00
Benjamin Kramer	a5dce35cba	AArch64: Avoid shifts by 64, that's undefined behavior. No functionality change. llvm-svn: 175400	2013-02-17 17:55:32 +00:00
Benjamin Kramer	de712b788b	Make the visibility of LLVMPPCCompilationCallback work with GCC. GCC warns about the attribute being ignored if it occurs after void. There seems to be some kind of incompatibility between clang and gcc here, but I can't fathom who's right. void LLVM_LIBRARY_VISIBILITY foo(); // clang: hidden, gcc: default LLVM_LIBRARY_VISIBILITY void *bar(); // clang: hidden, gcc: hidden void LLVM_LIBRARY_VISIBILITY qux(); // clang: hidden, gcc: hidden llvm-svn: 175394	2013-02-17 14:30:32 +00:00
Reed Kotler	61b474f97d	Clean up mips16 td file in preparation for massive pseudo lowering work. llvm-svn: 175379	2013-02-16 23:39:52 +00:00
Renato Golin	b2603ede95	Typo llvm-svn: 175371	2013-02-16 19:14:59 +00:00
Reed Kotler	188dad0eeb	One more try to make this look nice. I have lots of pseudo lowering as well as 16/32 bit variants to do and so I want this to look nice when I do it. I've been experimenting with this. No new test cases are needed. llvm-svn: 175369	2013-02-16 19:04:29 +00:00
NAKAMURA Takumi	f86d12caf0	[msvc x64] Update X86CompilationCallback_Win64.asm corresponding to r175267. llvm-svn: 175363	2013-02-16 16:04:29 +00:00
NAKAMURA Takumi	eddbc713e1	Target/R600/CMakeLists.txt: Prune SILowerLiteralConstants.cpp corresponding to r175354. llvm-svn: 175361	2013-02-16 15:30:28 +00:00
Jakub Staszak	d784d96074	Minor cleanups. No functionality change. llvm-svn: 175359	2013-02-16 13:34:26 +00:00
Christian Konig	b559b079b4	R600/SI: Add pattern to simplify i64 loading This is a candidate for the stable branch. Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 175356	2013-02-16 11:28:36 +00:00
Christian Konig	a881179ffe	R600/SI: nuke SReg_1 v3 It's completely unnecessary and can be replace with proper SReg_64 handling instead. This actually fixes a piglit test on SI. v2: use correct register class in addRegisterClass, set special classes as not allocatable v3: revert setting special classes as not allocateable This is a candidate for the stable branch. Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 175355	2013-02-16 11:28:30 +00:00
Christian Konig	c756cb9901	R600/SI: cleanup literal handling v3 Seems to be allot simpler, and also paves the way for further improvements. v2: rebased on master, use 0 in BUFFER_LOAD_FORMAT_XYZW, use VGPR0 in dummy EXP, avoid compiler warning, break after encoding the first literal. v3: correctly use V_ADD_F32_e64 This is a candidate for the stable branch. Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 175354	2013-02-16 11:28:22 +00:00
Christian Konig	b9e281a723	R600/SI: replace AllReg_* with [SV]Src_* v2 Mark all the operands that can also have an immediate. v2: SOFFSET is also an SSrc_32 operand This is a candidate for the stable branch. Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 175353	2013-02-16 11:28:13 +00:00
Christian Konig	3c3a7bfb06	R600/SI: fix VOPC encoding v2 Previously it only worked because of coincident. v2: fix 64bit versions, use 0x80 (inline 0) instead of SGPR0 for the unused SRC2 This is a candidate for the stable branch. Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 175352	2013-02-16 11:28:07 +00:00
Christian Konig	e3cba88714	R600/SI: move *_Helper definitions to SIInstrFormat.td This is a candidate for the stable branch. Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 175351	2013-02-16 11:28:02 +00:00

... 2 3 4 5 6 ...

23654 Commits