llvm-project

Commit Graph

Author	SHA1	Message	Date
Christian Konig	8590c1e371	R600/SI: remove some more unused code This is a candidate for the stable branch. Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 175350	2013-02-16 11:27:56 +00:00
Christian Konig	d886099f13	R600/structurizer: improve inverting conditions Stop adding more instructions than necessary. This is a candidate for the stable branch. Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 175349	2013-02-16 11:27:50 +00:00
Christian Konig	fc6a985c12	R600/structurizer: improve loop handling Generate more than one loop if it seems to make sense. This is a candidate for the stable branch. Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 175348	2013-02-16 11:27:45 +00:00
Christian Konig	b5d8866b84	R600/structurizer: improve finding condition values Using the new NearestCommonDominator class. This is a candidate for the stable branch. Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 175347	2013-02-16 11:27:40 +00:00
Christian Konig	0bccf9d60b	R600/structurizer: improve PHI value finding Using the new NearestCommonDominator class. This is a candidate for the stable branch. Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 175346	2013-02-16 11:27:35 +00:00
Christian Konig	d08e3d753e	R600/structurizer: add class to find the Nearest Common Dominator This is a candidate for the stable branch. Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 175345	2013-02-16 11:27:29 +00:00
Reed Kotler	8cf5103b2b	Use a different scheme to chose 16/32 variants. This scheme is more consistent with how BuildMI works. No new tests needed. All should work the same as before. llvm-svn: 175342	2013-02-16 09:47:57 +00:00
Bill Wendling	61375d8953	Reinitialize the ivars in the subtarget so that they can be reset with the new features. llvm-svn: 175336	2013-02-16 01:36:26 +00:00
Chad Rosier	925c9b499e	[ms-inline asm] Do not omit the frame pointer if we have ms-inline assembly. If the frame pointer is omitted, and any stack changes occur in the inline assembly, e.g.: "pusha", then any C local variable or C argument references will be incorrect. I pass no judgement on anyone who would do such a thing. ;) rdar://13218191 llvm-svn: 175334	2013-02-16 01:25:28 +00:00
Akira Hatanaka	a35bc832a0	[mips] Remove SDNPWantParent from the list of SDNodeProperties. No functionality change intended. llvm-svn: 175325	2013-02-16 00:14:37 +00:00
Bill Wendling	e9434778f7	Temporary revert of 175320. llvm-svn: 175322	2013-02-15 23:22:32 +00:00
Bill Wendling	a060d0efd8	Reinitialize the ivars in the subtarget. When we're recalculating the feature set of the subtarget, we need to have the ivars in their initial state. llvm-svn: 175320	2013-02-15 23:18:01 +00:00
Bill Wendling	5a92eeca6b	Support changing the subtarget features in ARM. llvm-svn: 175315	2013-02-15 22:41:25 +00:00
Bill Wendling	aef9c37c65	Use the 'target-features' and 'target-cpu' attributes to reset the subtarget features. If two functions require different features (e.g., `-mno-sse' vs. `-msse') then we want to honor that, especially during LTO. We can do that by resetting the subtarget's features depending upon the 'target-feature' attribute. llvm-svn: 175314	2013-02-15 22:31:27 +00:00
Chad Rosier	a915bbf4a1	[ms-inline asm] Adjust the EndLoc to account for the ']'. llvm-svn: 175312	2013-02-15 21:58:13 +00:00
Akira Hatanaka	5001be54ad	[mips] Clean up class MipsCCInfo. No functionality change intended. llvm-svn: 175310	2013-02-15 21:45:11 +00:00
Akira Hatanaka	69fb3d11ec	[mips] Split SelectAddr, which was used to match address patterns, into two functions. Set AddedComplexity to determine the order in which patterns are matched. This simplifies selection of floating point loads/stores. No functionality change intended. llvm-svn: 175300	2013-02-15 21:20:45 +00:00
Reed Kotler	76c9bcd43a	Remove a final dependency on the form field in tablegen; which is a remnant of the old jit and which we don't intend to support in mips16 or micromips. This dependency is for the testing of whether an instruction is a pseudo. llvm-svn: 175297	2013-02-15 21:05:58 +00:00
Jyotsna Verma	a556848131	Hexagon: Set appropriate TSFlags to the loads/stores with global address to support constant extension. This patch doesn't introduce any functionality changes. llvm-svn: 175280	2013-02-15 17:52:07 +00:00
Tim Northover	2e44769ed2	AArch64: add branch fixup pass. This is essentially a stripped-down version of the ConstandIslands pass (which always had these two functions), providing just the features necessary for correctness. In particular there needs to be a way to resolve the situation where a conditional branch's destination block ends up out of range. This issue crops up when self-hosting for AArch64. llvm-svn: 175269	2013-02-15 14:32:20 +00:00
Rafael Espindola	91cbcbb909	Give these callbacks hidden visibility. It is better to not export them more than we need to and some ELF linkers complain about directly accessing symbols with default visibility. llvm-svn: 175268	2013-02-15 14:15:59 +00:00
Rafael Espindola	9b7d4004bc	Don't make assumptions about the mangling of static functions in extern "C" blocks. We still don't have consensus if we should try to change clang or the standard, but llvm should work with compilers that implement the current standard and mangle those functions. llvm-svn: 175267	2013-02-15 14:08:43 +00:00
Benjamin Kramer	6ecb1e78a9	Make helpers static. Add missing include so LLVMInitializeObjCARCOpts gets C linkage. llvm-svn: 175264	2013-02-15 12:30:38 +00:00
Tim Northover	3533ad6bbd	AArch64: remove ConstantIsland pass & put literals in separate section. This implements the review suggestion to simplify the AArch64 backend. If we later discover that we really need the extra complexity of the ConstantIslands pass for performance reasons it can be resurrected. llvm-svn: 175258	2013-02-15 09:33:43 +00:00
Tim Northover	5466e36fb5	AArch64: refactor frame handling to use movz/movk for overlarge offsets. In the near future litpools will be in a different section, which means that any access to them is at least two instructions. This makes the case for a movz/movk pair (if total offset <= 32-bits) even more compelling. llvm-svn: 175257	2013-02-15 09:33:26 +00:00
Reed Kotler	f022147790	Fix minor mips16 issues in directives for function prologue. Probably this does not matter but makes it more gcc compatible which avoids possible subtle problems. Also, turned back on a disabled check in helloworld.ll. llvm-svn: 175237	2013-02-15 01:04:38 +00:00
Akira Hatanaka	30f05f3dc7	[mips] Disallow moving load/store instructions past volatile instructions. Unfortunately, I wasn't able to create a test case that demonstrates the problem I was trying to fix with this patch. llvm-svn: 175226	2013-02-14 23:54:40 +00:00
Akira Hatanaka	06bd138dad	[mips] Replace usage of SmallSet with BitVector, which is used to keep track of defined and used registers. Also add a few helper functions to simplify the code. llvm-svn: 175224	2013-02-14 23:40:57 +00:00
Akira Hatanaka	1083eb175c	[mips] Fix comments and coding style violations. Declare functions to be const. llvm-svn: 175222	2013-02-14 23:20:15 +00:00
Joel Jones	0f8617b17e	The ARM NEON vector compare instructions take three arguments. However, the assembler should also accept a two arg form, as the docuemntation specifies that the first (destination) register is optional. This patch uses TwoOperandAliasConstraint to add the two argument form. It also fixes an 80-column formatting problem in: test/MC/ARM/neon-bitwise-encoding <rdar://problem/12909419> Clang rejects ARM NEON assembly instructions llvm-svn: 175221	2013-02-14 23:18:40 +00:00
Eli Bendersky	a1c6635ca3	The operand listing is very much outdated. llvm-svn: 175220	2013-02-14 23:17:03 +00:00
Akira Hatanaka	dfd2f24d0c	[mips] Simplify code in function Filler::findDelayInstr. 1. Define and use function terminateSearch. 2. Use MachineBasicBlock::iterator instead of MachineBasicBlock::instr_iterator. 3. Delete the line which checks whether an instruction is a pseudo. llvm-svn: 175219	2013-02-14 23:11:24 +00:00
Jakub Staszak	701cc97e92	Simplify code. Remove "else after return". llvm-svn: 175212	2013-02-14 21:50:09 +00:00
Jyotsna Verma	de722193e5	Hexagon: Change insn class to support instruction encoding. This patch doesn't introduce any functionality changes. It adds some new fields to the Hexagon instruction classes and changes their layout to support instruction encoding. llvm-svn: 175205	2013-02-14 19:57:17 +00:00
Kay Tiong Khoo	f809c6491d	added basic support for Intel ADX instructions -feature flag, instructions definitions, test cases llvm-svn: 175196	2013-02-14 19:08:21 +00:00
Michel Danzer	e9bb18b555	R600/SI: Fix int_SI_fs_interp_constant The important fix is that the constant interpolation value is stored in the parameter slot P0, which is encoded as 2. In addition, drop the SI_INTERP_CONST pseudo instruction, pass the parameter slot as an operand to V_INTERP_MOV_F32 instead of hardcoding it there, and add a special operand class for the parameter slots for type checking and pretty printing. NOTE: This is a candidate for the Mesa stable branch. Reviewed-by: Christian König <christian.koenig@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 175193	2013-02-14 19:03:25 +00:00
Nadav Rotem	accb0c747c	80-col llvm-svn: 175189	2013-02-14 18:20:48 +00:00
Jyotsna Verma	3545d2fc41	Hexagon: Use multiclass for absolute addressing mode loads. This patch doesn't introduce any functionality changes. llvm-svn: 175187	2013-02-14 18:15:29 +00:00
Weiming Zhao	c598700788	Re-apply r175088 for bug fix 13622: Add paired register support for inline asm with 64-bit data on ARM Update test case to use -mtriple=arm-linux-gnueabi llvm-svn: 175186	2013-02-14 18:10:21 +00:00
Vincent Lejeune	f940fd05bd	R600: Do not fold single instruction with more that 3 kcache read It fixes around 100 tfb piglit tests and 16 glean tests. NOTE: This is a candidate for the Mesa stable branch. Reviewed-by: Tom Stellard <thomas.stellard at amd.com> llvm-svn: 175183	2013-02-14 16:57:19 +00:00
Vincent Lejeune	ea710fe419	R600: Export instructions are no longer terminator This allows MachineInstScheduler to reorder them, and thus make scheduling more efficient. Reviewed-by: Tom Stellard <thomas.stellard at amd.com> llvm-svn: 175182	2013-02-14 16:55:11 +00:00
Vincent Lejeune	d80bc1561a	R600: Fold zero/one in export instructions Reviewed-by: Tom Stellard <thomas.stellard at amd.com> llvm-svn: 175181	2013-02-14 16:55:06 +00:00
Vincent Lejeune	f694c10c8e	R600: Do not fold modifier/litterals in vector inst This fixes a couple of regressions on (probably not just) cayman NOTE: This is a candidate for the Mesa stable branch. Reviewed-by: Tom Stellard <thomas.stellard at amd.com> llvm-svn: 175180	2013-02-14 16:55:01 +00:00
Tim Northover	d514fd59a6	AArch64: switch from neverHasSideEffects to hasSideEffects. llvm-svn: 175176	2013-02-14 16:31:12 +00:00
Tim Northover	d21ddb9042	AArch64: stop claiming that NEON registers are usable for now. If vector types have legal register classes, then LLVM bypasses LegalizeTypes on them, which causes faults currently since the code to handle them isn't in place. This fixes test failures when AArch64 is the default target. llvm-svn: 175172	2013-02-14 16:22:14 +00:00
Tim Northover	75f436c4ea	AArch64: add block comments where missing Only comments affected. No code change at all. llvm-svn: 175169	2013-02-14 16:17:01 +00:00
Kristof Beyls	2efb59a719	Make ARMAsmParser accept the correct alignment specifier syntax in instructions. The parser will now accept instructions with alignment specifiers written like vld1.8 {d16}, [r0:64] , while also still accepting the incorrect syntax vld1.8 {d16}, [r0, :64] llvm-svn: 175164	2013-02-14 14:46:12 +00:00
Elena Demikhovsky	d0a0cc80cd	Fixed a bug in X86TargetLowering::LowerVectorIntExtend() (assertion failure). Added a test. llvm-svn: 175144	2013-02-14 08:20:26 +00:00
Michel Danzer	ae0a403dab	R600/SI: Check for empty stack in SIAnnotateControlFlow::isTopOfStack Fixes assertion failure in newly added lit test. Might just be a bandaid that needs to be revisited. llvm-svn: 175139	2013-02-14 08:00:33 +00:00
Rafael Espindola	8868faac14	Revert r175120 and r175121. Clang is producing the expected asm names again. llvm-svn: 175133	2013-02-14 03:33:34 +00:00
Reed Kotler	ec8a54904e	Remove the form field from Mips16 instruction formats and set things up so that we can apply the direct object emitter patch. This patch should be a nop right now and it's test is to not break what is already there. llvm-svn: 175126	2013-02-14 03:05:25 +00:00
Rafael Espindola	3c818086f2	Don't assume the mangling of static functions. llvm-svn: 175121	2013-02-14 02:49:18 +00:00
Rafael Espindola	764993493c	Don't asume that a static function in an extern "C" block will not be mangled. Since functions with internal linkage don't have language linkage, it is valid to overload them: extern "C" { static int foo(); static int foo(int); } So we mangle them. llvm-svn: 175120	2013-02-14 01:58:08 +00:00
Weiming Zhao	090edf7e67	temporarily revert the patch due to some conflicts llvm-svn: 175107	2013-02-13 23:24:40 +00:00
Anshuman Dasgupta	e96f804eba	Hexagon: add support for predicate-GPR copies. llvm-svn: 175102	2013-02-13 22:56:34 +00:00
Tom Stellard	91da4e9199	R600: Add support for 128-bit parameters NOTE: This is a candidate for the Mesa stable branch. llvm-svn: 175096	2013-02-13 22:05:20 +00:00
Nick Lewycky	beba972659	Don't build tail calls to functions with three inreg arguments on x86-32 PIC. Fixes PR15250! llvm-svn: 175092	2013-02-13 21:59:15 +00:00
Weiming Zhao	0632a4b002	Bug fix 13622: Add paired register support for inline asm with 64-bit data on ARM llvm-svn: 175088	2013-02-13 21:43:02 +00:00
Jyotsna Verma	d92252469e	Hexagon: Use absolute addressing mode loads/stores for global+offset instead of redefining separate instructions for them. llvm-svn: 175086	2013-02-13 21:38:46 +00:00
Chad Rosier	282edd7caa	[ms-inline-asm] Add support for memory references that have non-immediate displacements. rdar://12974533 llvm-svn: 175083	2013-02-13 21:33:44 +00:00
Reed Kotler	f662cff689	For Mips 16, add the optimization where the 16 bit form of addiu sp can be used if the offset fits in 11 bits. This makes use of the fact that the abi requires sp to be 8 byte aligned so the actual offset can fit in 8 bits. It will be shifted left and sign extended before being actually used. The assembler or direct object emitter will shift right the 11 bit signed field by 3 bits. We don't need to deal with that here. llvm-svn: 175073	2013-02-13 20:28:27 +00:00
Andrew Trick	553e0fe365	MIsched: HazardRecognizers are created for each DAG. Free them. llvm-svn: 175067	2013-02-13 19:22:27 +00:00
Krzysztof Parzyszek	2680b53d90	Add registration for PPC-specific passes to allow the IR to be dumped via -print-after-all. llvm-svn: 175058	2013-02-13 17:40:07 +00:00
Benjamin Kramer	8e2637e2b0	X86: Disable generation of rep;movsl when %esi is used as a base pointer. This happens when there is both stack realignment and a dynamic alloca in the function. If we overwrite %esi (rep;movsl uses fixed registers) we'll lose the base pointer and the next register spill will write into oblivion. Fixes PR15249 and unbreaks firefox on i386/freebsd. Mozilla uses dynamic allocas and freebsd a 4 byte stack alignment. llvm-svn: 175057	2013-02-13 13:40:35 +00:00
Reed Kotler	9cb8e7b9f5	Make jumptables work for -static llvm-svn: 175044	2013-02-13 08:32:14 +00:00
Elena Demikhovsky	9e0df7cb01	Prevent insertion of "vzeroupper" before call that preserves YMM registers, since a caller uses preserved registers across the call. llvm-svn: 175043	2013-02-13 08:02:04 +00:00
Eric Christopher	389ee71b0a	Check i1 as well as i8 variables for 8 bit registers for x86 inline assembly. llvm-svn: 175036	2013-02-13 06:01:05 +00:00
David Peixotto	4299cf83a3	Test commit. Fixed typo. llvm-svn: 175020	2013-02-13 00:36:35 +00:00
Jyotsna Verma	39f7a2b7a0	Hexagon: Add support to generate predicated absolute addressing mode instructions. llvm-svn: 174973	2013-02-12 16:06:23 +00:00
Justin Holewinski	be8dc6499a	[NVPTX] Disable vector registers Vectors were being manually scalarized by the backend. Instead, let the target-independent code do all of the work. The manual scalarization was from a time before good target-independent support for scalarization in LLVM. However, this forces us to specially-handle vector loads and stores, which we can turn into PTX instructions that produce/consume multiple operands. llvm-svn: 174968	2013-02-12 14:18:49 +00:00
Michel Danzer	3bb17ebd93	R600: Fix regression with shadow array sampler on pre-SI GPUs. 'R600/SI: Use proper instructions for array/shadow samplers.' removed two cases from TEX_SHADOW. Vincent Lejeune reported on IRC that this broke some shadow array piglit tests with the r600g driver. Reinstating the removed cases should fix this, and still works with radeonsi as well. I will follow up with some lit tests which would have caught the regression. NOTE: This is a candidate for the Mesa stable branch. Tested-by: Vincent Lejeune <vljn@ovi.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 174963	2013-02-12 12:11:23 +00:00
Arnold Schwaighofer	89aef93841	ARM cost model: Add vector reverse shuffle costs A reverse shuffle is lowered to a vrev and possibly a vext instruction (quad word). radar://13171406 llvm-svn: 174933	2013-02-12 02:40:39 +00:00
Arnold Schwaighofer	1f3d3ca769	ARM NEON: Handle v16i8 and v8i16 reverse shuffles Lower reverse shuffles to a vrev64 and a vext instruction instead of the default legalization of storing and loading to the stack. This is important because we generate reverse shuffles in the loop vectorizer when we reverse store to an array. uint8_t Arr[N]; for (i = 0; i < N; ++i) Arr[N - i - 1] = ... radar://13171760 llvm-svn: 174929	2013-02-12 01:58:32 +00:00
Kay Tiong Khoo	ab588efe42	Added 0x0D to 2-byte opcode extension table for prefetch* variants Fixed decode of existing 3dNow prefetchw instruction Intel is scheduled to add a compatible prefetchw (same encoding) to future CPUs llvm-svn: 174920	2013-02-12 00:19:12 +00:00
Akira Hatanaka	bf1af1acc7	[mips] Expand pseudo instructions before they are emitted in MipsCodeEmitter.cpp. JALR and NOP are expanded by function emitPseudoExpansionLowering, which is not called when the old JIT is used. This fixes the following tests which have been failing on llvm-mips-linux builder: LLVM :: ExecutionEngine__2003-01-04-LoopTest.ll LLVM :: ExecutionEngine__2003-05-06-LivenessClobber.ll LLVM :: ExecutionEngine__2003-06-04-bzip2-bug.ll LLVM :: ExecutionEngine__2005-12-02-TailCallBug.ll LLVM :: ExecutionEngine__2003-10-18-PHINode-ConstantExpr-CondCode-Failure.ll LLVM :: ExecutionEngine__hello2.ll LLVM :: ExecutionEngine__stubs.ll LLVM :: ExecutionEngine__test-branch.ll LLVM :: ExecutionEngine__test-call.ll LLVM :: ExecutionEngine__test-common-symbols.ll LLVM :: ExecutionEngine__test-loadstore.ll LLVM :: ExecutionEngine__test-loop.ll llvm-svn: 174912	2013-02-11 22:35:40 +00:00
Akira Hatanaka	3d38609fdd	[mips] Fix indentation. llvm-svn: 174907	2013-02-11 22:03:52 +00:00
Krzysztof Parzyszek	9a278f108a	Extend Hexagon hardware loop generation to handle various additional cases: - variety of compare instructions, - loops with no preheader, - arbitrary lower and upper bounds. llvm-svn: 174904	2013-02-11 21:37:55 +00:00
Krzysztof Parzyszek	cfe285e604	Implement HexagonInstrInfo::analyzeCompare. llvm-svn: 174901	2013-02-11 20:04:29 +00:00
Kay Tiong Khoo	d30b1a2ac7	fixed disassembly of some i386 system insts with intel syntax added file for test cases for i386 intel syntax llvm-svn: 174900	2013-02-11 19:46:36 +00:00
Michel Danzer	10ed47f927	R600/SI: Use V_ADD_F32 instead of V_MOV_B32 for clamp/neg/abs modifiers. The modifiers don't seem to have any effect with V_MOV_B32, supposedly it's meant to just move bits untouched. Fixes 46 piglit tests with radeonsi, though unfortunately 11 of those had just regressed because they started using the clamp modifier. NOTE: This is a candidate for the Mesa stable branch. Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 174890	2013-02-11 15:58:21 +00:00
Tim Northover	be867971cb	AArch64: fix build on some MSVC versions This does two things: It removes a call to abs() which may have "long long" parameter on Windows, which is not necessarily available in C++03. It also corrects the signedness of Amount, which was relying on implementation-defined conversions previously. Code was already tested (albeit in an implemnetation defined way) so no extra tests. llvm-svn: 174885	2013-02-11 14:25:52 +00:00
Tim Northover	e206778833	AArch64: Simplify logic in deciding whether bfi is valid Previous code had a confusing comment which was mostly an implementation detail. This condition corresponds to "lsb up to register width" and "width not ridiculous". llvm-svn: 174877	2013-02-11 12:32:18 +00:00
Tim Northover	60baeb984f	Make use of DiagnosticType to provide better AArch64 diagnostics. This gives a DiagnosticType to all AsmOperands in sight. This replaces all "invalid operand" diagnostics with something more specific. The messages given should still be sufficiently vague that they're not usually actively misleading when LLVM guesses your instruction incorrectly. llvm-svn: 174871	2013-02-11 09:29:37 +00:00
Evan Cheng	615620c9e8	Currently, codegen may spent some time in SDISel passes even if an entire function is successfully handled by fast-isel. That's because function arguments are always handled by SDISel. Introduce FastLowerArguments to allow each target to provide hook to handle formal argument lowering. As a proof-of-concept, add ARMFastIsel::FastLowerArguments to handle functions with 4 or fewer scalar integer (i8, i16, or i32) arguments. It completely eliminates the need for SDISel for trivial functions. rdar://13163905 llvm-svn: 174855	2013-02-11 01:27:15 +00:00
Joel Jones	440d8e48ae	Spelling correction llvm-svn: 174852	2013-02-10 23:56:30 +00:00
Vincent Lejeune	44bf8158c5	Test Commit - Remove some trailing whitespace in R600Instructions.td llvm-svn: 174839	2013-02-10 17:57:33 +00:00
Justin Holewinski	36a50991e7	[NVPTX] Make address space errors more explicit (llvm_unreachable -> report_fatal_error) llvm-svn: 174808	2013-02-09 13:34:15 +00:00
Tom Stellard	47d4201348	R600: Dump the function name when TargetLowering::LowerCall() fails Also output a more useful error message. NOTE: This is a candidate for the Mesa stable branch llvm-svn: 174763	2013-02-08 22:24:40 +00:00
Tom Stellard	7370ede2cd	R600: rework flow creation in the structurizer v2 This fixes a couple of bugs and incorrect assumptions, in total four more piglit tests now pass. v2: fix small bug in the dominator updating Patch by: Christian König Signed-off-by: Christian König <christian.koenig@amd.com> llvm-svn: 174762	2013-02-08 22:24:38 +00:00
Tom Stellard	048f14fd3b	R600: fix loop analyses in the structurizer Patch by: Christian König Intersecting loop handling was wrong. Signed-off-by: Christian König <christian.koenig@amd.com> Tested-by: Michel Dänzer <michel.daenzer@amd.com> llvm-svn: 174761	2013-02-08 22:24:37 +00:00
Tom Stellard	7ec0e4fbe3	R600: fix PHI value adding in the structurizer Otherwise we sometimes produce invalid code. Patch by: Christian König Signed-off-by: Christian König <christian.koenig@amd.com> Tested-by: Michel Dänzer <michel.daenzer@amd.com> llvm-svn: 174760	2013-02-08 22:24:35 +00:00
Reed Kotler	b9bf8dca47	Add the 16 bit version of addiu. To the assembler, the 16 and 32 bit are the same so we put in the comment field an indicator when we think we are emitting the 16 bit version. For the direct object emitter, the difference is important as well as for other passes which need an accurate count of program size. There will be other similar putbacks to this for various instructions. llvm-svn: 174747	2013-02-08 21:42:56 +00:00
Bill Schmidt	62fe7a5b17	Refine fix to bug 15041. Thanks to help from Nadav and Hal, I have a more reasonable (and even correct!) approach. This specifically penalizes the insertelement and extractelement operations for the performance hit that will occur on PowerPC processors. llvm-svn: 174725	2013-02-08 18:19:17 +00:00
Arnold Schwaighofer	594fa2dc2b	ARM cost model: Address computation in vector mem ops not free Adds a function to target transform info to query for the cost of address computation. The cost model analysis pass now also queries this interface. The code in LoopVectorize adds the cost of address computation as part of the memory instruction cost calculation. Only there, we know whether the instruction will be scalarized or not. Increase the penality for inserting in to D registers on swift. This becomes necessary because we now always assume that address computation has a cost and three is a closer value to the architecture. radar://13097204 llvm-svn: 174713	2013-02-08 14:50:48 +00:00
Reed Kotler	66165c8f96	When Mips16 frames grow large, the immediate field may exceed the maximum allowed size for the instruction. This code uses RegScavenger to fix this. We sometimes need 2 registers for Mips16 so we must handle things differently than how register scavenger is normally used. llvm-svn: 174696	2013-02-08 03:57:41 +00:00
Akira Hatanaka	a061281556	[mips] Make Filler a class and reduce indentation. llvm-svn: 174666	2013-02-07 21:32:32 +00:00
Bill Schmidt	b3cece13cf	Constrain PowerPC autovectorization to fix bug 15041. Certain vector operations don't vectorize well with the current PowerPC implementation. Element insert/extract performs poorly without VSX support because Altivec requires going through memory. SREM, UREM, and VSELECT all produce bad scalar code. There's a lot of work to do for the cost model before autovectorization will be tuned well, and this is not an attempt to address the larger problem. llvm-svn: 174660	2013-02-07 20:33:57 +00:00
Akira Hatanaka	061d1ea5da	[mips] Add definition of JALR instruction which has two register operands. Change the original JALR instruction with one register operand to be a pseudo-instruction. llvm-svn: 174657	2013-02-07 19:48:00 +00:00
Tom Stellard	1c822a8929	R600/SI: cleanup VGPR encoding Remove all the unused code. Patch by: Christian König Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 174656	2013-02-07 19:39:45 +00:00
Tom Stellard	aac1889a84	R600/SI: Handle VGPR64 destination in copyPhysReg(). Allows nexuiz to run with radeonsi. Patch by: Michel Dänzer Signed-off-by: Michel Dänzer <michel.daenzer@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 174655	2013-02-07 19:39:43 +00:00

1 2 3 4 5 ...

23354 Commits