llvm-project

Commit Graph

Author	SHA1	Message	Date
Rafael Espindola	15829c6c4c	Add test that was missing in my previous commit. llvm-svn: 114248	2010-09-18 00:37:27 +00:00
Dan Gohman	49c15c0f9f	Attempt to XFAIL this test on arm-linux, which is inexplicably failing. llvm-svn: 114241	2010-09-18 00:04:37 +00:00
Benjamin Kramer	de636ca9a8	Fix vmov.f64 disassembly on targets where sizeof(long) != 8. llvm-svn: 114240	2010-09-17 23:48:07 +00:00
Bob Wilson	cb6db98897	Add target-specific DAG combiner for BUILD_VECTOR and VMOVRRD. An i64 value should be in GPRs when it's going to be used as a scalar, and we use VMOVRRD to make that happen, but if the value is converted back to a vector we need to fold to a simple bit_convert. Radar 8407927. llvm-svn: 114233	2010-09-17 22:59:05 +00:00
Jim Grosbach	7a6c37d3e7	Teach the (non-MC) instruction printer to use the cannonical names for push/pop, and shift instructions on ARM. Update the tests to match. llvm-svn: 114230	2010-09-17 22:36:38 +00:00
Evan Cheng	e53ab6dffc	Teach machine sink to 1) Do forward copy propagation. This makes it easier to estimate the cost of the instruction being sunk. 2) Break critical edges on demand, including cases where the value is used by PHI nodes. Critical edge splitting is not yet enabled by default. llvm-svn: 114227	2010-09-17 22:28:18 +00:00
Jim Grosbach	6d800f88da	Update tests to handle MC-inst instruction printing of shift operations. The legacy asm printer uses instructions of the form, "mov r0, r0, lsl #3", while the MC-instruction printer uses the form "lsl r0, r0, #3". The latter mnemonic is correct and preferred according the ARM documentation (A8.6.98). The former are pseudo-instructions for the latter. llvm-svn: 114221	2010-09-17 21:58:46 +00:00
Jim Grosbach	4a5e54021a	FileCheck-ize llvm-svn: 114218	2010-09-17 21:46:16 +00:00
Jim Grosbach	20da4e360b	Move thumb2 tests to the thumb2 directory llvm-svn: 114206	2010-09-17 20:34:09 +00:00
Jim Grosbach	9b0cd20f72	tweak test to check instructions rather than relying on the comment string llvm-svn: 114204	2010-09-17 20:27:26 +00:00
Dan Gohman	f3a9c464b4	Fix this test to avoid an "inexact" fold. llvm-svn: 114202	2010-09-17 20:25:43 +00:00
Dan Gohman	534db8a5c8	Avoid emitting a PIC base register if no PIC addresses are needed. This fixes rdar://8396318. llvm-svn: 114201	2010-09-17 20:24:24 +00:00
Jim Grosbach	f3ceecec7e	tweak test to check instructions rather than relying on the comment string llvm-svn: 114200	2010-09-17 20:21:03 +00:00
Jim Grosbach	c18a460adc	tweak test to check instructions rather than relying on the comment string llvm-svn: 114199	2010-09-17 20:17:41 +00:00
Dan Gohman	695312637c	Fix this test so that folding doesn't depend on a potentially "inexact" result. llvm-svn: 114198	2010-09-17 20:15:53 +00:00
Chris Lattner	cea0a8d7ae	fix rdar://8444631 - encoder crash on 'enter' What a weird instruction. llvm-svn: 114190	2010-09-17 18:02:29 +00:00
Daniel Dunbar	35a7a0ee2e	MC/Mach-O/i386: Fix a crash in relocation handling. llvm-svn: 114176	2010-09-17 15:21:50 +00:00
Oscar Fuentes	d890a54353	tests/CMakeLists.txt: use `configure_file' instead of `sed'. The Windows users will appreciate this. llvm-svn: 114158	2010-09-17 03:22:21 +00:00
Daniel Dunbar	55f16678e4	MC/AsmParser: Add support for 'a + 4@GOTPCREL' and friends, by reconsing the expression to include the modifier. - Gross, but this a corner case we don't expect to see often in practice, but it is worth accepting. - Also improves diagnostics on invalid modifiers. llvm-svn: 114154	2010-09-17 02:47:07 +00:00
Dan Gohman	18fa17cf3d	Fix the folding of floating-point math library calls, like sin(infinity), so that it detects errors on platforms where libm doesn't set errno. It's still subject to host libm details though. llvm-svn: 114148	2010-09-17 01:38:06 +00:00
Devang Patel	871d0b1b1c	If FE forgot to provide a file name (usually it uses "stdin" as name in such situation) then make one up to ensure that debug info is not malformed. llvm-svn: 114119	2010-09-16 20:57:49 +00:00
Chris Lattner	74d320db97	fix rdar://8438816 - unrecognized 'fildq' instruction llvm-svn: 114116	2010-09-16 20:46:38 +00:00
Rafael Espindola	44bf266111	Print the address of sections as 0 and create the metadata sections in the same order as gnu as. llvm-svn: 114109	2010-09-16 19:46:31 +00:00
Owen Anderson	20154b3ed4	Add missing RUN line to this test. llvm-svn: 114106	2010-09-16 18:46:23 +00:00
Dale Johannesen	f95f59a0c2	When substituting sunkaddrs into indirect arguments an asm, we were walking the asm arguments once and stashing their Values. This is wrong because the same memory location can be in the list twice, and if the first one has a sunkaddr substituted, the stashed value for the second one will be wrong (use-after-free). PR 8154. llvm-svn: 114104	2010-09-16 18:30:55 +00:00
Owen Anderson	140296f5c0	It is possible, under specific circumstances involving ptrtoint ConstantExpr's, for LVI to end up trying to merge a Constant into a ConstantRange. Handle this conservatively for now, rather than asserting. The testcase is more complex that I would like, but the manifestation of the problem is sensitive to iteration orders and the state of the LVI cache, and I have not been able to reproduce it with manually constructed or simplified cases. Fixes PR8162. llvm-svn: 114103	2010-09-16 18:28:33 +00:00
Owen Anderson	94532cb297	Fix PR8161, in which an unreachable loop causes recursive instruction simplification to try to replace an instruction with itself. Add a predicate to the simplifier to prevent this case. llvm-svn: 114097	2010-09-16 17:42:36 +00:00
Rafael Espindola	f7f433200b	Make sure that names like .note.GNU-stack are accepted as valid section names. llvm-svn: 114091	2010-09-16 17:05:55 +00:00
Rafael Espindola	922e3f454b	Add support for the .zero directive. llvm-svn: 114077	2010-09-16 15:03:59 +00:00
Kalle Raiskila	c0e9b8d8bb	Change SPU register re-interpretations from OR to COPY_TO_REGCLASS instruction. This cleans up after the mess r108567 left in the CellSPU backend. ORCvt-instruction were used to reinterpret registers, and the ORs were then removed by isMoveInstr(). This patch now removes 350 instrucions of format: or $3, $3, $3 (from the 52 testcases in CodeGen/CellSPU). One case of a nonexistant or is checked for. Some moves of the form 'ori $., $., 0' and 'ai $., $., 0' still remain. llvm-svn: 114074	2010-09-16 12:29:33 +00:00
Jim Grosbach	196841144d	add a test of an edge case value for the FP immediate (needs all digits of precision) llvm-svn: 114028	2010-09-15 21:52:13 +00:00
Rafael Espindola	f667d929ce	Add a InitSections method to the streamer interface. The ELF implementation now creates text, data and bss to match the gnu as behavior. The text streamer still has the old MachO specific behavior since the testsuite checks that it will error when a directive is given before a setting the current section for example. A nice benefit is that -n is not required anymore when producing ELF files. llvm-svn: 114027	2010-09-15 21:48:40 +00:00
Jim Grosbach	27ab5fbd2b	Teach the MC disassembler to handle vmov.f32 and vmov.f64 immediate to register moves. Previously, the immediate was printed as the encoded integer value, which is incorrect. llvm-svn: 114021	2010-09-15 21:04:54 +00:00
Eli Friedman	ab3a128582	PR7959: Handle negative scales in GEPs correctly in BasicAA for non-64-bit targets. llvm-svn: 114015	2010-09-15 20:08:03 +00:00
Bob Wilson	660d7ecf32	Reapply Gabor's 113839, 113840, and 113876 with a fix for a problem encountered while building llvm-gcc for arm. This is probably the same issue that the ppc buildbot hit. llvm::prior works on a MachineBasicBlock::iterator, not a plain MachineInstr. llvm-svn: 113983	2010-09-15 17:12:08 +00:00
Gabor Greif	9ae4b271f2	the darwin9-powerpc buildbot keeps consistently crashing, backing out following to get it back to green, so I can investigate in peace: svn merge -c -113840 llvm/test/CodeGen/ARM/arm-and-tst-peephole.ll svn merge -c -113876 -c -113839 llvm/lib/Target/ARM/ARMBaseInstrInfo.cpp llvm-svn: 113980	2010-09-15 16:53:07 +00:00
Mikhail Glushenkov	0c99d12208	llvmc: make -x work with unknown suffixes. llvm-svn: 113972	2010-09-15 15:20:41 +00:00
Chris Lattner	ee7e6f42f8	lcall and ljmp always default to lcalll and ljmpl. This finally wraps up r8418316 llvm-svn: 113949	2010-09-15 05:30:20 +00:00
Chris Lattner	09bfe645f6	apparently jmpl $1,$2 is an alias for ljmpl, similiarly for call. Add this. llvm-svn: 113948	2010-09-15 05:25:21 +00:00
Chris Lattner	6757eae45e	Disambiguate lcall/ljmp to the 32-bit version. This happens even in 64-bit mode apparently. llvm-svn: 113945	2010-09-15 05:14:54 +00:00
Chris Lattner	5be87c619b	fix the encoding of sldt GR16 to have the 0x66 prefix, and add sldt GR32, which isn't documented in the intel manual but which gas accepts. Part of rdar://8418316 llvm-svn: 113938	2010-09-15 04:45:10 +00:00
Chris Lattner	6b40b0def1	implement aliases for shld/shrd, part of rdar://8418316 llvm-svn: 113937	2010-09-15 04:37:18 +00:00
Chris Lattner	4bd21710b6	fix rdar://8431880 - rcl/rcr with no shift amount not recognized llvm-svn: 113936	2010-09-15 04:33:27 +00:00
Chris Lattner	81ce173860	add various broken forms of fnstsw. I didn't add the %rax version because it adds a prefix and makes even less sense than the other broken forms. This wraps up rdar://8431422 llvm-svn: 113932	2010-09-15 04:15:16 +00:00
Chris Lattner	7df35dbd19	add some aliases for f[u]comi, part of rdar://8431422 llvm-svn: 113930	2010-09-15 04:08:38 +00:00
Chris Lattner	4dbcba0082	add a bunch of aliases for fp operations with no operand, rdar://8431422 llvm-svn: 113929	2010-09-15 04:04:33 +00:00
Michael J. Spencer	46d4cc2ef7	test: Fix coff-dump section array indicies to 1 based to match file format. llvm-svn: 113928	2010-09-15 03:58:51 +00:00
Michael J. Spencer	5d2c9acbdf	Tabs to spaces llvm-svn: 113927	2010-09-15 03:58:39 +00:00
Michael J. Spencer	8ae7922a89	Cleanup coff-dump.py llvm-svn: 113926	2010-09-15 03:58:24 +00:00
Chris Lattner	67e534505d	fix PR8144, a bug where constant merge would merge globals marked attribute(used). llvm-svn: 113911	2010-09-15 00:30:11 +00:00
Jim Grosbach	c7cf42d80b	Reapply r113875 with additional cleanups. "The register specified for a dregpair is the corresponding Q register, so to get the pair, we need to look up the sub-regs based on the qreg. Create a lookup function since we don't have access to TargetRegisterInfo here to be able to use getSubReg(ARM::dsub_[01])." Additionaly, fix the NEON VLD1* and VST1* instruction patterns not to use the dregpair modifier for the 2xdreg versions. Explicitly specifying the two registers as operands is more correct and more consistent with the other instruction patterns. This enables further cleanup of special case code in the disassembler as a nice side-effect. llvm-svn: 113903	2010-09-14 23:54:06 +00:00
Chris Lattner	5f2311dc29	add a terrible hack to allow out with dx is parens, a gas bug. This fixes PR8114 llvm-svn: 113894	2010-09-14 23:34:29 +00:00
Owen Anderson	f6fe577e88	Remove dead option from tests. llvm-svn: 113855	2010-09-14 21:03:40 +00:00
Gabor Greif	00e34f4b32	forgot the testcase change for r113839 llvm-svn: 113840	2010-09-14 09:30:17 +00:00
Gabor Greif	5dbe800203	test for and-tst peephole optimization documents the status-quo with its opportunities llvm-svn: 113838	2010-09-14 08:50:43 +00:00
Chris Lattner	f1144f0929	fix PR8102, a case where we'd copyValue from a value that we already deleted. Fix this by doing the copyValue's before we delete stuff! The testcase only repros the problem on my system with valgrind. llvm-svn: 113820	2010-09-14 00:19:00 +00:00
Dale Johannesen	b1248ffe9d	Basic smoke test for new x86mmx type. llvm-svn: 113783	2010-09-13 21:01:36 +00:00
Owen Anderson	fe12f23024	Add a reduced testcase for the infinite loop fixed in r113763. llvm-svn: 113770	2010-09-13 18:28:40 +00:00
Owen Anderson	c237a849e3	Re-apply r113679, which was reverted in r113720, which added a paid of new instcombine transforms to expose greater opportunities for store narrowing in codegen. This patch fixes a potential infinite loop in instcombine caused by one of the introduced transforms being overly aggressive. llvm-svn: 113763	2010-09-13 17:59:27 +00:00
Duncan Sands	df65e14397	Spelling fixes in comments. llvm-svn: 113746	2010-09-13 13:32:22 +00:00
Eric Christopher	26abd3e0c2	Revert 113679, it was causing an infinite loop in a testcase that I've sent on to Owen. llvm-svn: 113720	2010-09-12 06:09:23 +00:00
Chris Lattner	1bbb14ab8f	add a missed cmov alias, part of rdar://8416805 llvm-svn: 113693	2010-09-11 17:08:22 +00:00
Chris Lattner	3340c3e86c	add support for all the setCC aliases. Part of rdar://8416805 llvm-svn: 113692	2010-09-11 17:06:05 +00:00
Rafael Espindola	12d73d1f18	Add support for leb128 of absolute expressions. llvm-svn: 113691	2010-09-11 16:45:15 +00:00
Chris Lattner	b47c042e09	add support for pushfd/popfd which are aliases for pushfl/popfl. This fixes rdar://8408129 - pushfd and popfd get invalid instruction mnemonic errors llvm-svn: 113690	2010-09-11 16:39:16 +00:00
Chris Lattner	30561aba20	implement rdar://8407928 - support for in/out with a missing "a" register. llvm-svn: 113689	2010-09-11 16:32:12 +00:00
Rafael Espindola	6e321507b6	Add missing single quotes. llvm-svn: 113687	2010-09-11 15:45:48 +00:00
Rafael Espindola	2833e392ab	Change section_data dumping to print hex numbers instead of using python's %r. llvm-svn: 113685	2010-09-11 15:25:58 +00:00
Owen Anderson	70f4524427	Invert and-of-or into or-of-and when doing so would allow us to clear bits of the and's mask. This can result in increased opportunities for store narrowing in code generation. Update a number of tests for this change. This fixes <rdar://problem/8285027>. Additionally, because this inverts the order of ors and ands, some patterns for optimizing or-of-and-of-or no longer fire in instances where they did originally. Add a simple transform which recaptures most of these opportunities: if we have an or-of-constant-or and have failed to fold away the inner or, commute the order of the two ors, to give the non-constant or a chance for simplification instead. llvm-svn: 113679	2010-09-11 05:48:06 +00:00
Benjamin Kramer	8c35fb0739	Teach InstructionSimplify to fold (A & B) & A -> A & B and (A \| B) \| A -> A \| B. Reassociate does this but it doesn't catch all cases (e.g. if the operands are i1). llvm-svn: 113651	2010-09-10 22:39:55 +00:00
Bill Wendling	e26fffc597	Auto-upgrade the magic ".llvm.eh.catch.all.value" global to "llvm.eh.catch.all.value". Only the name needs to be changed. llvm-svn: 113600	2010-09-10 18:51:56 +00:00
Evan Cheng	1d6aa46cd7	Fix test so it passes on non-Darwin hosts. llvm-svn: 113577	2010-09-10 06:20:01 +00:00
Bob Wilson	8617234658	Fix merging base-updates for VLDM/VSTM: Before I switched these instructions to use AddrMode4, there was a count of the registers stored in one of the operands. I changed that to just count the operands but forgot to adjust for the size of D registers. This was noticed by Evan as a performance problem but it is a potential correctness bug as well, since it is possible that this could merge a base update with a non-matching immediate. llvm-svn: 113576	2010-09-10 05:15:04 +00:00
Evan Cheng	bf4070756f	Teach if-converter to be more careful with predicating instructions that would take multiple cycles to decode. For the current if-converter clients (actually only ARM), the instructions that are predicated on false are not nops. They would still take machine cycles to decode. Micro-coded instructions such as LDM / STM can potentially take multiple cycles to decode. If-converter should take treat them as non-micro-coded simple instructions. llvm-svn: 113570	2010-09-10 01:29:16 +00:00
Daniel Dunbar	e5444a88cd	llvm-mc: Don't crash when using -n and we see a directive before the initial section. - This is annoying, because we have to scatter this check everywhere that could emit real data, but I see no better solution. llvm-svn: 113552	2010-09-09 22:42:59 +00:00
Daniel Dunbar	43325c4a68	llvm-mc: Make sure we exit != 0 if any errors are encountered. llvm-svn: 113551	2010-09-09 22:42:56 +00:00
Jakob Stoklund Olesen	728941fabc	XFAIL test under valgrind. It is not really our problem if sh is leaking. llvm-svn: 113550	2010-09-09 22:02:13 +00:00
Owen Anderson	6270515918	Revert r113439, which relaxed the requirement that loops containing calls cannot be unrolled. After some discussion, there seems to be a better way to achieve the same effect. llvm-svn: 113528	2010-09-09 20:02:23 +00:00
Bruno Cardoso Lopes	e8501a468c	Add one more pattern to fallback movddup llvm-svn: 113522	2010-09-09 18:48:34 +00:00
Daniel Dunbar	db0ddaa50b	tests: XFAIL a handful of tests on the vg_leak builder, so we can get back to green. llvm-svn: 113491	2010-09-09 15:50:19 +00:00
Benjamin Kramer	0a96578ac0	Add an elf-dumper utility. - Output format and some of the code stolen from macho-dump. - Somewhat incomplete and probably buggy. - Comes with a very basic test. llvm-svn: 113488	2010-09-09 15:00:41 +00:00
Duncan Sands	78617ea13a	Get rid of the last use of -m64 in FrontendC. This solution of checking for either 4 or 8 is not very satisfactory, but it would catch the original problem (an alignment of 1). llvm-svn: 113485	2010-09-09 12:57:29 +00:00
Duncan Sands	11d56c309a	Another test that uses -m64. Here too it looks like it can be removed. Not that the XTARGET wasn't doing anything since it does nothing without an accompanying XFAIL. llvm-svn: 113484	2010-09-09 12:48:04 +00:00
Duncan Sands	e194e0c077	On i386, llvm-gcc cannot be assumed to support -m64. Since these tests pass here (i686-linux and x86-64-linux) without -m64, simply remove the -m64. llvm-svn: 113483	2010-09-09 12:43:44 +00:00
Bob Wilson	4adbaf1843	Fix NEON VLD pseudo instruction itineraries that were incorrectly copied from the VST pseudos. The VLD/VST scheduling still needs work (see pr6722), but at least we shouldn't confuse the loads with the stores. llvm-svn: 113473	2010-09-09 05:40:26 +00:00
Owen Anderson	8084dbaf8e	Relax the "don't unroll loops containing calls" rule. Instead, when a loop contains a call, lower the unrolling threshold to the optimize-for-size threshold. Basically, for loops containing calls, unrolling can still be profitable as long as the loop is REALLY small. llvm-svn: 113439	2010-09-08 23:10:07 +00:00
Chris Lattner	28a9c2f89a	fix rdar://8407548, I missed the commuted form of xchg/test without a suffix. llvm-svn: 113427	2010-09-08 22:27:05 +00:00
Owen Anderson	3fe002dfb5	Generalize instcombine's support for combining multiple bit checks into a single test. Patch by Dirk Steinke! llvm-svn: 113423	2010-09-08 22:16:17 +00:00
Chris Lattner	8ead237758	fix bugs in push/pop segment support, rdar://8407242 llvm-svn: 113422	2010-09-08 22:13:08 +00:00
Jim Grosbach	504d23bd05	Re-enable usage of the ARM base pointer. r113394 fixed the known failures. Re-running some nightly testers w/ it enabled to verify. llvm-svn: 113399	2010-09-08 20:12:02 +00:00
Eric Christopher	ca2ec95154	Remove ssp from this test. llvm-svn: 113392	2010-09-08 19:32:34 +00:00
Kalle Raiskila	e542972828	Fix CellSPU vector shuffles, again. Some cases of lowering to rotate were miscompiled. llvm-svn: 113355	2010-09-08 11:53:38 +00:00
Chris Lattner	2907d2e419	add support for the commuted form of the test instruction, rdar://8018260. llvm-svn: 113352	2010-09-08 05:51:12 +00:00
Chris Lattner	a9ca7837e4	implement proper support for sysret{,l,q}, rdar://8403907 llvm-svn: 113350	2010-09-08 05:45:34 +00:00
Chris Lattner	063363fa80	implement the iret suite of instructions properly, fixing rdar://8403974 llvm-svn: 113349	2010-09-08 05:38:31 +00:00
Chris Lattner	086a83afb1	add support for instruction prefixes on the same line as the instruction, implementing rdar://8033482 and PR7254. llvm-svn: 113348	2010-09-08 05:17:37 +00:00
Chris Lattner	8caea68a4f	gas accepts xchg <mem>, <reg> as a synonym for xchg <reg>, <mem>. Add this to the mc assembler, fixing PR8061 llvm-svn: 113346	2010-09-08 04:53:27 +00:00
Chris Lattner	4703cb4a96	fix the encoding of the "jump on *cx" family of instructions, rdar://8061602 llvm-svn: 113343	2010-09-08 04:30:51 +00:00
Jim Grosbach	261df12f64	disable for the moment while tracking down a few Thumb2-O0 failure that look related. (attempt deux, complete w/ test update this time) llvm-svn: 113333	2010-09-08 02:00:34 +00:00
Devang Patel	3f4abf397c	remove these tests for now. llvm-svn: 113293	2010-09-07 22:03:44 +00:00
Devang Patel	b0af23a1f6	There is no need to force target if the test is going to run on other x86 platforms. llvm-svn: 113285	2010-09-07 20:59:09 +00:00
Stuart Hastings	420c8a604f	Typo. Thanks to BillW for pointing it out! llvm-svn: 113281	2010-09-07 20:39:07 +00:00
Chris Lattner	6e27b3e004	Fix a serious performance regression introduced by r108687 on linux: turning (fptrunc (sqrt (fpext x))) -> (sqrtf x) is great, but we have to delete the original sqrt as well. Not doing so causes us to do two sqrt's when building with -fmath-errno (the default on linux). llvm-svn: 113260	2010-09-07 20:01:38 +00:00
Chris Lattner	29570bd695	rename test. llvm-svn: 113257	2010-09-07 19:57:06 +00:00
Stuart Hastings	a3188a81c0	Test case for r113248. Raar 8361341. llvm-svn: 113249	2010-09-07 18:43:57 +00:00
Devang Patel	e50b23e223	Fix command line used to link these test cases. llvm-svn: 113237	2010-09-07 18:17:56 +00:00
Devang Patel	9dc0e5be58	Reintroduce dbg-declare tests. llvm-svn: 113232	2010-09-07 18:01:49 +00:00
Devang Patel	688338eec3	Remove last three tests. I need to make them independent of my setup. llvm-svn: 113213	2010-09-07 17:08:57 +00:00
Devang Patel	55a3bab0d2	Add a test case to check handling of dbg-declare during hybrid mode where we begin using fast-isel but switch back to DAG building at some point. llvm-svn: 113210	2010-09-07 17:03:44 +00:00
Devang Patel	29a775adf1	Add a test case to check handling of dbg-declare by selection DAG builder. llvm-svn: 113209	2010-09-07 16:56:35 +00:00
Devang Patel	184c81c3e2	Add a test case to check handling of dbg-declare by fast-isel. llvm-svn: 113208	2010-09-07 16:40:53 +00:00
Chris Lattner	30bb384944	add missing cmov aliases, this resolves rdar://8208499 llvm-svn: 113189	2010-09-07 00:05:45 +00:00
Chris Lattner	7ece716da2	"sldt <mem>" is ambiguous in 64-bit mode, but should always be disambiguated as sldtw. sldtw and sldtq with a mem operands have the same effect, but sldtw is more compact. Force it to sldtw, resolving rdar://8017530 llvm-svn: 113186	2010-09-06 23:51:44 +00:00
Chris Lattner	415e04fad2	fix rdar://8017621 - llvm-mc can't guess encoding for "push $(1000)" llvm-svn: 113184	2010-09-06 23:40:56 +00:00
Chris Lattner	34e366b45c	fix the operand constraints of the immediate form of in/out, allowing unsigned 8-bit operands. This fixes rdar://8208481 llvm-svn: 113182	2010-09-06 23:29:05 +00:00
Chris Lattner	be9019090e	fix PR8067, an over-aggressive assertion in LICM. llvm-svn: 113146	2010-09-06 05:11:24 +00:00
Chris Lattner	b01c24a945	Teach loop rotate to hoist trivially invariant instructions in the duplicated block instead of duplicating them. Duplicating them into the end of the loop and the preheader means that we got a phi node in the header of the loop, which prevented LICM from hoisting them. GVN would usually come around later and merge the duplicated instructions so we'd get reasonable output... except that anything dependent on the shoulda-been-hoisted value can't be hoisted. In PR5319 (which this fixes), a memory value didn't get promoted. llvm-svn: 113134	2010-09-06 01:10:22 +00:00
Chris Lattner	72d283c826	fix PR8063, a crash in globalopt in the malloc analysis code. llvm-svn: 113109	2010-09-05 17:20:46 +00:00
Chris Lattner	eeba0c73e5	implement rdar://6653118 - fastisel should fold loads where possible. Since mem2reg isn't run at -O0, we get a ton of reloads from the stack, for example, before, this code: int foo(int x, int y, int z) { return x+y+z; } used to compile into: _foo: ## @foo subq $12, %rsp movl %edi, 8(%rsp) movl %esi, 4(%rsp) movl %edx, (%rsp) movl 8(%rsp), %edx movl 4(%rsp), %esi addl %edx, %esi movl (%rsp), %edx addl %esi, %edx movl %edx, %eax addq $12, %rsp ret Now we produce: _foo: ## @foo subq $12, %rsp movl %edi, 8(%rsp) movl %esi, 4(%rsp) movl %edx, (%rsp) movl 8(%rsp), %edx addl 4(%rsp), %edx ## Folded load addl (%rsp), %edx ## Folded load movl %edx, %eax addq $12, %rsp ret Fewer instructions and less register use = faster compiles. llvm-svn: 113102	2010-09-05 02:18:34 +00:00
Dan Gohman	487e250109	Fix LoopSimplify to notify ScalarEvolution when splitting a loop backedge into an inner loop, as the new loop iteration may differ substantially. This fixes PR8078. llvm-svn: 113057	2010-09-04 02:42:48 +00:00
Chris Lattner	50506787d1	fix a bug in my licm rewrite when a load from the promoted memory location is being re-stored to the memory location. We would get a dangling pointer from the SSAUpdate data structure and miss a use. This fixes PR8068 llvm-svn: 113042	2010-09-04 00:12:30 +00:00
Owen Anderson	c91c1a205a	Propagate non-local comparisons. Fixes PR1757. llvm-svn: 113025	2010-09-03 22:47:08 +00:00
Dale Johannesen	367afb5a00	Remove the rest of the nonexistent 64-bit AVX instructions. Bruno, please review. llvm-svn: 113014	2010-09-03 21:23:00 +00:00
David Greene	2a9de4d828	Generalize getFieldType to work on all TypedInits. Add a couple of testcases from Amaury Pouly. llvm-svn: 113010	2010-09-03 21:00:49 +00:00
Owen Anderson	c725462245	Add support for simplifying a load from a computed value to a load from a global when it is provable that they're equivalent. This fixes PR4855. llvm-svn: 112994	2010-09-03 19:08:37 +00:00
Jim Grosbach	03f4be86ba	Re-apply r112883: "For ARM stack frames that utilize variable sized objects and have either large local stack areas or require dynamic stack realignment, allocate a base register via which to access the local frame. This allows efficient access to frame indices not accessible via the FP (either due to being out of range or due to dynamic realignment) or the SP (due to variable sized object allocation). In particular, this greatly improves efficiency of access to spill slots in Thumb functions which contain VLAs." r112986 fixed a latent bug exposed by the above. llvm-svn: 112989	2010-09-03 18:37:12 +00:00
Owen Anderson	064cb4c807	Add a test for PR4413, which was apparently fixed at some point in the past. llvm-svn: 112987	2010-09-03 18:33:08 +00:00
Owen Anderson	50d8c8888c	Add PR number to test. llvm-svn: 112971	2010-09-03 16:58:25 +00:00
Daniel Dunbar	2ac3386ef3	Revert "For ARM stack frames that utilize variable sized objects and have either", it is breaking oggenc with Clang for ARMv6. This reverts commit 8d6e29cfda270be483abf638850311670829ee65. llvm-svn: 112962	2010-09-03 15:26:42 +00:00
NAKAMURA Takumi	24d039ebe3	test/CodeGen/X86: Add explicit -mtriple=(i686\|x86_64)-linux for Win32 host. llvm-svn: 112947	2010-09-03 03:24:08 +00:00
Bruno Cardoso Lopes	d6634a5b2e	AVX doesn't support mm operations neither its instrinsics. The AVX versions of PALIGN and PABS* should only exist for 128-bit. Remove the unnecessary stuff. llvm-svn: 112944	2010-09-03 02:08:45 +00:00
Bob Wilson	f65c9ef720	Replace NEON vabdl, vaba, and vabal intrinsics with combinations of the vabd intrinsic and add and/or zext operations. In the case of vaba, this also avoids the need for a DAG combine pattern to combine vabd with add. Update tests. Auto-upgrade the old intrinsics. llvm-svn: 112941	2010-09-03 01:35:08 +00:00
Chris Lattner	7bf4b82e97	update one more test llvm-svn: 112910	2010-09-02 23:32:55 +00:00
Chris Lattner	7f2f0930a7	add a new "llvm-dis -show-annotations" option, which causes it to print #uses comments, with a testcase. llvm-svn: 112906	2010-09-02 23:21:44 +00:00
Anton Korobeynikov	a5a645559c	Properly emit __chkstk call instead of __alloca on non-mingw windows targets. Patch by Cameron Esfahani! llvm-svn: 112902	2010-09-02 23:03:46 +00:00
Chris Lattner	65fb25a257	more test cleanup llvm-svn: 112892	2010-09-02 22:38:56 +00:00
Chris Lattner	bb451461ec	remove some noise from tests. llvm-svn: 112889	2010-09-02 22:35:33 +00:00
Chris Lattner	a18d7ec4fb	we are past the point where these tests are useful. llvm-svn: 112887	2010-09-02 22:32:02 +00:00
Jim Grosbach	7fd9aea67c	For ARM stack frames that utilize variable sized objects and have either large local stack areas or require dynamic stack realignment, allocate a base register via which to access the local frame. This allows efficient access to frame indices not accessible via the FP (either due to being out of range or due to dynamic realignment) or the SP (due to variable sized object allocation). In particular, this greatly improves efficiency of access to spill slots in Thumb functions which contain VLAs. rdar://7352504 rdar://8374540 rdar://8355680 llvm-svn: 112883	2010-09-02 22:29:01 +00:00
Chris Lattner	affc0e42f0	fix more AST updating bugs, correcting miscompilation in PR8041 llvm-svn: 112878	2010-09-02 22:19:10 +00:00
Dan Gohman	3c9b5f394b	Don't narrow the load and store in a load+twiddle+store sequence unless there are clearly no stores between the load and the store. This fixes this miscompile reported as PR7833. This breaks the test/CodeGen/X86/narrow_op-2.ll optimization, which is safe, but awkward to prove safe. Move it to X86's README.txt. llvm-svn: 112861	2010-09-02 21:18:42 +00:00
Sandeep Patel	0ca17f7e8a	Fix an unnecessary XFAIL llvm-svn: 112853	2010-09-02 20:19:24 +00:00
Owen Anderson	67dee4dcac	Fix typo. I accidentally edited the wrong file before my last commit. llvm-svn: 112851	2010-09-02 19:52:06 +00:00
Benjamin Kramer	e39017cb97	Add AsmParser support for the ELF .previous directive. Patch by Roman Divacky. llvm-svn: 112849	2010-09-02 18:53:37 +00:00
Owen Anderson	a8c896b704	Fix a bug in LazyValueInfo that CorrelatedValuePropagation exposed: In the LVI lattice, undef and the full set ConstantRange should not be treated as equivalent. llvm-svn: 112843	2010-09-02 18:23:58 +00:00
Jim Grosbach	66c681a644	Now that register allocation properly considers reserved regs, simplify the ARM register class allocation order functions to take advantage of that. llvm-svn: 112841	2010-09-02 18:14:29 +00:00
Bob Wilson	75a6408f88	Convert VLD1 and VLD2 instructions to use pseudo-instructions until after regalloc. llvm-svn: 112825	2010-09-02 16:00:54 +00:00
Duncan Sands	8dda07428a	Print the number of uses of a function in the .ll since it can be informative and there seems to be no reason not to. llvm-svn: 112812	2010-09-02 08:52:23 +00:00
NAKAMURA Takumi	a224e5563e	test/loop-strength-reduce4: Add explicit triplet for Win32 host. llvm-svn: 112802	2010-09-02 03:45:58 +00:00
NAKAMURA Takumi	54ce546865	test/twoaddr-coalesce: Do not use @main . Win32 codegen emits implicit invoking __main into, to fail. llvm-svn: 112801	2010-09-02 03:45:51 +00:00
Bob Wilson	38ab35a911	Remove NEON vmull, vmlal, and vmlsl intrinsics, replacing them with multiply, add, and subtract operations with zero-extended or sign-extended vectors. Update tests. Add auto-upgrade support for the old intrinsics. llvm-svn: 112773	2010-09-01 23:50:19 +00:00
Chris Lattner	8af45a889d	deepen my MMX/SRoA hack to avoid hurting non-x86 codegen. llvm-svn: 112763	2010-09-01 23:09:27 +00:00
Bruno Cardoso Lopes	fea81b4831	Using target specific nodes for shuffle nodes makes the mask check more strict, breaking some cases not checked in the testsuite, but also exposes some foldings not done before, as this example: movaps (%rdi), %xmm0 movaps (%rax), %xmm1 movaps %xmm0, %xmm2 movss %xmm1, %xmm2 shufps $36, %xmm2, %xmm0 now is generated as: movaps (%rdi), %xmm0 movaps %xmm0, %xmm1 movlps (%rax), %xmm1 shufps $36, %xmm1, %xmm0 llvm-svn: 112753	2010-09-01 22:33:20 +00:00
Jakob Stoklund Olesen	4b6fd48bba	Teach RemoveCopyByCommutingDef to check all aliases, not just subregisters. This caused a miscompilation in WebKit where %RAX had conflicting defs when RemoveCopyByCommutingDef was commuting a %EAX use. llvm-svn: 112751	2010-09-01 22:15:35 +00:00
Dale Johannesen	78d95e0089	Apparently only Darwin passes long double misaligned. Compensate. llvm-svn: 112748	2010-09-01 21:57:20 +00:00
Dan Gohman	0ad7d9c24e	Fix loop unswitching's assumption that a code path which either infinite loops or exits will eventually exit. This fixes PR5373. llvm-svn: 112745	2010-09-01 21:46:45 +00:00
Bill Wendling	6456efaffd	The output of opt -stats must be sent to stderr. Patch by NAKAMURA Takumi! llvm-svn: 112724	2010-09-01 18:32:56 +00:00
Chris Lattner	39eccb4754	temporarily revert r112664, it is causing a decoding conflict, and the testcases should be merged. llvm-svn: 112711	2010-09-01 16:00:50 +00:00
Michael J. Spencer	d8e5dfccc1	COFF: Update tests to reflect changes in last commit. llvm-svn: 112704	2010-09-01 14:15:31 +00:00
Dale Johannesen	e13c04d6da	Attempt to fix buildbot. llvm-svn: 112697	2010-09-01 05:19:06 +00:00
Chris Lattner	34e5361eb5	add a gross hack to work around a problem that Argiris reported on llvmdev: SRoA is introducing MMX datatypes like <1 x i64>, which then cause random problems because the X86 backend is producing mmx stuff without inserting proper emms calls. In the short term, force off MMX datatypes. In the long term, the X86 backend should not select generic vector types to MMX registers. This is being worked on, but won't be done in time for 2.8. rdar://8380055 llvm-svn: 112696	2010-09-01 05:14:33 +00:00
Chris Lattner	b9ed4f252f	filecheckize llvm-svn: 112695	2010-09-01 05:10:14 +00:00
Dan Gohman	110ed64fbb	Revert 112442 and 112440 until the compile time problems introduced by 112440 are resolved. llvm-svn: 112692	2010-09-01 01:45:53 +00:00
Dale Johannesen	52bd0dc3bb	Testcase for llvm checkin 112674. llvm-svn: 112675	2010-08-31 23:43:55 +00:00
Chris Lattner	030f02021b	licm is wasting time hoisting constant foldable operations, instead of hoisting them, just fold them away. This occurs in the testcase for PR8041, for example. llvm-svn: 112669	2010-08-31 23:00:16 +00:00
Bill Wendling	6789f8b6ae	We have a chance for an optimization. Consider this code: int x(int t) { if (t & 256) return -26; return 0; } We generate this: tst.w r0, #256 mvn r0, #25 it eq moveq r0, #0 while gcc generates this: ands r0, r0, #256 it ne mvnne r0, #25 bx lr Scandalous really! During ISel time, we can look for this particular pattern. One where we have a "MOVCC" that uses the flag off of a CMPZ that itself is comparing an AND instruction to 0. Something like this (greatly simplified): %r0 = ISD::AND ... ARMISD::CMPZ %r0, 0 @ sets [CPSR] %r0 = ARMISD::MOVCC 0, -26 @ reads [CPSR] All we have to do is convert the "ISD::AND" into an "ARM::ANDS" that sets [CPSR] when it's zero. The zero value will all ready be in the %r0 register and we only need to change it if the AND wasn't zero. Easy! llvm-svn: 112664	2010-08-31 22:41:22 +00:00
Devang Patel	86ec8b3a3f	Reapply r112623. Included additional check for unused byval argument. llvm-svn: 112659	2010-08-31 22:22:42 +00:00
Owen Anderson	a5e6b3eca4	Merge 2010-08-31-InfiniteRecursion.ll into crash.ll. llvm-svn: 112635	2010-08-31 20:27:17 +00:00
Devang Patel	529f248eb4	Revert r112623. It is causing self host build failures. llvm-svn: 112631	2010-08-31 19:41:03 +00:00
Devang Patel	8559932d36	Remember byval argument's frame index during argument lowering and use this info to emit debug info. Fixes Radar 8367011. llvm-svn: 112623	2010-08-31 18:50:09 +00:00
Owen Anderson	799a08ae48	Add a test for the duplicated-conditional situation illutrated by PR5652. llvm-svn: 112621	2010-08-31 18:49:12 +00:00
Chris Lattner	e2295f1c80	merge two tests. llvm-svn: 112617	2010-08-31 18:44:03 +00:00
Owen Anderson	3931c85956	Manually reduce this testcase. llvm-svn: 112615	2010-08-31 18:16:29 +00:00
Chris Lattner	fbcd165b59	merge two tests and convert to filecheck. llvm-svn: 112613	2010-08-31 18:05:08 +00:00
Owen Anderson	ada0623725	Add a micro-test for the transforms I added to JumpThreading. I have not been able to find a way to test each in isolation, for a few reasons: 1) The ability to look-through non-i1 BinaryOperator's requires the ability to look through non-constant ICmps in order for it to ever trigger. 2) The ability to do LVI-powered PHI value determination only matters in cases that ProcessBranchOnPHI can't handle. Since it already handles all the cases without other instructions in the def-use chain between the PHI and the branch, it requires the ability to look through ICmps and/or BinaryOperators as well. llvm-svn: 112611	2010-08-31 17:59:07 +00:00
Jim Grosbach	ad9b6de3b6	Update test for 112609 llvm-svn: 112610	2010-08-31 17:58:47 +00:00
Owen Anderson	064b139c8d	Rename test directory to reflect new pass name. llvm-svn: 112592	2010-08-31 07:50:31 +00:00
Owen Anderson	48d58ad64c	Rename ValuePropagation to a more descriptive CorrelatedValuePropagation. llvm-svn: 112591	2010-08-31 07:48:34 +00:00
Owen Anderson	3997a07fb9	More Chris-inspired JumpThreading fixes: use ConstantExpr to correctly constant-fold undef, and be more careful with its return value. This actually exposed an infinite recursion bug in ComputeValueKnownInPredecessors which theoretically already existed (in JumpThreading's handling of and/or of i1's), but never manifested before. This patch adds a tracking set to prevent this case. llvm-svn: 112589	2010-08-31 07:36:34 +00:00
Owen Anderson	376597c13e	Remove r111665, which implemented store-narrowing in InstCombine. Chris discovered a miscompilation in it, and it's not easily fixable at the optimizer level. I'll investigate reimplementing it in DAGCombine. llvm-svn: 112575	2010-08-31 04:41:06 +00:00
Anton Korobeynikov	3a1d87a7ba	Fix borken test llvm-svn: 112555	2010-08-30 23:41:49 +00:00
Owen Anderson	70b17c50e2	Combine these two tests, and make sure there's a newline at the end of the file. llvm-svn: 112554	2010-08-30 23:37:41 +00:00
Bob Wilson	4cd8a126c3	Remove NEON vmovn intrinsic, replacing it with vector truncate operations. Auto-upgrade the old intrinsic and update tests. llvm-svn: 112507	2010-08-30 20:02:30 +00:00
Chris Lattner	34bfab0ad5	two changes: 1) nuke ConstDataCoalSection, which is dead. 2) revise my previous patch for rdar://8018335, which was completely wrong. Specifically, it doesn't make sense to mark __TEXT,__const_coal as PURE_INSTRUCTIONS, because it is for readonly data. templates (it turns out) go to const_coal_nt. The real fix for rdar://8018335 was to give ConstTextCoalSection a section kind of ReadOnly instead of Text. llvm-svn: 112496	2010-08-30 18:12:35 +00:00
Michael J. Spencer	2f997cdedf	Partially revert r112480. Caused test failures. llvm-svn: 112486	2010-08-30 15:34:08 +00:00
NAKAMURA Takumi	e53cf6f85d	coff-dump.py: Fix PR7996. Now it is compatible to Python-2.4. llvm-svn: 112485	2010-08-30 15:19:56 +00:00
Michael J. Spencer	7983340465	Fix constant-over-index.ll test on windows. llvm-svn: 112483	2010-08-30 15:08:02 +00:00
Michael J. Spencer	41c18853c8	Test: Fix LLVMC tests on CMake. The CMake build didn't define TEST_COMPILE_CXX_CMD. The tests assumed gcc. llvm-svn: 112480	2010-08-30 14:49:00 +00:00
Duncan Sands	68c30907cc	Correct bogus module triple specifications. llvm-svn: 112469	2010-08-30 10:48:29 +00:00
Chris Lattner	263f804699	LICM does get dead instructions input to it. Instead of sinking them out of loops, just delete them. llvm-svn: 112451	2010-08-29 18:22:25 +00:00
Dan Gohman	3a08ed7904	Make IVUsers iterative instead of recursive. This has the side effect of reversing the order of most of IVUser's results. llvm-svn: 112442	2010-08-29 16:40:03 +00:00
Dan Gohman	6665550bca	Make this test less dependent on register allocation choices. llvm-svn: 112426	2010-08-29 14:49:42 +00:00
Dan Gohman	883fa863f8	Use exec. llvm-svn: 112425	2010-08-29 14:49:00 +00:00
Kalle Raiskila	1e616572d9	Fix lowering of INSERT_VECTOR_ELT in SPU. The IDX was treated as byte index, not element index. llvm-svn: 112422	2010-08-29 12:41:50 +00:00
Bob Wilson	d0c054886c	Remove NEON vaddl, vaddw, vsubl, and vsubw intrinsics. Instead, use llvm IR add/sub operations with one or both operands sign- or zero-extended. Auto-upgrade the old intrinsics. llvm-svn: 112416	2010-08-29 05:57:34 +00:00
Chris Lattner	c2887bc283	merge a bunch of shuffle tests into sse2.ll llvm-svn: 112398	2010-08-29 03:19:04 +00:00
Chris Lattner	b1ff978406	add some nounwind's llvm-svn: 112396	2010-08-29 03:07:47 +00:00
Chris Lattner	112b6ee3f2	fixme accomplished llvm-svn: 112386	2010-08-28 20:40:28 +00:00
Chris Lattner	94656b1c8c	fix the buildvector->insertp[sd] logic to not always create a redundant insertp[sd] $0, which is a noop. Before: _f32: ## @f32 pshufd $1, %xmm1, %xmm2 pshufd $1, %xmm0, %xmm3 addss %xmm2, %xmm3 addss %xmm1, %xmm0 ## kill: XMM0<def> XMM0<kill> XMM0<def> insertps $0, %xmm0, %xmm0 insertps $16, %xmm3, %xmm0 ret after: _f32: ## @f32 movdqa %xmm0, %xmm2 addss %xmm1, %xmm2 pshufd $1, %xmm1, %xmm1 pshufd $1, %xmm0, %xmm3 addss %xmm1, %xmm3 movdqa %xmm2, %xmm0 insertps $16, %xmm3, %xmm0 ret The extra movs are due to a random (poor) scheduling decision. llvm-svn: 112379	2010-08-28 17:59:08 +00:00
Chris Lattner	bcb6090ad0	fix the BuildVector -> unpcklps logic to not do pointless shuffles when the top elements of a vector are undefined. This happens all the time for X86-64 ABI stuff because only the low 2 elements of a 4 element vector are defined. For example, on: _Complex float f32(_Complex float A, _Complex float B) { return A+B; } We used to produce (with SSE2, SSE4.1+ uses insertps): _f32: ## @f32 movdqa %xmm0, %xmm2 addss %xmm1, %xmm2 pshufd $16, %xmm2, %xmm2 pshufd $1, %xmm1, %xmm1 pshufd $1, %xmm0, %xmm0 addss %xmm1, %xmm0 pshufd $16, %xmm0, %xmm1 movdqa %xmm2, %xmm0 unpcklps %xmm1, %xmm0 ret We now produce: _f32: ## @f32 movdqa %xmm0, %xmm2 addss %xmm1, %xmm2 pshufd $1, %xmm1, %xmm1 pshufd $1, %xmm0, %xmm3 addss %xmm1, %xmm3 movaps %xmm2, %xmm0 unpcklps %xmm3, %xmm0 ret This implements rdar://8368414 llvm-svn: 112378	2010-08-28 17:28:30 +00:00
Benjamin Kramer	2e5c14713c	Update ocaml test. llvm-svn: 112364	2010-08-28 10:29:41 +00:00
Chris Lattner	13ee795c42	remove unions from LLVM IR. They are severely buggy and not being actively maintained, improved, or extended. llvm-svn: 112356	2010-08-28 04:09:24 +00:00
Chris Lattner	504e5100d3	remove the ABCD and SSI passes. They don't have any clients that I'm aware of, aren't maintained, and LVI will be replacing their value. nlewycky approved this on irc. llvm-svn: 112355	2010-08-28 03:51:24 +00:00
Chris Lattner	d0214f3efe	handle the constant case of vector insertion. For something like this: struct S { float A, B, C, D; }; struct S g; struct S bar() { struct S A = g; ++A.B; A.A = 42; return A; } we now generate: _bar: ## @bar ## BB#0: ## %entry movq _g@GOTPCREL(%rip), %rax movss 12(%rax), %xmm0 pshufd $16, %xmm0, %xmm0 movss 4(%rax), %xmm2 movss 8(%rax), %xmm1 pshufd $16, %xmm1, %xmm1 unpcklps %xmm0, %xmm1 addss LCPI1_0(%rip), %xmm2 pshufd $16, %xmm2, %xmm2 movss LCPI1_1(%rip), %xmm0 pshufd $16, %xmm0, %xmm0 unpcklps %xmm2, %xmm0 ret instead of: _bar: ## @bar ## BB#0: ## %entry movq _g@GOTPCREL(%rip), %rax movss 12(%rax), %xmm0 pshufd $16, %xmm0, %xmm0 movss 4(%rax), %xmm2 movss 8(%rax), %xmm1 pshufd $16, %xmm1, %xmm1 unpcklps %xmm0, %xmm1 addss LCPI1_0(%rip), %xmm2 movd %xmm2, %eax shlq $32, %rax addq $1109917696, %rax ## imm = 0x42280000 movd %rax, %xmm0 ret llvm-svn: 112345	2010-08-28 01:50:57 +00:00
Chris Lattner	dd6601048e	optimize bitcasts from large integers to vector into vector element insertion from the pieces that feed into the vector. This handles a pattern that occurs frequently due to code generated for the x86-64 abi. We now compile something like this: struct S { float A, B, C, D; }; struct S g; struct S bar() { struct S A = g; ++A.A; ++A.C; return A; } into all nice vector operations: _bar: ## @bar ## BB#0: ## %entry movq _g@GOTPCREL(%rip), %rax movss LCPI1_0(%rip), %xmm1 movss (%rax), %xmm0 addss %xmm1, %xmm0 pshufd $16, %xmm0, %xmm0 movss 4(%rax), %xmm2 movss 12(%rax), %xmm3 pshufd $16, %xmm2, %xmm2 unpcklps %xmm2, %xmm0 addss 8(%rax), %xmm1 pshufd $16, %xmm1, %xmm1 pshufd $16, %xmm3, %xmm2 unpcklps %xmm2, %xmm1 ret instead of icky integer operations: _bar: ## @bar movq _g@GOTPCREL(%rip), %rax movss LCPI1_0(%rip), %xmm1 movss (%rax), %xmm0 addss %xmm1, %xmm0 movd %xmm0, %ecx movl 4(%rax), %edx movl 12(%rax), %esi shlq $32, %rdx addq %rcx, %rdx movd %rdx, %xmm0 addss 8(%rax), %xmm1 movd %xmm1, %eax shlq $32, %rsi addq %rax, %rsi movd %rsi, %xmm1 ret This resolves rdar://8360454 llvm-svn: 112343	2010-08-28 01:20:38 +00:00
Dan Gohman	e06905d1f0	Completely disable tail calls when fast-isel is enabled, as fast-isel doesn't currently support dealing with this. llvm-svn: 112341	2010-08-28 00:51:03 +00:00
Owen Anderson	cf7f941121	Add a prototype of a new peephole optimizing pass that uses LazyValue info to simplify PHIs and select's. This pass addresses the missed optimizations from PR2581 and PR4420. llvm-svn: 112325	2010-08-27 23:31:36 +00:00
Bob Wilson	13ce07fa92	Change ARM VFP VLDM/VSTM instructions to use addressing mode #4 , just like all the other LDM/STM instructions. This fixes asm printer crashes when compiling with -O0. I've changed one of the NEON tests (vst3.ll) to run with -O0 to check this in the future. Prior to this change VLDM/VSTM used addressing mode #5, but not really. The offset field was used to hold a count of the number of registers being loaded or stored, and the AM5 opcode field was expanded to specify the IA or DB mode, instead of the standard ADD/SUB specifier. Much of the backend was not aware of these special cases. The crashes occured when rewriting a frameindex caused the AM5 offset field to be changed so that it did not have a valid submode. I don't know exactly what changed to expose this now. Maybe we've never done much with -O0 and NEON. Regardless, there's no longer any reason to keep a count of the VLDM/VSTM registers, so we can use addressing mode #4 and clean things up in a lot of places. llvm-svn: 112322	2010-08-27 23:18:17 +00:00
Chris Lattner	954e9557e3	tidy up test. llvm-svn: 112321	2010-08-27 23:15:21 +00:00
Chris Lattner	b8b7d52631	no really, fix the test. llvm-svn: 112317	2010-08-27 23:05:54 +00:00
Chris Lattner	c8908b4cdb	fix this test. It's not clear what it's really testing. llvm-svn: 112316	2010-08-27 23:05:27 +00:00
Chris Lattner	6c1395f62a	Enhance the shift propagator to handle the case when you have: A = shl x, 42 ... B = lshr ..., 38 which can be transformed into: A = shl x, 4 ... iff we can prove that the would-be-shifted-in bits are already zero. This eliminates two shifts in the testcase and allows eliminate of the whole i128 chain in the real example. llvm-svn: 112314	2010-08-27 22:53:44 +00:00
Chris Lattner	18d7fc8fc6	Implement a pretty general logical shift propagation framework, which is good at ripping through bitfield operations. This generalize a bunch of the existing xforms that instcombine does, such as (x << c) >> c -> and to handle intermediate logical nodes. This is useful for ripping up the "promote to large integer" code produced by SRoA. llvm-svn: 112304	2010-08-27 22:24:38 +00:00
Chris Lattner	606b76eba6	merge and filecheckize test llvm-svn: 112289	2010-08-27 20:44:45 +00:00
Chris Lattner	c665156e9f	merge two tests llvm-svn: 112288	2010-08-27 20:42:10 +00:00
Chris Lattner	7398434675	teach the truncation optimization that an entire chain of computation can be truncated if it is fed by a sext/zext that doesn't have to be exactly equal to the truncation result type. llvm-svn: 112285	2010-08-27 20:32:06 +00:00
Chris Lattner	7413e87b6d	get this test passing on linux builders. llvm-svn: 112280	2010-08-27 18:49:08 +00:00
Chris Lattner	90cd746e63	Add an instcombine to clean up a common pattern produced by the SRoA "promote to large integer" code, eliminating some type conversions like this: %94 = zext i16 %93 to i32 ; <i32> [#uses=2] %96 = lshr i32 %94, 8 ; <i32> [#uses=1] %101 = trunc i32 %96 to i8 ; <i8> [#uses=1] This also unblocks other xforms from happening, now clang is able to compile: struct S { float A, B, C, D; }; float foo(struct S A) { return A.A + A.B+A.C+A.D; } into: _foo: ## @foo ## BB#0: ## %entry pshufd $1, %xmm0, %xmm2 addss %xmm0, %xmm2 movdqa %xmm1, %xmm3 addss %xmm2, %xmm3 pshufd $1, %xmm1, %xmm0 addss %xmm3, %xmm0 ret on x86-64, instead of: _foo: ## @foo ## BB#0: ## %entry movd %xmm0, %rax shrq $32, %rax movd %eax, %xmm2 addss %xmm0, %xmm2 movapd %xmm1, %xmm3 addss %xmm2, %xmm3 movd %xmm1, %rax shrq $32, %rax movd %eax, %xmm0 addss %xmm3, %xmm0 ret This seems pretty close to optimal to me, at least without using horizontal adds. This also triggers in lots of other code, including SPEC. llvm-svn: 112278	2010-08-27 18:31:05 +00:00
Bob Wilson	edf722add3	Add alignment arguments to all the NEON load/store intrinsics. Update all the tests using those intrinsics and add support for auto-upgrading bitcode files with the old versions of the intrinsics. llvm-svn: 112271	2010-08-27 17:13:24 +00:00
Owen Anderson	6ebbd92380	Use LVI to eliminate conditional branches where we've tested a related condition previously. Update tests for this change. This fixes PR5652. llvm-svn: 112270	2010-08-27 17:12:29 +00:00
Daniel Dunbar	1844a71e66	X86: Fix an encoding issue with LOCK_ADD64mr, which could lead to very hard to find miscompiles with the integrated assembler. llvm-svn: 112250	2010-08-27 01:30:14 +00:00
Chris Lattner	c188b96bbe	filecheckize llvm-svn: 112235	2010-08-26 22:23:39 +00:00
Chris Lattner	387d6bcdcb	rename test. llvm-svn: 112234	2010-08-26 22:20:47 +00:00
Chris Lattner	bfd2228182	optimize "integer extraction out of the middle of a vector" as produced by SRoA. This is part of rdar://7892780, but needs another xform to expose this. llvm-svn: 112232	2010-08-26 22:14:59 +00:00
Chris Lattner	d4ebd6df5a	optimize bitcast(trunc(bitcast(x))) where the result is a float and 'x' is a vector to be a vector element extraction. This allows clang to compile: struct S { float A, B, C, D; }; float foo(struct S A) { return A.A + A.B+A.C+A.D; } into: _foo: ## @foo ## BB#0: ## %entry movd %xmm0, %rax shrq $32, %rax movd %eax, %xmm2 addss %xmm0, %xmm2 movapd %xmm1, %xmm3 addss %xmm2, %xmm3 movd %xmm1, %rax shrq $32, %rax movd %eax, %xmm0 addss %xmm3, %xmm0 ret instead of: _foo: ## @foo ## BB#0: ## %entry movd %xmm0, %rax movd %eax, %xmm0 shrq $32, %rax movd %eax, %xmm2 addss %xmm0, %xmm2 movd %xmm1, %rax movd %eax, %xmm1 addss %xmm2, %xmm1 shrq $32, %rax movd %eax, %xmm0 addss %xmm1, %xmm0 ret ... eliminating half of the horribleness. llvm-svn: 112227	2010-08-26 21:55:42 +00:00
Chris Lattner	3c19d3d5c3	filecheckize llvm-svn: 112225	2010-08-26 21:51:41 +00:00
Chris Lattner	7717c616bd	rename test llvm-svn: 112224	2010-08-26 21:50:56 +00:00
Owen Anderson	bd2ecc7e68	Make JumpThreading smart enough to properly thread StrSwitch when it's compiled with clang++. llvm-svn: 112198	2010-08-26 17:40:24 +00:00
Dan Gohman	ca26f79051	Reapply r112091 and r111922, support for metadata linking, with a fix: add a flag to MapValue and friends which indicates whether any module-level mappings are being made. In the common case of inlining, no module-level mappings are needed, so MapValue doesn't need to examine non-function-local metadata, which can be very expensive in the case of a large module with really deep metadata (e.g. a large C++ program compiled with -g). This flag is a little awkward; perhaps eventually it can be moved into the ClonedCodeInfo class. llvm-svn: 112190	2010-08-26 15:41:53 +00:00
Chris Lattner	af23e9a798	Add a hackaround for PR7993 which is causing failures on x86 builders that lack sse2. llvm-svn: 112175	2010-08-26 06:57:07 +00:00
Chris Lattner	66afba7aa4	I think enough general codegen bugs are fixed to allow this to work on random hosts, lets see! llvm-svn: 112172	2010-08-26 05:52:42 +00:00
Chris Lattner	eb2cc0ce0e	implement SplitVecOp_CONCAT_VECTORS, fixing the included testcase with SSE1. llvm-svn: 112171	2010-08-26 05:51:22 +00:00
Chris Lattner	825294b85f	Make sure this forces the x86 targets llvm-svn: 112169	2010-08-26 05:25:05 +00:00
Chris Lattner	cc60609cb4	fix sse1 only codegen in x86-64 mode, which is something we apparently try to support. llvm-svn: 112168	2010-08-26 05:24:29 +00:00
Daniel Dunbar	95fe13c720	Revert r112091, "Remap metadata attached to instructions when remapping individual ...", which depends on r111922, which I am reverting. llvm-svn: 112157	2010-08-26 03:48:08 +00:00
Jim Grosbach	08da771ec3	Enable pre-RA virtual frame base register allocation. rdar://8277890 llvm-svn: 112127	2010-08-26 00:58:06 +00:00
Bob Wilson	4629f423f8	Revert svn 107892 (with changes to work with trunk). It caused a crash if a VLD result was not used (Radar 8355607). It should also fix pr7988, but I haven't verified that yet. llvm-svn: 112118	2010-08-26 00:13:36 +00:00
Chris Lattner	c7fb446a9d	temporarily disable this, which started failing on the llvm-i686-linux builder. I will investigate tonight. llvm-svn: 112113	2010-08-25 23:43:14 +00:00
Chris Lattner	75ff053497	Change handling of illegal vector types to widen when possible instead of expanding: e.g. <2 x float> -> <4 x float> instead of -> 2 floats. This affects two places in the code: handling cross block values and handling function return and arguments. Since vectors are already widened by legalizetypes, this gives us much better code and unblocks x86-64 abi and SPU abi work. For example, this (which is a silly example of a cross-block value): define <4 x float> @test2(<4 x float> %A) nounwind { %B = shufflevector <4 x float> %A, <4 x float> undef, <2 x i32> <i32 0, i32 1> %C = fadd <2 x float> %B, %B br label %BB BB: %D = fadd <2 x float> %C, %C %E = shufflevector <2 x float> %D, <2 x float> undef, <4 x i32> <i32 0, i32 1, i32 undef, i32 undef> ret <4 x float> %E } Now compiles into: _test2: ## @test2 ## BB#0: addps %xmm0, %xmm0 addps %xmm0, %xmm0 ret previously it compiled into: _test2: ## @test2 ## BB#0: addps %xmm0, %xmm0 pshufd $1, %xmm0, %xmm1 ## kill: XMM0<def> XMM0<kill> XMM0<def> insertps $0, %xmm0, %xmm0 insertps $16, %xmm1, %xmm0 addps %xmm0, %xmm0 ret This implements rdar://8230384 llvm-svn: 112101	2010-08-25 22:49:25 +00:00
Dan Gohman	fd824487a3	Remap metadata attached to instructions when remapping individual instructions, not when remapping modules. llvm-svn: 112091	2010-08-25 21:36:50 +00:00
Daniel Dunbar	3d148ac089	X86: Fix misencode of RI64mi8. This fixes OpenSSL / x86_64-apple-darwin10 / clang -O3. llvm-svn: 112089	2010-08-25 21:11:02 +00:00
Devang Patel	01262e129e	DIGlobalVariable can be used to encode debug info for globals that are directly folded into a constant by FE. llvm-svn: 112072	2010-08-25 18:52:02 +00:00
Daniel Dunbar	a54a1b0edf	ARM/Thumb2: Fix a misselect in getARMCmp, when attempting to adjust a signed comparison that would overflow. - The other under/overflow cases can't actually happen because the immediates which would trigger them are legal (so we don't enter this code), but adjusted the style to make it clear the transform is always valid. llvm-svn: 112053	2010-08-25 16:58:05 +00:00
Eric Christopher	6b1533a1a9	Add another basic test cribbed from the x86 fast-isel tests. llvm-svn: 112036	2010-08-25 07:57:29 +00:00
Eric Christopher	37d547aee6	Run this on thumb and arm. llvm-svn: 112035	2010-08-25 07:53:15 +00:00
Eric Christopher	e58c03698e	Make this testcase actually executed with fast-isel on arm. llvm-svn: 112033	2010-08-25 07:47:00 +00:00
Bruno Cardoso Lopes	0bc919fa35	Convert test to use filecheck and make it more specific llvm-svn: 112016	2010-08-25 01:47:16 +00:00
Owen Anderson	4afea9e3c6	In the default address space, any GEP off of null results in a trap value if you try to load it. Thus, any load in the default address space that completes implies that the base value that it GEP'd from was not null. llvm-svn: 112015	2010-08-25 01:16:47 +00:00
Michael J. Spencer	ccd28d0665	Fix COFF x86-64 relocations. PR7960. Multiple symbol reloc handling part of the patch by Cameron Esfahani. llvm-svn: 111963	2010-08-24 21:04:52 +00:00
Dan Gohman	c1a8958f76	XFAIL this on mingw, following remove_arguments_test.ll. llvm-svn: 111962	2010-08-24 20:54:50 +00:00
Dan Gohman	b2f29edc30	Add a testcase for basic bugpointing in the presence of metadata. llvm-svn: 111955	2010-08-24 20:23:51 +00:00
Daniel Dunbar	1c8d777c93	MC/X86: Tweak imul recognition, previous hack only applies for the imul form taking immediates. llvm-svn: 111950	2010-08-24 19:37:56 +00:00
Daniel Dunbar	09392785b4	MC/X86: Add custom hack for recognizing "imul $12, %eax" and friends. llvm-svn: 111947	2010-08-24 19:24:18 +00:00
Daniel Dunbar	2476432639	MC/AsmParser: Change ParseExpression to use ParseIdentifier(), to support dollars in identifiers. llvm-svn: 111946	2010-08-24 19:13:42 +00:00
Daniel Dunbar	94b84a19b9	MC/X86: Warn on scale factors > 1 without index register, instead of erroring, for 'as' compatibility. llvm-svn: 111945	2010-08-24 19:13:38 +00:00
Daniel Dunbar	3b96ffdac1	MC/Parser: Accept leading dollar signs in identifiers. - Implemented by manually splicing the tokens. If this turns out to be problematically platform specific, a more elegant solution would be to implement some context dependent lexing support. llvm-svn: 111934	2010-08-24 18:12:12 +00:00
Dan Gohman	c88fda477a	Fix X86's isLegalAddressingMode to recognize that static addresses need not be RIP-relative in small mode. llvm-svn: 111917	2010-08-24 15:55:12 +00:00
Kalle Raiskila	7e25bc4145	Fix SPU BE to use all the available return registers. llc used to assert on the added testcase. llvm-svn: 111911	2010-08-24 11:50:48 +00:00
Dan Gohman	c828c5465d	Extend function-local metadata to be usable as attachments. llvm-svn: 111895	2010-08-24 02:24:03 +00:00
Chris Lattner	02db8f6415	fix rdar://7997827 - Accept and ignore LL and ULL suffixes on integer literals. Also fix 0b010 syntax to actually work while we're at it :-) llvm-svn: 111876	2010-08-24 00:43:25 +00:00
Mikhail Glushenkov	aaed5ea9b7	llvmc: Make syntax more consistent. CompilationGraph and LanguageMap definitions do not use special syntax anymore. llvm-svn: 111862	2010-08-23 23:21:23 +00:00
Chris Lattner	58bd73a5a7	Add a new llvm.x86.int intrinsic, allowing access to the x86 int and int3 instructions. Patch by Peter Housel! llvm-svn: 111831	2010-08-23 19:39:25 +00:00
Chandler Carruth	ebf42ac831	Try to escape the '$'s in these so they reach the underlying 'sh' invocation. I have no idea how lit did the right thing here, but other test runners don't. llvm-svn: 111805	2010-08-23 08:54:19 +00:00
Dan Gohman	42ef669d81	Fix x86 fast-isel's cmp+branch folding to avoid folding when the comparison is in a different basic block from the branch. In such cases, the comparison's operands may not have initialized virtual registers available. llvm-svn: 111709	2010-08-21 02:32:36 +00:00
Bob Wilson	be745d8c00	Replace some NEON vmovl intrinsic that I missed earlier. llvm-svn: 111696	2010-08-20 23:22:43 +00:00
Bill Wendling	578ee4070c	Create the new linker type "linker_private_weak_def_auto". It's similar to "linker_private_weak", but it's known that the address of the object is not taken. For instance, functions that had an inline definition, but the compiler decided not to inline it. Note, unlike linker_private and linker_private_weak, linker_private_weak_def_auto may have only default visibility. The symbols are removed by the linker from the final linked image (executable or dynamic library). llvm-svn: 111684	2010-08-20 22:05:50 +00:00
Dale Johannesen	74c1f8ed7b	Test should pass on non-Darwin x86. llvm-svn: 111678	2010-08-20 21:18:55 +00:00
Dale Johannesen	bdc237c2ca	Don't run test on PPC darwin. llvm-svn: 111668	2010-08-20 18:29:27 +00:00
Owen Anderson	84c29a096b	Re-apply r111568 with a fix for the clang self-host. llvm-svn: 111665	2010-08-20 18:24:43 +00:00
Erick Tryzelaar	fb4c5012eb	Fix vmcore.ml test. llvm-svn: 111664	2010-08-20 18:24:35 +00:00
Mikhail Glushenkov	18277eafb0	llvmc: Fix alias generation. llvm-svn: 111662	2010-08-20 18:16:26 +00:00
Dan Gohman	a931605647	Convert DbgInfoPrinter to use errs() instead of outs(). llvm-svn: 111659	2010-08-20 18:03:05 +00:00
Erick Tryzelaar	8264a68b4c	Fix the running of ocaml tests. llvm-svn: 111626	2010-08-20 14:51:26 +00:00
Erick Tryzelaar	b4d48706ca	Expose LLVMSetOperand and LLVMGetNumOperands to llvm-c and ocaml. llvm-svn: 111625	2010-08-20 14:51:22 +00:00
Bob Wilson	21b62ac673	Fix some Ocaml tests: the %t substitution now returns an absolute path. llvm-svn: 111623	2010-08-20 14:20:17 +00:00
Bob Wilson	6c66144eb3	The %ocamlopt setting has embedded quotes. Copy the entire value instead of stopping at the first embedded quote. llvm-svn: 111622	2010-08-20 14:19:38 +00:00
Benjamin Kramer	18f47c7105	Update LLVMC tests for r111620. llvm-svn: 111621	2010-08-20 13:03:33 +00:00
Bob Wilson	9a511c07e4	Replace the arm.neon.vmovls and vmovlu intrinsics with vector sign-extend and zero-extend operations. llvm-svn: 111614	2010-08-20 04:54:02 +00:00
Owen Anderson	3323651ec7	Previous revert failed to remove this file. llvm-svn: 111582	2010-08-19 23:45:15 +00:00
Owen Anderson	43057cd56a	Revert r111568 to unbreak clang self-host. llvm-svn: 111571	2010-08-19 23:25:16 +00:00
Owen Anderson	bb723b228a	When a set of bitmask operations, typically from a bitfield initialization, only modifies the low bytes of a value, we can narrow the store to only over-write the affected bytes. llvm-svn: 111568	2010-08-19 22:15:40 +00:00
Evan Cheng	361b9be7c6	It's possible to sink a def if its local uses are PHI's. llvm-svn: 111537	2010-08-19 18:33:29 +00:00
Daniel Dunbar	0d7e9538db	tests: Haste makes waste. llvm-svn: 111525	2010-08-19 16:47:54 +00:00
Daniel Dunbar	471a649c6b	tests: Ignore whitespace in llvm_supports_binding() and llvm_gcc_supports(). llvm-svn: 111524	2010-08-19 16:46:52 +00:00
Kenneth Uildriks	d4b6ab9888	Fixed and reactivated a partial specialization test llvm-svn: 111516	2010-08-19 12:42:38 +00:00
Chris Lattner	f547740d3f	fix PR7465, mishandling of lcall and ljmp: intersegment long call and jumps. llvm-svn: 111496	2010-08-19 01:18:43 +00:00
Dale Johannesen	8d5f0208f2	Testcase for llvm-gcc checkin 111482. llvm-svn: 111483	2010-08-19 00:09:07 +00:00
Chris Lattner	3decde9305	refix PR1143 by making basicaa analyze zexts of indices aggresively, which I broke with a recent patch. llvm-svn: 111452	2010-08-18 23:09:49 +00:00
Dan Gohman	492c2ea31e	Add a testcase to verify that commands don't crash when they hit errors on stderr. llvm-svn: 111440	2010-08-18 22:35:56 +00:00
Dan Gohman	82656fb0e1	When sending stats output to stdout for grepping, don't emit normal output to standard output also. llvm-svn: 111435	2010-08-18 22:22:44 +00:00
Dan Gohman	2470818942	When sending stats output to stdout for grepping, don't emit normal output to standard output also. llvm-svn: 111401	2010-08-18 20:32:46 +00:00
Daniel Dunbar	8e92d9b68d	MC/ELF: Allow null values in virtual sections, ELF doesn't use special directives for putting contents in .bss, for example. llvm-svn: 111376	2010-08-18 18:22:37 +00:00
Kalle Raiskila	e60b5161d1	Fix a bug with insertelement on SPU. The previous algorithm in LowerVECTOR_SHUFFLE didn't check all requirements for "monotonic" shuffles. llvm-svn: 111361	2010-08-18 10:20:29 +00:00
Kalle Raiskila	ab49360f59	Remove all traces of v2[i,f]32 on SPU. The "half vectors" are now widened to full size by the legalizer. The only exception is in parameter passing, where half vectors are expanded. This causes changes to some dejagnu tests. llvm-svn: 111360	2010-08-18 10:04:39 +00:00
Kalle Raiskila	f3984d1ef6	Change SPU C calling convention to match that described in "SPU Application Binary Interface Specification, v1.9" by IBM. Specifically: use r3-r74 to pass parameters and the return value. llvm-svn: 111358	2010-08-18 09:50:30 +00:00
Chris Lattner	a25c05ed15	fix a buggy test llvm-svn: 111354	2010-08-18 04:55:12 +00:00
Chris Lattner	a33edcb56c	fix PR7589: In brief: gep P, (zext x) != gep P, (sext x) DecomposeGEPExpression was getting this wrong, confusing basicaa. llvm-svn: 111352	2010-08-18 04:28:19 +00:00
Chris Lattner	c8e38eb60b	filecheckize and detrivialize. llvm-svn: 111350	2010-08-18 04:25:43 +00:00
Chris Lattner	3c603024bb	Fix PR7755: knowing something about an inval for a pred from the LHS should disable reconsidering that pred on the RHS. However, knowing something about the pred on the RHS shouldn't disable subsequent additions on the RHS from happening. llvm-svn: 111349	2010-08-18 03:14:36 +00:00
Bob Wilson	fb7eaff759	Expand ZERO_EXTEND operations for NEON vector types. Testcase from Nick Lewycky. llvm-svn: 111341	2010-08-18 01:45:52 +00:00
Eric Christopher	51edc7b7e1	Temporarily revert r110987 as it's causing some miscompares in vector heavy code. I'll re-enable when we've tracked down the problem. llvm-svn: 111318	2010-08-17 22:55:27 +00:00
Dan Gohman	ed2b005842	Tweak IVUsers' concept of "interesting" to exclude add recurrences where the step value is an induction variable from an outer loop, to avoid trouble trying to re-expand such expressions. This effectively hides such expressions from indvars and lsr, which prevents them from getting into trouble. llvm-svn: 111317	2010-08-17 22:50:37 +00:00
Evan Cheng	efdc74ea59	Add nounwind. llvm-svn: 111312	2010-08-17 22:35:20 +00:00
Dale Johannesen	16f96445c3	Make fast scheduler handle asm clobbers correctly. PR 7882. Follows suggestion by Amaury Pouly, thanks. llvm-svn: 111306	2010-08-17 22:17:24 +00:00
Anton Korobeynikov	14be4dff8e	Add some win64 coff goodness. Patch by Cameron Esfahani! llvm-svn: 111287	2010-08-17 21:05:54 +00:00
Dan Gohman	5047ca0c02	When rotating loops, put the original header at the bottom of the loop, making the resulting loop significantly less ugly. Also, zap its trivial PHI nodes, since it's easy. llvm-svn: 111255	2010-08-17 17:39:21 +00:00
Bob Wilson	942b10f511	Change ARM PKHTB and PKHBT instructions to use a shift_imm operand to avoid printing "lsl #0". This fixes the remaining parts of pr7792. Make corresponding changes for encoding/decoding these instructions. llvm-svn: 111251	2010-08-17 17:23:19 +00:00
Bob Wilson	411dfad981	Allow more cases of undef shuffle indices and add tests for them. llvm-svn: 111226	2010-08-17 05:54:34 +00:00
Evan Cheng	f259efde47	PHI elimination should not break back edge. It can cause some significant code placement issues. rdar://8263994 good: LBB0_2: mov r2, r0 . . . mov r1, r2 bne LBB0_2 bad: LBB0_2: mov r2, r0 . . . @ BB#3: mov r1, r2 b LBB0_2 llvm-svn: 111221	2010-08-17 01:20:36 +00:00
Bob Wilson	eee4824f74	Add a testcase for svn 111208. llvm-svn: 111212	2010-08-16 23:44:29 +00:00
Bob Wilson	804f6159f1	Generalize a pattern for PKHTB: an SRL of 16-31 bits will guarantee that the high halfword is zero. The shift need not be exactly 16 bits. llvm-svn: 111196	2010-08-16 22:26:55 +00:00
Bob Wilson	3fd1e0dcda	Convert test to FileCheck. llvm-svn: 111195	2010-08-16 22:21:13 +00:00
Bob Wilson	8f553757c4	Convert a test to use FileCheck. llvm-svn: 111153	2010-08-16 17:05:27 +00:00
Dan Gohman	250b754428	Instead, teach SimplifyCFG to trim non-address-taken blocks from indirectbr destination lists. llvm-svn: 111122	2010-08-16 14:41:14 +00:00
Dan Gohman	fb83b043eb	Revert r111058, the lint check for indirectbr successors that aren't address-taken. This can occur normally, if the code which took the address got DCEd. llvm-svn: 111121	2010-08-16 14:39:19 +00:00
Benjamin Kramer	cbc55d9dc0	Test expects SSE, give him SSE. llvm-svn: 111115	2010-08-15 23:32:03 +00:00
Benjamin Kramer	4566466b7f	Restore arch on these test, they fail on arm. llvm-svn: 111109	2010-08-15 20:42:56 +00:00
Dale Johannesen	339423c460	Mark as XFAIL on darwin 8. PR 7886. llvm-svn: 111108	2010-08-15 19:40:29 +00:00
Mikhail Glushenkov	b1ec90bcf4	Update tests. llvm-svn: 111096	2010-08-15 07:07:24 +00:00
Dan Gohman	aa445c0751	LoopSimplify shouldn't split loop backedges that use indirectbr. PR7867. llvm-svn: 111061	2010-08-14 00:43:09 +00:00
Dan Gohman	4a63fad976	Teach SimplifyCFG how to simplify indirectbr instructions. - Eliminate redundant successors. - Convert an indirectbr with one successor into a direct branch. Also, generalize SimplifyCFG to be able to be run on a function entry block. It knows quite a few simplifications which are applicable to the entry block, and it only needs a few checks to avoid trouble with the entry block. llvm-svn: 111060	2010-08-14 00:29:42 +00:00
Dan Gohman	21e6dc6aa3	Add a lint check for an indirectbr destination which has not had its address taken. llvm-svn: 111058	2010-08-13 23:56:28 +00:00
Bob Wilson	4577f37d49	Add a Thumb2 t2RSBrr instruction for disassembly only. This fixes another part of PR7792. llvm-svn: 111057	2010-08-13 23:24:25 +00:00
Bob Wilson	3c9ed76ba5	Temporarily disable tail calls on ARM to work around some linker problems. llvm-svn: 111050	2010-08-13 22:43:33 +00:00
Bob Wilson	15b3c3d0ac	Move the Thumb2 SSAT and USAT optional shift operator out of the instruction opcode. This fixes part of PR7792. llvm-svn: 111047	2010-08-13 21:48:10 +00:00
Dale Johannesen	8d3c89e765	Revert 110491. While not wrong, it was based on a misanalysis and is undesirable. llvm-svn: 111028	2010-08-13 18:43:45 +00:00
Mikhail Glushenkov	1d54a4ea1d	One more XFAIL. llvm-svn: 111010	2010-08-13 07:03:56 +00:00
Mikhail Glushenkov	49fd7d3a5f	More XFAILs. llvm-svn: 111008	2010-08-13 07:01:55 +00:00
Mikhail Glushenkov	143a33758c	Add an XFAIL. llvm-svn: 111004	2010-08-13 04:15:45 +00:00
Mikhail Glushenkov	ee1ef8c402	Remove -fexceptions from llvmc tests. llvm-svn: 110999	2010-08-13 02:29:35 +00:00
Mikhail Glushenkov	d2cc5fb971	llvmc: fix two tests, remove XFAILs. Tested on Linux and Darwin; please add platform-specific XFAILs/mail me a bug report if this still fails. llvm-svn: 110998	2010-08-13 02:29:24 +00:00
Nate Begeman	2a0ca3e937	Reapply this transformation now that it is passing the external test which it previously failed. llvm-svn: 110987	2010-08-13 00:17:53 +00:00
Chris Lattner	363226dfe8	fix PR7876: If ipsccp decides that a function's address is taken before it rewrites the code, we need to use that in the post-rewrite pass. llvm-svn: 110962	2010-08-12 22:25:23 +00:00
Johnny Chen	8e8f1c133a	Cleaned up the for-disassembly-only entries in the arm instruction table so that the memory barrier variants (other than 'SY' full system domain read and write) are treated as one instruction with option operand. llvm-svn: 110951	2010-08-12 20:46:17 +00:00
Bruno Cardoso Lopes	7f704b31a9	- Teach SSEDomainFix to switch between different levels of AVX instructions. Here we guess that AVX will have domain issues, so just implement them for consistency and in the future we remove if it's unnecessary. - Make foldMemoryOperandImpl aware of 256-bit zero vectors folding and support the 128-bit counterparts of AVX too. - Make sure MOV[AU]PS instructions are only selected when SSE1 is enabled, and duplicate the patterns to match AVX. - Add a testcase for a simple 128-bit zero vector creation. llvm-svn: 110946	2010-08-12 20:20:53 +00:00
Bob Wilson	86fa07ea05	Add a test for llvm-gcc svn 110632. llvm-svn: 110935	2010-08-12 17:31:41 +00:00
Eric Christopher	ac40d49c70	Temporarily revert 110737 and 110734, they were causing failures in an external testsuite. llvm-svn: 110905	2010-08-12 07:01:22 +00:00
Bruno Cardoso Lopes	7306c86886	Begin to support some vector operations for AVX 256-bit intructions. The long term goal here is to be able to match enough of vector_shuffle and build_vector so all avx intrinsics which aren't mapped to their own built-ins but to shufflevector calls can be codegen'd. This is the first (baby) step, support building zeroed vectors. llvm-svn: 110897	2010-08-12 02:06:36 +00:00
Johnny Chen	74491bb52c	The autogened decoder was confusing the ARM STRBT for ARM USAT, because the .td entry for ARM STRBT is actually a super-instruction for A8.6.199 STRBT A1 & A2. Recover by looking for ARM:USAT encoding pattern before delegating to the auto- gened decoder. Added a "usat" test case to arm-tests.txt. llvm-svn: 110894	2010-08-12 01:40:54 +00:00
Daniel Dunbar	7d7b4d1b0f	MC/X86/AsmParser: Give an explicit error message when we reject an instruction because it could have an ambiguous suffix. llvm-svn: 110890	2010-08-12 00:55:42 +00:00
Devang Patel	48595bf2bc	This is x86 only test. llvm-svn: 110887	2010-08-12 00:17:38 +00:00
Johnny Chen	d59c73f998	Changed the format of DMBsy, DSBsy, and friends from Pseudo to MiscFrm. Added two test cases to arm-tests.txt. llvm-svn: 110880	2010-08-11 23:35:12 +00:00
Bob Wilson	add513112a	Move the ARM SSAT and USAT optional shift amount operand out of the instruction opcode. This also fixes part of PR7792. llvm-svn: 110875	2010-08-11 23:10:46 +00:00
Bruno Cardoso Lopes	1675ee7a02	Add testcases for all AVX 256-bit intrinsics added in the last couple days llvm-svn: 110854	2010-08-11 21:12:09 +00:00
Bruno Cardoso Lopes	29c8818ad9	Reapply r109881 using a more strict command line for llc. llvm-svn: 110833	2010-08-11 17:39:23 +00:00
Jim Grosbach	a5f923b1a1	fix silly typo llvm-svn: 110831	2010-08-11 17:32:46 +00:00
Jim Grosbach	2bf8bd1e19	Add a target triple, as the runtime library invocation varies a bit by platform. It's apparently "bl __muldf3" on linux, for example. Since that's not what we're checking here, it's more robust to just force a triple. We just wwant to check that the inline FP instructions are only generated on cpus that have them." llvm-svn: 110830	2010-08-11 17:31:12 +00:00
Evan Cheng	b0276814d5	Fix test and re-enable it. llvm-svn: 110829	2010-08-11 17:25:51 +00:00
Dan Gohman	4df4114870	Temporarily disable some failing tests, until they can be properly investigated. llvm-svn: 110825	2010-08-11 16:36:07 +00:00
Jim Grosbach	4d5dc3e7e5	cortex m4 has floating point support, but only single precision. llvm-svn: 110810	2010-08-11 15:44:15 +00:00
Dan Gohman	f3d783a6d2	Temporarily disable some failing tests, until they can be properly investigated. llvm-svn: 110808	2010-08-11 15:09:00 +00:00
Bill Wendling	6a98131468	Consider this code snippet: float t1(int argc) { return (argc == 1123) ? 1.234f : 2.38213f; } We would generate truly awful code on ARM (those with a weak stomach should look away): _t1: movw r1, #1123 movs r2, #1 movs r3, #0 cmp r0, r1 mov.w r0, #0 it eq moveq r0, r2 movs r1, #4 cmp r0, #0 it ne movne r3, r1 adr r0, #LCPI1_0 ldr r0, [r0, r3] bx lr The problem was that legalization was creating a cascade of SELECT_CC nodes, for for the comparison of "argc == 1123" which was fed into a SELECT node for the ?: statement which was itself converted to a SELECT_CC node. This is because the ARM back-end doesn't have custom lowering for SELECT nodes, so it used the default "Expand". I added a fairly simple "LowerSELECT" to the ARM back-end. It takes care of this testcase, but can obviously be expanded to include more cases. Now we generate this, which looks optimal to me: _t1: movw r1, #1123 movs r2, #0 cmp r0, r1 adr r0, #LCPI0_0 it eq moveq r2, #4 ldr r0, [r0, r2] bx lr .align 2 LCPI0_0: .long 1075344593 @ float 2.382130e+00 .long 1067316150 @ float 1.234000e+00 llvm-svn: 110799	2010-08-11 08:43:16 +00:00
Evan Cheng	5190f09291	Report error if codegen tries to instantiate a ARM target when the cpu does support it. e.g. cortex-m* processors. llvm-svn: 110798	2010-08-11 07:17:46 +00:00
Evan Cheng	40921a4e62	Add ARM Archv6M and let it implies FeatureDB (having dmb, etc.) llvm-svn: 110795	2010-08-11 06:51:54 +00:00
Daniel Dunbar	188b47b214	MC/ARM: Add basic support for handling predication by parsing it out of the mnemonic into a separate operand form. llvm-svn: 110794	2010-08-11 06:37:20 +00:00
Evan Cheng	49e02fc414	Add Cortex-M0 support. It's a ARMv6m device (no ARM mode) with some 32-bit instructions: dmb, dsb, isb, msr, and mrs. llvm-svn: 110786	2010-08-11 06:30:38 +00:00
Evan Cheng	6e809de90c	- Add subtarget feature -mattr=+db which determine whether an ARM cpu has the memory and synchronization barrier dmb and dsb instructions. - Change instruction names to something more sensible (matching name of actual instructions). - Added tests for memory barrier codegen. llvm-svn: 110785	2010-08-11 06:22:01 +00:00
Bill Wendling	79937dfc5b	Update test to match output of optimize compares for ARM. llvm-svn: 110765	2010-08-11 01:05:02 +00:00
Dan Gohman	f7495f286a	When analyzing loop exit conditions combined with and and or, don't make any assumptions about when the two conditions will agree on when to permit the loop to exit. This fixes PR7845. llvm-svn: 110758	2010-08-11 00:12:36 +00:00
Bill Wendling	871d4e1170	The optimize comparisons pass removes the "cmp" instruction this is checking for. llvm-svn: 110739	2010-08-10 22:16:05 +00:00
Nate Begeman	3ec892c167	Add test for recent instcombine vector shuffle enhancement llvm-svn: 110737	2010-08-10 21:58:00 +00:00
Daniel Dunbar	18cc4acb00	tests: Don't error out if HOME isn't present in t the environment. llvm-svn: 110711	2010-08-10 19:36:25 +00:00
Evan Cheng	3f251fb26e	Re-apply r110655 with fixes. Epilogue must restore sp from fp if the function stack frame has a var-sized object. Also added a test case to check for the added benefit of this patch: it's optimizing away the unnecessary restore of sp from fp for some non-leaf functions. llvm-svn: 110707	2010-08-10 19:30:19 +00:00
Daniel Dunbar	0dd47bfca3	Revert r110655, "Fix ARM hasFP() semantics. It should return true whenever FP register is", it breaks a couple test-suite tests. llvm-svn: 110701	2010-08-10 18:32:02 +00:00
Daniel Dunbar	d215976208	MC/AsmParser: Fix a bug in macro argument parsing, which was dropping parentheses from argument lists. llvm-svn: 110692	2010-08-10 17:38:52 +00:00
Jakob Stoklund Olesen	5730846c2f	Fix test for more architectures. Patch by Tobias Grosser. llvm-svn: 110685	2010-08-10 16:48:24 +00:00
Tobias Grosser	7fbe6cb429	RegionInfo: Do not assert if a BB is not part of the dominance tree. llvm-svn: 110665	2010-08-10 09:54:35 +00:00
Tobias Grosser	fedeff8015	Fix failing testcase. Those look like typos to me. llvm-svn: 110664	2010-08-10 09:54:29 +00:00
Devang Patel	b219746c80	Handle TAG_constant for integers. llvm-svn: 110656	2010-08-10 07:11:13 +00:00
Evan Cheng	8d5d1c1331	Fix ARM hasFP() semantics. It should return true whenever FP register is reserved, not available for general allocation. This eliminates all the extra checks for Darwin. This change also fixes the use of FP to access frame indices in leaf functions and cleaned up some confusing code in epilogue emission. llvm-svn: 110655	2010-08-10 06:26:49 +00:00
Eli Friedman	f99e7e6643	PR7853: fix a silly mistake introduced in r101899, and add a test to make sure it doesn't regress again. llvm-svn: 110597	2010-08-09 20:49:43 +00:00
Kalle Raiskila	999da1f3a0	Have SPU handle halfvec stores aligned by 8 bytes. llvm-svn: 110576	2010-08-09 16:33:00 +00:00
Rafael Espindola	cc4a9670d3	XFAIL for mingw that has no plugins. llvm-svn: 110574	2010-08-09 15:14:06 +00:00
Nick Lewycky	7f36ac54d7	Reject unrepresentable pointer types in intrinsics. Fixes PR7316. llvm-svn: 110541	2010-08-08 06:12:09 +00:00
Rafael Espindola	8aa19b05ee	Use %shlibext instead of .so llvm-svn: 110529	2010-08-08 00:55:59 +00:00
Rafael Espindola	92a4a833f9	Move the bugpoint test passes to a plugin in preparation for having bugpoint use opt. llvm-svn: 110520	2010-08-07 21:48:09 +00:00
Dale Johannesen	a3bd31a923	Use sdmem and sse_load_f64 (etc.) for the vector form of CMPSD (etc.) Matching a 128-bit memory operand is wrong, the instruction uses only 64 bits (same as ADDSD etc.) 8193553. llvm-svn: 110491	2010-08-07 00:33:42 +00:00
Stuart Hastings	5afa738d7f	Test case for r110459. Radar 8264751. Test case by Fariborz Jahanian! llvm-svn: 110467	2010-08-06 19:02:24 +00:00
Dan Gohman	e68958fcdf	Implement a proper getModRefInfo for va_arg. llvm-svn: 110458	2010-08-06 18:24:38 +00:00
Rafael Espindola	027d5bcf89	Fix eabi calling convention when a 64 bit value shadows r3. Without this what was happening was: * R3 is not marked as "used" * ARM backend thinks it has to save it to the stack because of vaarg * Offset computation correctly ignores it * Offsets are wrong llvm-svn: 110446	2010-08-06 15:35:32 +00:00
Eric Christopher	e1fb772aa5	Add an option to always emit realignment code for a particular module. llvm-svn: 110404	2010-08-05 23:57:43 +00:00
Dan Gohman	884dd752c3	Implement AccessesArguments checking in the two-callsite form of BasicAA::getModRefInfo. This allows BasicAA to say that two memset calls to non-aliasing memory locations don't interfere. llvm-svn: 110393	2010-08-05 23:34:50 +00:00
Dan Gohman	26ef7c7ab7	Fix memdep's code for reasoning about dependences between two calls. A Ref response from getModRefInfo is not useful here. Instead, check for identical calls only in the NoModRef case. Reapply r110270, and strengthen it to compensate for the memdep changes. When both calls are readonly, there is no dependence between them. llvm-svn: 110382	2010-08-05 22:09:15 +00:00
Devang Patel	cc3f3b341d	Move x86 specific tests into test/CodeGen/X86. llvm-svn: 110372	2010-08-05 20:25:37 +00:00
Bob Wilson	72de307116	Add an ARM RSCrr instruction for disassembly only. Partial fix for PR7792. llvm-svn: 110361	2010-08-05 18:59:36 +00:00
Bob Wilson	adb93e56a3	Add an ARM RSBrr instruction for disassembly only. Partial fix for PR7792. llvm-svn: 110358	2010-08-05 18:23:43 +00:00
Dan Gohman	c53ee449a5	Move x86-specific tests out of test/Transforms/LoopStrengthReduce and into test/CodeGen/X86, so that they aren't run when the x86 target is not enabled. Fix uglygep.ll to not be x86-specific. llvm-svn: 110343	2010-08-05 17:04:15 +00:00
Daniel Dunbar	e62e664656	tests: CodeGen/X86/GC tests require X86. llvm-svn: 110338	2010-08-05 15:45:33 +00:00
Daniel Dunbar	57e3f71538	tests: Mark MC/AsmParser tests as requiring x86 for now -- almost all of them rely on using a specific x86 triple to test what they want to test. llvm-svn: 110337	2010-08-05 15:44:15 +00:00
Rafael Espindola	5bca58a290	check-lit was failing again on F13 64 bits :-( llvm-svn: 110311	2010-08-05 03:35:01 +00:00
Dan Gohman	554b012f67	Revert r110270 for now. It appears to uncover a memdep bug. llvm-svn: 110293	2010-08-05 00:43:10 +00:00
Bob Wilson	97886d59d1	ARM "rrx" shift operands do not have an immediate. PR7790. llvm-svn: 110292	2010-08-05 00:34:42 +00:00
Dan Gohman	109561845b	The trouble with testing for "ModRef" and "NoModRef" is that one is a suffix of the other, and FileCheck accepts superstrings. Adjust the output to avoid this problem. llvm-svn: 110280	2010-08-04 23:37:55 +00:00
Bill Wendling	ca1cb13646	The lower invoke pass needs to have unreachable code elimination run after it because it could create such things. This fixes a MingW buildbot test failure. llvm-svn: 110279	2010-08-04 23:36:02 +00:00
Dan Gohman	bd33dab633	The two-callsite form of AliasAnalysis::getModRefInfo is documented to return Ref if the left callsite only reads memory read or written by the right callsite; fix BasicAliasAnalysis to implement this. Add AliasAnalysisEvaluator support for testing the two-callsite form of getModRefInfo. llvm-svn: 110270	2010-08-04 22:56:29 +00:00
Eli Friedman	39d0f57cab	PR7814: Truncates cannot be ignored for signed comparisons. llvm-svn: 110268	2010-08-04 22:40:58 +00:00
Stuart Hastings	49af1ebf2e	Test case for r110250. Radar 8264670. Test case by Fariborz Jahanian! llvm-svn: 110254	2010-08-04 22:05:38 +00:00
Bill Wendling	26feb849a4	Testcase for r110248. llvm-svn: 110249	2010-08-04 21:56:30 +00:00
Devang Patel	5c1f56b78f	Test case for combination of r110234 & r110235. llvm-svn: 110238	2010-08-04 18:42:46 +00:00
Dan Gohman	6786a04d0d	These tests are no longer stored in CVS. llvm-svn: 110201	2010-08-04 15:58:01 +00:00
Stuart Hastings	cba0d06b7c	call-imm.ll test case regex fix. Patch by Dimitry Andric! llvm-svn: 110199	2010-08-04 15:31:35 +00:00
Kalle Raiskila	8b2f70125f	Make SPU backend handle insertelement and store for "half vectors" llvm-svn: 110198	2010-08-04 13:59:48 +00:00
Bob Wilson	79daf7e0ae	Combine NEON VABD (absolute difference) intrinsics with ADDs to make VABA (absolute difference with accumulate) intrinsics. Radar 8228576. llvm-svn: 110170	2010-08-04 00:12:08 +00:00
Dan Gohman	3619660529	Make instcombine set explicit alignments on load or store instructions with alignment 0, so that subsequent passes don't need to bother checking the TargetData ABI size manually. llvm-svn: 110128	2010-08-03 18:20:32 +00:00
Jakob Stoklund Olesen	011ff9bec9	OK, that's it. This test is going away now. But don't worry, I am taking it to a nice farm in the country where it can play with other tests. And bunnies. It is not clear what is being tested, and the revision history shows a bunch of random changes to the expected instruction count. Clearly, we are just fudging it to pass whenever it fails. llvm-svn: 110118	2010-08-03 17:21:14 +00:00
Peter Collingbourne	ddaaf40d24	Add an atomic lowering pass llvm-svn: 110113	2010-08-03 16:19:16 +00:00
Michael J. Spencer	54cfd42c33	MC: Fix symbol fragment offsets in COFF. Patch by Cameron Esfahani! llvm-svn: 110104	2010-08-03 05:02:46 +00:00
Michael J. Spencer	d32764c8a0	Revert "MC: Fix symbol fragment offsets in COFF." This reverts commit r110100 Wrong path caps. llvm-svn: 110103	2010-08-03 04:53:28 +00:00
Michael J. Spencer	cf3d8b4ec4	MC: Fix symbol fragment offsets in COFF. Patch by Cameron Esfahani! llvm-svn: 110100	2010-08-03 04:43:24 +00:00
Stuart Hastings	460a356bf6	Diabolical hack to make a test compatible with clang. (Thanks to Dale!) Radar 8246180. llvm-svn: 110081	2010-08-02 23:29:03 +00:00
Dan Gohman	d8968da2c5	Add a lint check for indirectbr with no successors. llvm-svn: 110074	2010-08-02 23:06:43 +00:00
Stuart Hastings	0e6e8858ff	Testcase for r110043. Radar 8246180. llvm-svn: 110070	2010-08-02 22:09:53 +00:00
Kalle Raiskila	77558b7d13	More SPU v2f32 stuff added: insertelement and shuffle. llvm-svn: 110038	2010-08-02 11:22:10 +00:00
Kalle Raiskila	68b3886678	Add preliminary v2f32 support for SPU. Like with v2i32, we just duplicate the instructions and operate on half vectors. Also reorder code in SPUInstrInfo.td for better coherency. llvm-svn: 110037	2010-08-02 10:25:47 +00:00
Owen Anderson	8f306a779b	Re-apply the infamous r108614, with a fix pointed out by Dirk Steinke. llvm-svn: 110036	2010-08-02 09:32:13 +00:00
Kalle Raiskila	622f8eb981	Add preliminary v2i32 support for SPU backend. As there are no such registers in SPU, this support boils down to "emulating" them by duplicating instructions on the general purpose registers. This adds the most basic operations on v2i32: passing parameters, addition, subtraction, multiplication and a few others. llvm-svn: 110035	2010-08-02 08:54:39 +00:00
Daniel Dunbar	1465d7cffa	Fix comment. llvm-svn: 110006	2010-08-02 01:25:20 +00:00
Daniel Dunbar	5eeae48783	tests: Kill off custom targets which were just there for TestRunner.sh. llvm-svn: 110003	2010-08-02 00:52:44 +00:00
Daniel Dunbar	4b77d23d40	tests: Deprecate TestRunner.sh, and have it just invoke 'llvm-lit' (which will need to be in your path). Please move to using 'llvm-lit' if you are still using TestRunner.sh. llvm-svn: 110002	2010-08-02 00:52:41 +00:00
Eli Friedman	7595ce05a2	PR7781: Fix incorrect shifting in PPCTargetLowering::LowerBUILD_VECTOR. llvm-svn: 109998	2010-08-02 00:18:19 +00:00
Daniel Dunbar	b1af605e58	tests: Make 'lit' the default test tool. You can still use 'make check-dg' to run the tests using DejaGNU, but not for much longer. This is a last call for DejaGNU supporters, if no one complains soon the DejaGNU support is going to die. llvm-svn: 109997	2010-08-02 00:05:18 +00:00
Eli Friedman	1b2bc1b844	PR7774: Fix undefined shifts in Alpha backend. As a bonus, this actually improves the generated code in some cases. llvm-svn: 109985	2010-08-01 21:13:28 +00:00
Bob Wilson	66161f5eb4	Revert new AVX intrinsic tests. They are breaking buildbots and Bruno is away from a computer now. --- Reverse-merging r109881 into '.': D test/CodeGen/X86/avx-intrinsics-x86.ll D test/CodeGen/X86/avx-intrinsics-x86_64.ll llvm-svn: 109959	2010-07-31 22:36:03 +00:00
Daniel Dunbar	0b636a24c7	Speculatively revert r108614, "Another attempt at getting the clang self-host to like my instcombine patch.", in an attempt to fix Clang i386 bootstrap. - Also PR7719. llvm-svn: 109953	2010-07-31 19:51:11 +00:00
Bob Wilson	cd5fc7bef1	Add support for disassembling VMVN (immediate) instructions. PR7747. llvm-svn: 109946	2010-07-31 05:57:44 +00:00
Dale Johannesen	cf0287e56d	PPC doesn't supported VLA with large alignment. This was formerly rejected by the FE, so asserted in the BE; now the FE only warns, so we treat it as a legitimate fatal error in PPC BE. This means the test for the feature won't pass, so it's xfail'd. llvm-svn: 109892	2010-07-30 21:09:48 +00:00
Bruno Cardoso Lopes	92941fdb26	A bunch of tests for AVX intrinsics llvm-svn: 109881	2010-07-30 19:57:56 +00:00
Bob Wilson	964179cb58	Attempt to fix the llvm-gcc-powerpc-darwin9 buildbot. llvm-svn: 109876	2010-07-30 18:52:47 +00:00
Eli Friedman	ffe64c06ef	Fix for bug reported by Evzen Muller on llvm-commits: make sure to correctly check the range of the constant when optimizing a comparison between a constant and a sign_extend_inreg node. llvm-svn: 109854	2010-07-30 06:44:31 +00:00
Jim Grosbach	d343166a0b	Many Thumb2 instructions can reference the full ARM register set (i.e., have 4 bits per register in the operand encoding), but have undefined behavior when the operand value is 13 or 15 (SP and PC, respectively). The trivial coalescer in linear scan sometimes will merge a copy from SP into a subsequent instruction which uses the copy, and if that instruction cannot legally reference SP, we get bad code such as: mls r0,r9,r0,sp instead of: mov r2, sp mls r0, r9, r0, r2 This patch adds a new register class for use by Thumb2 that excludes the problematic registers (SP and PC) and is used instead of GPR for those operands which cannot legally reference PC or SP. The trivial coalescer explicitly requires that the register class of the destination for the COPY instruction contain the source register for the COPY to be considered for coalescing. This prevents errant instructions like that above. PR7499 llvm-svn: 109842	2010-07-30 02:41:01 +00:00
Eric Christopher	2e276485cb	Fix this up per llvm-gcc r109819. llvm-svn: 109820	2010-07-29 23:20:29 +00:00
Benjamin Kramer	d9624e2d2e	Remove XFAIL, test doesn't leak anymore. llvm-svn: 109801	2010-07-29 20:36:36 +00:00
Dale Johannesen	2bff50546c	Implement vector constants which are splat of integers with mov + vdup. 8003375. This is currently disabled by default because LICM will not hoist a VDUP, so it pessimizes the code if the construct occurs inside a loop (8248029). llvm-svn: 109799	2010-07-29 20:10:08 +00:00
Dan Gohman	390914cbe8	Make GlobalValue alignment consistent with load, store, and alloca alignment, fixing silent truncation of alignment values. llvm-svn: 109653	2010-07-28 20:56:48 +00:00
Dan Gohman	a7e5a24093	Define a maximum supported alignment value for load, store, and alloca instructions (constrained by their internal encoding), and add error checking for it. Fix an instcombine bug which generated huge alignment values (null is infinitely aligned). This fixes undefined behavior noticed by John Regehr. llvm-svn: 109643	2010-07-28 20:12:04 +00:00
Nate Begeman	53afc8f06a	Implement a vectorized algorithm for <16 x i8> << <16 x i8> This is about 4x faster and smaller than the existing scalarization. llvm-svn: 109566	2010-07-28 00:21:48 +00:00
Stuart Hastings	a7f1d4a2ba	Testcase for r109556. Radar 8198362. llvm-svn: 109557	2010-07-27 23:15:25 +00:00
Nate Begeman	269a6da023	~40% faster vector shl <4 x i32> on SSE 4.1 Larger improvements for smaller types coming in future patches. For: define <2 x i64> @shl(<4 x i32> %r, <4 x i32> %a) nounwind readnone ssp { entry: %shl = shl <4 x i32> %r, %a ; <<4 x i32>> [#uses=1] %tmp2 = bitcast <4 x i32> %shl to <2 x i64> ; <<2 x i64>> [#uses=1] ret <2 x i64> %tmp2 } We get: _shl: ## @shl pslld $23, %xmm1 paddd LCPI0_0, %xmm1 cvttps2dq %xmm1, %xmm1 pmulld %xmm1, %xmm0 ret Instead of: _shl: ## @shl pshufd $3, %xmm0, %xmm2 movd %xmm2, %eax pshufd $3, %xmm1, %xmm2 movd %xmm2, %ecx shll %cl, %eax movd %eax, %xmm2 pshufd $1, %xmm0, %xmm3 movd %xmm3, %eax pshufd $1, %xmm1, %xmm3 movd %xmm3, %ecx shll %cl, %eax movd %eax, %xmm3 punpckldq %xmm2, %xmm3 movd %xmm0, %eax movd %xmm1, %ecx shll %cl, %eax movd %eax, %xmm2 movhlps %xmm0, %xmm0 movd %xmm0, %eax movhlps %xmm1, %xmm1 movd %xmm1, %ecx shll %cl, %eax movd %eax, %xmm0 punpckldq %xmm0, %xmm2 movdqa %xmm2, %xmm0 punpckldq %xmm3, %xmm0 ret llvm-svn: 109549	2010-07-27 22:37:06 +00:00
Devang Patel	bd32256e25	Update tests to not rely on input file's absolute path. llvm-svn: 109521	2010-07-27 18:13:53 +00:00
Nate Begeman	317b969ac5	Fix a crash in the dag combiner caused by ConstantFoldBIT_CONVERTofBUILD_VECTOR calling itself recursively and returning a SCALAR_TO_VECTOR node, but assuming the input was always a BUILD_VECTOR. llvm-svn: 109519	2010-07-27 18:02:18 +00:00
Tobias Grosser	731b079edb	Make coff-dump.py executable and add python as executable for this script. This fixes the MC/COFF/basic-coff.ll test case. llvm-svn: 109497	2010-07-27 09:01:26 +00:00
Michael J. Spencer	f8270bdb2d	Make MC use Windows COFF on Windows and add tests. llvm-svn: 109494	2010-07-27 06:46:15 +00:00
Anton Korobeynikov	6bcea068db	Currently EH lowering code expects typeinfo to be global only. This assumption is not satisfied due to global mergeing. Workaround the issue by temporary disablinge mergeing of const globals. Also, ignore LLVM "special" globals. This fixes PR7716 llvm-svn: 109423	2010-07-26 18:45:39 +00:00
Owen Anderson	bb4c4b59a4	Fix a test with malformed IR. Not sure why this didn't fail before. llvm-svn: 109422	2010-07-26 18:44:56 +00:00
Dan Gohman	cd83870faf	Fix SCEVExpander::visitAddRecExpr so that it remembers the induction variable it inserted rather than using LoopInfo::getCanonicalInductionVariable to rediscover it, since that doesn't work on non-canonical loops. This fixes infinite recurrsion on such loops; PR7562. llvm-svn: 109419	2010-07-26 18:28:14 +00:00
Dan Gohman	b0961f2443	Avoid depending on LCSSA implicitly pulling in LoopSimplify. llvm-svn: 109410	2010-07-26 18:00:43 +00:00
Bruno Cardoso Lopes	306a1f9721	Support x86 "eiz" and "riz" pseudo index registers in the assembler. llvm-svn: 109295	2010-07-24 00:06:39 +00:00
Matt Fleming	fbd7f65248	Consolidate the ELF section directive tests into a single file as suggested by Chris Lattner. llvm-svn: 109290	2010-07-23 23:40:41 +00:00
Evan Cheng	df907f4594	- Allow target to specify when is register pressure "too high". In most cases, it's too late to start backing off aggressive latency scheduling when most of the registers are in use so the threshold should be a bit tighter. - Correctly handle live out's and extract_subreg etc. - Enable register pressure aware scheduling by default for hybrid scheduler. For ARM, this is almost always a win on # of instructions. It's runtime neutral for most of the tests. But for some kernels with high register pressure it can be a huge win. e.g. 464.h264ref reduced number of spills by 54 and sped up by 20%. llvm-svn: 109279	2010-07-23 22:39:59 +00:00
Bruno Cardoso Lopes	6f38011196	Move AVX encoding tests to different files llvm-svn: 109269	2010-07-23 21:25:26 +00:00
Dan Gohman	55e244698a	Use the proper type for shift counts. This fixes a bootstrap error. llvm-svn: 109265	2010-07-23 21:08:12 +00:00
Stuart Hastings	caf8e3a2db	Test case to insure template function declaration refers to correct filename. Radar 8063111. llvm-svn: 109258	2010-07-23 20:15:49 +00:00
Bruno Cardoso Lopes	ea0e05a3ce	Add AVX version of CLMUL instructions llvm-svn: 109248	2010-07-23 18:41:12 +00:00
Dan Gohman	0818684a70	DAGCombine (shl (anyext x, c)) to (anyext (shl x, c)) if the high bits are not demanded. This often allows the anyext to be folded away. llvm-svn: 109242	2010-07-23 18:03:30 +00:00
Bruno Cardoso Lopes	acd9230b1b	Add complete assembler support for FMA3 instructions, with descriptions and encodings taken from the AVX manual llvm-svn: 109204	2010-07-23 00:54:35 +00:00
Bruno Cardoso Lopes	0710c74f29	Add remaining AVX instructions (most of them dealing with GR64 destinations. This complete the assembler support for the general AVX ISA. But we still miss instructions from FMA3 and CLMUL specific feature flags, which are now the next step llvm-svn: 109168	2010-07-22 21:18:49 +00:00
Tobias Grosser	336734aca6	Add new RegionInfo pass. The RegionInfo pass detects single entry single exit regions in a function, where a region is defined as any subgraph that is connected to the remaining graph at only two spots. Furthermore an hierarchical region tree is built. Use it by calling "opt -regions analyze" or "opt -view-regions". llvm-svn: 109089	2010-07-22 07:46:31 +00:00
Eric Christopher	9a77382685	Custom lower the memory barrier instructions and add support for lowering without sse2. Add a couple of new testcases. Fixes a few libgomp tests and latent bugs. Remove a few todos. llvm-svn: 109078	2010-07-22 02:48:34 +00:00
Evan Cheng	285903853f	More register pressure aware scheduling work. llvm-svn: 109064	2010-07-21 23:53:58 +00:00
Bruno Cardoso Lopes	e3acfd4d58	Add more 256-bit forms for a bunch of regular AVX instructions Add 64-bit (GR64) versions of some instructions (which are not described in their SSE forms, but are described in AVX) llvm-svn: 109063	2010-07-21 23:53:50 +00:00
Eric Christopher	84bdfd80df	Baby steps towards ARM fast-isel. llvm-svn: 109047	2010-07-21 22:26:11 +00:00
Bruno Cardoso Lopes	6238c1d102	Add missing AVX convert instructions. Those instructions are not described in their SSE forms (although they exist), but add the AVX forms anyway, so the assembler can benefit from it llvm-svn: 109039	2010-07-21 21:37:59 +00:00
Dan Gohman	093cb79d4b	Disallow null as a named metadata operand. Make MDNode::destroy private. Fix the one thing that used MDNode::destroy, outside of MDNode itself. One should never delete or destroy an MDNode explicitly. MDNodes implicitly go away when there are no references to them (implementation details aside). llvm-svn: 109028	2010-07-21 18:54:18 +00:00
Rafael Espindola	4277e14dc4	Fix calling convention on ARM if vfp2+ is enabled. llvm-svn: 109009	2010-07-21 11:38:30 +00:00
Bruno Cardoso Lopes	cdbec62510	Add AVX only vzeroall and vzeroupper instructions llvm-svn: 109002	2010-07-21 08:56:24 +00:00
Eric Christopher	690aa72437	Turn this test on again after the llvm-gcc change in r108986. llvm-svn: 108987	2010-07-21 04:54:06 +00:00
Eric Christopher	8d95d26eb1	Update this to use a "valid" alignment. llvm-svn: 108985	2010-07-21 04:51:24 +00:00
Bruno Cardoso Lopes	3499934da6	Add new AVX vpermilps, vpermilpd and vperm2f128 instructions llvm-svn: 108984	2010-07-21 03:07:42 +00:00
Bruno Cardoso Lopes	3ceaf7a0a2	Add new AVX vmaskmov instructions, and also fix the VEX encoding bits to support it llvm-svn: 108983	2010-07-21 02:46:58 +00:00
Bruno Cardoso Lopes	e706501975	Add new AVX vextractf128 instructions llvm-svn: 108964	2010-07-20 23:19:02 +00:00
Matt Fleming	c3eb5e3d4b	Include some tests for the recently committed ELF section directive handlers. llvm-svn: 108938	2010-07-20 21:37:30 +00:00
Eric Christopher	3f696ff489	Testcase for llvm-gcc commit r108910. llvm-svn: 108918	2010-07-20 20:32:47 +00:00
Bruno Cardoso Lopes	3b505848fd	Add new AVX instruction vinsertf128 llvm-svn: 108892	2010-07-20 19:44:51 +00:00
Dan Gohman	625fd2292d	Fix SCEV denormalization of expressions where the exit value from one loop is involved in the increment of an addrec for another loop. This fixes rdar://8168938. llvm-svn: 108863	2010-07-20 17:06:20 +00:00
Jim Grosbach	badf087e45	update tests for smarter BIC usage llvm-svn: 108846	2010-07-20 16:16:48 +00:00
Duncan Sands	2e839de377	The same problem was being tracked in PR7652. llvm-svn: 108843	2010-07-20 15:52:32 +00:00
Bruno Cardoso Lopes	160695fecb	Fix PR7174, a couple o Mips fixes: - Fix a typo for PIC check during jmp table lowering - Also fix the "first jump table basic block is not considered only reachable by fall through" problem, use this ad-hoc solution until I come up with something better. Patch by stetorvs@gmail.com llvm-svn: 108820	2010-07-20 08:37:04 +00:00
Bruno Cardoso Lopes	ea7863647b	Fix Mips PR7473. Patch by stetorvs@gmail.com llvm-svn: 108816	2010-07-20 07:58:51 +00:00
Bruno Cardoso Lopes	6c8041ea34	x86_32 tests for vbroadcast llvm-svn: 108789	2010-07-20 00:11:50 +00:00
Bruno Cardoso Lopes	14c5fd437c	Add AVX vbroadcast new instruction llvm-svn: 108788	2010-07-20 00:11:13 +00:00
Bruno Cardoso Lopes	9de0ca73d4	Add 256-bit vaddsub, vhadd, vhsub, vblend and vdpp instructions! llvm-svn: 108769	2010-07-19 23:32:44 +00:00
Dan Gohman	b5e918dc05	After a custom inserter, in a block which has constant instructions, update the current basic block in addition to the current insert position, so that they remain consistent. This fixes rdar://8204072. llvm-svn: 108765	2010-07-19 22:48:56 +00:00
Daniel Dunbar	9db7d0addd	X86: Mark JMP{32,64}[mr] as requires 32-bit/64-bit mode. They are the same instruction, we only want to allow the one for the current subtarget. - This also fixes suffix matching for jmp instructions, because it eliminates the ambiguity between 'jmpl' and 'jmpq'. llvm-svn: 108746	2010-07-19 20:44:16 +00:00
Dale Johannesen	d4e389441d	Testcase for 108732 (8195660). llvm-svn: 108733	2010-07-19 18:22:40 +00:00
Devang Patel	18efced1a2	Fix PR 7662. Do not try to insert local variable info to a DIE used for function declaration. llvm-svn: 108731	2010-07-19 17:53:55 +00:00
Owen Anderson	3ccd81864f	Testcase for r108687. llvm-svn: 108689	2010-07-19 08:14:26 +00:00
Owen Anderson	9c271e2835	Remove r108639 now that it is handled by InstCombine instead. llvm-svn: 108688	2010-07-19 08:10:24 +00:00
Daniel Dunbar	9aefb8ee4c	X86-64: Mark WINCALL and more tail call instructions as code gen only. llvm-svn: 108685	2010-07-19 07:21:07 +00:00
Daniel Dunbar	b82cd9319b	MC/X86: We now match instructions like "incl %eax" correctly for the arch we are assembling; remove crufty custom cleanup code. llvm-svn: 108681	2010-07-19 06:14:54 +00:00
Daniel Dunbar	af75e1923c	tests: Force another triple. llvm-svn: 108666	2010-07-19 00:43:58 +00:00
Daniel Dunbar	3b4621103a	tests: Force triples. llvm-svn: 108658	2010-07-18 21:16:10 +00:00
Daniel Dunbar	40a564f09f	MC/AsmParser: Fix .abort and .secure_log_unique to accept arbitrary token sequences, not just strings. llvm-svn: 108655	2010-07-18 20:15:59 +00:00
Daniel Dunbar	6fb1c3ad8a	MC/AsmParser: Add macro argument substitution support. llvm-svn: 108654	2010-07-18 19:00:10 +00:00
Daniel Dunbar	4323571efb	MC/AsmParser: Add basic support for macro instantiation. llvm-svn: 108653	2010-07-18 18:54:11 +00:00
Daniel Dunbar	c1f58ec83c	MC/AsmParser: Add basic parsing support for .macro definitions. llvm-svn: 108652	2010-07-18 18:47:21 +00:00
Chris Lattner	ede90a2a58	daniel doesn't hate me, he hates macpython 2.5, which is a very reasonable position on life! llvm-svn: 108650	2010-07-18 18:42:18 +00:00
Daniel Dunbar	828984ff4e	MC/AsmParser: Add .macros_{off,on} support, not that makes sense since we don't support macros. llvm-svn: 108649	2010-07-18 18:38:02 +00:00
Owen Anderson	41670a11a8	Add a testcase for r108639. llvm-svn: 108640	2010-07-18 08:57:19 +00:00
Owen Anderson	7d2818b073	Another attempt at getting the clang self-host to like my instcombine patch. llvm-svn: 108614	2010-07-17 06:56:35 +00:00
Jim Grosbach	b97e2bbe32	Add combiner patterns to more effectively utilize the BFI (bitfield insert) instruction for non-constant operands. This includes the case referenced in the README.txt regarding a bitfield copy. llvm-svn: 108608	2010-07-17 03:30:54 +00:00
Eli Friedman	ceb16a5ce9	Test for ELF .size directive. llvm-svn: 108607	2010-07-17 03:15:24 +00:00
Jim Grosbach	11013eda5a	Add basic support to code-gen the ARM/Thumb2 bit-field insert (BFI) instruction and a combine pattern to use it for setting a bit-field to a constant value. More to come for non-constant stores. llvm-svn: 108570	2010-07-16 23:05:05 +00:00
Bill Wendling	bf8370ff36	Consider this function: void foo() { __builtin_unreachable(); } It will output the following on Darwin X86: _func1: Leh_func_begin0: pushq %rbp Ltmp0: movq %rsp, %rbp Ltmp1: Leh_func_end0: This prolog adds a new Call Frame Information (CFI) row to the FDE with an address that is not within the address range of the code it describes -- part is equal to the end of the function -- and therefore results in an invalid EH frame. If we emit a nop in this situation, then the CFI row is now within the address range. llvm-svn: 108568	2010-07-16 22:51:10 +00:00
Jakob Stoklund Olesen	c30b4ddc58	Remove the X86::FP_REG_KILL pseudo-instruction and the X86FloatingPointRegKill pass that inserted it. It is no longer necessary to limit the live ranges of FP registers to a single basic block. llvm-svn: 108536	2010-07-16 17:41:44 +00:00
Benjamin Kramer	50729ad717	Feed the right output into FileCheck. llvm-svn: 108523	2010-07-16 10:58:02 +00:00
Nick Lewycky	375efe3157	Arrays and vectors with different numbers of elements are not equivalent. llvm-svn: 108517	2010-07-16 06:31:12 +00:00
Tobias Grosser	3d84c9c793	LoopSimplify does not update domfrontier correctly. This fixes PR7649. llvm-svn: 108513	2010-07-16 05:59:45 +00:00
Jakob Stoklund Olesen	37c42a3d02	Remove many calls to TII::isMoveInstr. Targets should be producing COPY anyway. TII::isMoveInstr is going tobe completely removed. llvm-svn: 108507	2010-07-16 04:45:42 +00:00
Jakob Stoklund Olesen	b1671271ab	Add forgotten test case. llvm-svn: 108506	2010-07-16 04:45:35 +00:00
Dan Gohman	103c4ebea5	Use the source-order scheduler instead of the "fast" scheduler at -O0, because it's more likely to keep debug line information in its original order. llvm-svn: 108496	2010-07-16 02:01:19 +00:00
Eric Christopher	15a81cddb4	Also revert 108422, it's causing some test failures. Working on testcases for Owen. llvm-svn: 108494	2010-07-16 01:36:12 +00:00
Dan Gohman	c6eefe4d4e	Fix this test. llvm-svn: 108491	2010-07-16 01:28:45 +00:00
Dale Johannesen	bfd4fd7bb7	The SelectionDAGBuilder's handling of debug info, on rare occasions, caused code to be generated in a different order. All cases I've seen involved float softening in the type legalizer, and this could be perhaps be fixed there, but it's better not to generate things differently in the first place. 7797940 (6/29/2010..7/15/2010). llvm-svn: 108484	2010-07-16 00:02:08 +00:00
Bill Wendling	4bda1c8e68	Revert. This isn't the correct way to go. llvm-svn: 108478	2010-07-15 23:42:21 +00:00
Dan Gohman	fbbdfcaea7	Fix the order that SCEVExpander considers add operands in so that it doesn't miss an opportunity to form a GEP, regardless of the relative loop depths of the operands. This fixes rdar://8197217. llvm-svn: 108475	2010-07-15 23:38:13 +00:00
Bill Wendling	973dc3b1d8	Handle code gen for the unreachable instruction if it's the only instruction in the function. We'll just turn it into a "trap" instruction instead. The problem with not handling this is that it might generate a prologue without the equivalent epilogue to go with it: $ cat t.ll define void @foo() { entry: unreachable } $ llc -o - t.ll -relocation-model=pic -disable-fp-elim -unwind-tables .section __TEXT,__text,regular,pure_instructions .globl _foo .align 4, 0x90 _foo: ## @foo Leh_func_begin0: ## BB#0: ## %entry pushq %rbp Ltmp0: movq %rsp, %rbp Ltmp1: Leh_func_end0: ... The unwind tables then have bad data in them causing all sorts of problems. Fixes <rdar://problem/8096481>. llvm-svn: 108473	2010-07-15 23:32:40 +00:00
Evan Cheng	55f0c6b9fc	Split -enable-finite-only-fp-math to two options: -enable-no-nans-fp-math and -enable-no-infs-fp-math. All of the current codegen fp math optimizations only care whether the fp arithmetics arguments and results can never be NaN. llvm-svn: 108465	2010-07-15 22:07:12 +00:00
Chris Lattner	60b131654b	fix the definitions of ConstTextCoalSection/ConstDataCoalSection to keep "Text" in sync with the "pure instructions" section attribute. Lack of this attribute was preventing the assembler from emitting multibyte noops instructions for templates (and inlines, and other coalesced stuff) and was causing the assembler to mismatch .o files. This fixes rdar://8018335 llvm-svn: 108461	2010-07-15 21:22:00 +00:00
Devang Patel	df09db62e2	Fix crash reported in PR7653. llvm-svn: 108441	2010-07-15 18:45:27 +00:00
Dan Gohman	4afd412d6b	Watch out for a constant offset cancelling out a base register, forming a zero. This situation arrises in Fortran code with induction variables that start at 1 instead of 0. This fixes PR7651. llvm-svn: 108424	2010-07-15 15:14:45 +00:00
Owen Anderson	7151dfd48a	Reapply r108378, with bugfixes, testcase, and improved comment formatting. This now passes LIT, nighty test, and llvm-gcc bootstrap on my machine. llvm-svn: 108422	2010-07-15 15:00:23 +00:00
Chris Lattner	19eff2a9f6	Fix PR7647, handling the case when 'To' ends up being mutated by recursive simplification. This also enhances ReplaceAndSimplifyAllUses to actually do a real RAUW at the end of it, which updates any value handles pointing to "From" to start pointing to "To". This seems useful for debug info and random other VH users. llvm-svn: 108415	2010-07-15 06:36:08 +00:00
Chris Lattner	e985a63bbf	see comment. llvm-svn: 108409	2010-07-15 05:17:36 +00:00
Eric Christopher	25e72a8920	Temporarily disable this test. llvm-svn: 108371	2010-07-14 23:12:58 +00:00
Devang Patel	29168baf4b	Make it a .ll test case. llvm-svn: 108370	2010-07-14 23:12:52 +00:00
Eric Christopher	e34b383e71	Add a testcase for the vla and stack realignment warning. llvm-svn: 108365	2010-07-14 22:26:35 +00:00
Dale Johannesen	6fe8c37a01	Tests for llvm-gcc commit 108360. llvm-svn: 108362	2010-07-14 21:22:35 +00:00
Jim Grosbach	a90af1ba38	Improve 64-subtraction of immediates when parts of the immediate can fit in the literal field of an instruction. E.g., long long foo(long long a) { return a - 734439407618LL; } rdar://7038284 llvm-svn: 108339	2010-07-14 17:45:16 +00:00
Dan Gohman	042523340b	Delete fast-isel's trivial load optimization; it breaks debugging because it can look past points where a debugger might modify user variables. llvm-svn: 108336	2010-07-14 17:25:37 +00:00
Bob Wilson	bb57896f8e	Fix test to appease the buildbots. llvm-svn: 108334	2010-07-14 16:43:47 +00:00
Evan Cheng	a8e8874552	Fix for PR7193 was overly conservative. The only case where sibcall callee address cannot be allocated a register is in 32-bit mode where the first three arguments are marked inreg. In that case EAX, EDX, and ECX will be used for argument passing. This fixes PR7610. llvm-svn: 108327	2010-07-14 06:44:01 +00:00
Bob Wilson	bad47f62f6	Add support for NEON VMVN immediate instructions. llvm-svn: 108324	2010-07-14 06:31:50 +00:00
Chris Lattner	ec0e7b1643	revert r108320, I see the failures now... llvm-svn: 108322	2010-07-14 06:16:35 +00:00
Chris Lattner	658680b2f5	reapply benjamin's instcombine patch, I don't see anything wrong with it and can't repro any problems with a manual self-host. llvm-svn: 108320	2010-07-14 05:59:13 +00:00
Evan Cheng	c893115312	Re-enable the test with fix. llvm-svn: 108319	2010-07-14 05:49:23 +00:00
Chris Lattner	711338fb04	temporarily disable to test to fix buildbots. llvm-svn: 108310	2010-07-14 02:21:59 +00:00
Evan Cheng	d542414945	Teach ProcessImplicitDefs to transform more COPY instructions into IMPLICIT_DEF (and subsequently eliminate them). This allows machine LICM to hoist IMPLICIT_DEF's. PR7620. llvm-svn: 108304	2010-07-14 01:22:19 +00:00
Bob Wilson	103a0dcfe1	Add an ARM-specific DAG combining to avoid redundant VDUPLANE nodes. Radar 7373643. llvm-svn: 108303	2010-07-14 01:22:12 +00:00
Bruno Cardoso Lopes	6c6c14a55c	Add AVX 256-bit compare instructions and a bunch of testcases llvm-svn: 108286	2010-07-13 22:06:38 +00:00
Bob Wilson	a3f1901531	Use a target-specific VMOVIMM DAG node instead of BUILD_VECTOR to represent NEON VMOV-immediate instructions. This simplifies some things. llvm-svn: 108275	2010-07-13 21:16:48 +00:00
Bruno Cardoso Lopes	fd8bfcd6e1	AVX 256-bit conversion instructions Add the x86 VEX_L form to handle special cases where VEX_L must be set. llvm-svn: 108274	2010-07-13 21:07:28 +00:00
Dale Johannesen	caca5488dc	In inline asm treat indirect 'X' constraint as 'm'. This may not be right in all cases, but it's better than asserting which it was doing before. PR 7528. llvm-svn: 108268	2010-07-13 20:17:05 +00:00
Dan Gohman	afd69cf5b7	Add support for empty named metadata too. This isn't particularly useful, but it is nice for consistency. llvm-svn: 108262	2010-07-13 19:42:44 +00:00
Dan Gohman	1e0213a758	Add support for empty metadata nodes: !{}. llvm-svn: 108259	2010-07-13 19:33:27 +00:00
Evan Cheng	0cc4ad983d	Extend the r107852 optimization which turns some fp compare to code sequence using only i32 operations. It now optimize some f64 compares when fp compare is exceptionally slow (e.g. cortex-a8). It also catches comparison against 0.0. llvm-svn: 108258	2010-07-13 19:27:42 +00:00
Evan Cheng	f43961007c	-enable-unsafe-fp-math should not imply -enable-finite-only-fp-math. llvm-svn: 108254	2010-07-13 18:46:14 +00:00
Dale Johannesen	f241d4626c	Fix PR number. llvm-svn: 108251	2010-07-13 18:14:47 +00:00
Duncan Sands	f88a284579	Handle the case of a tail recursion in which the tail call is followed by a return that returns a constant, while elsewhere in the function another return instruction returns a different constant. This is a special case of accumulator recursion, so just generalize the existing logic a bit. llvm-svn: 108241	2010-07-13 15:41:41 +00:00
Chris Lattner	55595fb291	my work on adding segment registers to LEA missed the disassembler. Remove some code from the disassembler to compensate, unbreaking disassembly of lea's. llvm-svn: 108226	2010-07-13 04:23:55 +00:00
Bruno Cardoso Lopes	dff283e146	Add AVX 256-bit packed logical forms llvm-svn: 108224	2010-07-13 02:38:35 +00:00
Bruno Cardoso Lopes	36b32aeaa5	Add AVX 256-bit unop arithmetic instructions llvm-svn: 108223	2010-07-13 01:53:31 +00:00
Bruno Cardoso Lopes	8e67a0482e	Add AVX 256 binary arithmetic instructions llvm-svn: 108207	2010-07-12 23:04:15 +00:00
Dan Gohman	51e6d9bbf6	Apply the SSE dependence idiom for SSE unary operations to SD instructions too, in addition to SS instructions. And add a comment about it. llvm-svn: 108191	2010-07-12 20:46:04 +00:00
Bruno Cardoso Lopes	f9bcaad76d	Add AVX 256-bit MOVMSK forms llvm-svn: 108184	2010-07-12 20:06:32 +00:00
Daniel Dunbar	d388c93f87	MC/AsmParser: Move .tbss and .zerofill parsing to Darwin specific parser. llvm-svn: 108180	2010-07-12 19:37:35 +00:00
Daniel Dunbar	63a379dd5c	MC/AsmParser: Move .desc parsing to Darwin specific parser. llvm-svn: 108179	2010-07-12 19:22:53 +00:00
Daniel Dunbar	ae9da1481a	MC/AsmParser: Move some misc. Darwin directive handling to DarwinAsmParser. llvm-svn: 108174	2010-07-12 18:49:22 +00:00
Dan Gohman	c128e70ff2	Add a lint check for mismatched return types, inspired by PR6944. llvm-svn: 108162	2010-07-12 18:02:04 +00:00
Benjamin Kramer	8f36402ac2	Nope, still breaks the release selfhost bots :( llvm-svn: 108153	2010-07-12 16:38:48 +00:00
Benjamin Kramer	07b695e052	Reapply the "or" half of r108136, which seems to be less problematic. llvm-svn: 108152	2010-07-12 16:15:48 +00:00
Benjamin Kramer	c719e8ae9e	Revert r108141 again, sigh. llvm-svn: 108148	2010-07-12 14:42:04 +00:00
Benjamin Kramer	f578c36035	Reapply 108136 with an ugly pasto fixed. llvm-svn: 108141	2010-07-12 13:44:00 +00:00
Benjamin Kramer	9675e759cf	Revert r108136 until I figure out why it broke selfhost. llvm-svn: 108139	2010-07-12 12:35:49 +00:00
Benjamin Kramer	35473faa50	instcombine: fold (x & y) \| (~x & z) and (x & y) ^ (~x & z) into ((y ^ z) & x) ^ z which is one instruction shorter. (PR6773) before: %and = and i32 %y, %x %neg = xor i32 %x, -1 %and4 = and i32 %z, %neg %xor = xor i32 %and4, %and after: %xor1 = xor i32 %z, %y %and2 = and i32 %xor1, %x %xor = xor i32 %and2, %z llvm-svn: 108136	2010-07-12 11:54:45 +00:00
Chris Lattner	25eea4db66	fix PR7311 by avoiding breaking casts when a bitcast from scalar->vector is involved. llvm-svn: 108117	2010-07-12 01:19:22 +00:00
Chris Lattner	bbc25ff5cc	if jump threading is able to infer interesting values on both the LHS and RHS of an and/or instruction, don't multiply add known predecessor values. This fixes the crash on testcase from PR7498 llvm-svn: 108114	2010-07-12 00:47:34 +00:00
Chris Lattner	fd4a09fc0a	fix PR7429, a crash turning a load from a string into a float. llvm-svn: 108113	2010-07-12 00:22:51 +00:00
Chris Lattner	f8feba368c	convert to filechecconvert to filecheckk llvm-svn: 108112	2010-07-12 00:21:10 +00:00
Chris Lattner	9338b0a1e2	merge two tests. llvm-svn: 108111	2010-07-12 00:19:47 +00:00
Jakob Stoklund Olesen	c4227f1362	Remove TargetInstrInfo::copyRegToReg entirely. Targets must now implement TargetInstrInfo::copyPhysReg instead. There is no longer a default implementation forwarding to copyRegToReg. llvm-svn: 108095	2010-07-11 17:01:17 +00:00
Rafael Espindola	a76eccf815	Fix va_arg for doubles. With this patch VAARG nodes always contain the correct alignment information, which simplifies ExpandRes_VAARG a bit. The patch introduces a new alignment information to TargetLoweringInfo. This is needed since the two natural candidates cannot be used: * The 's' in target data: If this is set to the minimal alignment of any argument, getCallFrameTypeAlignment would return 4 for doubles on ARM for example. * The getTransientStackAlignment method. It is possible for an architecture to have argument less aligned than what we maintain the stack pointer. llvm-svn: 108072	2010-07-11 04:01:49 +00:00
Dan Gohman	79be2b9be5	Fix this test. llvm-svn: 108059	2010-07-10 22:42:12 +00:00
Jakob Stoklund Olesen	c4b3bcc051	FileCheckize inline asm FP stack tests llvm-svn: 108046	2010-07-10 16:30:25 +00:00
Dan Gohman	30933b3bdb	Add an explicit triple to make this test behave consistently. llvm-svn: 108041	2010-07-10 09:01:35 +00:00
Dan Gohman	367b65b56e	Fix this XTARGET so that this does doesn't XPASS on non-darwin hosts. llvm-svn: 108040	2010-07-10 09:01:03 +00:00
Dan Gohman	d7b5ce3312	Reapply bottom-up fast-isel, with several fixes for x86-32: - Check getBytesToPopOnReturn(). - Eschew ST0 and ST1 for return values. - Fix the PIC base register initialization so that it doesn't ever fail to end up the top of the entry block. llvm-svn: 108039	2010-07-10 09:00:22 +00:00
Bruno Cardoso Lopes	2419606bfb	Add AVX 256-bit packed MOVNT variants llvm-svn: 108021	2010-07-09 21:42:42 +00:00
Bruno Cardoso Lopes	6bc772eec7	Add AVX 256-bit unpack and interleave llvm-svn: 108017	2010-07-09 21:20:35 +00:00
Jakob Stoklund Olesen	51702ec46b	Fix a few tests llvm-svn: 108011	2010-07-09 20:43:09 +00:00
Jim Grosbach	2a5725b1a3	In the presence of variable sized objects, allocate an emergency spill slot. rdar://8131327 llvm-svn: 108008	2010-07-09 20:27:06 +00:00
Dan Gohman	ea9ae3e6ed	Add a target triple. llvm-svn: 108003	2010-07-09 19:17:36 +00:00
Dan Gohman	7929c448fc	Fix MachineLICM to actually visit inner loops. llvm-svn: 108001	2010-07-09 18:49:45 +00:00
Bruno Cardoso Lopes	792e906bef	Start the support for AVX instructions with 256-bit %ymm registers. A couple of notes: - The instructions are being added with dummy placeholder patterns using some 256 specifiers, this is not meant to work now, but since there are some multiclasses generic enough to accept them, when we go for codegen, the stuff will be already there. - Add VEX encoding bits to support YMM - Add MOVUPS and MOVAPS in the first round - Use "Y" as suffix for those Instructions: MOVUPSYrr, ... - All AVX instructions in X86InstrSSE.td will move soon to a new X86InstrAVX file. llvm-svn: 107996	2010-07-09 18:27:43 +00:00
Bob Wilson	6586e9b203	--- Reverse-merging r107947 into '.': U utils/TableGen/FastISelEmitter.cpp --- Reverse-merging r107943 into '.': U test/CodeGen/X86/fast-isel.ll U test/CodeGen/X86/fast-isel-loads.ll U include/llvm/Target/TargetLowering.h U include/llvm/Support/PassNameParser.h U include/llvm/CodeGen/FunctionLoweringInfo.h U include/llvm/CodeGen/CallingConvLower.h U include/llvm/CodeGen/FastISel.h U include/llvm/CodeGen/SelectionDAGISel.h U lib/CodeGen/LLVMTargetMachine.cpp U lib/CodeGen/CallingConvLower.cpp U lib/CodeGen/SelectionDAG/SelectionDAGBuilder.cpp U lib/CodeGen/SelectionDAG/FunctionLoweringInfo.cpp U lib/CodeGen/SelectionDAG/FastISel.cpp U lib/CodeGen/SelectionDAG/SelectionDAGISel.cpp U lib/CodeGen/SelectionDAG/ScheduleDAGSDNodes.cpp U lib/CodeGen/SelectionDAG/InstrEmitter.cpp U lib/CodeGen/SelectionDAG/TargetLowering.cpp U lib/Target/XCore/XCoreISelLowering.cpp U lib/Target/XCore/XCoreISelLowering.h U lib/Target/X86/X86ISelLowering.cpp U lib/Target/X86/X86FastISel.cpp U lib/Target/X86/X86ISelLowering.h llvm-svn: 107987	2010-07-09 16:37:18 +00:00
Jakob Stoklund Olesen	a57965827f	Fix test to be less sensitive of regalloc accidents llvm-svn: 107951	2010-07-09 01:32:11 +00:00
Bob Wilson	88a4e6dc0e	Print "dregpair" NEON operands with a space between them, for readability and consistency with other instructions that have lists of register operands. llvm-svn: 107944	2010-07-09 00:47:20 +00:00
Dan Gohman	0b5aa1cdd3	Re-apply bottom-up fast-isel, with fixes. Be very careful to avoid emitting a DBG_VALUE after a terminator, or emitting any instructions before an EH_LABEL. llvm-svn: 107943	2010-07-09 00:39:23 +00:00
Bob Wilson	21eed476e8	Reenable DAG combining for vector shuffles. It looks like it was temporarily disabled and then never turned back on again. Adjust some tests, one because this change avoids an unnecessary instruction, and the other to make it continue testing what it was intended to test. llvm-svn: 107941	2010-07-09 00:38:12 +00:00
Bill Wendling	a992445ff2	Extension of r107506. Make sure that we don't mark a function as having a call if the inline ASM doesn't need a stack frame. llvm-svn: 107922	2010-07-08 22:38:02 +00:00
Chris Lattner	9f034c1e5d	Rework segment prefix emission code to handle segments in memory operands at the same type as hard coded segments. This fixes problems where we'd emit the segment override after the REX prefix on instructions like: mov %gs:(%rdi), %rax This fixes rdar://8127102. I have several cleanup patches coming next. llvm-svn: 107917	2010-07-08 22:28:12 +00:00
Stuart Hastings	aa246f5687	Test case for r107843. Radar 8152866. llvm-svn: 107907	2010-07-08 20:31:05 +00:00
Evan Cheng	0f54854a1d	Check for FiniteOnlyFPMath as well. llvm-svn: 107904	2010-07-08 20:12:24 +00:00
Benjamin Kramer	2321e6a4d4	Teach instcombine to transform (X >s -1) ? C1 : C2 and (X <s 0) ? C2 : C1 into ((X >>s 31) & (C2 - C1)) + C1, avoiding the conditional. This optimization could be extended to take non-const C1 and C2 but we better stay conservative to avoid code size bloat for now. for int sel(int n) { return n >= 0 ? 60 : 100; } we now generate sarl $31, %edi andl $40, %edi leal 60(%rdi), %eax instead of testl %edi, %edi movl $60, %ecx movl $100, %eax cmovnsl %ecx, %eax llvm-svn: 107866	2010-07-08 11:39:10 +00:00
Eric Christopher	e796253217	A slight reworking of the custom patterns for x86-64 tpoff codegen and correct the testcase for valid assembly. Needs more tests. llvm-svn: 107860	2010-07-08 07:36:46 +00:00
Evan Cheng	be1f7a931e	r107852 is only safe with -enable-unsafe-fp-math to account for +0.0 == -0.0. llvm-svn: 107856	2010-07-08 06:01:49 +00:00
Evan Cheng	25f9364cbd	Optimize some vfp comparisons to integer ones. This patch implements the simplest case when the following conditions are met: 1. The arguments are f32. 2. The arguments are loads and they have no uses other than the comparison. 3. The comparison code is EQ or NE. e.g. vldr.32 s0, [r1] vldr.32 s1, [r0] vcmpe.f32 s1, s0 vmrs apsr_nzcv, fpscr beq LBB0_2 => ldr r1, [r1] ldr r0, [r0] cmp r0, r1 beq LBB0_2 More complicated cases will be implemented in subsequent patches. llvm-svn: 107852	2010-07-08 02:08:50 +00:00
Dale Johannesen	e2289285ae	Changes to ARM tail calls, mostly cosmetic. Add explicit testcases for tail calls within the same module. Duplicate some code to humor those who think .w doesn't apply on ARM. Leave this disabled on Thumb1, and add some comments explaining why it's hard and won't gain much. llvm-svn: 107851	2010-07-08 01:18:23 +00:00
Dan Gohman	e75704369d	Revert 107840 107839 107813 107804 107800 107797 107791. Debug info intrinsics win for now. llvm-svn: 107850	2010-07-08 01:00:56 +00:00
Chris Lattner	efa3c824cc	Fix the second half of PR7437: scalarrepl wasn't preserving address spaces when SRoA'ing memcpy's. llvm-svn: 107846	2010-07-08 00:27:05 +00:00
Chris Lattner	ac5881295c	Implement the major chunk of PR7195: support for 'callw' in the integrated assembler. Still some discussion to be done. llvm-svn: 107825	2010-07-07 22:27:31 +00:00
Bruno Cardoso Lopes	6c61451011	Add more assembly opcodes for SSE compare instructions llvm-svn: 107823	2010-07-07 22:24:03 +00:00
Jakob Stoklund Olesen	ddaf0099a5	Allow copies between GR8_ABCD_L and GR8_ABCD_H. This fixes PR7540. llvm-svn: 107809	2010-07-07 20:33:27 +00:00
Dan Gohman	e7ccc51cc1	Implement bottom-up fast-isel. This has the advantage of not requiring a separate DCE pass over MachineInstrs. llvm-svn: 107804	2010-07-07 19:20:32 +00:00
Dan Gohman	2d4d01d0de	Add X86FastISel support for return statements. This entails refactoring a bunch of stuff, to allow the target-independent calling convention logic to be employed. llvm-svn: 107800	2010-07-07 18:32:53 +00:00
Bruno Cardoso Lopes	fd8060335b	Add AVX AES instructions llvm-svn: 107798	2010-07-07 18:24:20 +00:00
Dan Gohman	00ef93258a	Remove interprocedural-basic-aa and associated code. The AliasAnalysis interface needs implementations to be consistent, so any code which wants to support different semantics must use a different interface. It's not currently worthwhile to add a new interface for this new concept. Document that AliasAnalysis doesn't support cross-function queries. llvm-svn: 107776	2010-07-07 14:27:09 +00:00
Bruno Cardoso Lopes	6d122aef97	Add AVX SSE4.2 instructions llvm-svn: 107752	2010-07-07 03:39:29 +00:00
Bruno Cardoso Lopes	8f5472a8e8	Add AVX SSE4.1 insertps, ptest and movntdqa instructions llvm-svn: 107747	2010-07-07 01:14:56 +00:00
Bruno Cardoso Lopes	6430c7350d	Add AVX SSE4.1 extractps and pinsr instructions llvm-svn: 107746	2010-07-07 01:01:13 +00:00
Bruno Cardoso Lopes	f3116ebe96	Add AVX SSE4.1 Extract Integer instructions llvm-svn: 107740	2010-07-07 00:07:24 +00:00
Dale Johannesen	ce65663330	Accept RIP-relative symbols with 'i' constraint, and print the (%rip) only if the 'a' modifier is present. PR 7528. llvm-svn: 107727	2010-07-06 23:27:00 +00:00
Bruno Cardoso Lopes	1f9ad516c6	Add the rest of AVX SSE4.1 packed move with sign/zero extend instructions llvm-svn: 107723	2010-07-06 23:15:17 +00:00
Dale Johannesen	6f01541ae6	Make test not hang waiting for input. llvm-svn: 107721	2010-07-06 23:06:58 +00:00
Bruno Cardoso Lopes	35702d27c4	Add part of AVX SSE4.1 packed move with sign/zero extend instructions llvm-svn: 107720	2010-07-06 23:01:41 +00:00
Bruno Cardoso Lopes	e2bd058d32	Add AVX vblendvpd, vblendvps and vpblendvb instructions Update VEX encoding to support those new instructions llvm-svn: 107715	2010-07-06 22:36:24 +00:00
Jakob Stoklund Olesen	a64c0a3d22	Be more forgiving when calculating alias interference for physreg coalescing. It is OK for an alias live range to overlap if there is a copy to or from the physical register. CoalescerPair can work out if the copy is coalescable independently of the alias. This means that we can join with the actual destination interval instead of using the getOrigDstReg() hack. It is no longer necessary to merge clobber ranges into subregisters. llvm-svn: 107695	2010-07-06 20:31:51 +00:00
Devang Patel	23a7593534	Fix PR7545 crash. llvm-svn: 107678	2010-07-06 18:18:32 +00:00
Rafael Espindola	7c510aa7bc	Don't create neon moves in CopyRegToReg. NEONMoveFixPass will do the conversion if profitable. llvm-svn: 107673	2010-07-06 16:24:34 +00:00
Eric Christopher	8f06b4a294	Remove mistakenly added test. llvm-svn: 107641	2010-07-06 05:20:13 +00:00
Eric Christopher	2ad0c779c3	Fix up -fstack-protector on linux to use the segment registers. Split out testcases per architecture and os now. Patch from Nelson Elhage. llvm-svn: 107640	2010-07-06 05:18:56 +00:00
Chris Lattner	60db4557cd	another v2f32 case, in this case showing poor codegen. llvm-svn: 107614	2010-07-05 05:52:56 +00:00
Chris Lattner	431e81f2fb	fix test on non-x86 hosts. llvm-svn: 107608	2010-07-05 03:56:55 +00:00
Chris Lattner	45cc4d74a3	Just rip v2f32 support completely out of the X86 backend. In the example in the testcase, we now generate: _test1: ## @test1 movss 4(%esp), %xmm0 addss 8(%esp), %xmm0 movl 12(%esp), %eax movss %xmm0, (%eax) ret instead of: _test1: ## @test1 subl $20, %esp movl 24(%esp), %eax movq %mm0, (%esp) movq %mm0, 8(%esp) movss (%esp), %xmm0 addss 12(%esp), %xmm0 movss %xmm0, (%eax) addl $20, %esp ret v2f32 support did not work reliably because most of the X86 backend didn't know it was legal. It was apparently only added to support returning source-level v2f32 values in MMX registers in x86-32 mode. If ABI compatibility is important on this GCC-extended-vector type for some reason, then the frontend should generate IR that returns v2i32 instead of v2f32. However, we generally don't try very hard to be abi compatible on gcc extended vectors. llvm-svn: 107601	2010-07-04 23:07:25 +00:00
Chris Lattner	681b926d54	fix PR7518 - terrible codegen of <2 x float>, by only marking v2f32 as legal in 32-bit mode. It is just as terrible there, but I just care about x86-64 and noone claims it is valuable in 64-bit mode. llvm-svn: 107600	2010-07-04 22:57:10 +00:00
Bruno Cardoso Lopes	ca99012ac0	Add AVX SSE4.1 blend, mpsadbw and vdp llvm-svn: 107560	2010-07-03 01:37:03 +00:00
Bruno Cardoso Lopes	bc75502f09	Add AVX SSE4.1 binop (some forms of packed max,min,mul,pack,cmp) instructions llvm-svn: 107558	2010-07-03 01:15:47 +00:00
Bruno Cardoso Lopes	fc9cdc4d61	Add AVX SSE4.1 Horizontal Minimum and Position instruction llvm-svn: 107552	2010-07-03 00:49:21 +00:00
Bruno Cardoso Lopes	621c85b038	Add AVX SSE4.1 round instructions llvm-svn: 107549	2010-07-03 00:37:44 +00:00
Bruno Cardoso Lopes	c7111fd355	- Add support for the rest of AVX SSE3 instructions - Fix VEX prefix to be emitted with 3 bytes whenever VEX_5M represents a REX equivalent two byte leading opcode llvm-svn: 107523	2010-07-02 22:06:54 +00:00
Evan Cheng	0ce84486c3	- Two-address pass should not assume unfolding is always successful. - X86 unfolding should check if the instructions being unfolded has memoperands. If there is no memoperands, then it must assume conservative alignment. If this would introduce an expensive sse unaligned load / store, then unfoldMemoryOperand etc. should not unfold the instruction. llvm-svn: 107509	2010-07-02 20:36:18 +00:00
Dale Johannesen	4d887f7ca7	Propagate the AlignStack bit in InlineAsm's to the PrologEpilog code, and use it to determine whether the asm forces stack alignment or not. gcc consistently does not do this for GCC-style asms; Apple gcc inconsistently sometimes does it for asm blocks. There is no convenient place to put a bit in either the SDNode or the MachineInstr form, so I've added an extra operand to each; unlovely, but it does allow for expansion for more bits, should we need it. PR 5125. Some existing testcases are affected. The operand lists of the SDNode and MachineInstr forms are indexed with awesome mnemonics, like "2"; I may fix this someday, but not now. I'm not making it any worse. If anyone is inspired I think you can find all the right places from this patch. llvm-svn: 107506	2010-07-02 20:16:09 +00:00
Bob Wilson	771d04b969	Fix incorrect asm-printing of some NEON immediates. Fix weak testcase so that it checks the immediate values, not just the instructions opcodes. Radar 8110263. llvm-svn: 107487	2010-07-02 17:23:44 +00:00
Dale Johannesen	744c74c444	Prevent test from hanging waiting for input. llvm-svn: 107446	2010-07-01 22:57:11 +00:00
Bob Wilson	8a99b730a9	ARM function alignments were off by a power of two. svn 83242 changed getFunctionAlignment and the corresponding use of that value in the ARM asm printer, but now we're using the standard asm printer. The result of this was that function alignments were dropped completely for Thumb functions. Radar 8143571. llvm-svn: 107435	2010-07-01 22:26:26 +00:00
Bill Wendling	03bcd6ecc8	Implement the "linker_private_weak" linkage type. This will be used for Objective-C metadata types which should be marked as "weak", but which the linker will remove upon final linkage. However, this linkage isn't specific to Objective-C. For example, the "objc_msgSend_fixup_alloc" symbol is defined like this: .globl l_objc_msgSend_fixup_alloc .weak_definition l_objc_msgSend_fixup_alloc .section __DATA, __objc_msgrefs, coalesced .align 3 l_objc_msgSend_fixup_alloc: .quad _objc_msgSend_fixup .quad L_OBJC_METH_VAR_NAME_1 This is different from the "linker_private" linkage type, because it can't have the metadata defined with ".weak_definition". Currently only supported on Darwin platforms. llvm-svn: 107433	2010-07-01 21:55:59 +00:00
Dan Gohman	84f90a387d	Remove context sensitivity concerns from interprocedural-basic-aa, and make it more aggressive in cases where both pointers are known to live in the same function. llvm-svn: 107420	2010-07-01 20:08:40 +00:00
Devang Patel	2b434e12cd	Debugging infomration is encoded in llvm IR using metadata. This is designed such a way that debug info for symbols preserved even if symbols are optimized away by the optimizer. Add new special pass to remove debug info for such symbols. llvm-svn: 107416	2010-07-01 19:49:20 +00:00
Bruno Cardoso Lopes	5e88700f28	Move SSE3 Move patterns to a more appropriate section Add AVX SSE3 packed horizontal and & sub instructions llvm-svn: 107405	2010-07-01 17:35:02 +00:00
Bruno Cardoso Lopes	886ee33a38	Add AVX SSE3 packed addsub instructions llvm-svn: 107404	2010-07-01 17:08:18 +00:00
Dan Gohman	d2965c10a1	Temporarily disable on-demand fast-isel. llvm-svn: 107393	2010-07-01 12:15:30 +00:00
Dan Gohman	aef3d140b7	Teach fast-isel to avoid loading a value from memory when it's already available in a register. This is pretty primitive, but it reduces the number of instructions in common testcases by 4%. llvm-svn: 107380	2010-07-01 03:49:38 +00:00
Dan Gohman	722f5fc567	Enable on-demand fast-isel. llvm-svn: 107377	2010-07-01 02:58:57 +00:00
Bruno Cardoso Lopes	a7a0c83563	Add AVX SSE3 replicate and convert instructions llvm-svn: 107375	2010-07-01 02:33:39 +00:00
Dan Gohman	7937d5606d	Teach X86FastISel to fold constant offsets and scaled indices in the same address. llvm-svn: 107373	2010-07-01 02:27:15 +00:00
Bruno Cardoso Lopes	05166740eb	- Add AVX SSE2 Move doubleword and quadword instructions. - Add encode bits for VEX_W - All 128-bit SSE 1 & SSE2 instructions that are described in the .td file now have a AVX encoded form already working. llvm-svn: 107365	2010-07-01 01:20:06 +00:00
Mikhail Glushenkov	0354891d98	Test for the -filelist fix. llvm-svn: 107363	2010-07-01 01:00:37 +00:00
Devang Patel	db735cbbab	Remove all debug info related named mdnodes. llvm-svn: 107323	2010-06-30 21:29:00 +00:00
Bruno Cardoso Lopes	cbcebe2950	Add AVX SSE2 mask creation and conditional store instructions llvm-svn: 107306	2010-06-30 18:38:10 +00:00
Dan Gohman	c0cca7fdda	Revert the part of r107257 which introduced new logic for using nsw and nuw flags from IR Instructions. On further consideration, this isn't valid. llvm-svn: 107298	2010-06-30 17:27:11 +00:00
Bruno Cardoso Lopes	d079c91683	Add AVX SSE2 packed integer extract/insert instructions llvm-svn: 107293	2010-06-30 17:03:03 +00:00
Dan Gohman	725ed0364b	Add a testcase for scev-aa's new capability. llvm-svn: 107258	2010-06-30 07:17:47 +00:00
Bruno Cardoso Lopes	e82689fea2	Add AVX SSE2 integer unpack instructions llvm-svn: 107246	2010-06-30 04:06:39 +00:00
Bruno Cardoso Lopes	ec0115c9b7	Add AVX SSE2 packed integer shuffle instructions llvm-svn: 107245	2010-06-30 03:47:56 +00:00
Bruno Cardoso Lopes	be792feb8b	Add AVX SSE2 pack with saturation integer instructions llvm-svn: 107241	2010-06-30 02:30:25 +00:00
Bruno Cardoso Lopes	2686ea4555	Add AVX SSE2 integer packed compare instructions llvm-svn: 107240	2010-06-30 02:21:09 +00:00
Bruno Cardoso Lopes	2e2caefff9	- Add AVX form of all SSE2 logical instructions - Add VEX encoding bits to x86 MRM0r-MRM7r llvm-svn: 107238	2010-06-30 01:58:37 +00:00
Devang Patel	648df7bf64	Add variables into a scope before constructing scope DIE otherwise variables won't be included DIE tree. llvm-svn: 107228	2010-06-30 00:11:08 +00:00
Bruno Cardoso Lopes	3f71ddfaad	Add several AVX integer packed binop instructions llvm-svn: 107225	2010-06-29 23:47:49 +00:00
Dan Gohman	ae36b1ed42	Fix ScalarEvolution's tripcount computation for chains of loops where each loop's induction variable's start value is the exit value of a preceding loop. llvm-svn: 107224	2010-06-29 23:43:06 +00:00
Bruno Cardoso Lopes	30689a3a7f	Add AVX ld/st XCSR register. Add VEX encoding bits for MRMXm x86 form llvm-svn: 107204	2010-06-29 20:35:48 +00:00
Jakob Stoklund Olesen	dadea5b178	Fix the handling of partial redefines in the fast register allocator. A partial redefine needs to be treated like a tied operand, and the register must be reloaded while processing use operands. This fixes a bug where partially redefined registers were processed as normal defs with a reload added. The reload could clobber another use operand if it was a kill that allowed register reuse. llvm-svn: 107193	2010-06-29 19:15:30 +00:00
Bob Wilson	d91d5bfc95	Fix a register scavenger crash when dealing with undefined subregs. The LowerSubregs pass needs to preserve implicit def operands attached to EXTRACT_SUBREG instructions when it replaces those instructions with copies. llvm-svn: 107189	2010-06-29 18:42:49 +00:00
Bruno Cardoso Lopes	a4575f5b31	Add AVX non-temporal stores llvm-svn: 107178	2010-06-29 18:22:01 +00:00
Dan Gohman	9bbd007f15	Add a few more interesting testcases. llvm-svn: 107177	2010-06-29 18:17:11 +00:00
Bruno Cardoso Lopes	21a9433e9e	Add sqrt, rsqrt and rcp AVX instructions llvm-svn: 107166	2010-06-29 17:26:30 +00:00
Rafael Espindola	38a7d7cbc3	Add a VT argument to getMinimalPhysRegClass and replace the copy related uses of getPhysicalRegisterRegClass with it. If we want to make a copy (or estimate its cost), it is better to use the smallest class as more efficient operations might be possible. llvm-svn: 107140	2010-06-29 14:02:34 +00:00
Duncan Sands	dddf876e96	Looks like this test is missing an XFAIL line. llvm-svn: 107134	2010-06-29 13:18:50 +00:00
Evan Cheng	b59dd8f10a	PR7503: uxtb16 is not available for ARMv7-M. Patch by Brian G. Lucas. llvm-svn: 107122	2010-06-29 05:38:36 +00:00
Bob Wilson	1e5da550e5	Reapply my if-conversion cleanup from svn r106939 with fixes. There are 2 changes relative to the previous version of the patch: 1) For the "simple" if-conversion case, there's no need to worry about RemoveExtraEdges not handling an unanalyzable branch. Predicated terminators are ignored in this context, so RemoveExtraEdges does the right thing. This might break someday if we ever treat indirect branches (BRIND) as predicable, but for now, I just removed this part of the patch, because in the case where we do not add an unconditional branch, we rely on keeping the fall-through edge to CvtBBI (which is empty after this transformation). The change relative to the previous patch is: @@ -1036,10 +1036,6 @@ IterIfcvt = false; } - // RemoveExtraEdges won't work if the block has an unanalyzable branch, - // which is typically the case for IfConvertSimple, so explicitly remove - // CvtBBI as a successor. - BBI.BB->removeSuccessor(CvtBBI->BB); RemoveExtraEdges(BBI); // Update block info. BB can be iteratively if-converted. 2) My patch exposed a bug in the code for merging the tail of a "diamond", which had previously never been exercised. The code was simply checking that the tail had a single predecessor, but there was a case in MultiSource/Benchmarks/VersaBench/dbms where that single predecessor was neither edge of the diamond. I added the following change to check for that: @@ -1276,7 +1276,18 @@ // tail, add a unconditional branch to it. if (TailBB) { BBInfo TailBBI = BBAnalysis[TailBB->getNumber()]; - if (TailBB->pred_size() == 1 && !TailBBI.HasFallThrough) { + bool CanMergeTail = !TailBBI.HasFallThrough; + // There may still be a fall-through edge from BBI1 or BBI2 to TailBB; + // check if there are any other predecessors besides those. + unsigned NumPreds = TailBB->pred_size(); + if (NumPreds > 1) + CanMergeTail = false; + else if (NumPreds == 1 && CanMergeTail) { + MachineBasicBlock::pred_iterator PI = TailBB->pred_begin(); + if (PI != BBI1->BB && PI != BBI2->BB) + CanMergeTail = false; + } + if (CanMergeTail) { MergeBlocks(BBI, TailBBI); TailBBI.IsDone = true; } else { With these fixes, I was able to run all the SingleSource and MultiSource tests successfully. llvm-svn: 107110	2010-06-29 00:55:23 +00:00
Dan Gohman	0824affeff	Add an Intraprocedural form of BasicAliasAnalysis, which aims to properly handles instructions and arguments defined in different functions, or across recursive function iterations. llvm-svn: 107109	2010-06-29 00:50:39 +00:00
Bruno Cardoso Lopes	d6a091a4d4	Described the missing AVX forms of SSE2 convert instructions llvm-svn: 107108	2010-06-29 00:36:02 +00:00
Devang Patel	1575e9f5ce	The comment string does not match for all targets. PowerPC uses ;. llvm-svn: 107103	2010-06-29 00:04:40 +00:00
Bob Wilson	269a89fd3a	Unlike other targets, ARM now uses BUILD_VECTORs post-legalization so they can't be changed arbitrarily by the DAGCombiner without checking if it is running after legalization. llvm-svn: 107097	2010-06-28 23:40:25 +00:00
Dale Johannesen	764b056c30	Refix XTARGET. Previous attempt matches on powerpc-apple-darwin, although I don't see why. llvm-svn: 107090	2010-06-28 22:45:33 +00:00
Dale Johannesen	65cd5ba74d	Attempt to fix XTARGET. llvm-svn: 107088	2010-06-28 22:31:52 +00:00
Devang Patel	1de21ec498	Use DW_FORM_addr for DW_AT_entry_pc. llvm-svn: 107085	2010-06-28 22:22:47 +00:00
Dale Johannesen	17feb07c53	In asm's, output operands with matching input constraints have to be registers, per gcc documentation. This affects the logic for determining what "g" should lower to. PR 7393. A couple of existing testcases are affected. llvm-svn: 107079	2010-06-28 22:09:45 +00:00
Dan Gohman	e697a6f24f	Constant fold x == undef to undef. llvm-svn: 107074	2010-06-28 21:30:07 +00:00
Dan Gohman	7c34ece501	Fix Value::stripPointerCasts and BasicAA to avoid trouble on code in unreachable blocks, which have have use-def cycles. This fixes PR7514. llvm-svn: 107071	2010-06-28 21:16:52 +00:00
Devang Patel	68c81196f9	Remove this weak test. llvm-svn: 107059	2010-06-28 20:24:35 +00:00
Dale Johannesen	0e4d964bfe	Testcase for llvm-gcc fix 107051. llvm-svn: 107052	2010-06-28 20:07:30 +00:00
Jakob Stoklund Olesen	fde9c348e9	Don't write temporary files in test directory llvm-svn: 107049	2010-06-28 20:01:15 +00:00
Jakob Stoklund Olesen	0117091c16	Add a triple so test runs on Linux as well. llvm-svn: 107045	2010-06-28 19:31:15 +00:00
Jakob Stoklund Olesen	0d94d7af78	Add more special treatment for inline asm in RegAllocFast. When an instruction has tied operands and physreg defines, we must take extra care that the tied operands conflict with neither physreg defs nor uses. The special treatment is given to inline asm and instructions with tied operands / early clobbers and physreg defines. This fixes PR7509. llvm-svn: 107043	2010-06-28 18:34:34 +00:00
Devang Patel	f3b2db68c6	Preserve deleted function's local variables' debug info. Radar 8122864. llvm-svn: 107027	2010-06-28 18:25:03 +00:00
Devang Patel	6e34f19b17	Make this test darwin specific. llvm-svn: 107025	2010-06-28 18:04:03 +00:00
Chris Lattner	93e63a0218	this test is failing nondeterministically and blaming me, just disable it for now. llvm-svn: 106960	2010-06-26 22:08:30 +00:00
Benjamin Kramer	c1ecfd86a3	Fix test weirdness. llvm-svn: 106959	2010-06-26 22:06:50 +00:00
Benjamin Kramer	3bbc52ce3e	Fix some tests that didn't test anything. llvm-svn: 106954	2010-06-26 20:05:06 +00:00
Kenneth Uildriks	7228d98b85	Partial specialization test should not depend on the order of specialization operations or the names assigned to the specialized functions llvm-svn: 106953	2010-06-26 18:47:40 +00:00
Rafael Espindola	2041abd958	When splitting a VAARG, remember its alignment. This produces terrible but correct code. llvm-svn: 106952	2010-06-26 18:22:20 +00:00
Bob Wilson	418e64a385	Revert my if-conversion cleanup since it caused a bunch of nightly test regressions. --- Reverse-merging r106939 into '.': U test/CodeGen/Thumb2/thumb2-ifcvt3.ll U lib/CodeGen/IfConversion.cpp llvm-svn: 106951	2010-06-26 17:47:06 +00:00
Duncan Sands	3a5cb69cb8	Fix PR7328: when turning a tail recursion into a loop, need to preserve the returned value after the tail call if it differs from other return values. The optimal thing to do would be to introduce a phi node for the return value, but for the moment just fix the miscompile. llvm-svn: 106947	2010-06-26 12:53:31 +00:00
Eli Friedman	b9bdc5a52d	Remove bogus test. llvm-svn: 106941	2010-06-26 04:59:56 +00:00
Bob Wilson	c72da6bb56	Clean up some problems with extra CFG edges being introduced during if-conversion. The RemoveExtraEdges function doesn't work for blocks that end with unanalyzable branches, so in those cases, the "extra" edges must be explicitly removed. The CopyAndPredicateBlock and MergeBlocks methods can also avoid copying successor edges due to branches that have already been removed. The latter case is especially helpful when MergeBlocks is called for handling "diamond" if-conversions, where otherwise you can end up with some weird intermediate states in the CFG. Unfortunately I've been unable to find cases where this cleanup actually makes a significant difference in the code. There is one test where we manage to remove an empty block at the end of a function. Radar 6911268. llvm-svn: 106939	2010-06-26 04:27:33 +00:00
Jakob Stoklund Olesen	d7d0d4e882	When creating X86 MUL8 and DIV8 instructions, make sure we don't produce CopyFromReg nodes for aliasing registers (AX and AL). This confuses the fast register allocator. Instead of CopyFromReg(AL), use ExtractSubReg(CopyFromReg(AX), sub_8bit). This fixes PR7312. llvm-svn: 106934	2010-06-26 00:39:23 +00:00
Bruno Cardoso Lopes	74d716b9cd	Add AVX convert CVTSS2SI{rr,rm} and CVTDQ2PS{rr,rm} instructions llvm-svn: 106917	2010-06-25 23:47:23 +00:00
Bruno Cardoso Lopes	83651094ad	Reapply r106896: Add several AVX MOV flavors Support VEX encoding for MRMDestReg llvm-svn: 106912	2010-06-25 23:33:42 +00:00
Daniel Dunbar	acbdf53db4	Thumb2ITBlockPass: Fix a possible dereference of an invalid iterator. This was introduced in r106343, but only showed up recently (with a particular compiler & linker combination) because of the particular check, and because we have no builtin checking for dereferencing the end of an array, which is truly unfortunate. llvm-svn: 106908	2010-06-25 23:14:54 +00:00
Bruno Cardoso Lopes	4530fed87e	revert this now, it's using avx instead of sse :) llvm-svn: 106906	2010-06-25 23:04:29 +00:00
Evan Cheng	02b184de5b	Change if-conversion block size limit checks to add some flexibility. llvm-svn: 106901	2010-06-25 22:42:03 +00:00
Bruno Cardoso Lopes	a34d9b6d84	Add several AVX MOV flavors Support VEX encoding for MRMDestReg llvm-svn: 106896	2010-06-25 22:27:51 +00:00
Dale Johannesen	ce97d55ad9	The hasMemory argument is irrelevant to how the argument for an "i" constraint should get lowered; PR 6309. While this argument was passed around a lot, this is the only place it was used, so it goes away from a lot of other places. llvm-svn: 106893	2010-06-25 21:55:36 +00:00
Dan Gohman	8de1fe3ccf	pcmpeqd and friends are Commutable. llvm-svn: 106886	2010-06-25 21:05:35 +00:00
Bill Wendling	e41e40f689	- Reapply r106066 now that the bzip2 build regression has been fixed. - 2010-06-25-CoalescerSubRegDefDead.ll is the testcase for r106878. llvm-svn: 106880	2010-06-25 20:48:10 +00:00
Devang Patel	27510cc623	XFAIL this test on powerpc for now. llvm-svn: 106862	2010-06-25 17:32:23 +00:00
Bruno Cardoso Lopes	cbdcce6478	Add some AVX convert instructions llvm-svn: 106815	2010-06-25 00:39:30 +00:00
Dan Gohman	600658a4ba	Don't write an output file to cwd, and put an rdar prefix on an rdar number. llvm-svn: 106810	2010-06-24 23:45:15 +00:00
Dan Gohman	9a2f0473b2	Teach EmitLiveInCopies to omit copies for unused virtual registers, and to clean up unused incoming physregs from the live-in list. llvm-svn: 106805	2010-06-24 22:23:02 +00:00
Bill Wendling	2d3c490026	It's possible that a flag is added to the SDNode that points back to the original SDNode. This is badness. Also, this function allows one SDNode to point multiple flags to another SDNode. Badness as well. llvm-svn: 106793	2010-06-24 22:00:37 +00:00
Devang Patel	c657c621b7	DBG_VALUE machine instruction pointing to undefined register for a variable justify a separate scope if the variable is inlined function's argument. Radar 8122864. llvm-svn: 106792	2010-06-24 21:51:19 +00:00
Bruno Cardoso Lopes	4398fd7b83	- Add AVX COMI{SS,SD}{rr,rm} and UCOMI{SS,SD}{rr,rm}. - Fix a small VEX encoding issue. - Move compare instructions to their appropriate place. llvm-svn: 106787	2010-06-24 20:48:23 +00:00
Dale Johannesen	5ad5226c58	Disallow matching "i" constraint to symbol addresses when address requires a register or secondary load to compute (most PIC modes). This improves "g" constraint handling. 8015842. The test from 2007 is attempting to test the fix for PR1761, but since -relocation-model=static doesn't work on Darwin x86-64, it was not testing what it was supposed to be testing and was passing erroneously. Fixed to use Linux x86-64. llvm-svn: 106779	2010-06-24 20:14:51 +00:00
Jakob Stoklund Olesen	45230239e4	Replace a big gob of old coalescer logic with the new CoalescerPair class. CoalescerPair can determine if a copy can be coalesced, and which register gets merged away. The old logic in SimpleRegisterCoalescing had evolved into something a bit too convoluted. This second attempt fixes some crashes that only occurred Linux. llvm-svn: 106769	2010-06-24 18:15:01 +00:00
Bob Wilson	279e55fb2e	PR7458: Try commuting Thumb2 instruction operands to put them into 2-address form so they can be narrowed to 16-bit instructions. llvm-svn: 106762	2010-06-24 16:50:20 +00:00
Dan Gohman	463f26b4be	Eliminate the other half of the BRCOND optimization, and update as many tests as possible. llvm-svn: 106749	2010-06-24 15:24:03 +00:00
Dan Gohman	df6b33e778	Eliminate the first have of the optimization which eliminates BRCOND when the condition is constant. This optimization shouldn't be necessary, because codegen shouldn't be able to find dead control paths that the IR-level optimizer can't find. And it's undesirable, because it encourages bugpoint to leave "br i1 false" branches in its output. And it wasn't updating the CFG. I updated all the tests I could, but some tests are too reduced and I wasn't able to meaningfully preserve them. llvm-svn: 106748	2010-06-24 15:04:11 +00:00
Dan Gohman	600f62b3ba	Reapply r106634, now that the bug it exposed is fixed. llvm-svn: 106746	2010-06-24 14:30:44 +00:00
Chris Lattner	8048662539	Teach the x86 mc assembler that %dr6 = %db6, this implements rdar://8013734 llvm-svn: 106725	2010-06-24 07:29:18 +00:00
Dan Gohman	0695e09b09	Optimize the "bit test" code path for switch lowering in the case where the bit mask has exactly one bit. llvm-svn: 106716	2010-06-24 02:06:24 +00:00
Jakob Stoklund Olesen	dbb58d2974	Revert "Replace a big gob of old coalescer logic with the new CoalescerPair class." Whiny buildbots. llvm-svn: 106710	2010-06-24 00:52:22 +00:00
Bruno Cardoso Lopes	191a1cd2bb	Add AVX CMP{SS,SD}{rr,rm} instructions and encoding testcases llvm-svn: 106705	2010-06-24 00:32:06 +00:00
Jakob Stoklund Olesen	f38e6720cc	Replace a big gob of old coalescer logic with the new CoalescerPair class. CoalescerPair can determine if a copy can be coalesced, and which register gets merged away. The old logic in SimpleRegisterCoalescing had evolved into something a bit too convoluted. llvm-svn: 106701	2010-06-24 00:12:39 +00:00
Bill Wendling	f470747a36	We are missing opportunites to use ldm. Take code like this: void t(int cp0, int cp1, int dp, int fmd) { int c0, c1, d0, d1, d2, d3; c0 = (cp0++ & 0xffff) \| ((cp1++ << 16) & 0xffff0000); c1 = (cp0++ & 0xffff) \| ((cp1++ << 16) & 0xffff0000); / ... */ } It code gens into something pretty bad. But with this change (analogous to the X86 back-end), it will use ldm and generate few instructions. llvm-svn: 106693	2010-06-23 23:00:16 +00:00
Bruno Cardoso Lopes	05220c9a0d	Add AVX MOVMSK{PS,PD}rr instructions llvm-svn: 106683	2010-06-23 21:30:27 +00:00
Bruno Cardoso Lopes	3183dd5692	Add tests for different AVX cmp opcodes, also teach the x86 asm parser to understand the vcmp instruction llvm-svn: 106678	2010-06-23 21:10:57 +00:00
Bruno Cardoso Lopes	360d6fe299	Add AVX SHUF{PS,PD}{rr,rm} instructions llvm-svn: 106672	2010-06-23 20:07:15 +00:00
Nico Weber	337e8db712	Add support for the x86 instructions "pusha" and "popa". llvm-svn: 106671	2010-06-23 20:00:58 +00:00
Bruno Cardoso Lopes	30a28d6588	Fix a tblgen bug. Given the pattern below as an example: list<dag> Pattern = [(set RC:$dst, (v4f32 (shufp:src3 RC:$src1, (mem_frag addr:$src2))))]; The right reference resolving should lead to: list<dag> Pattern = [(set VR128:$dst, (v4f32 (shufp:src3 VR128:$src1, (mem_frag addr:$src2))))]; But was yielding: list<dag> Pattern = [(set VR128:$dst, (v4f32 (shufp VR128:$src1, (mem_frag addr:$src2))))]; Fix this by passing the right name when creating a new DagInit node. llvm-svn: 106670	2010-06-23 19:50:39 +00:00
Dale Johannesen	fc40f0a1ab	Reinstate correct test, remove the real invalidated test. llvm-svn: 106664	2010-06-23 18:56:06 +00:00
Dale Johannesen	6effb503f5	Remove tests invalidated by previous checkin. llvm-svn: 106663	2010-06-23 18:53:12 +00:00
Bill Wendling	a136521a17	MorphNodeTo doesn't preserve the memory operands. Because we're morphing a node into the same node, but with different non-memory operands, we need to replace the memory operands after it's finished morphing. llvm-svn: 106643	2010-06-23 18:16:24 +00:00
Daniel Dunbar	f643aa86b0	tests: Tweak lit.cfg to fix breakage with out-of-dir lookup. llvm-svn: 106638	2010-06-23 18:06:16 +00:00
Daniel Dunbar	4df321b7ad	Revert r106263, "Fold the ShrinkDemandedOps pass into the regular DAGCombiner pass,"... it was causing both 'file' (with clang) and 176.gcc (with llvm-gcc) to be miscompiled. llvm-svn: 106634	2010-06-23 17:09:26 +00:00
Daniel Dunbar	ef5a4383ad	Revert r106066, "Create a more targeted fix for not sinking instructions into a range where it"... it causes bzip2 to be miscompiled by Clang. Conflicts: lib/CodeGen/MachineSink.cpp llvm-svn: 106614	2010-06-23 00:48:25 +00:00
Stuart Hastings	c0efbd5b31	Less incorrect handling of zero-length bitfields. Radars `7992077` and 8093043. llvm-svn: 106611	2010-06-23 00:31:14 +00:00
Bruno Cardoso Lopes	1e13c17a55	Add AVX compare packed instructions llvm-svn: 106600	2010-06-22 23:37:59 +00:00
Dan Gohman	f1cf963c64	Loosen up this test so that it doesn't depend as much on register allocation details. llvm-svn: 106599	2010-06-22 23:32:47 +00:00
Dan Gohman	1081f1a0f5	Fix OptimizeMax to handle an odd case where one of the max operands is another max which folds. This fixes PR7454. llvm-svn: 106594	2010-06-22 23:07:13 +00:00
Bruno Cardoso Lopes	535aa8ea91	Reapply support for AVX unpack and interleave instructions, with testcases this time. llvm-svn: 106593	2010-06-22 23:02:38 +00:00
Bruno Cardoso Lopes	1a890f9dc0	Add AVX MOV{SS,SD}{rr,rm} instructions llvm-svn: 106588	2010-06-22 22:38:56 +00:00
Bob Wilson	c5d712232d	Thumb1 functions using @llvm.returnaddress were not saving the incoming LR. Radar 8031193. llvm-svn: 106582	2010-06-22 22:04:24 +00:00
Eric Christopher	6250bd9e3c	Move a 64-bit test to the 64-bit file. Fixes an llvm-mc assertion during test runs. llvm-svn: 106577	2010-06-22 21:11:51 +00:00
Dale Johannesen	6d4802ba6c	Add SSE so these actually pass on non-X86 hosts. llvm-svn: 106575	2010-06-22 20:54:03 +00:00
Bruno Cardoso Lopes	dc883cf45a	Fix a subtle multiclass bug: when using class inheritance on a toplevel 'defm', make sure to properly resolve references. llvm-svn: 106570	2010-06-22 20:30:50 +00:00
Bill Wendling	7e35d39fee	Corresponding test changes for r106564. llvm-svn: 106569	2010-06-22 20:30:14 +00:00
Mon P Wang	825639e849	Move v-binop-widen tests to X86 since they don't work on all platforms llvm-svn: 106562	2010-06-22 19:40:50 +00:00
Jakob Stoklund Olesen	9c47dac677	Remove the SimpleJoin optimization from SimpleRegisterCoalescing. Measurements show that it does not speed up coalescing, so there is no reason the keep the added complexity around. Also clean out some unused methods and static functions. llvm-svn: 106548	2010-06-22 16:13:57 +00:00
Dan Gohman	f820bd327d	Allow "exhaustive" trip count evaluation on phi nodes with all constant operands. llvm-svn: 106537	2010-06-22 13:15:46 +00:00
Evan Cheng	37bb617f8a	Tail merging pass shall not break up IT blocks. rdar://8115404 llvm-svn: 106517	2010-06-22 01:18:16 +00:00
Dan Gohman	3c1b3c61e9	Teach two-address lowering how to unfold a load to open up commuting opportunities. For example, this lets it emit this: movq (%rax), %rcx addq %rdx, %rcx instead of this: movq %rdx, %rcx addq (%rax), %rcx in the case where %rdx has subsequent uses. It's the same number of instructions, and usually the same encoding size on x86, but it appears faster, and in general, it may allow better scheduling for the load. llvm-svn: 106493	2010-06-21 22:17:20 +00:00
Evan Cheng	1fb4de8ec5	Fix PR7421: bug in kill transferring logic. It was ignoring loads / stores which have already been processed. llvm-svn: 106481	2010-06-21 21:21:14 +00:00
Dan Gohman	2dd1d3d182	Make this test more robust in case LLVM ever decides to align the global variable differently. llvm-svn: 106454	2010-06-21 19:56:27 +00:00
Dale Johannesen	dd471bbb10	Add missing FileCheck call. llvm-svn: 106443	2010-06-21 18:46:08 +00:00
Devang Patel	c8bceaa418	test case for r106438. llvm-svn: 106439	2010-06-21 18:37:23 +00:00
Dale Johannesen	d5c58b76ab	Fix PR 7433. Silly typo in non-Darwin ARM tail call handling, plus correct R9 handling in that mode. llvm-svn: 106434	2010-06-21 18:21:49 +00:00
Eric Christopher	bf572c7cea	Add some codegen patterns for x86_64-linux-gnu tls codegen matching. Based on a patch by Patrick Marlier! llvm-svn: 106433	2010-06-21 18:21:27 +00:00
Kalle Raiskila	df071b7e42	Add the check to the testcase of r106419. llvm-svn: 106421	2010-06-21 15:11:51 +00:00
Kalle Raiskila	0ab5a02579	Mark the SPU 'lr' instruction to never have side effects. This allows the fast regiser allocator to remove redundant register moves. Update a set of tests that depend on the register allocator to be linear scan. llvm-svn: 106420	2010-06-21 15:08:16 +00:00
Kalle Raiskila	d7f50c118a	Fix the lowering of VECTOR_SHUFFLE on SPU to handle splats. llvm-svn: 106419	2010-06-21 14:42:19 +00:00
Kalle Raiskila	6f58190f6f	Fix lowering of VECTOR_SHUFFLE on SPU. Old algorithm used to choke llc with the attached test. llvm-svn: 106411	2010-06-21 10:17:36 +00:00
Evan Cheng	884a8fe5fa	Fix a crash caused by dereference of MBB.end(). rdar://8110842 llvm-svn: 106399	2010-06-20 00:54:38 +00:00
Dan Gohman	51d00092b6	Include the use kind along with the expression in the key of the use sharing map. The reconcileNewOffset logic already forces a separate use if the kinds differ, so incorporating the kind in the key means we can track more sharing opportunities. More sharing means fewer total uses to track, which means smaller problem sizes, which means the conservative throttles don't kick in as often. llvm-svn: 106396	2010-06-19 21:29:59 +00:00
Dan Gohman	866971ed3d	Fix ScalarEvolution's "exhaustive" trip count evaluation code to avoid assuming that loops are in canonical form, as ScalarEvolution doesn't depend on LoopSimplify itself. Also, with indirectbr not all loops can be simplified. This fixes PR7416. llvm-svn: 106389	2010-06-19 14:17:24 +00:00
Bruno Cardoso Lopes	8737b7d73d	Refactor aliased packed logical instructions, also add AVX AND,OR,XOR,NAND{P}{S,D}{rr,rm} instructions. llvm-svn: 106374	2010-06-19 02:44:01 +00:00
Evan Cheng	f3c01f3ef6	Disable sibcall optimization for Thumb1 for now since Thumb1RegisterInfo::emitEpilogue is not expecting them. llvm-svn: 106368	2010-06-19 01:01:32 +00:00
Bruno Cardoso Lopes	1e205f6b1c	Shrink down code and add for free AVX {MIN,MAX}P{S,D}{rm,rr} instructions llvm-svn: 106366	2010-06-19 00:37:31 +00:00
Chris Lattner	e808a78ac1	fix rdar://7873482 by teaching the instruction encoder to emit segment prefixes. Daniel wrote most of this patch. llvm-svn: 106364	2010-06-19 00:34:00 +00:00
Evan Cheng	119824ed4d	Move ARM if-conversion before post-ra scheduling. llvm-svn: 106355	2010-06-18 23:32:07 +00:00
Evan Cheng	2d51c7c592	Allow ARM if-converter to be run after post allocation scheduling. - This fixed a number of bugs in if-converter, tail merging, and post-allocation scheduler. If-converter now runs branch folding / tail merging first to maximize if-conversion opportunities. - Also changed the t2IT instruction slightly. It now defines the ITSTATE register which is read by instructions in the IT block. - Added Thumb2 specific hazard recognizer to ensure the scheduler doesn't change the instruction ordering in the IT block (since IT mask has been finalized). It also ensures no other instructions can be scheduled between instructions in the IT block. This is not yet enabled. llvm-svn: 106344	2010-06-18 23:09:54 +00:00
Jakob Stoklund Olesen	07f4fa8198	TwoAddressInstructionPass::CoalesceExtSubRegs can insert INSERT_SUBREG instructions, but it doesn't really understand live ranges, so the first INSERT_SUBREG uses an implicitly defined register. Fix it in LiveVariableAnalysis by adding the <undef> flag. llvm-svn: 106333	2010-06-18 22:29:44 +00:00
Evan Cheng	cf9e8a987f	Fix an inverted condition. llvm-svn: 106330	2010-06-18 22:17:13 +00:00
Jakob Stoklund Olesen	22a212f97c	When using ADDri to get the address of a stack object, 255 is a conservative limit on the offset that can be materialized without using the register scavenger. llvm-svn: 106312	2010-06-18 20:59:25 +00:00
Dan Gohman	24ceda8eb0	Revert r106304 (105548 and friends), which are the SCEVComplexityCompare optimizations. There is still some nondeterminism remaining. llvm-svn: 106306	2010-06-18 19:54:20 +00:00
Bruno Cardoso Lopes	23f8321cbc	Teach tablegen how to inherit from classes in 'defm' definitions. The rule is simple: only inherit from a class list if they come in the end, after the last multiclass. llvm-svn: 106305	2010-06-18 19:53:41 +00:00
Dan Gohman	4c807fca97	Reapply 105540, 105542, and 105548, and revert r105732. llvm-svn: 106304	2010-06-18 19:26:04 +00:00
Dale Johannesen	c1570dda5c	Enable tail calls on ARM by default, with some basic tests. This has been well tested on Darwin but not elsewhere. It should work provided the linker correctly resolves B.W <label in other function> which it has not seen before, at least from llvm-based compilers. I'm leaving the arm-tail-calls switch in until I see if there's any problems because of that; it might need to be disabled for some environments. llvm-svn: 106299	2010-06-18 19:00:18 +00:00
Jakob Stoklund Olesen	b9f91667e1	Treat the ARM inline asm {cc} constraint as a physreg (%CPSR), just like X86 does for {flags}. If we create virtual registers of the CCR class, RegAllocFast may try to spill them, and we can't do that. llvm-svn: 106289	2010-06-18 16:49:33 +00:00
Dan Gohman	559020df1d	Don't write a file named "&1". llvm-svn: 106269	2010-06-18 01:49:17 +00:00
Dan Gohman	f3aea7aecf	Disable indvars on loops when LoopSimplify form is not available. This fixes PR7333. llvm-svn: 106267	2010-06-18 01:35:11 +00:00
Dan Gohman	99ba4dac59	Don't maintain a set of deleted nodes; instead, use a HandleSDNode to track a node over CSE events. This fixes PR7368. llvm-svn: 106266	2010-06-18 01:24:29 +00:00
Bruno Cardoso Lopes	2323168705	Add {mix,max}{ss,sd}{rr,rm} AVX forms. llvm-svn: 106264	2010-06-18 01:12:56 +00:00
Dan Gohman	b92156d5e4	Fold the ShrinkDemandedOps pass into the regular DAGCombiner pass, which is faster, simpler, and less surprising. llvm-svn: 106263	2010-06-18 01:05:21 +00:00
Dan Gohman	30d7a51d6c	Make this test less fragile. llvm-svn: 106255	2010-06-18 00:06:03 +00:00
Dale Johannesen	1f8e5fbc7a	Testcase for llvm-gcc 106225. llvm-svn: 106226	2010-06-17 17:43:14 +00:00
Rafael Espindola	29dda21e96	Remove arm_apcscc from the test files. It is the default and doing this matches what llvm-gcc and clang now produce. llvm-svn: 106221	2010-06-17 15:18:27 +00:00
Bruno Cardoso Lopes	4d1d798736	For a tablegen expression such as !if(a,b,c), let 'a' be evaluated for 'bit' operators llvm-svn: 106185	2010-06-17 00:31:36 +00:00
Bruno Cardoso Lopes	77a4a56251	let the '!eq' expression support 'int' and 'bit' types llvm-svn: 106171	2010-06-16 23:24:12 +00:00
Jakob Stoklund Olesen	207cd4bbd7	Allow a register to be redefined multiple times in a basic block. LiveVariableAnalysis was a bit picky about a register only being redefined once, but that really isn't necessary. Here is an example of chained INSERT_SUBREGs that we can handle now: 68 %reg1040<def> = INSERT_SUBREG %reg1040, %reg1028<kill>, 14 register: %reg1040 +[70,134:0) 76 %reg1040<def> = INSERT_SUBREG %reg1040, %reg1029<kill>, 13 register: %reg1040 replace range with [70,78:1) RESULT: %reg1040,0.000000e+00 = [70,78:1)[78,134:0) 0@78-(134) 1@70-(78) 84 %reg1040<def> = INSERT_SUBREG %reg1040, %reg1030<kill>, 12 register: %reg1040 replace range with [78,86:2) RESULT: %reg1040,0.000000e+00 = [70,78:1)[78,86:2)[86,134:0) 0@86-(134) 1@70-(78) 2@78-(86) 92 %reg1040<def> = INSERT_SUBREG %reg1040, %reg1031<kill>, 11 register: %reg1040 replace range with [86,94:3) RESULT: %reg1040,0.000000e+00 = [70,78:1)[78,86:2)[86,94:3)[94,134:0) 0@94-(134) 1@70-(78) 2@78-(86) 3@86-(94) rdar://problem/8096390 llvm-svn: 106152	2010-06-16 21:29:40 +00:00
Jim Grosbach	2c8b829238	modify so the test doesn't drop an output file in the test source directory. The test should also likely have some FileCheck bits to validate the output(?). llvm-svn: 106146	2010-06-16 21:07:06 +00:00
Devang Patel	79b0da30fb	Be specific. Use FileCheck. llvm-svn: 106135	2010-06-16 19:39:45 +00:00
Rafael Espindola	a20e2dfe86	Make sure that simplify libcalls does not replace a call with one calling convention with a new call with a different calling convention. llvm-svn: 106134	2010-06-16 19:34:01 +00:00
Devang Patel	e3721dd27c	This requires more investigation. Unblock buildbots for now. llvm-svn: 106122	2010-06-16 18:19:49 +00:00
Devang Patel	37e4f98cb6	Update test to explicitly capture llc output. llvm-svn: 106121	2010-06-16 18:04:12 +00:00
Benjamin Kramer	a13bd20396	simplify-libcalls: fold strncmp(x, y, 1) -> memcmp(x, y, 1) The memcmp will be optimized further and even the pathological case 'strstr(x, "x") == x' generates optimal code now. llvm-svn: 106097	2010-06-16 10:30:29 +00:00
Evan Cheng	f128bdcb55	Make post-ra scheduling, anti-dep breaking, and register scavenger (conservatively) aware of predicated instructions. This enables ARM to move if-conversion before post-ra scheduler. llvm-svn: 106091	2010-06-16 07:35:02 +00:00
Bill Wendling	8c0cf0994d	Create a more targeted fix for not sinking instructions into a range where it will conflict with another live range. The place which creates this scenerio is the code in X86 that lowers a select instruction by splitting the MBBs. This eliminates the need to check from the bottom up in an MBB for live pregs. llvm-svn: 106066	2010-06-15 23:46:31 +00:00
Rafael Espindola	1115afb092	Update test to match recent llvm-gcc change. llvm-svn: 106056	2010-06-15 22:16:40 +00:00
Jakob Stoklund Olesen	ec2e964fd6	Remove the local register allocator. Please use the fast allocator instead. llvm-svn: 106051	2010-06-15 21:58:33 +00:00
Benjamin Kramer	1118860e3a	simplify-libcalls: fold strstr(a, b) == a -> strncmp(a, b, strlen(b)) == 0 llvm-svn: 106047	2010-06-15 21:34:25 +00:00
Rafael Espindola	ae591be4e9	Set the mtriple in some tests so that they use AAPCS. llvm-svn: 106041	2010-06-15 20:42:00 +00:00
Mon P Wang	7a84689cc5	Fixed vector widening of binary instructions that can trap. Patch by Visa Putkinen! llvm-svn: 106038	2010-06-15 20:29:05 +00:00
Chris Lattner	874c92bd47	fix fastisel to handle GS and FS relative pointers. Patch by Nelson Elhage! llvm-svn: 106031	2010-06-15 19:08:40 +00:00
Rafael Espindola	5a24a56e1e	Remove the arm_aapcscc marker from the tests. It is the default for the linux targets. llvm-svn: 106029	2010-06-15 19:04:29 +00:00
Jakob Stoklund Olesen	246e9a07a2	Avoid processing early clobbers twice in RegAllocFast. Early clobbers defining a virtual register were first alocated to a physreg and then processed as a physreg EC, spilling the virtreg. This fixes PR7382. llvm-svn: 105998	2010-06-15 16:20:57 +00:00
Jakob Stoklund Olesen	82eca35b3e	Add CoalescerPair helper class. Given a copy instruction, CoalescerPair can determine which registers to coalesce in order to eliminate the copy. It deals with all the subreg fun to determine a tuple (DstReg, SrcReg, SubIdx) such that: - SrcReg is a virtual register that will disappear after coalescing. - DstReg is a virtual or physical register whose live range will be extended. - SubIdx is 0 when DstReg is a physical register. - SrcReg can be joined with DstReg:SubIdx. CoalescerPair::isCoalescable() determines if another copy instruction is compatible with the same tuple. This fixes some NEON miscompilations where shuffles are getting coalesced as if they were copies. The CoalescerPair class will replace a lot of the spaghetti logic in JoinCopy later. llvm-svn: 105997	2010-06-15 16:04:21 +00:00
Bob Wilson	a55b8877e6	Generalize the pre-coalescing of extract_subregs feeding reg_sequences, replacing the overly conservative checks that I had introduced recently to deal with correctness issues. This makes a pretty noticable difference in our testcases where reg_sequences are used. I've updated one test to check that we no longer emit the unnecessary subreg moves. llvm-svn: 105991	2010-06-15 05:56:31 +00:00
Chris Lattner	00ab615406	apparently lots of dupes. llvm-svn: 105956	2010-06-14 20:19:03 +00:00
Chris Lattner	faa7bdccbf	fix a nasty bug where we were not treating available_externally symbols as declarations in the X86 backend. This would manifest on darwin x86-32 as errors like this with -fvisibility=hidden: symbol '__ZNSbIcED1Ev' can not be undefined in a subtraction expression This fixes PR7353. llvm-svn: 105954	2010-06-14 20:11:56 +00:00
Chris Lattner	bbb798c7d1	remove old test. llvm-svn: 105953	2010-06-14 20:07:43 +00:00
Chris Lattner	b30f87b74e	rename test llvm-svn: 105952	2010-06-14 20:07:34 +00:00
Chris Lattner	329ea064ed	jump threading can't split a critical edge from an indirectbr. This fixes PR7356. llvm-svn: 105950	2010-06-14 19:45:43 +00:00
Stuart Hastings	37b827fd11	Test case for Radar 8004649. llvm-svn: 105949	2010-06-14 18:37:04 +00:00
Benjamin Kramer	6e42d53cb3	Test case for r105914. llvm-svn: 105915	2010-06-13 16:16:54 +00:00
Daniel Dunbar	250a21b79b	tests: Run macho-dump with binary unbuffered streams on Windows, I can't find a Python 2.6 way to change stdin to binary. llvm-svn: 105894	2010-06-12 17:05:28 +00:00
Daniel Dunbar	edcc628289	tests: Make macho-dump.bat actually work. llvm-svn: 105891	2010-06-12 16:21:54 +00:00
Daniel Dunbar	12225eb687	tests: Propogate LLVM_SRC_ROOT and PYTHON_EXECUTABLE environment variables to tests. llvm-svn: 105890	2010-06-12 16:21:19 +00:00
Bruno Cardoso Lopes	a714ea0f7d	More AVX: {ADD,SUB,MUL,DIV}{PD,PS}rm llvm-svn: 105870	2010-06-12 01:53:48 +00:00
Bruno Cardoso Lopes	b06f54b852	More AVX: {ADD,SUB,MUL,DIV}{PD,PS}rr Handle OpSize TSFlag for AVX llvm-svn: 105869	2010-06-12 01:23:26 +00:00
Bruno Cardoso Lopes	fd5458d4bd	More AVX instructions ({ADD,SUB,MUL,DIV}{SS,SD}rm) Introduce the VEX_X field llvm-svn: 105859	2010-06-11 23:50:47 +00:00
Daniel Dunbar	56b093f572	tests: Add wrapper script for calling macho-dump on Win32. llvm-svn: 105856	2010-06-11 23:29:48 +00:00
Bob Wilson	f07d33d8f1	Add a missing bitcast. This code used to only handle conversions between i64 and f64 types, but now it also handle Neon vector types, so the f64 result of VMOVDRR may need to be converted to a Neon type. Radar 8084742. llvm-svn: 105845	2010-06-11 22:45:25 +00:00
Stuart Hastings	afe54f1625	Support for nested functions/classes in debug output. (Again.) Radar 7424645. llvm-svn: 105828	2010-06-11 20:08:44 +00:00
Bruno Cardoso Lopes	5f2adccc1b	Teach tablegen to allow "let" expressions inside multiclasses, providing more ways to factor out commonality from the records. llvm-svn: 105776	2010-06-10 02:42:59 +00:00
Bill Wendling	d53a2cb4ac	Testcase for r105741. llvm-svn: 105750	2010-06-09 20:30:22 +00:00
Jakob Stoklund Olesen	8bc5eca331	Mark physregs defined by inline asm as implicit. This is a bit of a hack to make inline asm look more like call instructions. It would be better to produce correct dead flags during isel. llvm-svn: 105749	2010-06-09 20:05:00 +00:00
Daniel Dunbar	e16d569932	Workaround SCEV non-determinism on this test, for now, to get buildbots back to green. Dan, please revert this once the real problem is fixed. llvm-svn: 105732	2010-06-09 17:54:40 +00:00
Kalle Raiskila	5e0862f7f5	Fix SPU to cope with vector insertelement to an undef position. We default to inserting to lane 0. llvm-svn: 105722	2010-06-09 09:58:17 +00:00
Kalle Raiskila	056113a211	Handle loading from/storing to undef pointers on SPU by inserting a random load/store, rather than crashing llc. llvm-svn: 105710	2010-06-09 08:29:41 +00:00
Bruno Cardoso Lopes	c2f87b7bb2	Reapply r105521, this time appending "LLU" to 64 bit immediates to avoid breaking the build. llvm-svn: 105652	2010-06-08 22:51:23 +00:00
Rafael Espindola	efac7f5e90	Add more virtual memory to lit. The python in x86-64 fedora 13 needs it to run the llvm tests :-( It was failing with -- Testing: 5324 tests, 8 threads -- Fatal Python error: PyEval_AcquireThread: NULL new thread state llvm-svn: 105610	2010-06-08 16:17:58 +00:00
Stuart Hastings	8612940357	Tweak test for debug/metadata change, update to FileCheck. Radar 7424645. llvm-svn: 105559	2010-06-07 21:50:54 +00:00
Dan Gohman	22e1adbb11	Fix this test to work under lit. llvm-svn: 105553	2010-06-07 20:58:11 +00:00
Dan Gohman	fa9ad13002	Run dead type elimination after dead argument elimination. llvm-svn: 105552	2010-06-07 20:28:37 +00:00
Dan Gohman	fb8ed43349	Make bugpoint dead-argument-hacking actually work, and actually test it. llvm-svn: 105551	2010-06-07 20:20:33 +00:00
Dan Gohman	70910a6ab6	Optimize ScalarEvolution's SCEVComplexityCompare predicate: don't go scrounging through SCEVUnknown contents and SCEVNAryExpr operands; instead just do a simple deterministic comparison of the precomputed hash data. Also, since this is more precise, it eliminates the need for the slow N^2 duplicate detection code. llvm-svn: 105540	2010-06-07 19:06:13 +00:00
Kenneth Uildriks	1850444000	Partial specialization was not checking the callsite to make sure it was using the same constants as the specialization, leading to calls to the wrong specialization. Patch by Takumi Nakamura\! llvm-svn: 105528	2010-06-05 14:50:21 +00:00
Chris Lattner	fdd2614330	revert r105521, which is breaking the buildbots with stuff like this: In file included from X86InstrInfo.cpp:16: X86GenInstrInfo.inc:2789: error: integer constant is too large for 'long' type X86GenInstrInfo.inc:2790: error: integer constant is too large for 'long' type X86GenInstrInfo.inc:2792: error: integer constant is too large for 'long' type X86GenInstrInfo.inc:2793: error: integer constant is too large for 'long' type X86GenInstrInfo.inc:2808: error: integer constant is too large for 'long' type X86GenInstrInfo.inc:2809: error: integer constant is too large for 'long' type X86GenInstrInfo.inc:2816: error: integer constant is too large for 'long' type X86GenInstrInfo.inc:2817: error: integer constant is too large for 'long' type llvm-svn: 105524	2010-06-05 04:17:30 +00:00
Bruno Cardoso Lopes	594fa26317	Initial AVX support for some instructions. No patterns matched yet, only assembly encoding support. llvm-svn: 105521	2010-06-05 03:53:24 +00:00
Bruno Cardoso Lopes	c4f614870f	Teach tablegen to support 'defm' inside multiclasses. llvm-svn: 105519	2010-06-05 02:11:52 +00:00
Stuart Hastings	3ca391027f	Revert 105492 & 105493 due to a testcase regression. Radar 7424645. llvm-svn: 105511	2010-06-05 00:39:29 +00:00
Dan Gohman	bbfb6aca92	LSR needs to remember inserted instructions even in postinc mode, because there could be multiple subexpressions within a single expansion which require insert point adjustment. This fixes PR7306. llvm-svn: 105510	2010-06-05 00:33:07 +00:00
Devang Patel	3eed2cf587	test case for r105504. Radar 8055687. llvm-svn: 105505	2010-06-04 23:47:41 +00:00
Evan Cheng	a03e6f85fe	Re-apply 105308 with fix. llvm-svn: 105502	2010-06-04 23:28:13 +00:00
Stuart Hastings	7c015988fe	Support for nested functions/classes in debug output. Radar 7424645. llvm-svn: 105492	2010-06-04 22:36:03 +00:00
Devang Patel	36da24b546	Copy location info for current function argument from dbg.declare if respective store instruction does not have any location info. llvm-svn: 105490	2010-06-04 22:27:30 +00:00
Dale Johannesen	065d6fd537	More tail call removal. llvm-svn: 105485	2010-06-04 21:14:24 +00:00
Dan Gohman	538b413ccb	Fix normalization and de-normalization of non-affine SCEVs. llvm-svn: 105480	2010-06-04 19:16:34 +00:00
Mon P Wang	622cdd2297	Fixed a bug during widening where we would avoid legalizing a node. When we replace an OpA with a widened OpB, it is possible to get new uses of OpA due to CSE when recursively updating nodes. Since OpA has been processed, the new uses are not examined again. The patch checks if this occurred and it it did, updates the new uses of OpA to use OpB. llvm-svn: 105453	2010-06-04 01:20:10 +00:00
Dale Johannesen	b3780b1103	Remove more tail calls. llvm-svn: 105450	2010-06-04 01:01:24 +00:00
Dale Johannesen	e7b392dca9	Remove a tail call, and move some CHECKs to the functions where they belong. llvm-svn: 105449	2010-06-04 01:01:04 +00:00
Dan Gohman	8fdda8a655	This test doesn't need the ssp attribute. llvm-svn: 105440	2010-06-04 00:14:48 +00:00
Dale Johannesen	e288fee959	Remove tail call. A tail call version will follow. llvm-svn: 105438	2010-06-04 00:03:37 +00:00
Dale Johannesen	9f71f7f70c	Remove tail call to preserve this test. A tail call version will follow. llvm-svn: 105422	2010-06-03 21:57:48 +00:00
Dale Johannesen	41528aeb0b	Make this test not use tail calls. A tail call version will follow. llvm-svn: 105419	2010-06-03 21:53:01 +00:00
Dan Gohman	d83e3e7750	Fix SimplifyDemandedBits' AssertZext logic to demand all the bits. It needs to demand the high bits because it's asserting that they're zero. llvm-svn: 105406	2010-06-03 20:21:33 +00:00
Bob Wilson	30093b5d8b	Revert 105308. llvm-svn: 105399	2010-06-03 18:28:31 +00:00
Bill Wendling	f82aea634c	Machine sink could potentially sink instructions into a block where the physical registers it defines then interfere with an existing preg live range. For instance, if we had something like these machine instructions: BB#0 ... = imul ... EFLAGS<imp-def,dead> test ..., EFLAGS<imp-def> jcc BB#2 EFLAGS<imp-use> BB#1 ... ; fallthrough to BB#2 BB#2 ... ; No code that defines EFLAGS jcc ... EFLAGS<imp-use> Machine sink will come along, see that imul implicitly defines EFLAGS, but because it's "dead", it assumes that it can move imul into BB#2. But when it does, imul's "dead" imp-def of EFLAGS is raised from the dead (a zombie) and messes up the condition code for the jump (and pretty much anything else which relies upon it being correct). The solution is to know which pregs are live going into a basic block. However, that information isn't calculated at this point. Nor does the LiveVariables pass take into account non-allocatable physical registers. In lieu of this, we do a very conservative pass through the basic block to determine if a preg is live coming out of it. llvm-svn: 105387	2010-06-03 07:54:20 +00:00
Eric Christopher	f67fe3b1e8	One underscore, not two. llvm-svn: 105379	2010-06-03 04:02:59 +00:00
Eli Friedman	dbbbf73c96	Implement expansion in type legalization for add/sub with overflow. The expansion is the same as that used by LegalizeDAG. The resulting code sucks in terms of performance/codesize on x86-32 for a 64-bit operation; I haven't looked into whether different expansions might be better in general. llvm-svn: 105378	2010-06-03 03:49:50 +00:00
Evan Cheng	a2da22734f	Enable machine cse of instructions which define physical registers. llvm-svn: 105308	2010-06-02 01:08:27 +00:00
Devang Patel	89f2db6b67	DwarfWrite is now smart enough to drop debug value pointing to undefined register. Update this test to avoid this. iSel not properly lowring argument into a well formed DBG_VALUE in some cases is a separate issue and not related to the test in this testcase. llvm-svn: 105295	2010-06-01 23:01:43 +00:00
Devang Patel	b0c76394a3	Keep track of incoming debug value of unused argument. Radar 7927666. llvm-svn: 105285	2010-06-01 19:59:01 +00:00
Dan Gohman	b782caa393	Fill in missing support for ISD::FEXP, ISD::FPOWI, and friends. llvm-svn: 105283	2010-06-01 18:35:14 +00:00
Kalle Raiskila	8916358f97	Fix handling of 'load' nodes. llvm-svn: 105269	2010-06-01 13:34:47 +00:00
Bill Wendling	1a764f93a0	Debreak test for non-Darwin. llvm-svn: 105257	2010-05-31 21:47:24 +00:00
Duncan Sands	4c904fa797	Fix PR7272: when inlining through a callsite with byval arguments, the newly created allocas may be used by inlined calls, so these need to have their tail call flags cleared. Fixes PR7272. llvm-svn: 105255	2010-05-31 21:00:26 +00:00
Eric Christopher	24efc63000	Add a test for the llvm-gcc commit in r90200. llvm-svn: 105253	2010-05-31 20:39:10 +00:00
Chris Lattner	14c46517b5	fix PR6623: when optimizing for size, don't inline memcpy/memsets that are too large. This causes the freebsd bootloader to be too large apparently. It's unclear if this should be an -Os or -Oz thing. Thoughts welcome. llvm-svn: 105228	2010-05-31 17:30:14 +00:00
Chris Lattner	291a189cda	upgrade and filecheckize this test. llvm-svn: 105227	2010-05-31 17:27:17 +00:00
Nick Lewycky	aee2632be3	The memcpy intrinsic only takes i8* for %src and %dst, so cast them to that first. Fixes PR7265. llvm-svn: 105206	2010-05-31 06:16:35 +00:00
Evan Cheng	707b7cc429	Remove schedule-livein-copies. It's not being used. llvm-svn: 105095	2010-05-29 02:23:39 +00:00
Evan Cheng	27c4933e02	Fix PR7193: if sibling call address can take a register, make sure there are enough registers available by counting inreg arguments. llvm-svn: 105092	2010-05-29 01:35:22 +00:00
Evan Cheng	cc2efe11db	Fix some latency computation bugs: if the use is not a machine opcode do not just return zero. llvm-svn: 105061	2010-05-28 23:26:21 +00:00
Dan Gohman	0fa67e479a	Add lint checks for function attributes. llvm-svn: 105009	2010-05-28 21:43:57 +00:00
Kevin Enderby	4c71e08ed8	MC/X86: Add alias for movzx. llvm-svn: 105005	2010-05-28 21:20:21 +00:00
Kevin Enderby	b29228905f	MC/X86: Add alias for fwait. llvm-svn: 105001	2010-05-28 20:59:10 +00:00
Kevin Enderby	76413597a9	Fix the use of x86 control and debug registers so that the assertion failure in getX86RegNum() does not happen. Patch by Shantonu Sen! llvm-svn: 104994	2010-05-28 19:01:27 +00:00
Dale Johannesen	526bd59aaf	Add missing space; works for me. llvm-svn: 104992	2010-05-28 18:45:59 +00:00
Dan Gohman	c575ec61ea	Fix lint's memcpy and memmove checks, and its basic block traversal. llvm-svn: 104970	2010-05-28 17:44:00 +00:00
Jakob Stoklund Olesen	2085089c49	Fix more tests that depended on the default register allocator choice. llvm-svn: 104961	2010-05-28 17:06:30 +00:00
Dan Gohman	862f034188	Detect self-referential values. llvm-svn: 104957	2010-05-28 16:45:33 +00:00
Dan Gohman	672393f6c7	Remove this va_arg test, which is no longer applicable. llvm-svn: 104956	2010-05-28 16:44:04 +00:00
Stuart Hastings	c1e216583f	Revert 104841, 104842, 104876 due to buildbot failures. Radar 7424645. llvm-svn: 104953	2010-05-28 16:41:07 +00:00
Dan Gohman	cef9fc37f4	Eli pointed out that va_arg instruction result values don't reference the stack. llvm-svn: 104951	2010-05-28 16:34:49 +00:00
Dan Gohman	54d7aaa819	Teach lint how to look through simple store+load pairs and other effective no-op constructs, to make it more effective on unoptimized IR. llvm-svn: 104950	2010-05-28 16:21:24 +00:00
Dan Gohman	df5d7dcef1	Teach instcombine to promote alloca array sizes. llvm-svn: 104945	2010-05-28 15:09:00 +00:00
Dan Gohman	71505aa4de	Add a testcase for getelementptr index promotion. llvm-svn: 104944	2010-05-28 15:07:59 +00:00
Dan Gohman	ddba4b725a	Add a lint check for returning the address of stack memory. llvm-svn: 104936	2010-05-28 04:33:42 +00:00
Dan Gohman	2140a74979	Eliminate the restriction that the array size in an alloca must be i32. This will help reduce the amount of casting required on 64-bit targets. llvm-svn: 104911	2010-05-28 01:14:11 +00:00
Jakob Stoklund Olesen	b613ae2c89	Add a -regalloc=default option that chooses a register allocator based on the -O optimization level. This only really affects llc for now because both the llvm-gcc and clang front ends override the default register allocator. I intend to remove that code later. llvm-svn: 104904	2010-05-27 23:57:25 +00:00
Evan Cheng	3d3ee87d4e	llvm can't correctly support 'H', 'Q' and 'R' modifiers. Just mark it an error. llvm-svn: 104891	2010-05-27 22:08:38 +00:00
Kevin Enderby	9738f64bd9	MC/X86: Add aliases for Jcc variants. llvm-svn: 104890	2010-05-27 21:33:19 +00:00
Devang Patel	7a9dedf0ab	Do not drop location info for inlined function args. llvm-svn: 104884	2010-05-27 20:25:04 +00:00
Stuart Hastings	bf132360a8	Adjust test case for lexical block pruning. Follow-on to r104842 and Radar 7424645. llvm-svn: 104876	2010-05-27 19:57:51 +00:00
Devang Patel	91ad65e8b7	Let's try one more time to match patterns. The goal is to match following 3 lines. In otherwords, a temp. label between to DEBUG_VALUE comments. ;DEBUG_VALUE: bar:x <- undef ## 2010-01-18-Inlined-Debug.c:7 Ltmp1: ;DEBUG_VALUE: foo:__x <- undef ## 2010-01-18-Inlined-Debug.c:5 llvm-svn: 104872	2010-05-27 19:46:38 +00:00
Duncan Sands	f162eace49	Teach instCombine to remove malloc+free if malloc's only uses are comparisons to null. Patch by Matti Niemenmaa. llvm-svn: 104871	2010-05-27 19:09:06 +00:00
Devang Patel	da01e5e907	Temp. labels number may not match for all configurations. llvm-svn: 104858	2010-05-27 17:51:08 +00:00
Devang Patel	5e6b71ce34	inlined function's arguments need a label to mark the start point because they are not directly attached to current function. llvm-svn: 104848	2010-05-27 16:47:30 +00:00
Stuart Hastings	8e99e50d08	Support for nested functions/classes in debug output. Radar 7424645. llvm-svn: 104841	2010-05-27 16:16:54 +00:00
Gabor Greif	38303d7e3b	rename test to represent meaningful date llvm-svn: 104831	2010-05-27 09:32:38 +00:00
Bob Wilson	ebdc772457	Add a test for llvm-gcc svn r104726. llvm-svn: 104805	2010-05-27 05:30:36 +00:00
Eric Christopher	8ae57895f5	Add a quick test of relocations. llvm-svn: 104794	2010-05-27 00:53:40 +00:00
Devang Patel	6b9a9fe207	Simplify. Eliminate unneeded debug_loc entry. llvm-svn: 104785	2010-05-26 23:55:23 +00:00
Dan Gohman	a20a5cd24f	Reinstate checking of stackrestore, with checking for both Read and Write, and add a comment explaining this. llvm-svn: 104756	2010-05-26 22:21:25 +00:00
Dan Gohman	1249adf160	Implement checking of the tail keyword. llvm-svn: 104744	2010-05-26 21:46:36 +00:00
Devang Patel	1b08572a66	Update debug info when live-in reg is copied into a vreg. llvm-svn: 104732	2010-05-26 20:18:50 +00:00
Kevin Enderby	70e34983e8	Fix the x86 move to/from segment register instructions. llvm-svn: 104731	2010-05-26 20:10:45 +00:00
Dale Johannesen	053dd21c84	Testcase for 104624/104619/PR7191/8023512. Reduced from one provided by Duncan Sands, thanks! llvm-svn: 104710	2010-05-26 17:55:45 +00:00
Devang Patel	9fc11706e3	First cut at supporting .debug_loc section. This is used to track variable information. llvm-svn: 104649	2010-05-25 23:40:22 +00:00
Benjamin Kramer	9439084cea	Properly promote operands when optimizing a single-character memcmp. llvm-svn: 104648	2010-05-25 22:53:43 +00:00
Eric Christopher	19a4b843cc	Add support for initialized global data for darwin tls. Update comments and testcases accordingly. llvm-svn: 104635	2010-05-25 21:28:50 +00:00
Kevin Enderby	492d4f409a	Changed the encoding of X86 floating point stack operations where both operands are st(0). These can be encoded using an opcode for storing in st(0) or using an opcode for storing in st(i), where i can also be 0. To allow testing with the darwin assembler and get a matching binary the opcode for storing in st(0) is now used. To do this the same logical trick is use from the darwin assembler in converting things like this: fmul %st(0), %st into this: fmul %st(0) by looking for the second operand being X86::ST0 for specific floating point mnemonics then removing the second X86::ST0 operand. This also has the add benefit to allow things like: fmul %st(1), %st that llvm-mc did not assemble. llvm-svn: 104634	2010-05-25 20:52:34 +00:00
Dale Johannesen	cd4ba6caba	Removing test; Chris thinks it's better to have the bug go untested than have a testcase this large. So be it. llvm-svn: 104632	2010-05-25 20:40:10 +00:00
Daniel Dunbar	0e767d7364	MC/X86: Add a hack to allow recognizing 'cmpltps' and friends. llvm-svn: 104626	2010-05-25 19:49:32 +00:00
Dale Johannesen	60fe2cdc4f	Fix another variant of PR 7191. Also add a testcase Mon Ping provided; unfortunately bugpoint failed to reduce it, but I think it's important to have a test for this in the suite. 8023512. llvm-svn: 104624	2010-05-25 18:47:23 +00:00
Daniel Dunbar	4a5b2c597b	MC/X86: Define explicit immediate forms of cmp{ss,sd,ps,pd}. llvm-svn: 104622	2010-05-25 18:40:53 +00:00
Kevin Enderby	c798965e63	The BT64ri8 record in X86Instr64bit.td was missing a REX_W which is required for the 64-bit version of the Bit Test instruction. llvm-svn: 104621	2010-05-25 18:16:58 +00:00
Eric Christopher	f6562d35ac	Make sure aeskeygenassist uses an unsigned immediate field. Fixes rdar://8017638 llvm-svn: 104617	2010-05-25 17:33:22 +00:00
Dan Gohman	79b6a0f140	Fix an mmx movd encoding. llvm-svn: 104552	2010-05-24 20:51:08 +00:00
Kevin Enderby	dc71cc794b	MC/X86: Add aliases for CMOVcc variants. llvm-svn: 104549	2010-05-24 20:32:23 +00:00
Bob Wilson	3eb7691858	Thumb2 RSBS instructions were being printed without the 'S' suffix. Fix it by changing the T2I_rbin_s_is multiclass to handle the CPSR output and 'S' suffix in the same way as T2I_bin_s_irs. llvm-svn: 104531	2010-05-24 18:44:06 +00:00
Evan Cheng	755d45be43	LR is in GPR, not tGPR even in Thumb1 mode. llvm-svn: 104518	2010-05-24 18:00:18 +00:00
Daniel Dunbar	b52fcd6304	MC/X86: Subdivide immediates a bit more, so that we properly recognize immediates based on the width of the target instruction. For example: addw $0xFFFF, %ax should match the same as addw $-1, %ax but we used to match it to the longer encoding. llvm-svn: 104453	2010-05-22 21:02:33 +00:00
Daniel Dunbar	d459e29a0a	MC/X86: Add alias for setz, setnz, jz, jnz. llvm-svn: 104435	2010-05-22 06:37:33 +00:00
Evan Cheng	168ced94d8	Implement @llvm.returnaddress. rdar://8015977. llvm-svn: 104421	2010-05-22 01:47:14 +00:00
Eric Christopher	64087cd346	This test is darwin only. Make it so(tm). llvm-svn: 104418	2010-05-22 00:55:55 +00:00
Bob Wilson	91fdf68516	Recognize more BUILD_VECTORs and VECTOR_SHUFFLEs that can be implemented by copying VFP subregs. This exposed a bunch of dead code in the *spill-q.ll tests, so I tweaked those tests to keep that code from being optimized away. Radar 7872877. llvm-svn: 104415	2010-05-22 00:23:12 +00:00
Eric Christopher	6fdea1bda8	Add full bss data support for darwin tls variables. llvm-svn: 104414	2010-05-22 00:10:22 +00:00
Kevin Enderby	7e7482c80f	Added retl for 32-bit x86 and added retq for 64-bit x86. llvm-svn: 104394	2010-05-21 23:01:38 +00:00
Bob Wilson	51d9ee3ff6	Change CodeGen/ARM/2009-11-02-NegativeLane.ll to use 16-bit vector elements so that it will continue to test what it was meant to test when I commit a separate change for better support of BUILD_VECTOR and VECTOR_SHUFFLE for Neon. Fix a DAG combiner crash exposed by this test change. llvm-svn: 104380	2010-05-21 21:05:32 +00:00
Chris Lattner	0735ecfe17	now that fp reg kill insertion stuff happens as a separate pass after isel instead of being interlaced with it, we can trust that all the code for a function has been isel'd before it is run. The practical impact of this is that we can scan for machine instr phis instead of doing a fuzzy match on the LLVM BB for phi nodes. Doing the fuzzy match required knowing when isel would produce an fp reg stack phi which was gross. It was also wrong in cases where select got lowered to a branch tree because cmovs aren't available (PR6828). Just do the scan on machine phis which is simpler, faster and more correct. This fixes PR6828. llvm-svn: 104333	2010-05-21 18:17:54 +00:00
Jakob Stoklund Olesen	a648c6a757	Teach VirtRegRewriter to handle spilling in instructions that have multiple definitions of the virtual register. This happens when spilling the registers produced by REG_SEQUENCE: %reg1047:5<def>, %reg1047:6<def>, %reg1047:7<def> = VLD3d8 %reg1033, 0, pred:14, pred:%reg0 The rewriter would spill the register multiple times, dead store elimination tried to keep up, but ended up cutting the branch it was sitting on. llvm-svn: 104321	2010-05-21 16:36:13 +00:00
Dale Johannesen	b3b9c8ac48	Fix i64->f64 conversion, x86-64, -no-sse. A bit tricky since there's a 3rd 64-bit type, MMX vectors. PR 7135. llvm-svn: 104308	2010-05-21 00:52:33 +00:00
Evan Cheng	34c260458a	Change ARM scheduling default to list-hybrid if the target supports floating point instructions (and is not using soft float). llvm-svn: 104307	2010-05-21 00:43:17 +00:00
Daniel Dunbar	baf2eea6f4	MC/X86: Add movq alias for movabsq, to allow matching 64-bit immediates with movq. llvm-svn: 104275	2010-05-20 20:36:29 +00:00
Dan Gohman	ee2fea3cd7	When canonicalizing icmp operand order to put the loop invariant operand on the left, the interesting operand is on the right. This fixes a bug where LSR was failing to recognize ICmpZero uses, which led it to be unable to reverse the induction variable in the attached testcase. Delete test/CodeGen/X86/stack-color-with-reg-2.ll, because its test is extremely fragile and hard to meaningfully update. llvm-svn: 104262	2010-05-20 19:26:52 +00:00
Bob Wilson	5954994bba	Handle Neon v2f64 and v2i64 vector shuffles as register copies. This fixes the remaining issue with pr7167. llvm-svn: 104257	2010-05-20 18:39:53 +00:00
Dan Gohman	29790edb93	Fix assembly parsing and encoding of the pushf and popf family of instructions. llvm-svn: 104231	2010-05-20 16:16:00 +00:00
Dan Gohman	1e19eab963	Define the x86 pause instruction. llvm-svn: 104204	2010-05-20 01:35:50 +00:00
Dan Gohman	a3b7570a3a	Fix the sfence instruction to use MRM_F8 instead of MRM7r, since it doesn't have a register operand. Also, use I instead of PSI, for consistency with mfence and lfence. llvm-svn: 104203	2010-05-20 01:23:41 +00:00
Bill Wendling	de852faef9	Match "4" or "8" depending upon if it's 32- or 64-bit. llvm-svn: 104196	2010-05-20 00:27:10 +00:00
Eric Christopher	4b4446be7c	Once more, with feeling. llvm-svn: 104190	2010-05-20 00:07:13 +00:00
Dan Gohman	20fab456da	Teach LSR how to cope better with unrolled loops on targets where the addressing modes don't make this trivially easy. This allows it to avoid falling into the less precise heuristics in more cases. llvm-svn: 104186	2010-05-19 23:43:12 +00:00
Chris Lattner	7cbfa4462f	fix rdar://7986634 - match instruction opcodes case insensitively. llvm-svn: 104183	2010-05-19 23:34:33 +00:00
Bill Wendling	1c4687e350	Testcase for r104181. llvm-svn: 104182	2010-05-19 23:33:26 +00:00
Eric Christopher	63476ddae6	A more combo tls testcase. llvm-svn: 104163	2010-05-19 21:19:42 +00:00
Eric Christopher	b95493c495	Few more simple tls testcases. llvm-svn: 104148	2010-05-19 20:35:15 +00:00
Jakob Stoklund Olesen	e11cdf8cc8	TwoAddressInstructionPass doesn't really know how to merge live intervals when lowering REG_SEQUENCE instructions. Insert copies for REG_SEQUENCE sources not killed to avoid breaking later passes. llvm-svn: 104146	2010-05-19 20:08:00 +00:00
Eric Christopher	6304da132f	Attempt to run this test on x86 only. llvm-svn: 104143	2010-05-19 18:59:37 +00:00
Bob Wilson	f070b1b571	Testcase to go with 104141. llvm-svn: 104142	2010-05-19 18:58:37 +00:00
Evan Cheng	daeca2d156	t2LEApcrel and tLEApcrel are re-materializable. This makes it possible to hoist more loads during machine LICM. llvm-svn: 104115	2010-05-19 07:28:01 +00:00
Evan Cheng	abd0ad54a4	Intrinsics which do a vector compare (results are all zero or all ones) are modeled as icmp / fcmp + sext. This is turned into a vsetcc by dag combine (yes, not a good long term solution). The targets can then isel the vsetcc to the appropriate instruction. The trouble arises when the result of a vector cmp + sext is then and'ed with all ones. Instcombine will turn it into a vector cmp + zext, dag combiner will miss turning it into a vsetcc and hell breaks loose after that. Teach dag combine to turn a vector cpm + zest into a vsetcc + and 1. This fixes rdar://7923010. llvm-svn: 104094	2010-05-19 01:08:17 +00:00
Eric Christopher	c09d5a29d8	Add a test to make sure that we're lowering the shift amount correctly. llvm-svn: 104090	2010-05-19 00:22:04 +00:00
Jakob Stoklund Olesen	430b6e40ab	Remember to update VirtRegLastUse when spilling without killing before a call. llvm-svn: 104074	2010-05-18 22:20:09 +00:00
Dan Gohman	887dd1cd31	When converting a test to a cmp to fold a load, use the cmp that has an 8-bit immediate field rather than one with a wider immediate field. llvm-svn: 104064	2010-05-18 21:42:03 +00:00
Eric Christopher	7f173d1d27	Quick test to make sure we're emitting the tbss section correctly. llvm-svn: 104063	2010-05-18 21:40:20 +00:00
Evan Cheng	f19384d54a	Sink dag combine's post index load / store code that swap base ptr and index into the target hook. Only the target knows whether the swap is safe. In Thumb2 mode, the offset must be an immediate. rdar://7998649 llvm-svn: 104060	2010-05-18 21:31:17 +00:00
Dale Johannesen	6338d15939	Test passed on ppc, to my surprise; if it worked there it may work everywhere... llvm-svn: 104053	2010-05-18 20:47:04 +00:00
Evan Cheng	e7fc64a5c9	Fix PR7162: Use source register classes and sub-indices to determine the correct register class of the definitions of REG_SEQUENCE. llvm-svn: 104050	2010-05-18 20:03:28 +00:00
Dale Johannesen	fb7df5317a	Testcase for llvm-gcc checkin 104042. llvm-svn: 104043	2010-05-18 19:03:51 +00:00
Kevin Enderby	53e0631516	Fixed the problem with a branch to "0b" that was not parsed by llvm-mc correctly. The Lexer was incorrectly eating the newline casusing it to branch to address 0. Updated the test case to use a "0:" label and a branch to "0b". llvm-svn: 104038	2010-05-18 17:51:35 +00:00
Daniel Dunbar	d5563f420a	MC/Mach-O: Implement support for setting indirect symbol table offset in section header. Also, create symbol data for LHS of assignment, to match 'as' symbol ordering better. llvm-svn: 104033	2010-05-18 17:28:24 +00:00
Daniel Dunbar	a4820fcc78	MC/X86: Implement custom lowering to make sure we match things like X86::ADC32ri $0, %eax to X86::ADC32i32 $0 llvm-svn: 104030	2010-05-18 17:22:24 +00:00
Evan Cheng	48f0de96d6	FIX PR7158. SimplifyVBinOp was asserting when it fails to constant fold (op (build_vector), (build_vector)). llvm-svn: 104004	2010-05-18 00:03:40 +00:00
Evan Cheng	1e4f55200d	Fix PR7175. Insert copies of a REG_SEQUENCE source if it is used by other REG_SEQUENCE instructions. llvm-svn: 103994	2010-05-17 23:24:12 +00:00
Kevin Enderby	0510b48fd9	Added support in MC for Directional Local Labels. llvm-svn: 103989	2010-05-17 23:08:19 +00:00
Eric Christopher	9635b3da6b	More data/parsing support for tls directives. Add a few more testcases and cleanup comments as well. llvm-svn: 103985	2010-05-17 22:53:55 +00:00
Evan Cheng	f2c9a96f3c	Fix PR7156. If the sources of a REG_SEQUENCE are all IMPLICIT_DEF's. Replace it with an IMPLICIT_DEF rather than deleting it or else it would be left without a def. llvm-svn: 103984	2010-05-17 22:09:49 +00:00
Daniel Dunbar	bb166bed40	MC/Mach-O/x86: Optimal nop sequences should only be used for the .text sections, not all sections in the text segment. llvm-svn: 103981	2010-05-17 21:54:30 +00:00
Daniel Dunbar	b7b796cc11	MC/Mach-O: Reverse order of SymbolData scanning when emitting instructions. - This fixes a string table mismatch with 'as' when two new symbols are defined in a single instruction. llvm-svn: 103979	2010-05-17 21:19:59 +00:00
Evan Cheng	29c463862e	Careful with reg_sequence coalescing to not to overwrite sub-register indices. llvm-svn: 103971	2010-05-17 20:57:12 +00:00
Daniel Dunbar	0211a96989	MC/Mach-O: Fix some differences in symbol flag handling. - Don't clear weak reference flag, 'as' was only "trying" to do this, it wasn't actually succeeding. - Clear the "lazy bound" bit when we mark something external. This corresponds roughly to the lazy clearing of the bit that 'as' implements in symbol_table_lookup. - The exact meaning of these flags appears pretty loose, since 'as' isn't very consistent. For now we just try to match 'as', we will clean this up one day hopefully. llvm-svn: 103964	2010-05-17 20:12:31 +00:00
Evan Cheng	3d98b996ff	Turn on -neon-reg-sequence by default. Using NEON load / store multiple instructions will no longer create gobs of vmov of D registers! llvm-svn: 103960	2010-05-17 19:51:20 +00:00
Daniel Dunbar	9b4a824217	llvm-mc: Support reassignment of variables in one special case, when the variable has not yet been used in an expression. This allows us to support a few cases that show up in real code (mostly because gcc generates it for Objective-C on Darwin), without giving up a reasonable semantic model for assignment. llvm-svn: 103950	2010-05-17 17:46:23 +00:00
Jakob Stoklund Olesen	176a9c4272	Avoid allocating the same physreg to multiple virtregs in one instruction. While that approach works wonders for register pressure, it tends to break everything. This should unbreak the arm-linux builder and fix a number of miscompilations. llvm-svn: 103946	2010-05-17 17:18:59 +00:00
Jakob Stoklund Olesen	7d22a81b61	Only use clairvoyance when defining a register, and then only if it has one use. This makes allocation independent on the ordering of use-def chains. llvm-svn: 103935	2010-05-17 04:50:57 +00:00
Eric Christopher	68b1bbe66a	Assume that we'll handle mangling the symbols earlier and just put the symbol to the file as we have it. Simplifies out tbss handling. llvm-svn: 103928	2010-05-17 02:13:02 +00:00
Dale Johannesen	f92c344167	Removing as part of previous reversion. llvm-svn: 103915	2010-05-16 20:19:40 +00:00
Dale Johannesen	2ef974ee0e	Revert 103911; it broke a test that expects bitconvert <1xi64> -> i64 to work in MMX registers on hosts where -no-sse is the default (not mine). The right thing is to accept this and make i64->f64 conversions go through memory, but I don't have time right now. llvm-svn: 103914	2010-05-16 20:19:04 +00:00
Dale Johannesen	fc1492d71b	Make x86-64 64-bit bitconvert work when SSE is not available. (This worked as of about 6 months ago and I didn't track down exactly what broke it; I think this fix is appropriate.) llvm-svn: 103911	2010-05-16 18:22:38 +00:00
Anton Korobeynikov	8f35fabbc1	Add support for thiscall calling convention. Patch by Charles Davis and Steven Watanabe! llvm-svn: 103902	2010-05-16 09:08:45 +00:00
Anton Korobeynikov	1bf28a128b	Some cheap DAG combine goodness for multiplication with a particular constant. This can be extended later on to handle more "complex" constants. llvm-svn: 103881	2010-05-15 18:16:59 +00:00
Evan Cheng	4cad68eb34	Allow TargetLowering::getRegClassFor() to be called on illegal types. Also allow target to override it in order to map register classes to illegal but synthesizable types. e.g. v4i64, v8i64 for ARM / NEON. llvm-svn: 103854	2010-05-15 02:18:07 +00:00
Bill Wendling	0160e55893	SystemZ really does mean "has calls" and not just "adjusts stack." Go ahead and replace the check with the appropriate predicate. Modify the testcase to reflect the correct code. (It should be saving callee-saved registers on the stack allocated by the calling fuction.) llvm-svn: 103829	2010-05-14 22:17:42 +00:00
Devang Patel	c87e867111	Test case for r103800. llvm-svn: 103801	2010-05-14 21:04:45 +00:00
Kevin Enderby	7bc111f5a9	Fix so "int3" is correctly accepted, added "into" and fixed "int" with an argument, like "int $4", to not get an Assertion error. llvm-svn: 103791	2010-05-14 19:16:02 +00:00
Daniel Dunbar	2493ddfe42	MC/Mach-O/x86_64: Darwin's special "signed_N" relocation types should only be used to replace a normal relocation, not a reference to a GOT entry. llvm-svn: 103789	2010-05-14 18:53:40 +00:00
Jakob Stoklund Olesen	4d5c1061e3	Simplify the handling of physreg defs and uses in RegAllocFast. This adds extra security against using clobbered physregs, and it adds kill markers to physreg uses. llvm-svn: 103784	2010-05-14 18:03:25 +00:00
Daniel Dunbar	148e876ac2	XFAIL the test I added with vg_leak, apparently it is the first and only llc -filetype=obj test, and -filetype=obj leaks a few objects. Added a FIXME, we need to sort out the ownership model for the various MC objects. llvm-svn: 103769	2010-05-14 07:47:51 +00:00
Daniel Dunbar	3439ed6324	Inline Asm: Ensure buffer is newline terminated to match how the text is printed. - This is a hack, but I can't decide the best place to handle this. Chris? llvm-svn: 103765	2010-05-14 04:31:50 +00:00
Eric Christopher	9fb6bb07ca	Add AsmParser support for darwin tbss directive. Nothing uses this yet. llvm-svn: 103757	2010-05-14 01:50:28 +00:00
Nick Lewycky	23b545ca4b	Actually run the test. Thanks Daniel Dunbar! llvm-svn: 103720	2010-05-13 17:41:06 +00:00
Nick Lewycky	3230f0ac25	Add testcase for r103653. llvm-svn: 103699	2010-05-13 06:00:14 +00:00
Daniel Dunbar	e35c88d5ad	MC/Mach-O: Add another zerofill test to improve coverage. llvm-svn: 103691	2010-05-13 01:10:28 +00:00
Jakob Stoklund Olesen	0ba2e2a568	Take allocation hints from copy instructions to/from physregs. This causes way more identity copies to be generated, ripe for coalescing. llvm-svn: 103686	2010-05-13 00:19:43 +00:00
Chris Lattner	8cb4728a15	fix rdar://7965971 and a fixme: use ParseIdentifier in ParseDirectiveDarwinZerofill instead of hard coding the check for identifier. This allows quoted symbol names to be used. llvm-svn: 103682	2010-05-13 00:10:34 +00:00
Chris Lattner	9efef006cf	reapply r103668 with a fix. Never make "minor syntax changes" after testing before committing. llvm-svn: 103681	2010-05-13 00:02:47 +00:00
Chris Lattner	e354235512	revert r103668 for now, it is apparently breaking things. llvm-svn: 103677	2010-05-12 23:40:59 +00:00
Chris Lattner	a6df4650fd	moffset forms of moves are x86-32 only, make the parser lower them to the correct x86-64 instructions since we don't have a clean way to handle this in td files yet. rdar://7947184 llvm-svn: 103668	2010-05-12 23:13:36 +00:00
Chris Lattner	e132b0a92c	fix the encoding of the obscure "moffset" forms of moves, i386 part first. rdar://7947184 llvm-svn: 103660	2010-05-12 22:48:24 +00:00
Jakob Stoklund Olesen	955a0e71e9	Make sure to add kill flags to the last use of a virtreg when it is redefined. The X86 floating point stack pass and others depend on good kill flags. llvm-svn: 103635	2010-05-12 18:46:03 +00:00
Devang Patel	0bcbcbd23e	Test case for r103633. llvm-svn: 103634	2010-05-12 18:31:04 +00:00
Dale Johannesen	352117adf5	Testcase for llvm 103572 (7898991). llvm-svn: 103574	2010-05-12 05:04:20 +00:00
Daniel Dunbar	059379a9d7	MC/X86: Extend suffix matching hack to match 'q' suffix. llvm-svn: 103535	2010-05-12 00:54:20 +00:00
Daniel Dunbar	ba2f4c3884	MC/Mach-O/x86_64: Add a new hook for checking whether a particular section can be diced into atoms, and adjust getAtom() to take this into account. - This fixes relocations to symbols in fixed size literal sections, for example. llvm-svn: 103532	2010-05-12 00:38:17 +00:00
Jakob Stoklund Olesen	e6e39dc310	Enable a bunch more -regalloc=fast tests llvm-svn: 103531	2010-05-12 00:11:24 +00:00
Daniel Dunbar	53ce0e12d8	MC/Mach-O/x86_64: Fix PCrel adjustment for x86_64, which was using the fixup offset instead of the fixup address as intended. llvm-svn: 103527	2010-05-11 23:53:11 +00:00
Jakob Stoklund Olesen	132668102e	Keep track of the last place a live virtreg was used. This allows us to add accurate kill markers, something the scavenger likes. Add some more tests from ARM that needed this. llvm-svn: 103521	2010-05-11 23:24:45 +00:00
Jakob Stoklund Olesen	84c881e593	One more -regalloc=fast test llvm-svn: 103509	2010-05-11 20:51:07 +00:00
Jakob Stoklund Olesen	3f0241e0f9	Simplify the tracking of used physregs to a bulk bitor followed by a transitive closure after allocating all blocks. Add a few more test cases for -regalloc=fast. llvm-svn: 103500	2010-05-11 20:30:28 +00:00
Jakob Stoklund Olesen	f1b3029a54	Mostly rewrite RegAllocFast. Sorry for the big change. The path leading up to this patch had some TableGen changes that I didn't want to commit before I knew they were useful. They weren't, and this version does not need them. The fast register allocator now does no liveness calculations. Instead it relies on kill flags provided by isel. (Currently those kill flags are also ignored due to isel bugs). The allocation algorithm is supposed to work with any subset of valid kill flags. More kill flags simply means fewer spills inserted. Registers are allocated from a working set that contains no aliases. That means most allocations can be done directly without expensive alias checks. When the working set runs out of registers we do the full alias check to find new free registers. llvm-svn: 103488	2010-05-11 18:54:45 +00:00
Daniel Dunbar	3937e28da0	MC/Mach-O x86_64: Switch to using fragment atom symbol. - This eliminates getAtomForAddress() (which was a linear search) and simplifies getAtom(). - This also fixes some correctness problems where local labels at the same address as non-local labels could be assigned to the wrong atom. llvm-svn: 103480	2010-05-11 17:22:50 +00:00
Kalle Raiskila	9dd3ef8d01	Make SPU backend not assert on jump tables. llvm-svn: 103466	2010-05-11 11:00:02 +00:00
Evan Cheng	2fa5a7e7e4	Select @llvm.trap to the special B with 1111 condition (i.e. trap) instruction. llvm-svn: 103459	2010-05-11 07:26:32 +00:00
Daniel Dunbar	75778984f9	MC/Mach-O: Fix another mismatch with .weak_definition, we shouldn't use a scattered relocation entry with a .weak_definition. llvm-svn: 103443	2010-05-10 23:15:20 +00:00
Devang Patel	1a0df9a80e	Enable multiple Compile Units in one module. This means now 'llvm-ld a.bc b.bc' will preserve debug info appropriately. llvm-svn: 103439	2010-05-10 22:49:55 +00:00
Chris Lattner	d86a5a5e45	this really is needed. :( llvm-svn: 103434	2010-05-10 21:23:48 +00:00
Chris Lattner	ba44bf052a	just remove this, it isn't needed. llvm-svn: 103432	2010-05-10 21:01:47 +00:00
Chris Lattner	58aff8fb57	fix PR7105 by enumerating MDNodes on all @llvm.foo function calls, not just recognized intrinsics. llvm-svn: 103428	2010-05-10 20:53:17 +00:00
Chris Lattner	05b4caff3e	fix a pretty obvious typo. We test things before committing them, right? llvm-svn: 103427	2010-05-10 20:51:06 +00:00
David Greene	103d4b43e9	Fix PR6875: This includes a patch by Roman Divacky to fix the initial crash. Move the actual addition of passes from PassManager::add to PassManager::addImpl. That way, when adding printer passes we won't recurse infinitely. Finally, check to make sure that we are actually adding a FunctionPass to a FunctionPassManager before doing a print before or after it. Immutable passes are strange in this way because they aren't FunctionPasses yet they can be and are added to the FunctionPassManager. llvm-svn: 103425	2010-05-10 20:24:27 +00:00
Evan Cheng	02947a4551	Be careful with operand promotion. For a binary operation, the source operands may be the same. PR7018. rdar://7939869. llvm-svn: 103419	2010-05-10 19:03:57 +00:00
Devang Patel	fbc75d039a	Test case for 103414. llvm-svn: 103415	2010-05-10 17:49:40 +00:00
Kalle Raiskila	92ea401d8f	Fix encoding of 'sf' and 'sfh' instructions. llvm-svn: 103399	2010-05-10 08:13:49 +00:00
Chris Lattner	84d4618659	make simplifycfg insert an llvm.trap before the 'unreachable' it introduces when it detects undefined behavior. llvm.trap generally codegens into some thing really small (e.g. a 2 byte ud2 instruction on x86) and debugging this sort of thing is "nontrivial". For example, we now compile: void foo() { (int)0 = 42; } into: _foo: pushl %ebp movl %esp, %ebp ud2 Some may even claim that this is a security hole, though that seems dubious to me. This addresses rdar://7958343 - Optimizing away null dereference potentially allows arbitrary code execution llvm-svn: 103356	2010-05-08 22:15:59 +00:00
Chris Lattner	02b0df5338	Teach instcombine to transform a bitcast/(zext\|trunc)/bitcast sequence with a vector input and output into a shuffle vector. This sort of sequence happens when the input code stores with one type and reloads with another type and then SROA promotes to i96 integers, which make everyone sad. This fixes rdar://7896024 llvm-svn: 103354	2010-05-08 21:50:26 +00:00
Chris Lattner	5a62d6e578	Fix PR7052, patch by Jakub Staszak! llvm-svn: 103347	2010-05-08 20:01:44 +00:00
Bill Wendling	cd476b6760	Readd testcase. llvm-svn: 103335	2010-05-08 04:47:54 +00:00
Dan Gohman	d0800241d2	When pruning candidate formulae out of an LSRUse, update the LSRUse's Regs set after all pruning is done, rather than trying to do it on the fly, which can produce an incomplete result. This fixes a case where heuristic pruning was stripping all formulae from a use, which led the solver to enter an infinite loop. Also, add a few asserts to diagnose this kind of situation. llvm-svn: 103328	2010-05-07 23:36:59 +00:00
Bill Wendling	6b5897b4de	Remove. Don't XFAIL. llvm-svn: 103321	2010-05-07 23:09:17 +00:00
Bill Wendling	32d8981ec0	Temorarily revert r101984. llvm-svn: 103314	2010-05-07 22:45:36 +00:00
Dan Gohman	7de01ec2c9	SDDbgValues are apparently not being legalized. Fix a symptom of the problem, and not the real problem itself, by dropping debug info for i128 values. rdar://7958162. llvm-svn: 103310	2010-05-07 22:19:08 +00:00
Kevin Enderby	51bed9c870	Fix i386 relocations to Weak Definitions. The relocation entries should be external and the item to be relocated should not have the address of the symbol added in. llvm-svn: 103302	2010-05-07 21:44:23 +00:00
Dale Johannesen	51c1695a0a	Fix PR 7087, and probably other things, by extending getConstantFP to accept the two supported long double target types. This was not the original intent, but there are other places that assume this works and it's easy enough to do. llvm-svn: 103299	2010-05-07 21:35:53 +00:00
Devang Patel	be8ee1a09e	Update test to use valid debug info. llvm-svn: 103287	2010-05-07 20:34:00 +00:00
Jim Grosbach	2a41cad900	Clean up the conditional for handling of sign_extend_inreg based on whether the extract instructions are available. rdar://7956878 llvm-svn: 103277	2010-05-07 18:34:55 +00:00
Duncan Sands	ebf838274f	Correct some bogus target triples. llvm-svn: 103265	2010-05-07 17:03:48 +00:00
Dan Gohman	5d5b8b1b8c	Add an LLVM IR version of code sinking. This uses the same simple algorithm as MachineSink, but it isn't constrained by MachineInstr-level details. llvm-svn: 103257	2010-05-07 15:40:13 +00:00
Nick Lewycky	45f530db39	Revert r103133 and add testcase from PR7066. llvm-svn: 103233	2010-05-07 01:45:38 +00:00
Dale Johannesen	bbfa3067bd	Adjust tests affected by llvm-gcc 103229. All results here match gcc-4.2. llvm-svn: 103230	2010-05-07 01:11:31 +00:00
Dan Gohman	7421ae48bf	Disable the new unknown-location code for now. It causes a major increase in the debug line info section, and it's causing regressions in a gdb testsuite. llvm-svn: 103226	2010-05-07 01:08:53 +00:00
Daniel Dunbar	21aa523c28	MC/X86: X86AbsMemAsmOperand is subclass of X86NoSegMemAsmOperand. - This fixes "leal 0, %eax", for example. llvm-svn: 103205	2010-05-06 22:39:14 +00:00
Chris Lattner	348dc9b15a	fix rdar://7947167 - llvm-mc doesn't match movsq llvm-svn: 103199	2010-05-06 21:48:14 +00:00
Sean Callanan	e7e1cf9fbd	Eliminated the classification of control registers into %ecr_ and %rcr_, leaving just %cr_ which is what people expect. Updated the disassembler to support this unified register set. Added a testcase to verify that the registers continue to be decoded correctly. llvm-svn: 103196	2010-05-06 20:59:00 +00:00
Dan Gohman	779c69bbc5	Add a DebugLoc argument to TargetInstrInfo::copyRegToReg, so that it doesn't have to guess. llvm-svn: 103194	2010-05-06 20:33:48 +00:00
Dan Gohman	cb4e3e51a9	Add a testcase for r103135, explicitly representing unknown locations in debug line info. llvm-svn: 103189	2010-05-06 17:49:17 +00:00
Daniel Dunbar	b0ceb764b8	Revert r103137, fix for $ in labels. It looks like we can't actually handle this at the token level. Consider the following horrible test case: a = 1 .globl $a movl ($a), %eax movl $a, %eax movl $$a, %eax llvm-svn: 103178	2010-05-06 14:46:38 +00:00
Chris Lattner	35096e82c5	Fix PR7054 - Assertion `Symbol->isUndefined() && "Cannot define a symbol twice!"' failed. Users can write broken code that emits the same label twice with asm renaming, detect this and emit a fatal backend error instead of aborting. llvm-svn: 103140	2010-05-06 00:05:37 +00:00
Chris Lattner	482fa218d4	fix rdar://7946934 - in some limited cases, the assembler should allow $ at the start of a symbol name. llvm-svn: 103137	2010-05-05 23:51:28 +00:00
Jim Grosbach	151cd8f159	Cleanup of ARMv7M support. Move hardware divide and Thumb2 extract/pack instructions to subtarget features and update tests to reflect. PR5717. llvm-svn: 103136	2010-05-05 23:44:43 +00:00
Jakob Stoklund Olesen	1b6f698e85	Fix PR6520. An earlyclobber physreg must not be allocated to anything else. llvm-svn: 103133	2010-05-05 23:07:41 +00:00
Stuart Hastings	7e60a6bd71	Test case for pr2394 and r102979. llvm-svn: 103129	2010-05-05 22:49:33 +00:00
Jim Grosbach	245b169212	fix copy/paste oops. llvm-svn: 103122	2010-05-05 21:07:46 +00:00
Jim Grosbach	44d7f49887	Add tests for ARMV7M divide instruction use llvm-svn: 103120	2010-05-05 20:47:15 +00:00
Jim Grosbach	e36cd72e38	remove unneeded underscores. llvm-svn: 103114	2010-05-05 19:55:58 +00:00
Jim Grosbach	5ced648ba8	Convert to filecheck llvm-svn: 103113	2010-05-05 19:41:11 +00:00
Daniel Dunbar	f3a53baf00	MC/Mach-O: Mark absolute variable's appropriately, and add Mach-O support for writing them. - <rdar://problem/7885351> integrated assembler broken for i386 objc code llvm-svn: 103112	2010-05-05 19:01:05 +00:00
Daniel Dunbar	027fa5f31c	MC/Mach-O/x86_64: Relocations in debug sections should use local relocations when possible. - <rdar://problem/7934873> llvm-svn: 103092	2010-05-05 17:22:39 +00:00
Duncan Sands	687900ed83	Use llvm.foo as the intrinsic, rather than llvm.dbg.value. Since the values passed to llvm.dbg.value were not valid for the intrinsic, it might have caused trouble one day if the verifier ever started checking for valid debug info. llvm-svn: 103038	2010-05-04 20:09:25 +00:00
Chris Lattner	0185047b3f	"on the rare occasion the SPU BE produces illegal assembly - it tries to emit an add instruction of the form 'a reg, reg, imm'." Patch by Kalle Raiskila! llvm-svn: 103021	2010-05-04 17:58:46 +00:00
Daniel Dunbar	c3e0bafc6d	MC/X86: Chris pointed that 'as' isn't consistent in accepting the long form of instructions which have no direct register usage. Darwin 'as' accepts: add $0, (%rax) but rejects mov $0, (%rax) for example. Given that, only accept suffix matches which match exactly one form. We still need to emit nice diagnostics for failures... llvm-svn: 103015	2010-05-04 17:31:02 +00:00
Daniel Dunbar	9b816a1bb3	MC/X86: Add "support" for matching ATT style mnemonic prefixes. - The idea is that when a match fails, we just try to match each of +'b', +'w', +'l'. If exactly one matches, we assume this is a mnemonic prefix and accept it. If all match, we assume it is width generic, and take the 'l' form. - This would be a horrible hack, if it weren't so simple. Therefore it is an elegant solution! Chris gets the credit for this particular elegant solution. :) - Next step to making this more robust is to have the X86 matcher generate the mnemonic prefix information. Ideally we would also compute up-front exactly which mnemonic to attempt to match, but this may require more custom code in the matcher than is really worth it. llvm-svn: 103012	2010-05-04 16:12:42 +00:00
Duncan Sands	c2928c6ef5	Fix a variant of PR6112 found by thinking about it: when doing RAUW of a global variable with a local variable in function F, if function local metadata M in function G was using the global then M would become function-local to both F and G, which is not allowed. See the testcase for an example. Fixed by detecting this situation and zapping the metadata operand when it occurs. llvm-svn: 103007	2010-05-04 12:43:36 +00:00
Devang Patel	075e9b5d66	Set DW_AT_APPLE_omit_frame_ptr in endFunction() where MachineFunction is available all the time. llvm-svn: 103001	2010-05-04 06:15:30 +00:00
Devang Patel	801b8ea42a	Do not ignore debug loc attached with llvm.dbg.declare while collecting debug info used by a module. llvm-svn: 102995	2010-05-04 01:05:02 +00:00
Dale Johannesen	81bfca7bde	Implement builtin_return_address(x) and builtin_frame_address(x) on PPC for x!=0. 7624113. llvm-svn: 102972	2010-05-03 22:59:34 +00:00
Jakob Stoklund Olesen	f4e4e84115	Check that subregisters don't have independent values in RemoveCopyByCommutingDef(). This fixes PR6941. llvm-svn: 102970	2010-05-03 22:40:32 +00:00
Dan Gohman	0553acff5e	Fix tests to use fadd, fsub, and fmul, instead of add, sub, and mul, when the type is floating-point. llvm-svn: 102969	2010-05-03 22:36:46 +00:00
Bill Wendling	06bf470104	Revert r102948. llvm-svn: 102964	2010-05-03 21:51:21 +00:00
Kevin Enderby	6f2f8d0798	Changed llvm-mc to use the same suffixes with floating point compare instructions as the Mac OS X darwin assembler. Some of which like 'fcoml' assembled to different opcodes. While some of the suffixes were just different. llvm-svn: 102958	2010-05-03 21:31:40 +00:00
Kevin Enderby	e3a1726034	Fixed the encoding of two of the X86 movq instuctions. The Move quadword from mm to mm/m64 and the Move quadword from xmm2/mem64 to xmm1 had the incorrect encodings. llvm-svn: 102952	2010-05-03 21:03:31 +00:00
Kevin Enderby	1a51d4cec9	Fixed the encoding of the x86 push instructions. Using a 32-bit immediate value caused the a pushl instruction to be incorrectly encoding using only two bytes of immediate, causing the following 2 instruction bytes to be part of the 32-bit immediate value. Also fixed the one byte form of push to be used when the immediate would fit in a signed extended byte. Lastly changed the names to not include the 32 of PUSH32 since they actually push the size of the stack pointer. llvm-svn: 102951	2010-05-03 20:45:05 +00:00
Bill Wendling	88c734e8ae	Testcase for r102947. llvm-svn: 102948	2010-05-03 20:39:35 +00:00
Devang Patel	9f5200a122	Check for side effects before splitting loop. Patch by Jakub Staszak! llvm-svn: 102928	2010-05-03 18:06:58 +00:00
Dan Gohman	2ad68de4aa	Fix a bug which prevented tail merging of return instructions in beneficial cases. See the changes in test/CodeGen/X86/tail-opts.ll and test/CodeGen/ARM/ifcvt2.ll for details. The fix is to change HashEndOfMBB to hash at most one instruction, instead of trying to apply heuristics about when it will be profitable to consider more than one instruction. The regular tail-merging heuristics are already prepared to handle the same cases, and they're more precise. Also, make test/CodeGen/ARM/ifcvt5.ll and test/CodeGen/Thumb2/thumb2-branch.ll slightly more complex so that they continue to test what they're intended to test. And, this eliminates the problem in test/CodeGen/Thumb2/2009-10-15-ITBlockBranch.ll, the testcase from PR5204. Update it accordingly. llvm-svn: 102907	2010-05-03 14:35:47 +00:00
Duncan Sands	211427bda9	Remove the -enable-sjlj-eh option, which doesn't do anything. Remove the -enable-eh option which is only used by the JIT, and replace it with -jit-enable-eh. llvm-svn: 102865	2010-05-02 15:36:26 +00:00
Chris Lattner	b49a622fe9	revert r102831. We already delete dead readonly calls in other places, killing a valid transformation is not the right answer. llvm-svn: 102850	2010-05-01 17:19:38 +00:00
Anton Korobeynikov	737718d4f4	Insert ANY_EXTEND node instead of invalid truncate during DAG Combining (X & 1), when needed. This fixes PR7001 llvm-svn: 102838	2010-05-01 12:52:34 +00:00
Anton Korobeynikov	319d71f44f	Do folding for indirect branches, where possible llvm-svn: 102836	2010-05-01 12:28:21 +00:00
Anton Korobeynikov	ebbdfef2fc	Implement indirect branches on MSP430 llvm-svn: 102835	2010-05-01 12:04:32 +00:00
Owen Anderson	550986ea90	Disable the call-deletion transformation introduced in r86975. Without halting analysis, it is illegal to delete a call to a read-only function. The correct solution is almost certainly to add a "must halt" attribute and only allow deletions in its presence. XFAIL the relevant testcase for now. llvm-svn: 102831	2010-05-01 08:34:28 +00:00
Chris Lattner	532112b98a	fix PR5009 by making CGSCCPM realize that a call was devirtualized if an indirect call site was removed and a direct one was added, not just if an indirect call site was modified to be direct. llvm-svn: 102830	2010-05-01 06:38:43 +00:00
Chris Lattner	c3bc80a082	rename test llvm-svn: 102829	2010-05-01 06:34:13 +00:00
Chris Lattner	fc8d9ee6c3	Implement rdar://6295824 and PR6724 with two tiny changes that can have a big effect :). The first is to enable the iterative SCC passmanager juice that kicks in when the scc passmgr detects that a function pass has devirtualized a call. In this case, it will rerun all the passes it manages on the SCC, up to the iteration count limit (4). This is useful because a function pass may devirualize a call, and we want the inliner to inline it, or pruneeh to infer stuff about it, etc. The second patch is to add all call sites to the DevirtualizedCalls list the inliner uses. This list is about to get renamed, but the jist of this is that the inliner now reconsiders all inlined call sites as candidates for further inlining. The intuition is this that in cases like this: f() { g(1); } g(int x) { h(x); } We analyze this bottom up, and may decide that it isn't profitable to inline H into G. Next step, we decide that it is profitable to inline G into F, and do so, which means that F now calls H. Even though the call from G -> H may not have been profitable to inline, the call from F -> H may be (in this case because a constant allows folding etc). In my spot checks, this doesn't have a big impact on code. For example, the LLC output for 252.eon grew from 0.02% (from 317252 to 317308) and 176.gcc actually shrunk by .3% (from 1525612 to 1520964 bytes). 252.eon never iterated in the SCC Passmgr, 176.gcc iterated at most 1 time. llvm-svn: 102823	2010-05-01 01:15:56 +00:00
Chris Lattner	e8262675a3	The inliner has traditionally not considered call sites that appear due to inlining a callee as candidates for futher inlining, but a recent patch made it do this if those call sites were indirect and became direct. Unfortunately, in bizarre cases (see testcase) doing this can cause us to infinitely inline mutually recursive functions into callers not in the cycle. Fix this by keeping track of the inline history from which callsite inline candidates got inlined from. This shouldn't affect any "real world" code, but is required for a follow on patch that is coming up next. llvm-svn: 102822	2010-05-01 01:05:10 +00:00
Bill Wendling	02bc6787ca	Test failing too much on too many platforms. llvm-svn: 102812	2010-05-01 00:12:33 +00:00
Bill Wendling	06cacb1291	Maybe it needs sse2? llvm-svn: 102802	2010-04-30 23:19:29 +00:00
Bill Wendling	613fb7daa6	Force 64-bit. llvm-svn: 102800	2010-04-30 22:45:20 +00:00
Chris Lattner	a9bac86d16	Dan recently disabled recursive inlining within a function, but we were still inlining self-recursive functions into other functions. Inlining a recursive function into itself has the potential to reduce recursion depth by a factor of 2, inlining a recursive function into something else reduces recursion depth by exactly 1. Since inlining a recursive function into something else is a weird form of loop peeling, turn this off. The deleted testcase was added by Dale in r62107, since then we're leaning towards not inlining recursive stuff ever. In any case, if we like inlining recursive stuff, it should be done within the recursive function itself to get the algorithm recursion depth win. llvm-svn: 102798	2010-04-30 22:37:22 +00:00
Bill Wendling	de4b225093	EXTRACT_VECTOR_ELT of an INSERT_VECTOR_ELT may have the same index, but the indexes could be of a different value type. Or not even using the same SDNode for the constant (weird, I know). Compare the actual values instead of the pointers. llvm-svn: 102791	2010-04-30 22:19:17 +00:00
Jakob Stoklund Olesen	9afed0f98b	The local register allocator has to spill dirty callee saved registers before a call that might throw. The landing pad assumes that all registers are in stack slots. We used to spill those dirty CSRs after the call, and the stack slots would be wrong when arriving at the landing pad. llvm-svn: 102770	2010-04-30 21:19:29 +00:00
Devang Patel	3ca9a9b59c	Preserve debug info attached with call instruction while eliminating dead argument. Radar 7927803 llvm-svn: 102760	2010-04-30 20:23:54 +00:00
Devang Patel	cde3576e0d	New test. llvm-svn: 102746	2010-04-30 19:39:29 +00:00
Dan Gohman	299e7b93ac	Add lint checks for invalid uses of memory. llvm-svn: 102733	2010-04-30 19:05:00 +00:00
Dan Gohman	6221b85680	Add -o /dev/null to some tests which don't care about their output. llvm-svn: 102722	2010-04-30 17:42:30 +00:00
Evan Cheng	5f2314f3a3	Fix test. llvm-svn: 102694	2010-04-30 06:00:56 +00:00
Evan Cheng	5117a555e0	Another sibcall bug. If caller and callee calling conventions differ, then it's only safe to do a tail call if the results are returned in the same way. llvm-svn: 102683	2010-04-30 01:12:32 +00:00
Jakob Stoklund Olesen	8d4214578d	Reject really weird coalescer case when trying to merge identical subregisters of different register classes. e.g. %reg1048:3<def> = EXTRACT_SUBREG %RAX<kill>, 3 Where %reg1048 is a GR32 register. This is not impossible to handle, but it is pretty hard and very rare. This should unbreak the dragonegg builder. llvm-svn: 102672	2010-04-29 23:47:46 +00:00
Evan Cheng	38dfa5cf20	Load folding tail call should not use ebp / rbp after it's popped. PEI should use esp / rsp to reference frame instead. llvm-svn: 102596	2010-04-29 05:08:22 +00:00

... 20 21 22 23 24 ...

11897 Commits