llvm-project

Commit Graph

Author	SHA1	Message	Date
Nuno Lopes	20ea62527a	move the bounds checking pass to the instrumentation folder, where it belongs. I dunno why in the world I dropped it in the Scalar folder in the first place. No functionality change. llvm-svn: 160587	2012-07-20 22:39:33 +00:00
Benjamin Kramer	5be8f60126	Remove unused private member variables uncovered by the recent changes to clang's -Wunused-private-field. llvm-svn: 160583	2012-07-20 22:05:57 +00:00
Galina Kistanova	434efb29b5	Fix few warnings. llvm-svn: 160576	2012-07-20 21:30:52 +00:00
Daniel Dunbar	04b4583c9b	raw_ostream: Add a has_colors() method. llvm-svn: 160558	2012-07-20 18:29:41 +00:00
Daniel Dunbar	712de82154	Process: Add sys::Process::FileDescriptorHasColors(). llvm-svn: 160557	2012-07-20 18:29:38 +00:00
Owen Anderson	3a8bdb5677	Make RegisterOperand a subclass of DAGOperand so that RegisterOperands can be passed into multiclasses that take DAGOperands as multiclass parameters. llvm-svn: 160540	2012-07-20 03:38:19 +00:00
Benjamin Kramer	347d559323	Pull the simple parts of DenseMapInfo<DebugLoc> inline and prune includes. llvm-svn: 160507	2012-07-19 15:00:34 +00:00
Alexey Samsonov	e16e16add6	DebugInfo library: add support for fetching absolute paths to source files (instead of basenames) from DWARF. Use this behavior in llvm-dwarfdump tool. Reviewed by Benjamin Kramer. llvm-svn: 160496	2012-07-19 07:03:58 +00:00
Galina Kistanova	aaf9735951	Fixed few warnings. llvm-svn: 160493	2012-07-19 04:50:12 +00:00
Bill Wendling	0de5913855	Remove tabs. llvm-svn: 160473	2012-07-19 00:01:33 +00:00
Bill Wendling	a88946e21a	Remove tabs. llvm-svn: 160472	2012-07-19 00:01:00 +00:00
Bill Wendling	efe80cb87e	Remove tabs. llvm-svn: 160471	2012-07-18 23:58:37 +00:00
Jordan Rose	82632bffbc	Allow PointerIntPairs to be created from const void . For a measure of safety, this conversion is only permitted if the stored pointer type can also be created from a const void . llvm-svn: 160456	2012-07-18 21:58:49 +00:00
Simon Atanasyan	8856ef886a	Add some missed ELF constants definitions: - section types - dynamic table entries tags - state flags for DT_FLAGS_1 entry The patch reviewed by Rafael Espindola. llvm-svn: 160433	2012-07-18 14:12:32 +00:00
NAKAMURA Takumi	5f8d8eb692	Update config.h.cmake corresponding to config.h.in. llvm-svn: 160431	2012-07-18 09:17:02 +00:00
Andrew Trick	0d10225fa2	SCEVTraversal: Add a visited set. Expression trees may be DAGs. Make sure traversal has linear complexity. llvm-svn: 160426	2012-07-18 05:14:03 +00:00
Jakob Stoklund Olesen	6ca05ebd50	Fix broken ipo_ext_iterator constructors. These functions have obviously never been used before. They should be identical to the idf_ext_iterator counterparts. llvm-svn: 160381	2012-07-17 17:57:25 +00:00
Jakob Stoklund Olesen	c92bde7ba9	Allow for customized graph edge pruning in PostOrderIterator.h Make it possible to prune individual graph edges from a post-order traversal by specializing the po_iterator_storage template. Previously, it was only possible to prune full graph nodes. Edge pruning makes it possible to remove loop back-edges, for example. Also replace the existing DFSetTraits customization hook with a po_iterator_storage method for observing the post-order. DFSetTraits was only used by LoopIterator.h which now provides a po_iterator_storage specialization. Thanks to Sean and Chandler for reviewing. llvm-svn: 160366	2012-07-17 15:35:40 +00:00
Simon Atanasyan	bb02d8de47	Revert commit r160307. We decide to move builtins selection to the backend. llvm-svn: 160352	2012-07-17 08:14:45 +00:00
Jim Grosbach	514410ba07	TableGen: Allow conditional instruction pattern in multiclass. Define a 'null_frag' SDPatternOperator node, which if referenced in an instruction Pattern, results in the pattern being collapsed to be as-if '[]' had been specified instead. This allows supporting a multiclass definition where some instaniations have ISel patterns associated and others do not. For example, multiclass myMulti<RegisterClass rc, SDPatternOperator OpNode = null_frag> { def _x : myI<(outs rc:), (ins rc:), []>; def _r : myI<(outs rc:), (ins rc:), [(set rc:, (OpNode rc:))]>; } defm foo : myMulti<GRa, not>; defm bar : myMulti<GRb>; llvm-svn: 160333	2012-07-17 00:47:06 +00:00
Simon Atanasyan	ef2128c12c	MIPS: Create two definitions for __builtin_mips_shll_qb builtin. The first variant accepts immediate number as the second argument. The second variant accepts register operand as the second argument. llvm-svn: 160307	2012-07-16 18:51:39 +00:00
Tom Stellard	adf452260f	Revert "include/llvm: Add R600 Intrinsics v6" This reverts commit 600f7a90f3eef4c5108179b43e27cfd9e5de7cdc. llvm-svn: 160302	2012-07-16 18:19:48 +00:00
Tom Stellard	ee1812b94f	include/llvm: Add R600 Intrinsics v6 llvm-svn: 160271	2012-07-16 14:17:14 +00:00
Chandler Carruth	f5fe556c70	Add support for attaching branch weight metadata directly from the IRBuilder. Added a basic unit test for this with CreateCondBr. I didn't go all the way and test the switch side as the boilerplate for setting up the switch IRBuilder unit tests is a lot more. Fortunately, the two share all the interesting code paths. llvm-svn: 160251	2012-07-16 07:45:06 +00:00
Chandler Carruth	36e2ecf528	Move llvm/Support/TypeBuilder.h -> llvm/TypeBuilder.h. This completes the move of *Builder classes into the Core library. No uses of this builder in Clang or DragonEgg I could find. If there is a desire to have an IR-building-support library that contains all of these builders, that can be easily added, but currently it seems likely that these add no real overhead to VMCore. llvm-svn: 160243	2012-07-15 23:45:24 +00:00
Chandler Carruth	d9d363f8d7	Update the header guard I missed when moving the header. llvm-svn: 160242	2012-07-15 23:45:20 +00:00
Chandler Carruth	ec7ad6561f	Move llvm/Support/MDBuilder.h to llvm/MDBuilder.h, to live with IRBuilder, DIBuilder, etc. This is the proper layering as MDBuilder can't be used (or implemented) without the Core Metadata representation. Patches to Clang and Dragonegg coming up. llvm-svn: 160237	2012-07-15 23:26:50 +00:00
Nadav Rotem	a62368c965	Refactor the code that checks that all operands of a node are UNDEFs. Add a micro-optimization to getNode of CONCAT_VECTORS when both operands are undefs. Can't find a testcase for this because VECTOR_SHUFFLE already handles undef operands, but Duncan suggested that we add this. Together with Michael Kuperstein <michael.m.kuperstein@intel.com> llvm-svn: 160229	2012-07-15 08:38:23 +00:00
Eric Christopher	abb6ffd9b3	Move IsSameValue from clang's ASTImporter to be methods on the APInt/APSInt classes. Part of rdar://11875995 llvm-svn: 160223	2012-07-15 00:23:36 +00:00
Andrew Trick	653513b8dd	LSR Fix: check SCEV expression safety before expansion. All SCEV expressions used by LSR formulae must be safe to expand. i.e. they may not contain UDiv unless we can prove nonzero denominator. Fixes PR11356: LSR hoists UDiv. llvm-svn: 160205	2012-07-13 23:33:10 +00:00
Andrew Trick	365e31c36c	Factor SCEV traversal code so I can use it elsewhere. No functionality. llvm-svn: 160203	2012-07-13 23:33:03 +00:00
Galina Kistanova	8aded18c5d	Fixed few warnings. llvm-svn: 160192	2012-07-13 21:06:54 +00:00
Alexander Kornienko	73221f5624	Initializers for some fields were missing in Option::Option llvm-svn: 160170	2012-07-13 12:55:23 +00:00
Eric Christopher	54c39e0688	Regenerate. llvm-svn: 160134	2012-07-12 17:59:12 +00:00
Benjamin Kramer	0ab2794eda	Add intrinsics for Ivy Bridge's rdrand instruction. The rdrand/cmov sequence is the same that is emitted by both GCC and ICC. Fixes PR13284. llvm-svn: 160117	2012-07-12 09:31:43 +00:00
Chandler Carruth	2207f76cd4	Teach the LiveInterval::join function to use the fast merge algorithm, generalizing its implementation sufficiently to support this value number scenario as well. This cuts out another significant performance hit in large functions (over 10k basic blocks, etc), especially those with "natural" CFG structures. llvm-svn: 160026	2012-07-10 22:25:21 +00:00
Chad Rosier	97c2214277	Move [get\|set]BasePtrStackAdjustment() from MachineFrameInfo to X86MachineFunctionInfo as this is currently only used by X86. If this ever becomes an issue on another arch (e.g., ARM) then we can hoist it back out. llvm-svn: 160009	2012-07-10 18:27:15 +00:00
Chad Rosier	bdb08ac50a	Add support for dynamic stack realignment in the presence of dynamic allocas on X86. Basically, this is a reapplication of r158087 with a few fixes. Specifically, (1) the stack pointer is restored from the base pointer before popping callee-saved registers and (2) in obscure cases (see comments in patch) we must cache the value of the original stack adjustment in the prologue and apply it in the epilogue. rdar://11496434 llvm-svn: 160002	2012-07-10 17:45:53 +00:00
Chandler Carruth	e18614dd17	Add an efficient merge operation to LiveInterval and use it to avoid quadratic behavior when performing pathological merges. Fixes the core element of PR12652. There is only one user of addRangeFrom left: join. I'm hoping to refactor further in a future patch and have join use this merge operation as well. llvm-svn: 159982	2012-07-10 05:16:17 +00:00
Chandler Carruth	ac766b9b42	Teach LiveIntervals how to verify themselves and start using it in some of the trick merge routines. This adds a layer of testing that was necessary when implementing more efficient (and complex) merge logic for this datastructure. No functionality changed here. llvm-svn: 159981	2012-07-10 05:06:03 +00:00
Jim Grosbach	700068206f	Allow intrinsics to be used in place of node matchables. TableGen has support for using an intrinics name directly in a DAG, but this breaks down when referring to just a node, as that's handled initializer list stuff entirely via subclassing in the parser. That is, using an instrinsic like "(int_my_intrinsic ...)" works fine. Using it standalone for parameterizing the operator in such a DAG does not. Fixing this is simple enough, as we simply declare Intrinsic as deriving from SDPatternOperator, which is the class name intended for exactly this purpose in TargetSelectionDAG.td. When the intrinsic is actually used in the DAG pattern, it will be recognized and expanded to an intrinsic_wo_chain (et. al.) just like when it's used directly. Incoming ARM NEON cleanup based on this and a bit of functionality improvement after that. llvm-svn: 159973	2012-07-10 00:51:11 +00:00
Benjamin Kramer	a5e136b613	Remove some trivial copy ctors so the classes become trivially copyable and get the optimized SmallVector implementation. llvm-svn: 159916	2012-07-08 19:47:51 +00:00
Benjamin Kramer	c810a68923	SmallVector: Make use of move semantics to speed up moving objects in erase() and insert() llvm-svn: 159914	2012-07-08 12:06:35 +00:00
Andrew Trick	87255e340e	I'm introducing a new machine model to simultaneously allow simple subtarget CPU descriptions and support new features of MachineScheduler. MachineModel has three categories of data: 1) Basic properties for coarse grained instruction cost model. 2) Scheduler Read/Write resources for simple per-opcode and operand cost model (TBD). 3) Instruction itineraties for detailed per-cycle reservation tables. These will all live side-by-side. Any subtarget can use any combination of them. Instruction itineraries will not change in the near term. In the long run, I expect them to only be relevant for in-order VLIW machines that have complex contraints and require a precise scheduling/bundling model. Once itineraries are only actively used by VLIW-ish targets, they could be replaced by something more appropriate for those targets. This tablegen backend rewrite sets things up for introducing MachineModel type #2: per opcode/operand cost model. llvm-svn: 159891	2012-07-07 04:00:00 +00:00
Andrew Trick	91118a6155	whitespace llvm-svn: 159890	2012-07-07 03:59:51 +00:00
Andrew Trick	030e2f8f1a	Tweak spelling. llvm-svn: 159889	2012-07-07 03:59:48 +00:00
Chad Rosier	73b02825d0	Fix the naming of ensureAlignment. Per the coding standard function names should be camel case, and start with a lower case letter. llvm-svn: 159877	2012-07-06 23:13:38 +00:00
Bill Wendling	aa02e36fa8	Add a print method to the ObjC property object. llvm-svn: 159848	2012-07-06 19:12:31 +00:00
Dmitri Gribenko	aa4f47f266	Revert r159789. llvm-svn: 159834	2012-07-06 16:42:25 +00:00
NAKAMURA Takumi	b8c7dada33	llvm/include/llvm/CMakeLists.txt: Cut dependency to intrinsics_gen. llvm-svn: 159831	2012-07-06 15:55:39 +00:00
Dmitri Gribenko	d5200f1bc4	Enable new[] on llvm::BumpPtrAllocator. llvm-svn: 159789	2012-07-06 00:25:39 +00:00
Owen Anderson	00da236f7e	Fix an overzealous assertion. It is legitimate for a target to have multiple fixups on a single instruction that target the same byte, so long as their bit-offsets are coordinates appropriately. llvm-svn: 159785	2012-07-05 22:30:42 +00:00
Chandler Carruth	853d14b7b6	Remove dead infrastructure for building DenseMaps with a SlotIndex as the key -- they are now stored in an IntervalMap. I noticed this while looking into PR12652. llvm-svn: 159745	2012-07-05 11:40:23 +00:00
Chandler Carruth	264854f9a0	Finish fixing the MachineOperand hashing, providing a nice modern hash_value overload for MachineOperands. This addresses a FIXME sufficient for me to remove it, and cleans up the code nicely too. The important changes to the hashing logic: - TargetFlags are now included in all of the hashes. These were complete missed. - Register operands have their subregisters and whether they are a def included in the hash. - We now actually hash all of the operand types. Previously, many operand types were simply dropped on the floor. For example: - Floating point immediates - Large integer immediates (>64-bit) - External globals! - Register masks - Metadata operands - It removes the offset from the block-address hash; I'm a bit suspicious of this, but isIdenticalTo doesn't consider the offset for black addresses. Any patterns involving these entities could have triggered extreme slowdowns in MachineCSE or PHIElimination. Let me know if there are PRs you think might be closed now... I'm looking myself, but I may miss them. llvm-svn: 159743	2012-07-05 11:06:22 +00:00
Stepan Dyatkovskiy	a3b11bdbea	Reverted r159658: Optimized diff operation: implemented the case when LHS and RHS subsets contains single numbers only. llvm-svn: 159704	2012-07-04 06:07:06 +00:00
Stepan Dyatkovskiy	7ff588f986	Reverted r156659, due to probable performance regressions, DenseMap should be used here: IntegersSubsetMapping - Replaced type of Items field from std::list with std::map. In neares future I'll test it with DenseMap and do the correspond replacement if possible. llvm-svn: 159703	2012-07-04 05:53:05 +00:00
Jakob Stoklund Olesen	f8a63a1507	Add an experimental early if-conversion pass, off by default. This pass performs if-conversion on SSA form machine code by speculatively executing both sides of the branch and using a cmov instruction to select the result. This can help lower the number of branch mispredictions on architectures like x86 that don't have predicable instructions. The current implementation is very aggressive, and causes regressions on mosts tests. It needs good heuristics that have yet to be implemented. llvm-svn: 159694	2012-07-04 00:09:54 +00:00
Nuno Lopes	9291ff4078	fold PHI nodes in SizeOffsetEvaluator whenever possible. Unfortunately this change requires the cache map to hold WeakVHs instead llvm-svn: 159667	2012-07-03 17:13:25 +00:00
Stepan Dyatkovskiy	9f3d5d6d5f	IntegersSubsetMappin: cosmetic changes in diff operation. llvm-svn: 159661	2012-07-03 14:29:26 +00:00
Stepan Dyatkovskiy	f2127fb741	Part of r159527. Splitted into series of patches and gone with fixed PR13256: IntegersSubsetMapping Added new methods - add(self& RHS, SuccessorClass *S) - detachCase - removeCase - findSuccessor - getCases - getCaseSingleNumber - isOverlapped llvm-svn: 159660	2012-07-03 14:15:36 +00:00
Stepan Dyatkovskiy	8b0c97e0dd	Part of r159527. Splitted into series of patches and gone with fixed PR13256: IntegersSubsetMapping - Replaced type of Items field from std::list with std::map. In neares future I'll test it with DenseMap and do the correspond replacement if possible. llvm-svn: 159659	2012-07-03 13:46:45 +00:00
Stepan Dyatkovskiy	438ba5f0bd	Part of r159527. Splitted into series of patches and gone with fixed PR13256: Optimized diff operation: implemented the case when LHS and RHS subsets contains single numbers only. llvm-svn: 159658	2012-07-03 13:29:14 +00:00
Chandler Carruth	9f0e4a2f18	Micro-optimize this function a bit. This shrinks the generated code some, and allows the routine to be inlined into common callers. The various bits that hit this code in their hotpath seem slightly lower on the profile, but I can't really measure a performance improvement as everything seems to still be bottlenecked on likely cache misses. =/ llvm-svn: 159648	2012-07-03 07:16:13 +00:00
Eric Christopher	b65acc61a5	Revert "IntRange:" as it appears to be breaking self hosting. This reverts commit b2833d9dcba88c6f0520cad760619200adc0442c. llvm-svn: 159618	2012-07-02 23:22:21 +00:00
Evan Cheng	39e90029a2	Target option DisableJumpTables is a gross hack. Move it to TargetLowering instead. llvm-svn: 159611	2012-07-02 22:39:56 +00:00
David Blaikie	6b8b051b47	Fix -Wstring-conversion warning. Patch by Matt Beaumont-Gay. llvm-svn: 159583	2012-07-02 21:00:00 +00:00
Bob Wilson	cac3b90633	Extend TargetPassConfig to allow running only a subset of the normal passes. This is still a work in progress but I believe it is currently good enough to fix PR13122 "Need unit test driver for codegen IR passes". For example, you can run llc with -stop-after=loop-reduce to have it dump out the IR after running LSR. Serializing machine-level IR is not yet supported but we have some patches in progress for that. The plan is to serialize the IR to a YAML file, containing separate sections for the LLVM IR, machine-level IR, and whatever other info is needed. Chad suggested that we stash the stop-after pass in the YAML file and use that instead of the start-after option to figure out where to restart the compilation. I think that's a great idea, but since it's not implemented yet I put the -start-after option into this patch for testing purposes. llvm-svn: 159570	2012-07-02 19:48:45 +00:00
Bob Wilson	b9b693650a	Consistently use AnalysisID types in TargetPassConfig. This makes it possible to just use a zero value to represent "no pass", so the phony NoPassID global variable is no longer needed. llvm-svn: 159568	2012-07-02 19:48:37 +00:00
Bob Wilson	bbd38dd9c0	Add all codegen passes to the PassManager via TargetPassConfig. This is a preliminary step toward having TargetPassConfig be able to start and stop the compilation at specified passes for unit testing and debugging. No functionality change. llvm-svn: 159567	2012-07-02 19:48:31 +00:00
Bob Wilson	36e31cca78	Add a missing forward declaration of PassManagerBase. llvm-svn: 159566	2012-07-02 19:48:18 +00:00
Andrew Trick	f161e391f8	Reapply "Make NumMicroOps a variable in the subtarget's instruction itinerary." Reapplies r159406 with minor cleanup. The regressions appear to have been spurious. llvm-svn: 159541	2012-07-02 18:10:42 +00:00
Stepan Dyatkovskiy	1698d50aac	Fixed switch in IntRange::isSingleNumber method. llvm-svn: 159540	2012-07-02 17:42:46 +00:00
Stepan Dyatkovskiy	0373bbc8d6	IntRange, fixed warning in isSingleNumber method llvm-svn: 159532	2012-07-02 14:10:46 +00:00
Stepan Dyatkovskiy	8b9ecca42d	IntRange: - Changed isSingleNumber method behaviour. Now this flag is calculated on demand. IntegersSubsetMapping - Optimized diff operation. - Replaced type of Items field from std::list with std::map. - Added new methods: bool isOverlapped(self &RHS) void add(self& RHS, SuccessorClass S) void detachCase(self& NewMapping, SuccessorClass Succ) void removeCase(SuccessorClass Succ) SuccessorClass findSuccessor(const IntTy& Val) const IntTy* getCaseSingleNumber(SuccessorClass *Succ) IntegersSubsetTest - DiffTest: Added checks for successors. SimplifyCFG Updated SwitchInst usage (now it is case-ragnes compatible) for - SimplifyEqualityComparisonWithOnlyPredecessor - FoldValueComparisonIntoPredecessors llvm-svn: 159527	2012-07-02 13:02:18 +00:00
Alexey Samsonov	f4462fa3ca	This patch extends the libLLVMDebugInfo which contains a minimalistic DWARF parser: 1) DIContext is now able to return function name for a given instruction address (besides file/line info). 2) llvm-dwarfdump accepts flag --functions that prints the function name (if address is specified by --address flag). 3) test case that checks the basic functionality of llvm-dwarfdump added llvm-svn: 159512	2012-07-02 05:54:45 +00:00
Benjamin Kramer	6846d47abb	Avoid sign compare warning. llvm-svn: 159481	2012-06-30 10:02:08 +00:00
Manman Ren	6fa76dc0e0	Add SrcReg2 to analyzeCompare and optimizeCompareInstr to handle Compare instructions with two register operands. llvm-svn: 159465	2012-06-29 21:33:59 +00:00
Manman Ren	c146589aa4	Add getUniqueVRegDef to MachineRegisterInfo. This comes in handy during peephole optimization. llvm-svn: 159453	2012-06-29 19:16:05 +00:00
Chandler Carruth	aafe0918bc	Move llvm/Support/IRBuilder.h -> llvm/IRBuilder.h This was always part of the VMCore library out of necessity -- it deals entirely in the IR. The .cpp file in fact was already part of the VMCore library. This is just a mechanical move. I've tried to go through and re-apply the coding standard's preferred header sort, but at 40-ish files, I may have gotten some wrong. Please let me know if so. I'll be committing the corresponding updates to Clang and Polly, and Duncan has DragonEgg. Thanks to Bill and Eric for giving the green light for this bit of cleanup. llvm-svn: 159421	2012-06-29 12:38:19 +00:00
Bill Wendling	f799efdedc	The DIBuilder class is just a wrapper around debug info creation (a.k.a. MDNodes). The module doesn't belong in Analysis. Move it to the VMCore instead. llvm-svn: 159414	2012-06-29 08:32:07 +00:00
Andrew Trick	51a8cf77b8	Revert "Make NumMicroOps a variable in the subtarget's instruction itinerary." This reverts commit r159406. I noticed a performance regression so I'll back out for now. llvm-svn: 159411	2012-06-29 07:10:41 +00:00
Andrew Trick	ce27bb999d	misched: count micro-ops toward the issue limit. llvm-svn: 159407	2012-06-29 03:23:22 +00:00
Andrew Trick	1f50152b2d	Make NumMicroOps a variable in the subtarget's instruction itinerary. The TargetInstrInfo::getNumMicroOps API does not change, but soon it will be used by MachineScheduler. Now each subtarget can specify the number of micro-ops per itinerary class. For ARM, this is currently always dynamic (-1), because it is used for load/store multiple which depends on the number of register operands. Zero is now a valid number of micro-ops. This can be used for nop pseudo-instructions or instructions that the hardware can squash during dispatch. llvm-svn: 159406	2012-06-29 03:23:18 +00:00
Manman Ren	98a5bf24a9	X86: add more GATHER intrinsics in LLVM Corrected type for index of llvm.x86.avx2.gather.d.pd.256 from 256-bit to 128-bit. Corrected types for src\|dst\|mask of llvm.x86.avx2.gather.q.ps.256 from 256-bit to 128-bit. Support the following intrinsics: llvm.x86.avx2.gather.d.q, llvm.x86.avx2.gather.q.q llvm.x86.avx2.gather.d.q.256, llvm.x86.avx2.gather.q.q.256 llvm.x86.avx2.gather.d.d, llvm.x86.avx2.gather.q.d llvm.x86.avx2.gather.d.d.256, llvm.x86.avx2.gather.q.d.256 llvm-svn: 159402	2012-06-29 00:54:20 +00:00
Nuno Lopes	ec9653b363	add a new @llvm.donothing intrinsic that, well, does nothing, and teach CodeGen to ignore calls to it llvm-svn: 159383	2012-06-28 22:30:12 +00:00
Benjamin Kramer	17523ebbff	Fix hexagon gcc builtin names to use '_' instead of '.'. This way the generated GCC builtin to LLVM intrinsic converter actually works. llvm-svn: 159370	2012-06-28 20:08:47 +00:00
Simon Atanasyan	7f3bdb37da	Define MIPS DSP Rev1 intrinsics. That allows frontend to emit a correct IR. This patch was reviewed in the llvm-commits list by Jim Grosbach. llvm-svn: 159364	2012-06-28 18:20:28 +00:00
Nuno Lopes	181d67ecb1	MemoryBuiltins: - recognize C++ new(std::nothrow) friends - ignore ExtractElement and ExtractValue instructions in size/offset analysis (all easy cases are probably folded away before we get here) - also recognize realloc as noalias llvm-svn: 159356	2012-06-28 16:34:03 +00:00
Nuno Lopes	5020db2a8c	add ConstantRange::difference (to perform set difference/relative complement) llvm-svn: 159352	2012-06-28 16:10:13 +00:00
Benjamin Kramer	92658b8149	Devirtualize DIScope and subclasses. Nothing in here makes use of the virtuality. llvm-svn: 159349	2012-06-28 14:25:45 +00:00
Hal Finkel	f2dcb9a9c4	Allow BBVectorize to form non-2^n-length vectors. The original algorithm only used recursive pair fusion of equal-length types. This is now extended to allow pairing of any types that share the same underlying scalar type. Because we would still generally prefer the 2^n-length types, those are formed first. Then a second set of iterations form the non-2^n-length types. Also, a call to SimplifyInstructionsInBlock has been added after each pairing iteration. This takes care of DCE (and a few other things) that make the following iterations execute somewhat faster. For the same reason, some of the simple shuffle-combination cases are now handled internally. There is some additional refactoring work to be done, but I've had many requests for this feature, so additional refactoring will come soon in future commits (as will additional test cases). llvm-svn: 159330	2012-06-28 05:42:42 +00:00
Hal Finkel	74e5225c92	Refactor operation equivalence checking in BBVectorize by extending Instruction::isSameOperationAs. Maintaining this kind of checking in different places is dangerous, extending Instruction::isSameOperationAs consolidates this logic into one place. Here I've added an optional flags parameter and two flags that are important for vectorization: CompareIgnoringAlignment and CompareUsingScalarTypes. llvm-svn: 159329	2012-06-28 05:42:26 +00:00
Bill Wendling	e38859dc8e	Move lib/Analysis/DebugInfo.cpp to lib/VMCore/DebugInfo.cpp and include/llvm/Analysis/DebugInfo.h to include/llvm/DebugInfo.h. The reasoning is because the DebugInfo module is simply an interface to the debug info MDNodes and has nothing to do with analysis. llvm-svn: 159312	2012-06-28 00:05:13 +00:00
Jack Carter	8ad0c272af	The ELF relocation record format is different for N64 which many Mips 64 ABIs use than for O64 which many if not all other target ABIs use. Most architectures have the following 64 bit relocation record format: typedef struct { Elf64_Addr r_offset; /* Address of reference / Elf64_Xword r_info; / Symbol index and type of relocation / } Elf64_Rel; typedef struct { Elf64_Addr r_offset; Elf64_Xword r_info; Elf64_Sxword r_addend; } Elf64_Rela; Whereas N64 has the following format: typedef struct { Elf64_Addr r_offset;/ Address of reference / Elf64_Word r_sym; / Symbol index / Elf64_Byte r_ssym; / Special symbol / Elf64_Byte r_type3; / Relocation type / Elf64_Byte r_type2; / Relocation type / Elf64_Byte r_type; / Relocation type / } Elf64_Rel; typedef struct { Elf64_Addr r_offset;/ Address of reference / Elf64_Word r_sym; / Symbol index / Elf64_Byte r_ssym; / Special symbol / Elf64_Byte r_type3; / Relocation type / Elf64_Byte r_type2; / Relocation type / Elf64_Byte r_type; / Relocation type */ Elf64_Sxword r_addend; } Elf64_Rela; The structure is the same size, but the r_info data element is now 5 separate elements. Besides the content aspects, endian byte reordering will be different for the area with each element being endianized separately. I treat this as generic and continue to pass r_type as an integer masking and unmasking the byte sized N64 values for N64 mode. I've implemented this and it causes no affect on other current targets. This passes make check. Jack llvm-svn: 159299	2012-06-27 22:28:30 +00:00
Matt Beaumont-Gay	a58862310c	Revert r159136 due to PR13124. Original commit message: If a constant or a function has linkonce_odr linkage and unnamed_addr, mark it hidden. Being linkonce_odr guarantees that it is available in every dso that needs it. Being a constant/function with unnamed_addr guarantees that the copies don't have to be merged. llvm-svn: 159272	2012-06-27 17:10:33 +00:00
Bill Wendling	e02a1f8cf2	Revamp how debugging information is emitted for debug info objects. It's not necessary for each DI class to have its own copy of `print' and `dump'. Instead, just give DIDescriptor those methods and have it call the appropriate debugging printing routine based on the type of the debug information. llvm-svn: 159237	2012-06-26 22:57:33 +00:00
Manman Ren	a09820414a	X86: add GATHER intrinsics (AVX2) in LLVM Support the following intrinsics: llvm.x86.avx2.gather.d.pd, llvm.x86.avx2.gather.q.pd llvm.x86.avx2.gather.d.pd.256, llvm.x86.avx2.gather.q.pd.256 llvm.x86.avx2.gather.d.ps, llvm.x86.avx2.gather.q.ps llvm.x86.avx2.gather.d.ps.256, llvm.x86.avx2.gather.q.ps.256 Modified Disassembler to handle VSIB addressing mode. llvm-svn: 159221	2012-06-26 19:47:59 +00:00
Jakob Stoklund Olesen	59a0d3243b	Allow targets to inject passes before the virtual register rewriter. Such passes can be used to tweak the register assignments in a target-dependent way, for example to avoid write-after-write dependencies. llvm-svn: 159209	2012-06-26 17:09:29 +00:00
Stepan Dyatkovskiy	e481e0daf4	IntegersSubsetMapping: implemented "diff" operation. Operation allows at the same time perform up to three operations: - LHS exclude RHS - LHS intersect RHS (LHS successors will keeped) - RHS exclude LHS The complexity is N+M, where N is size of LHS M is size of RHS. llvm-svn: 159201	2012-06-26 11:57:43 +00:00
Stepan Dyatkovskiy	883850c4d2	IntegersSubsetMapping: removed exclude operation, it will replaced with more universal "diff" operation in next commit. Changes was separated onto two commits for better readability. llvm-svn: 159200	2012-06-26 11:41:47 +00:00
Andrew Trick	fb2ba3e1cb	Enable the new LoopInfo algorithm by default. The primary advantage is that loop optimizations will be applied in a stable order. This helps debugging and unit test creation. It is also a better overall implementation without pathologically bad performance on deep functions. On large functions (llvm-stress --size=200000 \| opt -loops) Before: 0.1263s After: 0.0225s On deep functions (after tweaking llvm-stress, thanks Nadav): Before: 0.2281s After: 0.0227s See r158790 for more comments. The loop tree is now consistently generated in forward order, but loop passes are applied in reverse order over the program. If we have a loop optimization that prefers forward order, that can easily be achieved by adding a different type of LoopPassManager. llvm-svn: 159183	2012-06-26 04:11:38 +00:00
Owen Anderson	c272bab5a5	Define DAGOperand, an empty base class for RegisterClass and Operand. This allows one to write multiclasses that are polymorphic over both registers and non-register operands. llvm-svn: 159162	2012-06-25 21:25:16 +00:00
Nuno Lopes	490096c8fb	add CallSite/CallInst/InvokeInst::hasFnAttr() llvm-svn: 159144	2012-06-25 16:16:58 +00:00
Rafael Espindola	540c3d23df	If a constant or a function has linkonce_odr linkage and unnamed_addr, mark it hidden. Being linkonce_odr guarantees that it is available in every dso that needs it. Being a constant/function with unnamed_addr guarantees that the copies don't have to be merged. llvm-svn: 159136	2012-06-25 14:30:31 +00:00
Eli Bendersky	f0ad3606c7	The name (and comment describing) of llvm::GetFirstDebuigLocInBasicBlock no longer represents what the function does. Therefore, the function is removed and its functionality is folded into the only place in the code-base where it was being used. llvm-svn: 159133	2012-06-25 10:13:14 +00:00
Chandler Carruth	00bef3fff1	Just remove generic support for C++11 alignas -- GCC is already advertising complete support w/o alignas implemented, and its implementation of alignas in the latest versions is so convoluted as to be unusable. llvm-svn: 159125	2012-06-25 05:20:13 +00:00
Hal Finkel	3099ce9489	Allow controlling vectorization of boolean values separately from other integer types. These are used as the result of comparisons, and often handled differently from larger integer types. llvm-svn: 159111	2012-06-24 13:28:01 +00:00
NAKAMURA Takumi	c707c253ad	llvm/Support/IntegersSubset.h: Add a copy constructor on IntegersSubset to appease msvc. msvc mis-infers ParentTy(RHS) to (const RangesCollectionTy &). llvm-svn: 159101	2012-06-24 03:48:53 +00:00
NAKAMURA Takumi	37d96f0862	llvm/Support/IntegersSubset.h: Fix whitespace. llvm-svn: 159100	2012-06-24 03:48:47 +00:00
Hal Finkel	4b06b1a0ee	Allow BBVectorize to fuse compare instructions. llvm-svn: 159088	2012-06-23 21:52:50 +00:00
Marshall Clow	78ade1dd08	Add relocation types for Hexagon processor; patch by Sidney Manning <sidneym@codeaurora.org> llvm-svn: 159081	2012-06-23 14:46:18 +00:00
Hans Wennborg	ac9fb36c31	Clean-up after r159077. Remove temporary GlobalVariable constructors now that Clang has been updated (r159078). llvm-svn: 159079	2012-06-23 12:14:23 +00:00
Hans Wennborg	cbe34b4cc9	Extend the IL for selecting TLS models (PR9788) This allows the user/front-end to specify a model that is better than what LLVM would choose by default. For example, a variable might be declared as @x = thread_local(initialexec) global i32 42 if it will not be used in a shared library that is dlopen'ed. If the specified model isn't supported by the target, or if LLVM can make a better choice, a different model may be used. llvm-svn: 159077	2012-06-23 11:37:03 +00:00
Stepan Dyatkovskiy	8e00efeace	Optimized usage of new SwitchInst case values (IntegersSubset type) in Local.cpp, Execution.cpp and BitcodeWriter.cpp. I got about 1% of compile-time improvement on my machines (Ubuntu 11.10 i386 and Ubuntu 12.04 x64). llvm-svn: 159076	2012-06-23 10:58:58 +00:00
Jim Grosbach	3a8a0fa8e6	TableGen: AsmMatcher support for better operand diagnostics. "Invalid operand" may be a completely correct diagnostic, but it's often insufficiently specific to really help identify and fix the problem in assembly source. Allow a target to specify a more-specific diagnostic kind for each AsmOperandClass derived definition and use that to provide more detailed diagnostics when an operant of that class resulted in a match failure. rdar://8987109 llvm-svn: 159050	2012-06-22 23:56:44 +00:00
Jakob Stoklund Olesen	a127fc780a	Remove ProcessImplicitDefs.h which was unused. The ProcessImplicitDefs class can be local to its implementation file. llvm-svn: 159041	2012-06-22 22:27:36 +00:00
Jakob Stoklund Olesen	4fa84ba8b9	Delete a boring statistic. llvm-svn: 159030	2012-06-22 20:40:15 +00:00
Jakob Stoklund Olesen	c61edda0ab	Store live intervals in an IndexedMap. It is both smaller and faster than DenseMap. llvm-svn: 159029	2012-06-22 20:37:52 +00:00
Hal Finkel	8db5547252	Revert r158679 - use case is unclear (and it increases the memory footprint). Original commit message: Allow up to 64 functional units per processor itinerary. This patch changes the type used to hold the FU bitset from unsigned to uint64_t. This will be needed for some upcoming PowerPC itineraries. llvm-svn: 159027	2012-06-22 20:27:13 +00:00
Evan Cheng	f5bd6c6510	EmitZerofill should take a 64-bit size or else it's chopping off large zero-filled global. rdar://11729134 llvm-svn: 159023	2012-06-22 20:14:46 +00:00
Jakob Stoklund Olesen	37e797fedc	Stop computing physreg live ranges. Everyone is using on-demand regunit ranges now. llvm-svn: 159018	2012-06-22 18:20:50 +00:00
Kaelyn Uhrain	e209570e8f	Remove a variable that is unused when assertions aren't enabled. llvm-svn: 159011	2012-06-22 17:18:15 +00:00
Jakob Stoklund Olesen	b1b3e4aa58	Remove LiveIntervals::trackingRegUnits(). With regunit liveness permanently enabled, this function would always return true. Also remove now obsolete code for checking physreg interference. llvm-svn: 159006	2012-06-22 16:46:44 +00:00
Dmitri Gribenko	1fed006a4d	Change comment into proper Doxygen member comment. llvm-svn: 159000	2012-06-22 16:00:48 +00:00
Stepan Dyatkovskiy	a6c8cc307b	Fixed r158979. Original message: Performance optimizations: - SwitchInst: case values stored separately from Operands List. It allows to make faster access to individual case value numbers or ranges. - Optimized IntItem, added APInt value caching. - Optimized IntegersSubsetGeneric: added optimizations for cases when subset is single number or when subset consists from single numbers only. llvm-svn: 158997	2012-06-22 14:53:30 +00:00
Rafael Espindola	ea59166190	Remove another duplicated variable. We only need one to tell us if the linker knows dwarf or not. llvm-svn: 158993	2012-06-22 13:32:49 +00:00
Rafael Espindola	d7bdaf5795	Fix a FIXME: DwarfRequiresRelocationForSectionOffset is the same as DwarfUsesRelocationsAcrossSections. llvm-svn: 158992	2012-06-22 13:24:07 +00:00
Duncan Sands	83884a1042	Revert commit 158979 (dyatkovskiy) since it is causing several buildbots to fail. Original commit message: Performance optimizations: - SwitchInst: case values stored separately from Operands List. It allows to make faster access to individual case value numbers or ranges. - Optimized IntItem, added APInt value caching. - Optimized IntegersSubsetGeneric: added optimizations for cases when subset is single number or when subset consists from single numbers only. On my machine these optimizations gave about 4-6% of compile-time improvement. llvm-svn: 158986	2012-06-22 10:35:06 +00:00
Stepan Dyatkovskiy	fcfa633bf8	Performance optimizations: - SwitchInst: case values stored separately from Operands List. It allows to make faster access to individual case value numbers or ranges. - Optimized IntItem, added APInt value caching. - Optimized IntegersSubsetGeneric: added optimizations for cases when subset is single number or when subset consists from single numbers only. On my machine these optimizations gave about 4-6% of compile-time improvement. llvm-svn: 158979	2012-06-22 07:35:13 +00:00
Andrew Trick	9c302673b2	Use "NoItineraries" for processors with no itineraries. This makes it explicit when ScoreboardHazardRecognizer will be used. "GenericItineraries" would only make sense if it contained real itinerary values and still required ScoreboardHazardRecognizer. llvm-svn: 158963	2012-06-22 03:58:51 +00:00
Nick Lewycky	33da33676f	Emit relocations for DW_AT_location entries on systems which need it. This is a recommit of r127757. Fixes PR9493. Patch by Paul Robinson! llvm-svn: 158957	2012-06-22 01:25:12 +00:00
Lang Hames	b8650f106a	Rename -allow-excess-fp-precision flag to -fuse-fp-ops, and switch from a boolean flag to an enum: { Fast, Standard, Strict } (default = Standard). This option controls the creation by optimizations of fused FP ops that store intermediate results in higher precision than IEEE allows (E.g. FMAs). The behavior of this option is intended to match the behaviour specified by a soon-to-be-introduced frontend flag: '-ffuse-fp-ops'. Fast mode - allows formation of fused FP ops whenever they're profitable. Standard mode - allow fusion only for 'blessed' FP ops. At present the only blessed op is the fmuladd intrinsic. In the future more blessed ops may be added. Strict mode - allow fusion only if/when it can be proven that the excess precision won't effect the result. Note: This option only controls formation of fused ops by the optimizers. Fused operations that are explicitly requested (e.g. FMA via the llvm.fma.* intrinsic) will always be honored, regardless of the value of this option. Internally TargetOptions::AllowExcessFPPrecision has been replaced by TargetOptions::AllowFPOpFusion. llvm-svn: 158956	2012-06-22 01:09:09 +00:00
Nuno Lopes	9792d68381	remove extractMallocCallFromBitCast, since it was tailor maded for its sole user. Update GlobalOpt accordingly. llvm-svn: 158952	2012-06-22 00:25:01 +00:00
Nuno Lopes	dc6085e52d	Add support for invoke to the MemoryBuiltin analysid. Update comments accordingly. Make instcombine remove useless invokes to C++'s 'new' allocation function (test attached). llvm-svn: 158937	2012-06-21 21:25:05 +00:00
Nuno Lopes	beee2f10e0	move some typedefs so that we don't polute the llvm namespace. this should appease the GCC buildbots llvm-svn: 158924	2012-06-21 16:58:41 +00:00
Nuno Lopes	55fff83422	refactor the MemoryBuiltin analysis: - provide more extensive set of functions to detect library allocation functions (e.g., malloc, calloc, strdup, etc) - provide an API to compute the size and offset of an object pointed by Move a few clients (GVN, AA, instcombine, ...) to the new API. This implementation is a lot more aggressive than each of the custom implementations being replaced. Patch reviewed by Nick Lewycky and Chandler Carruth, thanks. llvm-svn: 158919	2012-06-21 15:45:28 +00:00
Nadav Rotem	4e9012c2b1	Add a number of threshold arguments to the SRA pass. A patch by Tom Stellard with minor changes. llvm-svn: 158918	2012-06-21 13:44:31 +00:00
Jakob Stoklund Olesen	88ef957e74	Remove LiveIntervals::iterator. Live intervals for regunits and virtual registers are stored separately, and physreg live intervals are going away. To visit the live ranges of all virtual registers, use this pattern instead: for (unsigned i = 0, e = MRI->getNumVirtRegs(); i != e; ++i) { unsigned Reg = TargetRegisterInfo::index2VirtReg(i); if (MRI->reg_nodbg_empty(Reg)) continue; llvm-svn: 158879	2012-06-20 23:54:20 +00:00
Jakob Stoklund Olesen	1911a0203d	Remove the RenderMachineFunction HTML output pass. I don't think anyone has been using this functionality for a while, and it is getting in the way of refactoring now. llvm-svn: 158876	2012-06-20 23:47:58 +00:00
Andrew Trick	2ed08a7cc1	Restructure PopulateLoopsDFS::insertIntoLoop. As Nadav pointed out the first implementation was obscure. llvm-svn: 158862	2012-06-20 22:18:33 +00:00
Andrew Trick	dbb7ae54b8	Add "extern template" declarations now that we use explicit instantiation. This is supported by gcc and clang, but guarded by a macro for MSVC 2008. The extern template declaration is not necessary but generally good form. It can avoid extra instantiations of the template methods defined inline. The EXTERN_TEMPLATE_INSTANTIATION macro could probably be generalized to handle multiple template parameters if someone thinks it's worthwhile. llvm-svn: 158840	2012-06-20 20:17:20 +00:00
Jakob Stoklund Olesen	833308d785	Only update regunit live ranges that have been precomputed. Regunit live ranges are computed on demand, so when mi-sched calls handleMove, some regunits may not have live ranges yet. That makes updating them easier: Just skip the non-existing ranges. They will be computed correctly from the rescheduled machine code when they are needed. llvm-svn: 158831	2012-06-20 18:00:57 +00:00
Chandler Carruth	5c0997f066	Remove 'static' from inline functions defined in header files. There is a pretty staggering amount of this in LLVM's header files, this is not all of the instances I'm afraid. These include all of the functions that (in my build) are used by a non-static inline (or external) function. Specifically, these issues were caught by the new '-Winternal-linkage-in-inline' warning. I'll try to just clean up the remainder of the clearly redundant "static inline" cases on functions (not methods!) defined within headers if I can do so in a reliable way. There were even several cases of a missing 'inline' altogether, or my personal favorite "static bool inline". Go figure. ;] llvm-svn: 158800	2012-06-20 08:39:33 +00:00
Andrew Trick	ff2ed7b687	A new algorithm for computing LoopInfo. Temporarily disabled. -stable-loops enables a new algorithm for generating the Loop forest. It differs from the original algorithm in a few respects: - Not determined by use-list order. - Initially guarantees RPO order of block and subloops. - Linear in the number of CFG edges. - Nonrecursive. I didn't want to change the LoopInfo API yet, so the block lists are still inclusive. This seems strange to me, and it means that building LoopInfo is not strictly linear, but it may not be a problem in practice. At least the block lists start out in RPO order now. In the future we may add an attribute or wrapper analysis that allows other passes to assume RPO order. The primary motivation of this work was not to optimize LoopInfo, but to allow reproducing performance issues by decomposing the compilation stages. I'm often unable to do this with the current LoopInfo, because the loop tree order determines Loop pass order. Serializing the IR tends to invert the order, which reverses the optimization order. This makes it nearly impossible to debug interdependent loop optimizations such as LSR. I also believe this will provide more stable performance results across time. llvm-svn: 158790	2012-06-20 05:23:33 +00:00
Andrew Trick	cda51d430d	Move the implementation of LoopInfo into LoopInfoImpl.h. The implementation only needs inclusion from LoopInfo.cpp and MachineLoopInfo.cpp. Clients of the interface should only include the interface. This makes the interface readable and speeds up rebuilds after modifying the implementation. llvm-svn: 158787	2012-06-20 03:42:09 +00:00
Nick Kledzik	18497e9242	Add permissions(), map_file_pages(), and unmap_file_pages() to llvm::sys::fs and add unit test. Unix is implemented. Windows side needs to be implemented. llvm-svn: 158770	2012-06-20 00:28:54 +00:00
Chad Rosier	7369692790	Add an ensureMaxAlignment() function to MachineFrameInfo (analogous to ensureAlignment() in MachineFunction). Also, drop setMaxAlignment() in favor of this new function. This creates a main entry point to setting MaxAlignment, which will be helpful for future work. No functionality change intended. llvm-svn: 158758	2012-06-19 22:59:12 +00:00
Lang Hames	39fb1d08dc	Add DAG-combines for aggressive FMA formation. This patch adds DAG combines to form FMAs from pairs of FADD + FMUL or FSUB + FMUL. The combines are performed when: (a) Either AllowExcessFPPrecision option (-enable-excess-fp-precision for llc) OR UnsafeFPMath option (-enable-unsafe-fp-math) are set, and (b) TargetLoweringInfo::isFMAFasterThanMulAndAdd(VT) is true for the type of the FADD/FSUB, and (c) The FMUL only has one user (the FADD/FSUB). If your target has fast FMA instructions you can make use of these combines by overriding TargetLoweringInfo::isFMAFasterThanMulAndAdd(VT) to return true for types supported by your FMA instruction, and adding patterns to match ISD::FMA to your FMA instructions. llvm-svn: 158757	2012-06-19 22:51:23 +00:00
Chad Rosier	bb335c96f9	Typo. Patch by Cameron McInally <cameron.mcinally@nyu.edu>. llvm-svn: 158754	2012-06-19 22:28:18 +00:00
Rafael Espindola	ca3e0ee8b3	Move the support for using .init_array from ARM to the generic TargetLoweringObjectFileELF. Use this to support it on X86. Unlike ARM, on X86 it is not easy to find out if .init_array should be used or not, so the decision is made via TargetOptions and defaults to off. Add a command line option to llc that enables it. llvm-svn: 158692	2012-06-19 00:48:28 +00:00
Nuno Lopes	f9abcb7ba9	revert r158660, since Chris has some issues with this patch (namely using code to reprent information only used by the compiler) Original commit msg: add the 'alloc' metadata node to represent the size of offset of buffers pointed to by pointers. This metadata can be attached to any instruction returning a pointer llvm-svn: 158688	2012-06-18 23:34:26 +00:00
David Blaikie	2417379c63	Don't copy a potentially-uninitialized variable. Based on review discussion of r158638 with Chandler Carruth, Tobias von Koch, and Duncan Sands and a -Wmaybe-uninitialized warning from GCC. llvm-svn: 158685	2012-06-18 22:31:28 +00:00
Hal Finkel	8eac009633	Allow up to 64 functional units per processor itinerary. This patch changes the type used to hold the FU bitset from unsigned to uint64_t. This will be needed for some upcoming PowerPC itineraries. llvm-svn: 158679	2012-06-18 21:08:18 +00:00
Marshall Clow	d3e2a76ca4	Added accessors for getting coff_relocation info llvm-svn: 158675	2012-06-18 19:47:16 +00:00
Nuno Lopes	b7c941bad9	add the 'alloc' metadata node to represent the size of offset of buffers pointed to by pointers. This metadata can be attached to any instruction returning a pointer llvm-svn: 158660	2012-06-18 16:04:04 +00:00
Benjamin Kramer	23a9c3e090	Bring the return value of SmallVector::insert in line with std::vector::insert. It always returns the iterator for the first inserted element, or the passed in iterator if the inserted range was empty. Flesh out the unit test more and fix all the cases it uncovered so far. llvm-svn: 158645	2012-06-17 12:46:13 +00:00
Chandler Carruth	b539b47793	Remove SmallMap, and the several files that were used to implement it. We have SmallDenseMap now that has more correct and predictable semantics, even though it is a more narrow abstraction. llvm-svn: 158644	2012-06-17 12:07:42 +00:00
Benjamin Kramer	371b9b0e99	SmallVector: return a valid iterator for the rare case of inserting an empty range into a SmallVector. Patch by Johannes Schaub! llvm-svn: 158643	2012-06-17 11:52:22 +00:00
Chandler Carruth	4de807a552	Add a unit test for 'swap', and fix a pile of bugs in SmallDenseMap::swap. First, make it parse cleanly. Yay for uninstantiated methods. Second, make the inline-buckets case work correctly. This is way trickier than it should be due to the uninitialized values in empty and tombstone buckets. Finally fix a few typos that caused construction/destruction mismatches in the counting unittest. llvm-svn: 158641	2012-06-17 11:28:13 +00:00
Chandler Carruth	a1be842f98	Add tests for *DenesMap for both key and value types' construction and destruction and fix a bug in SmallDenseMap they caught. This is kind of a poor-man's version of the testing that just adds the addresses to a set on construction and removes them on destruction. We check that double construction and double destruction don't occur. Amusingly enough, this is enough to catch a lot of SmallDenseMap issues because we spend a lot of time with fixed stable addresses in the inline buffer. The SmallDenseMap bug fix included makes grow() not double-destroy in some cases. It also fixes a FIXME there, the code was pretty crappy. We now don't have any wasted initialization, but we do move the entries in inline bucket array an extra time. It's probably a better tradeoff, and is much easier to get correct. llvm-svn: 158639	2012-06-17 10:33:51 +00:00
Chandler Carruth	20dd838ae5	Introduce a SmallDenseMap container that re-uses the existing DenseMap implementation. This type includes an inline bucket array which is used initially. Once it is exceeded, an array of 64 buckets is allocated on the heap. The bucket count grows from there as needed. Some highlights of this implementation: - The inline buffer is very carefully aligned, and so supports types with alignment constraints. - It works hard to avoid aliasing issues. - Supports types with non-trivial constructors, destructors, copy constructions, etc. It works reasonably hard to minimize copies and unnecessary initialization. The most common initialization is to set keys to the empty key, and so that should be fast if at all possible. This class has a performance / space trade-off. It tries to optimize for relatively small maps, and so packs the inline bucket array densely into the object. It will be marginally slower than a normal DenseMap in a few use patterns, so it isn't appropriate everywhere. The unit tests for DenseMap have been generalized a bit to support running over different map implementations in addition to different key/value types. They've then been automatically extended to cover the new container through the magic of GoogleTest's typed tests. All of this is still a bit rough though. I'm going to be cleaning up some aspects of the implementation, documenting things better, and adding tests which include non-trivial types. As soon as I'm comfortable with the correctness, I plan to switch existing users of SmallMap over to this class as it is already more correct w.r.t. construction and destruction of objects iin the map. Thanks to Benjamin Kramer for all the reviews of this and the lead-up patches. That said, more review on this would really be appreciated. As I've noted a few times, I'm quite surprised how hard it is to get the semantics for a hashtable-based map container with a small buffer optimization correct. =] llvm-svn: 158638	2012-06-17 09:05:09 +00:00
Benjamin Kramer	b9f84bb0ce	Guard private fields that are unused in Release builds with #ifndef NDEBUG. llvm-svn: 158608	2012-06-16 21:48:13 +00:00
Hal Finkel	16ddd4b66b	Move the Metadata merging methods from GVN and make them public in MDNode. There are other passes, BBVectorize specifically, that also need some of this functionality. llvm-svn: 158605	2012-06-16 20:33:37 +00:00
Benjamin Kramer	34d6a9e57a	Merge the SmallBitVector and BitVector unit tests with gtest's typed test magic and bring SmallBitVector up to date. llvm-svn: 158600	2012-06-16 10:51:07 +00:00
Chandler Carruth	dea00d7c5a	Add support to the alignment support header for conjuring a character array of a suitable size and alignment for any of a number of different types to be stored into the character array. The mechanisms for producing an explicitly aligned type are fairly complex because this operation is poorly supported on all compilers. We've spent a fairly significant amount of time experimenting with different implementations inside of Google, and the one using explicitly expanded templates has been the most robust. Credit goes to Nick Lewycky for writing the first 20 versions or so of this logic we had inside of Google. I based this on the only one to actually survive. In case anyone is worried, yes we are both explicitly re-contributing and re-licensing it for LLVM. =] Once the issues with actually specifying the alignment are finished, it turns out that most compilers don't in turn align anything the way they are instructed. Testing of this logic against both Clang and GCC indicate that the alignment constraints are largely ignored by both compilers! I've come up with and used a work-around by wrapping each alignment-hinted type directly in a struct, and using that struct to align the character array through a union. This elaborate hackery is terrifying, but I've included testing that caught a terrifying number of bugs in every other technique I've tried. All of this in order to implement a poor C++98 programmers emulation of C++11 unrestricted unions in classes such as SmallDenseMap. llvm-svn: 158597	2012-06-16 08:52:57 +00:00
Chandler Carruth	144a2ac89d	Lift the NumElements and NumTombstones members into the super class rather than the base class. Add a pile of boilerplate to indirect around this. This is pretty ugly, but it allows the super class to change the representation of these values, which will be key for doing a SmallDenseMap. Suggestions on better method structuring / naming are welcome, but keep in mind that SmallDenseMap won't have an 'unsigned' member to expose a reference to... =/ llvm-svn: 158586	2012-06-16 01:18:07 +00:00
Chandler Carruth	d7291625cc	Factor DenseMap into a base class that implements the hashtable logic, and a derived class that provides the allocation and growth strategy. This is the first (and biggest) step toward building a SmallDenseMap that actually behaves exactly the same as DenseMap, and supports all the same types and interface points with the same semantics. llvm-svn: 158585	2012-06-16 01:05:01 +00:00
Marshall Clow	71757ef3ed	Adding acessors to COFFObjectFile so that clients can get at the (non-generic) bits llvm-svn: 158484	2012-06-15 01:08:25 +00:00
Rafael Espindola	def1b09be2	Implement the isSafeToDiscardIfUnused predicate and use it in globalopt and globaldce. Globaldce was already removing linkonce globals, but globalopt was not. llvm-svn: 158476	2012-06-14 22:48:13 +00:00
Stepan Dyatkovskiy	8af777330e	SmallMap, FlatArrayMap::copyFrom Replaced memcpy with std::copy, since the first one may work improperly with non POD data. llvm-svn: 158457	2012-06-14 16:59:43 +00:00
Chandler Carruth	72738f6341	Group the 'unsigned' members after the pointer to avoid 4 bytes of padding on x86-64. llvm-svn: 158421	2012-06-13 21:44:07 +00:00
Kay Tiong Khoo	f294921e24	*typo: Cyles changed to Cycles llvm-svn: 158404	2012-06-13 15:53:04 +00:00
Duncan Sands	318a89ddac	When linearizing a multiplication, return at once if we see a factor of zero, since then the entire expression must equal zero (similarly for other operations with an absorbing element). With this in place a bunch of reassociate code for handling constants is dead since it is all taken care of when linearizing. No intended functionality change. llvm-svn: 158398	2012-06-13 09:42:13 +00:00
Craig Topper	71dc02d659	Fix intrinsics for XOP frczss/sd instructions. These instructions only take one source register and zero the upper bits of the destination rather than preserving them. llvm-svn: 158396	2012-06-13 07:18:53 +00:00
Jakob Stoklund Olesen	1c66b87f7d	Eliminate struct TableGenBackend. TableGen backends are simply written as functions now. Patch by Sean Silva! llvm-svn: 158389	2012-06-13 05:15:49 +00:00
Andrew Trick	5b90645abb	sched: Avoid trivially redundant DAG edges. Take the one with higher latency. llvm-svn: 158379	2012-06-13 02:39:00 +00:00
David Blaikie	5452aa5f47	Remove use of GNU extension to resolve Clang warning. llvm-svn: 158364	2012-06-12 17:06:32 +00:00
Duncan Sands	d7aeefebd6	Now that Reassociate's LinearizeExprTree can look through arbitrary expression topologies, it is quite possible for a leaf node to have huge multiplicity, for example: x0 = xx, x1 = x0x0, x2 = x1*x1, ... rapidly gives a value which is x raised to a vast power (the multiplicity, or weight, of x). This patch fixes the computation of weights by correctly computing them no matter how big they are, rather than just overflowing and getting a wrong value. It turns out that the weight for a value never needs more bits to represent than the value itself, so it is enough to represent weights as APInts of the same bitwidth and do the right overflow-avoiding dance steps when computing weights. As a side-effect it reduces the number of multiplies needed in some cases of large powers. While there, in view of external uses (eg by the vectorizer) I made LinearizeExprTree static, pushing the rank computation out into users. This is progress towards fixing PR13021. llvm-svn: 158358	2012-06-12 14:33:56 +00:00
Argyrios Kyrtzidis	c6dc4d75fd	Satisfy C++ aliasing rules, per suggestion by Chandler. llvm-svn: 158346	2012-06-12 01:06:16 +00:00
Argyrios Kyrtzidis	8d19c86c9a	For llvm::sys::ThreadLocalImpl instead of malloc'ing the platform-specific thread local data, embed them in the class using a uint64_t and make sure we get compiler errors if there's a platform where this is not big enough. This makes ThreadLocal more safe for using it in conjunction with CrashRecoveryContext. Related to crash in rdar://11434201. llvm-svn: 158342	2012-06-12 00:21:31 +00:00
Andrew Trick	3e465fb225	misched: When querying RegisterPressureTracker, always save current and max pressure. llvm-svn: 158340	2012-06-11 23:42:23 +00:00
Jakob Stoklund Olesen	e6aed139f0	Write llvm-tblgen backends as functions instead of sub-classes. The TableGenBackend base class doesn't do much, and will be removed completely soon. Patch by Sean Silva! llvm-svn: 158311	2012-06-11 15:37:55 +00:00
Jakob Stoklund Olesen	f30fa58ebb	Fix a problem with the reverse bundle iterators. This showed up the first time rend() was called on a bundled instruction in the Mips backend. Also avoid dereferencing end() in bundle_iterator::operator++(). We still don't have a place to put unit tests for this stuff. llvm-svn: 158310	2012-06-11 15:11:12 +00:00
Craig Topper	7afe343be5	Add intrinsics for immediate form of XOP vprot instructions. Use i128mem instead of f128mem for integer XOP instructions. llvm-svn: 158291	2012-06-10 07:31:56 +00:00
Craig Topper	3352ba55b9	Replace XOP vpcom intrinsics with fewer intrinsics that take the immediate as an argument. llvm-svn: 158278	2012-06-09 16:46:13 +00:00
Benjamin Kramer	df97aa1628	Hashing: Remove outdated comment. Support for reserved hash values was removed in r151865. llvm-svn: 158276	2012-06-09 15:33:28 +00:00
Andrew Trick	fc8ce08be3	Register pressure: added getPressureAfterInstr. llvm-svn: 158256	2012-06-09 02:16:58 +00:00
Jakob Stoklund Olesen	c26fbbfba5	Sketch a LiveRegMatrix analysis pass. The LiveRegMatrix represents the live range of assigned virtual registers in a Live interval union per register unit. This is not fundamentally different from the interference tracking in RegAllocBase that both RABasic and RAGreedy use. The important differences are: - LiveRegMatrix tracks interference per register unit instead of per physical register. This makes interference checks cheaper and assignments slightly more expensive. For example, the ARM D7 reigster has 24 aliases, so we would check 24 physregs before assigning to one. With unit-based interference, we check 2 units before assigning to 2 units. - LiveRegMatrix caches regmask interference checks. That is currently duplicated functionality in RABasic and RAGreedy. - LiveRegMatrix is a pass which makes it possible to insert target-dependent passes between register allocation and rewriting. Such passes could tweak the register assignments with interference checking support from LiveRegMatrix. Eventually, RABasic and RAGreedy will be switched to LiveRegMatrix. llvm-svn: 158255	2012-06-09 02:13:10 +00:00
Dmitri Gribenko	dbeafa773a	Convert comments to proper Doxygen comments. llvm-svn: 158248	2012-06-09 00:01:45 +00:00
Andrew Trick	ce679ad89d	Removing strange "using" declarations form TargetInstrInfo. I can't imagine why these were added. Trial and error. llvm-svn: 158247	2012-06-08 23:56:26 +00:00
Jakob Stoklund Olesen	1224312f5b	Reintroduce VirtRegRewriter. OK, not really. We don't want to reintroduce the old rewriter hacks. This patch extracts virtual register rewriting as a separate pass that runs after the register allocator. This is possible now that CodeGen/Passes.cpp can configure the full optimizing register allocator pipeline. The rewriter pass uses register assignments in VirtRegMap to rewrite virtual registers to physical registers, and it inserts kill flags based on live intervals. These finalization steps are the same for the optimizing register allocators: RABasic, RAGreedy, and PBQP. llvm-svn: 158244	2012-06-08 23:44:45 +00:00
Andrew Trick	423fa6faee	TargetInstrInfo hooks implemented in codegen should be declared pure virtual. llvm-svn: 158233	2012-06-08 21:52:38 +00:00
Andrew Trick	8cf028752f	Sched itinerary fix: Avoid static initializers. This fixes an accidental dependence on static initialization order that I introduced yesterday. Thank you Lang!!! llvm-svn: 158215	2012-06-08 18:25:47 +00:00
Andrew Trick	a5d24ca453	Continue factoring computeOperandLatency. Use it for ARM hasHighOperandLatency. llvm-svn: 158164	2012-06-07 19:42:04 +00:00
Pete Cooper	f8d60d36c2	Add internal read flags to MachineInstrBuilder and hook them into the MachineOperand flag of the same name llvm-svn: 158137	2012-06-07 04:43:52 +00:00
Manman Ren	9c9641812c	Revert r157755. The commit is intended to fix rdar://11540023. It is implemented as part of peephole optimization. We can actually implement this in the SelectionDAG lowering phase. llvm-svn: 158122	2012-06-06 23:53:03 +00:00
Andrew Trick	05ff4667eb	Move RegisterClassInfo.h. Allow targets to access this API. It's required for RegisterPressure. llvm-svn: 158102	2012-06-06 20:29:31 +00:00
Andrew Trick	88517f608c	Move RegisterPressure.h. Make it a general utility for use by Targets. llvm-svn: 158097	2012-06-06 19:47:35 +00:00
Benjamin Kramer	009b1c1cf1	Round 2 of dead private variable removal. LLVM is now -Wunused-private-field clean except for - lib/MC/MCDisassembler/Disassembler.h. Not sure why it keeps all those unaccessible fields. - gtest. llvm-svn: 158096	2012-06-06 19:47:08 +00:00
Benjamin Kramer	628a39faa3	Remove unused private fields found by clang's new -Wunused-private-field. There are some that I didn't remove this round because they looked like obvious stubs. There are dead variables in gtest too, they should be fixed upstream. llvm-svn: 158090	2012-06-06 18:25:08 +00:00

... 2 3 4 5 6 ...

16232 Commits