llvm-project

Commit Graph

Author	SHA1	Message	Date
Zachary Turner	0e327d0360	[llvm-pdbutil] Add back support for dumping file checksums. When dumping module source files, also dump checksums. llvm-svn: 305526	2017-06-15 23:12:41 +00:00
Zachary Turner	f8a2e04812	[llvm-pdbutil] Add back the ability to dump hashes and index offsets. This was regressed in a previous patch that re-wrote the dumper, and I'm incrementally adding back the pieces that are missing. llvm-svn: 305524	2017-06-15 23:04:42 +00:00
Zachary Turner	6305545527	Resubmit "[llvm-pdbutil] rewrite the "raw" output style." This resubmits commit c0c249e9f2ef83e1d1e5f166b50673d92f3579d7. It was broken due to some weird template issues, which have since been fixed. llvm-svn: 305517	2017-06-15 22:24:24 +00:00
Zachary Turner	da504b794c	Revert "[llvm-pdbutil] rewrite the "raw" output style." This reverts commit 83ea17ebf2106859a51fbc2a86031b44d33696ad. This is failing due to some strange template problems, so reverting until it can be straightened out. llvm-svn: 305505	2017-06-15 20:55:51 +00:00
Spyridoula Gravani	7a27a26db0	[DWARF] Removed dead code. The verifier functionality is provided by the DWARFVerifier class (as it should). llvm-svn: 305503	2017-06-15 20:40:08 +00:00
Zachary Turner	b560fdf3b8	[llvm-pdbutil] rewrite the "raw" output style. After some internal discussions, we agreed that the raw output style had outlived its usefulness. It was originally created before we had even thought of dumping to YAML, and it was intended to give us some insight into the internals of a PDB file. Now we have YAML mode which does almost exactly this but is more powerful in that it can round-trip back to a PDB, which the raw mode could not do. So the raw mode had become purely a maintenance burden. One option was to just delete it. However, its original goal was to be as readable as possible while staying close to the "metal" - i.e. presenting the output in a way that maps directly to the underlying file format. We don't actually need that last requirement anymore since it's covered by the yaml mode, so we could repurpose "raw" mode to actually just be as readable as possible. This patch implements about 80% of the functionality previously in raw mode, but in a completely different style that is more akin to what cvdump outputs. Records are very compressed, often times appearing on just one line. One nice thing about this is that it makes full record matching easier, because you can grep for indices, names, and leaf types on a single line often. See the tests for some examples of what the new output looks like. Note that this patch actually regresses the functionality of raw mode in a few areas, but only because the patch was already unreasonably large and going 100% would have been even worse. Specifically, this patch is missing: The ability to dump module debug subsections (checksums, lines, etc) The ability to dump section headers Aside from that everything is here. While goign through the tests fixing them all up, I found many duplicate tests. They've been deleted. In subsequent patches I will go through and re-add the missing functionality. Differential Revision: https://reviews.llvm.org/D34191 llvm-svn: 305495	2017-06-15 19:34:41 +00:00
Galina Kistanova	3c0505d30c	Specified ReportError as noreturn friendly to old compilers. llvm-svn: 305405	2017-06-14 17:32:53 +00:00
Zachary Turner	a8cfc29c9a	Resubmit "[codeview] Make obj2yaml/yaml2obj support .debug$S..." This was originally reverted because of some non-deterministic failures on certain buildbots. Luckily ASAN eventually caught this as a stack-use-after-scope, so the fix is included in this patch. llvm-svn: 305393	2017-06-14 15:59:27 +00:00
Zachary Turner	0085dce221	Revert "[codeview] Make obj2yaml/yaml2obj support .debug$S..." This is causing failures on linux bots with an invalid stream read. It doesn't repro in any configuration on Windows, so reverting until I have a chance to investigate on Linux. llvm-svn: 305371	2017-06-14 06:24:24 +00:00
Zachary Turner	a3da4467fa	[codeview] Make obj2yaml/yaml2obj support .debug$S/T sections. This allows us to use yaml2obj and obj2yaml to round-trip CodeView symbol and type information without having to manually specify the bytes of the section. This makes for much easier to maintain tests. See the tests under lld/COFF in this patch for example. Before they just said SectionData: <blob> whereas now we can use meaningful record descriptions. Note that it still supports the SectionData yaml field, which could be useful for initializing a section to invalid bytes for testing, for example. Differential Revision: https://reviews.llvm.org/D34127 llvm-svn: 305366	2017-06-14 05:31:00 +00:00
Spyridoula Gravani	e41823bb89	Added partial verification for .apple_names accelerator table in llvm-dwarfdump output. This patch adds code which verifies that each bucket in the .apple_names accelerator table is either empty or has a valid hash index. Differential Revision: https://reviews.llvm.org/D34177 llvm-svn: 305344	2017-06-14 00:17:55 +00:00
Galina Kistanova	41def9b72c	Reverted r305339 as MSVC is not happy with noreturn in lambda. llvm-svn: 305343	2017-06-13 23:57:51 +00:00
Galina Kistanova	680c7605a7	Specified LLVM_ATTRIBUTE_NORETURN for ReportError. llvm-svn: 305339	2017-06-13 23:39:42 +00:00
Reid Kleckner	8cbdd0c0f2	[PDB] Add a module descriptor for every object file Summary: Expose the module descriptor index and fill it in for section contributions. Reviewers: zturner Subscribers: llvm-commits, ruiu, hiraditya Differential Revision: https://reviews.llvm.org/D34126 llvm-svn: 305296	2017-06-13 15:49:13 +00:00
Zachary Turner	68ea80d0a7	Slightly better fix for dealing with no-id-stream PDBs. The last fix required the user to manually add the required feature. This caused an LLD test to fail because I failed to update LLD. In practice we can hide this logic so it can just be transparently added when we write the PDB. llvm-svn: 305236	2017-06-12 21:46:51 +00:00
Zachary Turner	990d0c8158	[llvm-pdbdump] Don't fail on PDBs with no ID stream. Older PDBs don't have this. Its presence is detected by using the various "feature" flags that come at the end of the PDB Stream. Detect this, and don't try to dump the ID stream if the features tells us it's not present. llvm-svn: 305235	2017-06-12 21:34:53 +00:00
Zachary Turner	d334cebac4	Fix a null pointer dereference in llvm-pdbutil pretty. Static data members were causing a problem because I mistakenly assumed all members would affect a class's layout and so the Layout member would be non-null. llvm-svn: 305229	2017-06-12 20:46:35 +00:00
Sylvestre Ledru	337804d86a	Same expressions on both sides of the return Summary: I guess we want PointerToMemberFunction & PointerToDataMember Fix coverity cid 1376038 Reviewers: zturner Reviewed By: zturner Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D34110 llvm-svn: 305219	2017-06-12 18:53:46 +00:00
David Blaikie	a91885a08c	dwarfdump: Handle relocs to zlib (.zdebug*) compressed sections llvm-svn: 305152	2017-06-10 19:32:50 +00:00
Galina Kistanova	038f9854ec	Added llvm_unreachable as ReportError cannot be specified as noreturn. llvm-svn: 305143	2017-06-10 07:50:14 +00:00
Zachary Turner	3226fe95bb	[pdb] Support CoffSymbolRVA debug subsection. llvm-svn: 305108	2017-06-09 20:46:52 +00:00
Zachary Turner	7e62cd17d6	Allow VarStreamArray to use stateful extractors. Previously extractors tried to be stateless with any additional context information needed in order to parse items being passed in via the extraction method. This led to quite cumbersome implementation challenges and awkwardness of use. This patch brings back support for stateful extractors, making the implementation and usage simpler. llvm-svn: 305093	2017-06-09 17:54:36 +00:00
Bob Haarman	fdf499bf2d	[codeview] use 32-bit integer for RelocOffset in DebugLinesSubsection Summary: RelocOffset is a 32-bit value, but we previously truncated it to 16 bits. Fixes PR33335. Reviewers: zturner, hiraditya! Reviewed By: zturner Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D33968 llvm-svn: 305043	2017-06-09 01:18:10 +00:00
Zachary Turner	28c22c83e3	[pdb] Don't crash on unknown debug subsections. More and more unknown debug subsection kinds are being discovered so we should make it possible to dump these and display the bytes. llvm-svn: 305041	2017-06-09 00:53:59 +00:00
Zachary Turner	deb391309c	[CodeView] Support remaining debug subsection types This adds support for Symbols, StringTable, and FrameData subsection types. Even though these subsections rarely if ever appear in a PDB file (they are usually in object files), there's no theoretical reason why they couldn't appear in a PDB. The real issue though is that in order to add support for dumping and writing them (which will be useful for object files), we need a way to test them. And since there is no support for reading and writing them to / from object files yet, making PDB support them is the best way to both add support for the underlying format and add support for tests at the same time. Later, when we go to add support for reading / writing them from object files, we'll need only minimal changes in the underlying read/write code. llvm-svn: 305037	2017-06-09 00:28:08 +00:00
Zachary Turner	1bf7762049	[llvm-pdbdump] Support native ordering of subsections in raw mode. This is the same change for the YAML Output style applied to the raw output style. Previously we would queue up all subsections until every one had been read, and then output them in a pre- determined order. This was because some subsections need to be read first in order to properly dump later subsections. This patch allows them to be dumped in the order they appear. Differential Revision: https://reviews.llvm.org/D34015 llvm-svn: 305034	2017-06-08 23:49:01 +00:00
Zachary Turner	15eb237fd3	[PDB] Don't crash on /debug:fastlink PDBs. Apparently support for /debug:fastlink PDBs isn't part of the DIA SDK (!), and it was causing llvm-pdbdump to crash because we weren't checking for a null pointer return value. This manifests when calling findChildren on the IDiaSymbol, and it returns E_NOTIMPL. llvm-svn: 304982	2017-06-08 16:00:40 +00:00
NAKAMURA Takumi	92c99cd6dc	Update libdeps to add BinaryFormat, introduced in r304864. llvm-svn: 304869	2017-06-07 04:48:49 +00:00
Zachary Turner	264b5d9e88	Move Object format code to lib/BinaryFormat. This creates a new library called BinaryFormat that has all of the headers from llvm/Support containing structure and layout definitions for various types of binary formats like dwarf, coff, elf, etc as well as the code for identifying a file from its magic. Differential Revision: https://reviews.llvm.org/D33843 llvm-svn: 304864	2017-06-07 03:48:56 +00:00
Zachary Turner	1bfb9f47af	Fix uninitialized read. llvm-svn: 304846	2017-06-06 23:54:23 +00:00
Adrian Prantl	318d1195f2	Introduce -brief command line option to llvm-dwarfdump This patch introduces a new command line option, called brief, to llvm-dwarfdump. When -brief is used, the attribute forms for the .debug_info section will not be emitted to output. Patch by Spyridoula Gravani! rdar://problem/21474365 Differential Revision: https://reviews.llvm.org/D33867 llvm-svn: 304844	2017-06-06 23:28:45 +00:00
Chandler Carruth	aaeada6c75	Fix another ordering constraint with windows.h and comment about a revers constraint that we got right (by chance). llvm-svn: 304792	2017-06-06 12:43:20 +00:00
Chandler Carruth	6bda14b313	Sort the remaining #include lines in include/... and lib/.... I did this a long time ago with a janky python script, but now clang-format has built-in support for this. I fed clang-format every line with a #include and let it re-sort things according to the precise LLVM rules for include ordering baked into clang-format these days. I've reverted a number of files where the results of sorting includes isn't healthy. Either places where we have legacy code relying on particular include ordering (where possible, I'll fix these separately) or where we have particular formatting around #include lines that I didn't want to disturb in this patch. This patch is entirely mechanical. If you get merge conflicts or anything, just ignore the changes in this patch and run clang-format over your #include lines in the files. Sorry for any noise here, but it is important to keep these things stable. I was seeing an increasing number of patches with irrelevant re-ordering of #include lines because clang-format was used. This patch at least isolates that churn, makes it easy to skip when resolving conflicts, and gets us to a clean baseline (again). llvm-svn: 304787	2017-06-06 11:49:48 +00:00
Wolfgang Pieb	77d3e938f8	[DWARF] Adding support for the DWARF v5 string offsets table (consumer/reader part only). Reviewers: dblaikie, aprantl Differential Revision: https://reviews.llvm.org/D32779 llvm-svn: 304759	2017-06-06 01:22:34 +00:00
Zachary Turner	88101dadcc	[CodeView] Fix endianness bug. We should be outputting in little endian, but we were writing in host endianness. llvm-svn: 304741	2017-06-05 22:12:23 +00:00
Zachary Turner	349c18f837	[CodeView] Handle Cross Module Imports and Exports. While it's not entirely clear why a compiler or linker might put this information into an object or PDB file, one has been spotted in the wild which was causing llvm-pdbdump to crash. This patch adds support for reading-writing these sections. Since I don't know how to get one of the native tools to generate this kind of debug info, the only test here is one in which we feed YAML into the tool to produce a PDB and then spit out YAML from the resulting PDB and make sure that it matches. llvm-svn: 304738	2017-06-05 21:40:33 +00:00
Zachary Turner	5b74ff33e7	[PDB] Fix use after free. Previously MappedBlockStream owned its own BumpPtrAllocator that it would allocate from when a read crossed a block boundary. This way it could still return the user a contiguous buffer of the requested size. However, It's not uncommon to open a stream, read some stuff, close it, and then save the information for later. After all, since the entire file is mapped into memory, the data should always be available as long as the file is open. Of course, the exception to this is when the data isn't in the file, but rather in some buffer that we temporarily allocated to present this contiguous view. And this buffer would get destroyed as soon as the strema was closed. The fix here is to force the user to specify the allocator, this way it can provide an allocator that has whatever lifetime it chooses. Differential Revision: https://reviews.llvm.org/D33858 llvm-svn: 304623	2017-06-03 00:33:35 +00:00
Zachary Turner	92dcdda623	[CodeView] Support CodeView subsections in any order. Previously we would expect certain subsections to appear in a certain order because some subsections would reference other subsections, but in practice we need to support arbitrary orderings since some object file and PDB file producers generate them this way. This also paves the way for supporting Yaml <-> Object File conversion of CodeView, since Object Files typically have quite a large number of subsections in their debug info. Differential Revision: https://reviews.llvm.org/D33807 llvm-svn: 304588	2017-06-02 19:49:14 +00:00
Zachary Turner	afb81a83a9	Fix 2 more -Wreorder warnings. llvm-svn: 304494	2017-06-01 23:24:50 +00:00
Zachary Turner	ebd3ae8371	[CodeView] Properly align symbol records on read/write. Object files have symbol records not aligned to any particular boundary (e.g. 1-byte aligned), while PDB files have symbol records padded to 4-byte aligned boundaries. Since they share the same reading / writing code, we have to provide an option to specify the alignment and propagate it up to the producer or consumer who knows what the alignment is supposed to be for the given container type. Added a test for this by modifying the existing PDB -> YAML -> PDB round-tripping code to round trip symbol records as well as types. Differential Revision: https://reviews.llvm.org/D33785 llvm-svn: 304484	2017-06-01 21:52:41 +00:00
Adrian Prantl	f4bc1f77b7	[DWARF] Introduce Dump Options This commit introduces a structure that holds all the flags that control the pretty printing of dwarf output. Patch by Spyridoula Gravani! Differential Revision: https://reviews.llvm.org/D33749 llvm-svn: 304446	2017-06-01 18:18:23 +00:00
Zachary Turner	d427383cb8	[CodeView] Move CodeView YAML code to ObjectYAML. This is the beginning of an effort to move the codeview yaml reader / writer into ObjectYAML so that it can be shared. Currently the only consumer / producer of CodeView YAML is llvm-pdbdump, but CodeView can exist outside of PDB files, and indeed is put into object files and passed to the linker to produce PDB files. Furthermore, there are subtle differences in the types of records that show up in object file CodeView vs PDB file CodeView, but they are otherwise 99% the same. By having this code in ObjectYAML, we can have llvm-pdbdump reuse this code, while teaching obj2yaml and yaml2obj to use this syntax for dealing with object files that can contain CodeView. This patch only adds support for CodeView type information to ObjectYAML. Subsequent patches will add support for CodeView symbol information. llvm-svn: 304248	2017-05-30 21:53:05 +00:00
Galina Kistanova	8c1e2f9108	Added missing break. llvm-svn: 304230	2017-05-30 19:02:49 +00:00
Zachary Turner	591312c5c1	[CodeView] Add more DebugSubsection implementations. This adds implementations for Symbols and FrameData, and renames the existing codeview::StringTable class to conform to the DebugSectionStringTable convention. llvm-svn: 304222	2017-05-30 17:13:33 +00:00
Zachary Turner	8c099fe06e	[CodeView] Rename ModuleDebugFragment -> DebugSubsection. This is more concise, and matches the terminology used in other parts of the codebase more closely. llvm-svn: 304218	2017-05-30 16:36:15 +00:00
George Rimar	a25d329b33	Recommit "[DWARF] - Make collectAddressRanges() return section index in addition to Low/High PC" With fix of uninitialized variable. Original commit message: This change is intended to use for LLD in D33183. Problem we have in LLD when building .gdb_index is that we need to know section which address range belongs to. Previously it was solved on LLD side by providing fake section addresses with use of llvm::LoadedObjectInfo interface. We assigned file offsets as addressed. Then after obtaining ranges lists, for each range we had to find section ID's. That not only was slow, but also complicated implementation and was the reason of incorrect behavior when sections share the same offsets, like D33176 shows. This patch makes DWARF parsers to return section index as well. That solves problem mentioned above. Differential revision: https://reviews.llvm.org/D33184 llvm-svn: 304078	2017-05-27 18:10:23 +00:00
George Rimar	1f9cab6b1c	Revert r304002 "[DWARF] - Make collectAddressRanges() return section index in addition to Low/High PC" Revert it again. Now another bot unhappy: http://lab.llvm.org:8011/builders/clang-s390x-linux/builds/8750 llvm-svn: 304011	2017-05-26 17:36:23 +00:00
George Rimar	bc223c63cc	[DWARF] - Make collectAddressRanges() return section index in addition to Low/High PC This change is intended to use for LLD in D33183. Problem we have in LLD when building .gdb_index is that we need to know section which address range belongs to. Previously it was solved on LLD side by providing fake section addresses with use of llvm::LoadedObjectInfo interface. We assigned file offsets as addressed. Then after obtaining ranges lists, for each range we had to find section ID's. That not only was slow, but also complicated implementation and was the reason of incorrect behavior when sections share the same offsets, like D33176 shows. This patch makes DWARF parsers to return section index as well. That solves problem mentioned above. Differential revision: https://reviews.llvm.org/D33184 llvm-svn: 304002	2017-05-26 16:26:18 +00:00
George Rimar	a8403a64ea	Revert "[DWARF] - Make collectAddressRanges() return section index in addition to Low/High PC" Broked BB again: TEST 'LLVM :: DebugInfo/X86/dbg-value-regmask-clobber.ll' FAILED ... LLVM ERROR: Section was outside of section table. llvm-svn: 303984	2017-05-26 13:20:09 +00:00
George Rimar	655b7b63f6	Recommit r303978 "[DWARF] - Make collectAddressRanges() return section index in addition to Low/High PC" With fix of test compilation. Initial commit message: This change is intended to use for LLD in D33183. Problem we have in LLD when building .gdb_index is that we need to know section which address range belongs to. Previously it was solved on LLD side by providing fake section addresses with use of llvm::LoadedObjectInfo interface. We assigned file offsets as addressed. Then after obtaining ranges lists, for each range we had to find section ID's. That not only was slow, but also complicated implementation and was the reason of incorrect behavior when sections share the same offsets, like D33176 shows. This patch makes DWARF parsers to return section index as well. That solves problem mentioned above. Differential revision: https://reviews.llvm.org/D33184 llvm-svn: 303983	2017-05-26 13:13:50 +00:00
George Rimar	7d5f12185a	Revert r303978 "[DWARF] - Make collectAddressRanges() return section index in addition to Low/High PC" It failed BB. llvm-svn: 303981	2017-05-26 12:53:41 +00:00
George Rimar	732f268aa0	[DWARF] - Make collectAddressRanges() return section index in addition to Low/High PC This change is intended to use for LLD in D33183. Problem we have in LLD when building .gdb_index is that we need to know section which address range belongs to. Previously it was solved on LLD side by providing fake section addresses with use of llvm::LoadedObjectInfo interface. We assigned file offsets as addressed. Then after obtaining ranges lists, for each range we had to find section ID's. That not only was slow, but also complicated implementation and was the reason of incorrect behavior when sections share the same offsets, like D33176 shows. This patch makes DWARF parsers to return section index as well. That solves problem mentioned above. Differential revision: https://reviews.llvm.org/D33184 llvm-svn: 303978	2017-05-26 12:46:41 +00:00
Zachary Turner	f2110283c6	Remove unused member. llvm-svn: 303942	2017-05-25 23:47:56 +00:00
Zachary Turner	fed467eefb	[CV Type Merging] Find nested type indices faster. Merging two type streams is one of the most time consuming parts of generating a PDB, and as such it needs to be as fast as possible. The visitor abstractions used for interoperating nicely with many different types of inputs and outputs have been used widely and help greatly for testability and implementing tools, but the abstractions build up and get in the way of performance. This patch removes all of the visitation stuff from the type stream merger, essentially re-inventing the leaf / member switch and loop, but at a very low level. This allows us many other optimizations, such as not actually deserializing any records (even member records which don't describe their own length), as the operation of "figure out how long this record is" is somewhat faster than "figure out how long this record and get all its fields out". Furthermore, whereas before we had to deserialize, re-write type indices, then re-serialize, now we don't have to do any of those 3 steps. We just find out where the type indices are and pull them directly out of the byte stream and re-write them. This is worth a 50-60% performance increase. On top of all other optimizations that have been applied this week, I now get the following numbers when linking lld.exe and lld.pdb MSVC: 25.67s Before This Patch: 18.59s After This Patch: 8.92s So this is a huge performance win. Differential Revision: https://reviews.llvm.org/D33564 llvm-svn: 303935	2017-05-25 23:36:16 +00:00
Zachary Turner	2897e0306e	[lld] Fix a bug where we continually re-follow type servers. Originally this was intended to be set up so that when linking a PDB which refers to a type server, it would only visit the PDB once, and on subsequent visitations it would just skip it since all the records had already been added. Due to some C++ scoping issues, this was not occurring and it was revisiting the type server every time, which caused every record to end up being thrown away on all subsequent visitations. This doesn't affect the performance of linking clang-cl generated object files because we don't use type servers, but when linking object files and libraries generated with /Zi via MSVC, this means only 1 object file has to be linked instead of N object files, so the speedup is quite large. llvm-svn: 303920	2017-05-25 21:16:03 +00:00
Zachary Turner	7f97c362a4	[CodeView Type Merging] Don't keep re-allocating temp serializer. Previously, every time we wanted to serialize a field list record, we would create a new copy of FieldListRecordBuilder, which would in turn create a temporary instance of TypeSerializer, which itself had a std::vector<> that was about 128K in size. So this 128K allocation was happening every time. We can re-use the same instance over and over, we just have to clear its internal hash table and seen records list between each run. This saves us from the constant re-allocations. This is worth an ~18.5% speed increase (3.75s -> 3.05s) in my tests. Differential Revision: https://reviews.llvm.org/D33506 llvm-svn: 303919	2017-05-25 21:15:37 +00:00
Bob Haarman	55256ada25	[pdb] pad source file name buffer at the end instead of the beginning Summary: DbiStreamBuilder calculated the offset of the source file names inside the file info substream as the size of the file info substream minus the size of the file names. Since the file info substream is padded to a multiple of 4 bytes, this caused the first file name to be aligned on a 4-byte boundary. By contrast, DbiModuleList would read the file names immediately after the file name offset table, without skipping to the next 4-byte boundary. This change makes it so that the file names are written to the location where DbiModuleList expects them, and puts any necessary padding for the file info substream after the file names instead of before it. Reviewers: amccarth, rnk, zturner Reviewed By: amccarth, zturner Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D33475 llvm-svn: 303917	2017-05-25 21:12:15 +00:00
Zachary Turner	c4e4b7e31e	Fix a bug in MappedBlockStream. It was using the number of blocks of the entire PDB file as the number of blocks of each stream that was created. This was only an issue in the readLongestContiguousChunk function, which was never called prior. This bug surfaced when I updated an algorithm to use this function and the algorithm broke. llvm-svn: 303916	2017-05-25 21:12:00 +00:00
Zachary Turner	dda25b128c	[CodeView Type Merging] Avoid record deserialization when possible. A profile shows the majority of time doing type merging is spent deserializing records from sequences of bytes into friendly C++ structures that we can easily access members of in order to find the type indices to re-write. Records are prefixed with their length, however, and most records have type indices that appear at fixed offsets in the record. For these records, we can save some cycles by just looking at the right place in the byte sequence and re-writing the value, then skipping the record in the type stream. This saves us from the costly deserialization of examining every field, including potentially null terminated strings which are the slowest, even though it was unnecessary to begin with. In addition, we apply another optimization. Previously, after deserializing a record and re-writing its type indices, we would unconditionally re-serialize it in order to compute the hash of the re-written record. This would result in an alloc and memcpy for every record. If no type indices were re-written, however, this was an unnecessary allocation. In this patch re-writing is made two phase. The first phase discovers the indices that need to be rewritten and their new values. This information is passed through to the de-duplication code, which only copies and re-writes type indices in the serialized byte sequence if at least one type index is different. Some records have type indices which only appear after variable length strings, or which have lists of type indices, or various other situations that can make it tricky to make this optimization. While I'm not giving up on optimizing these cases as well, for now we can get the easy cases out of the way and lay the groundwork for more complicated cases later. This patch yields another 50% speedup on top of the already large speedups submitted over the past 2 days. In two tests I have run, I went from 9 seconds to 3 seconds, and from 16 seconds to 8 seconds. Differential Revision: https://reviews.llvm.org/D33480 llvm-svn: 303914	2017-05-25 21:06:28 +00:00
Zachary Turner	bb64231d2d	Don't do a full scan of the type stream before processing records. LazyRandomTypeCollection is designed for random access, and in order to provide this it lazily indexes ranges of types. In the case of types from an object file, there is no partial index to build off of, so it has to index the full stream up front. However, merging types only requires sequential access, and when that is needed, this extra work is simply wasted. Changing the algorithm to work on sequential arrays of types rather than random access type collections eliminates this up front scan. llvm-svn: 303707	2017-05-24 00:26:27 +00:00
Zachary Turner	7daf62e743	[CodeView] Eliminate redundant hashes and allocations. When writing field list records, we would construct a temporary type serializer that shared a bump ptr allocator with the rest of the application, so anything allocated from here would live forever. Furthermore, this temporary serializer had all the properties of a full blown serializer including record hashing and de-duplication. These features are required when you're merging multiple type streams into each other, because different streams may contain identical records, but records from the same type stream will never collide with each other. So all of this hashing was unnecessary. To solve this, two fixes are made: 1) The temporary serializer keeps its own bump ptr allocator instead of sharing a global one. When it's finished, all of its memory is freed. 2) Instead of using the same temporary serializer for the life of an entire type stream, we use it only for the life of a single field list record and delete it when the field list record is completed. This way the hash table will not grow as other records from the same type stream get inserted. Further improvements could eliminate hashing entirely from this codepath. This reduces the link time by 85% in my test, from 1 minute to 9 seconds. llvm-svn: 303676	2017-05-23 18:56:23 +00:00
Reid Kleckner	36238b15d7	Speculative build fix for non-Windows llvm-svn: 303667	2017-05-23 18:28:13 +00:00
Reid Kleckner	ded38803c5	[PDB] Hash types up front when merging types instead of using StringMap Summary: First, StringMap uses llvm::HashString, which is only good for short identifiers and really bad for large blobs of binary data like type records. Moving to `DenseMap<StringRef, TypeIndex>` with some tricks for memory allocation fixes that. Unfortunately, that didn't buy very much performance. Profiling showed that we spend a long time during DenseMap growth rehashing existing entries. Also, in general, DenseMap is faster when the keys are small. This change takes that to the logical conclusion by introducing a small wrapper value type around a pointer to key data. The key data contains a precomputed hash, the original record data (pointer and size), and the type index, which is the "value" of our original map. This reduces the time to produce llvm-as.exe and llvm-as.pdb from ~15s on my machine to 3.5s, which is about a 4x improvement. Reviewers: zturner, inglorion, ruiu Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D33428 llvm-svn: 303665	2017-05-23 18:23:59 +00:00
Zachary Turner	bf35e6ab2a	Revert "Make TypeSerializer's StringMap use the same allocator." This reverts commit e34ccb7b57da25cc89ded913d8638a2906d1110a. This is causing failures on the ASAN bots. llvm-svn: 303640	2017-05-23 15:50:37 +00:00
David Blaikie	15d85fc537	libDebugInfo: Support symbolizing using DWP files llvm-svn: 303609	2017-05-23 06:48:53 +00:00
David Blaikie	37d1cff491	FIX: Remove debugging assert left in previous commit Sorry for the bot noise. llvm-svn: 303592	2017-05-23 00:31:24 +00:00
David Blaikie	f9803fb4bb	libDebugInfo: Avoid independently parsing the same .dwo file for two separate CUs residing there NFC, just an optimization. Will be building on this for DWP support shortly. llvm-svn: 303591	2017-05-23 00:30:42 +00:00
Zachary Turner	d4136e945e	Implement various flavors of type merging. Previous algotirhm assumed that types and ids are in a single unified stream. For inputs that come from object files, this is the case. But if the input is already a PDB, or is the result of a previous merge, then the types and ids will already have been split up, in which case we need an algorithm that can accept operate on independent streams of types and ids that refer across stream boundaries to each other. Differential Revision: https://reviews.llvm.org/D33417 llvm-svn: 303577	2017-05-22 21:07:43 +00:00
Zachary Turner	12f8c31c04	Make TypeSerializer's StringMap use the same allocator. llvm-svn: 303576	2017-05-22 21:07:14 +00:00
David Blaikie	d2f3a941e0	libDebugInfo/DWARF: Apply relocations for debug_addr addresses in object files llvm-symbolizer would fail to symbolize addresses in unlinked object files when handling .dwo file data because the addresses would not be relocated in the same way as the ranges in the skeleton CU in the object file. Fix that so object files can be symbolized the same as executables. llvm-svn: 303532	2017-05-22 07:02:47 +00:00
David Blaikie	8d039d40c5	llvm-symbolizer: Support multiple CUs in a single DWO file llvm-svn: 303482	2017-05-20 03:32:49 +00:00
Zachary Turner	526f4f2aa8	Resubmit "[CodeView] Provide a common interface for type collections." This was originally reverted because it was a breaking a bunch of bots and the breakage was not surfacing on Windows. After much head-scratching this was ultimately traced back to a bug in the lit test runner related to its pipe handling. Now that the bug in lit is fixed, Windows correctly reports these test failures, and as such I have finally (hopefully) fixed all of them in this patch. llvm-svn: 303446	2017-05-19 19:26:58 +00:00
Zachary Turner	1dfcf8d92c	Revert "[CodeView] Provide a common interface for type collections." This is a squash of ~5 reverts of, well, pretty much everything I did today. Something is seriously broken with lit on Windows right now, and as a result assertions that fire in tests are triggering failures. I've been breaking non-Windows bots all day which has seriously confused me because all my tests have been passing, and after running lit with -a to view the output even on successful runs, I find out that the tool is crashing and yet lit is still reporting it as a success! At this point I don't even know where to start, so rather than leave the tree broken for who knows how long, I will get this back to green, and then once lit is fixed on Windows, hopefully hopefully fix the remaining set of problems for real. llvm-svn: 303409	2017-05-19 05:57:45 +00:00
Zachary Turner	47fdc73771	Don't crash if someone tries to visit an empty type stream. llvm-svn: 303408	2017-05-19 05:18:09 +00:00
Zachary Turner	59ab6a3816	[CodeView] Reduce memory usage in TypeSerializer. We were using a BumpPtrAllocator to allocate stable storage for a record, then trying to insert that into a hash table. If a collision occurred, the bytes were never inserted and the allocation was unnecessary. At the cost of an extra hash computation, check first if it exists, and only if it does do we allocate and insert. llvm-svn: 303407	2017-05-19 04:56:48 +00:00
Zachary Turner	8f1d87a79a	Fix crasher in CodeView test. Apparently this was always broken, but previously we were more graceful about it and we would print "unknown udt" if we couldn't find the type index, whereas now we just segfault because we assume it's valid. But this exposed a real bug, which is that we weren't looking in the right place. So fix that, and also fix this crash at the same time. llvm-svn: 303397	2017-05-19 00:56:39 +00:00
Zachary Turner	7b62d7ccc0	Fix some build errors and warnings. llvm-svn: 303391	2017-05-18 23:12:42 +00:00
Zachary Turner	b32ec02b80	[CodeView] Raise the source to ID map out of the TypeStreamMerger. This map will be needed to rewrite symbol streams after re-writing the corresponding type streams. llvm-svn: 303390	2017-05-18 23:04:08 +00:00
Zachary Turner	8fb441ab9c	[llvm-pdbdump] Add the ability to merge PDBs. Merging PDBs is a feature that will be used heavily by the linker. The functionality already exists but does not have deep test coverage because it's not easily exposed through any tools. This patch aims to address that by adding the ability to merge PDBs via llvm-pdbdump. It takes arbitrarily many PDBs and outputs a single PDB. Using this new functionality, a test is added for merging type records. Future patches will add the ability to merge symbol records, module information, etc. llvm-svn: 303389	2017-05-18 23:03:41 +00:00
Zachary Turner	0c60f269fc	[CodeView] Provide a common interface for type collections. Right now we have multiple notions of things that represent collections of types. Most commonly used are TypeDatabase, which is supposed to keep mappings from TypeIndex to type name when reading a type stream, which happens when reading PDBs. And also TypeTableBuilder, which is used to build up a collection of types dynamically which we will later serialize (i.e. when writing PDBs). But often you just want to do some operation on a collection of types, and you may want to do the same operation on any kind of collection. For example, you might want to merge two TypeTableBuilders or you might want to merge two type streams that you loaded from various files. This dichotomy between reading and writing is responsible for a lot of the existing code duplication and overlapping responsibilities in the existing CodeView library classes. For example, after building up a TypeTableBuilder with a bunch of type records, if we want to dump it we have to re-invent a bunch of extra glue because our dumper takes a TypeDatabase or a CVTypeArray, which are both incompatible with TypeTableBuilder. This patch introduces an abstract base class called TypeCollection which is shared between the various type collection like things. Wherever we previously stored a TypeDatabase& in some common class, we now store a TypeCollection&. The advantage of this is that all the details of how the collection are implemented, such as lazy deserialization of partial type streams, is completely transparent and you can just treat any collection of types the same regardless of where it came from. Differential Revision: https://reviews.llvm.org/D33293 llvm-svn: 303388	2017-05-18 23:03:06 +00:00
Zachary Turner	5a83fb153f	Fix some minor issues in PDB parsing library. 1) Until now I'd never seen a valid PDB where the DBI stream and the PDB Stream disagreed on the "Age" field. Because of that, we had code to assert that they matched. Recently though I was given a PDB where they disagreed, so this assumption has proven to be incorrect. Remove this check. 2) We were walking the entire list of hash values for types up front and then throwing away the values. For large PDBs this was a significant slow down. Remove this. With this patch, I can dump the list of all compilands from a 1.5GB PDB file in just a few seconds. llvm-svn: 303351	2017-05-18 15:14:44 +00:00
George Rimar	47f84b1a3c	[DWARF] - Simplify RelocVisitor implementation. We do not need to store relocation width field. Patch removes relative code, that simplifies implementation. Differential revision: https://reviews.llvm.org/D33274 llvm-svn: 303335	2017-05-18 08:25:11 +00:00
George Rimar	f98b9ac5da	[lib/Object] - Minor API update for llvm::Decompressor. I revisited Decompressor API (issue with it was triggered during D32865 review) and found it is probably provides more then we really need. Issue was about next method's signature: Error decompress(SmallString<32> &Out); It is too strict. At first I wanted to change it to decompress(SmallVectorImpl<char> &Out), but then found it is still not flexible because sticks to SmallVector. During reviews was suggested to use templating to simplify code. Patch do that. Differential revision: https://reviews.llvm.org/D33200 llvm-svn: 303331	2017-05-18 08:00:01 +00:00
Bob Haarman	de33a63784	[llvm-pdbdump] in yaml2pdb, generate default output filename if none given Summary: llvm-pdbdump yaml2pdb used to fail with a misleading error message ("An I/O error occurred on the file system") if no output file was specified. This change adds an assert to PDBFileBuilder to check that an output file name is specified, and makes llvm-pdbdump generate an output file name based on the input file name if no output file name is explicitly specified. Reviewers: amccarth, zturner Reviewed By: zturner Subscribers: fhahn, llvm-commits Differential Revision: https://reviews.llvm.org/D33296 llvm-svn: 303299	2017-05-17 20:46:48 +00:00
Zachary Turner	1d795c451e	[CodeView] Simplify the use of visiting type records & streams. There is often a lot of boilerplate code required to visit a type record or type stream. The #1 use case is that you have a sequence of bytes that represent one or more records, and you want to deserialize each one, switch on it, and call a callback with the deserialized record that the user can examine. Currently this requires at least 6 lines of code: codeview::TypeVisitorCallbackPipeline Pipeline; Pipeline.addCallbackToPipeline(Deserializer); Pipeline.addCallbackToPipeline(MyCallbacks); codeview::CVTypeVisitor Visitor(Pipeline); consumeError(Visitor.visitTypeRecord(Record)); With this patch, it becomes one line of code: consumeError(codeview::visitTypeRecord(Record, MyCallbacks)); This is done by having the deserialization happen internally inside of the visitTypeRecord function. Since this is occasionally not desirable, the function provides a 3rd parameter that can be used to change this behavior. Hopefully this can significantly reduce the barrier to entry to using the visitation infrastructure. Differential Revision: https://reviews.llvm.org/D33245 llvm-svn: 303271	2017-05-17 16:39:06 +00:00
George Rimar	fed9f09f48	[DWARF] - Cleanup relocations proccessing. RelocAddrMap was a pair of <width, address>, where width is relocation size (4/8/x, x < 8), and width field was never used in code. Relocations proccessing loop had checks for width field. Does not look like DWARF parser should do that. There is probably no much sense to validate relocations during proccessing them in parser. Patch removes relocation's width relative code from DWARFContext. Differential revision: https://reviews.llvm.org/D33194 llvm-svn: 303251	2017-05-17 12:10:51 +00:00
George Rimar	41e656768d	[DWARF] - Add RelocAddrEntry for cleanup. NFCi. Was mentioned as possible cleanup during review of D33184. llvm-svn: 303171	2017-05-16 14:05:45 +00:00
George Rimar	4671f2e08c	[DWARF] - Use DWARFAddressRange struct instead of uint64_t pair for DWARFAddressRangesVector. Recommit of r303159 "[DWARF] - Use DWARFAddressRange struct instead of uint64_t pair for DWARFAddressRangesVector" All places were shitched to use DWARFAddressRange now. Suggested during review of D33184. llvm-svn: 303163	2017-05-16 12:30:59 +00:00
George Rimar	3824cca7b3	Revert r303159 "[DWARF] - Use DWARFAddressRange struct instead of uint64_t pair for DWARFAddressRangesVector." Something went wrong, it broke BB. http://green.lab.llvm.org/green//job/clang-stage1-cmake-RA-incremental_build/38477/consoleFull#-200034420049ba4694-19c4-4d7e-bec5-911270d8a58c llvm-svn: 303162	2017-05-16 12:05:03 +00:00
George Rimar	8680b6ee9c	[DWARF] - Use DWARFAddressRange struct instead of uint64_t pair for DWARFAddressRangesVector. Suggested during review of D33184. llvm-svn: 303159	2017-05-16 11:54:19 +00:00
George Rimar	958b01aa69	[DWARF] - Speedup handling of relocations in DWARFContextInMemory. I am working on a speedup of building .gdb_index in LLD and noticed that relocations that are proccessed in DWARFContextInMemory often uses the same symbol in a row. This patch introduces caching to reduce the relocations proccessing time. For benchmark, I took debug LLC binary objects configured with -ggnu-pubnames and linked it using LLD. Link time without --gdb-index is about 4,45s. Link time with --gdb-index: a) Without patch: 19,16s b) With patch: 15,52s That means time spent on --gdb-index in this configuration is 19,16s - 4,45s = 14,71s (without patch) vs 15,52s - 4,45s = 11,07s (with patch). Differential revision: https://reviews.llvm.org/D31136 llvm-svn: 303051	2017-05-15 11:45:28 +00:00
Zachary Turner	dd3a739d52	[CodeView] Add a random access type visitor. This adds a visitor that is capable of accessing type records randomly and caching intermediate results that it learns about during partial linear scans. This yields amortized O(1) access to a type stream even though type streams cannot normally be indexed. Differential Revision: https://reviews.llvm.org/D33009 llvm-svn: 302936	2017-05-12 19:18:12 +00:00
Wolfgang Pieb	15fa44698c	[DWARF] Fix a parsing issue with type unit headers. Reviewers: dblaikie Differential Revision: https://reviews.llvm.org/D32987 llvm-svn: 302574	2017-05-09 19:38:38 +00:00
Aaron Ballman	f22f885b66	Removing a file that is not necessary (and was causing link diagnostics with MSVC 2015); NFC. llvm-svn: 302531	2017-05-09 14:22:48 +00:00
Diana Picus	e8da53f4e0	Revert "[Dwarf] Disable reference verification for now (PR32972)" This reverts commit r302520 because it break the unit tests. llvm-svn: 302524	2017-05-09 13:05:43 +00:00
Renato Golin	94d6c8fb36	[Dwarf] Disable reference verification for now (PR32972) There is no other explanation about why this only started happening now, even though it crashes on old code (supposedly reachable from here). The only common factor between the failing bots is that they use GCC (4.9 and 5.3) to compile Clang, while the others use Clang 3.8, but the failure is while building the tests, as an assertion, on Clang. Commenting it out for now in hope the bots will go back green, but we should keep looking for the real cause, and update bugzilla. llvm-svn: 302520	2017-05-09 12:36:50 +00:00
Greg Clayton	58a2e0d90b	Add const to "DWARFDie &Die" in a few functions as they can't change the DWARFDie. llvm-svn: 302471	2017-05-08 21:29:17 +00:00
Eugene Zemtsov	3b52dbd934	Fix typo llvm-svn: 302470	2017-05-08 21:20:53 +00:00
Greg Clayton	5404f114d3	Fix typo "veify" to "verify". llvm-svn: 302466	2017-05-08 20:53:00 +00:00
Zachary Turner	1dacb24222	[CodeView] Add support for random access type visitors. Previously type visitation was done strictly sequentially, and TypeIndexes were computed by incrementing the TypeIndex of the last visited record. This works fine for situations like dumping, but not when you want to visit types in random order. For example, in a debug session someone might lookup a symbol by name, find that it has TypeIndex 10,000 and then want to go straight to TypeIndex 10,000. In order to make this work, the visitation framework needs a mode where it can plumb TypeIndices through the callback pipeline. This patch adds such a mode. In doing so, it is necessary to provide an alternative implementation of TypeDatabase that supports random access, so that is done as well. Nothing actually uses these random access capabilities yet, but this will be done in subsequent patches. Differential Revision: https://reviews.llvm.org/D32928 llvm-svn: 302454	2017-05-08 18:38:43 +00:00
Zachary Turner	8c74673388	[CodeView] Reserve TypeDatabase records up front. Most of the time we know exactly how many type records we have in a list, and we want to use the visitor to deserialize them into actual records in a database. Previously we were just using push_back() every time without reserving the space up front in the vector. This is obviously terrible from a performance standpoint, and it's not uncommon to have PDB files with half a million type records, where the performance degredation was quite noticeable. llvm-svn: 302302	2017-05-05 22:02:37 +00:00
George Rimar	2122ff64c6	[llvm-dwarfdump] - Print an error message if section decompression failed. llvm-dwarfdump currently prints no message if decompression fails for some reason. I noticed that during work on one of LLD patches where LLD produced an broken output. It was a bit confusing to see no output for section dumped and no any error message at all. Patch adds error message for such cases. Differential revision: https://reviews.llvm.org/D32865 llvm-svn: 302221	2017-05-05 10:52:39 +00:00
Zachary Turner	bedc85fb4b	[pdb] Don't verify TPI hash values up front. Verifying the hash values as we are currently doing results in iterating every type record before the user even tries to access the first one, and the API user has no control over, or ability to hook into this process. As a result, when the user wants to iterate over types to print them or index them, this results in a second iteration over the same list of types. When there's upwards of 1,000,000 type records, this is obviously quite undesirable. This patch raises the verification outside of TpiStream , and llvm-pdbdump hooks a hash verification visitor into the normal dumping process. So we still verify the hash records, but we can do it while not requiring a second iteration over the type stream. Differential Revision: https://reviews.llvm.org/D32873 llvm-svn: 302206	2017-05-04 23:53:54 +00:00
Zachary Turner	1eb9a0297c	[PDB] Don't build the entire source file list up front. I tried to run llvm-pdbdump on a very large (~1.5GB) PDB to try and identify show-stopping performance problems. This patch addresses the first such problem. When loading the DBI stream, before anyone has even tried to access a single record, we build an in memory map of every source file for every module. In the particular PDB I was using, this was over 85 million files. Specifically, the complexity is O(mn) where m is the number of modules and n is the average number of source files (including headers) per module. The whole reason for doing this was so that we could have constant time access to any module and any of its source file lists. However, we can still get O(1) access to the source file list for a given module with a simple O(m) precomputation, and access to the list of modules is already O(1) anyway. So this patches reduces the O(mn) up-front precomputation to an O(m) one, where n is ~6,500 and n*m is about 85 million in my pathological test case. Differential Revision: https://reviews.llvm.org/D32870 llvm-svn: 302205	2017-05-04 23:53:29 +00:00
Greg Clayton	48ff66a280	Don't return an invalid line table if the DW_AT_stmt_list value is not in the .debug_line section. llvm-svn: 302180	2017-05-04 18:29:44 +00:00
Paul Robinson	ae2e6f37f3	clang-format and restyle DWARFFormValue before working on it. NFC llvm-svn: 302086	2017-05-03 21:53:21 +00:00
Zachary Turner	4f145b2a59	Remove unused private field. llvm-svn: 302069	2017-05-03 19:42:06 +00:00
Greg Clayton	c5b2d561e8	Break verification down into smaller functions to keep code clean. Adrian requested that we break things down to make things clean in the DWARFVerifier. This patch breaks everything down into nice individual functions and cleans up the code quite a bit and prepares us for the next round of verifiers. Differential Revision: https://reviews.llvm.org/D32812 llvm-svn: 302062	2017-05-03 18:25:46 +00:00
Davide Italiano	2e23ce4cad	[CodeView] Remove constructor initialization of a removed field. I should've staged this with my last commit. llvm-svn: 302059	2017-05-03 18:02:46 +00:00
Zachary Turner	cf468d86f3	[CodeView] Use actual strings for dealing with checksums and lines. The raw CodeView format references strings by "offsets", but it's confusing what table the offset refers to. In the case of line number information, it's an offset into a buffer of records, and an indirection is required to get another offset into a different table to find the final string. And in the case of checksum information, there is no indirection, and the offset refers directly to the location of the string in another buffer. This would be less confusing if we always just referred to the strings by their value, and have the library be smart enough to correctly resolve the offsets on its own from the right location. This patch makes that possible. When either reading or writing, all the user deals with are strings, and the library does the appropriate translations behind the scenes. llvm-svn: 302053	2017-05-03 17:11:40 +00:00
Zachary Turner	2d5c2cd3ce	[llvm-readobj] Update readobj to re-use parsing code. llvm-readobj hand rolls some CodeView parsing code for string tables, so this patch updates it to re-use some of the newly introduced parsing code in LLVMDebugInfoCodeView. Differential Revision: https://reviews.llvm.org/D32772 llvm-svn: 302052	2017-05-03 17:11:11 +00:00
Greg Clayton	b8c162b53c	Create DWARFVerifier.cpp and .h and move all DWARF verification code over into it. Adrian requested we create a DWARFVerifier.cpp file to contain all of the DWARF verification stuff. This change simply moves the functionality over into DWARFVerifier.h and DWARFVerifier.cpp, renames the DWARFVerifier methods to start with lower case, and switches DWARFContext.cpp over to using the new functionality. Differential Revision: https://reviews.llvm.org/D32809 llvm-svn: 302044	2017-05-03 16:02:29 +00:00
Zachary Turner	c504ae3cef	Resubmit r301986 and r301987 "Add codeview::StringTable" This was reverted due to a "missing" file, but in reality what happened was that I renamed a file, and then due to a merge conflict both the old file and the new file got added to the repository. This led to an unused cpp file being in the repo and not referenced by any CMakeLists.txt but #including a .h file that wasn't in the repo. In an even more unfortunate coincidence, CMake didn't report the unused cpp file because it was in a subdirectory of the folder with the CMakeLists.txt, and not in the same directory as any CMakeLists.txt. The presence of the unused file was then breaking certain tools that determine file lists by globbing rather than by what's specified in CMakeLists.txt In any case, the fix is to just remove the unused file from the patch set. llvm-svn: 302042	2017-05-03 15:58:37 +00:00
Greg Clayton	8df55b43e1	Verify that no compile units share the same line table in "llvm-dwarfdump --verify" Check to make sure no compile units have the same DW_AT_stmt_list values. Report a verification error if they do. Differential Revision: https://reviews.llvm.org/D32771 llvm-svn: 302039	2017-05-03 15:45:31 +00:00
Daniel Jasper	dff096f217	Revert r301986 (and subsequent r301987). The patch is failing to add StringTableStreamBuilder.h, but that isn't even discovered because the corresponding StringTableStreamBuilder.cpp isn't added to any CMakeLists.txt file and thus never built. I think this patch is just incomplete. llvm-svn: 302002	2017-05-03 07:29:25 +00:00
Zachary Turner	59e83892e0	Fix use after free in BinaryStream library. This was reported by the ASAN bot, and it turned out to be a fairly fundamental problem with the design of VarStreamArray and the way it passes context information to the extractor. The fix was cumbersome, and I'm not entirely pleased with it, so I plan to revisit this design in the future when I'm not pressed to get the bots green again. For now, this fixes the issue by storing the context information by value instead of by reference, and introduces some impossibly-confusing template magic to make things "work". llvm-svn: 301999	2017-05-03 05:34:00 +00:00
Zachary Turner	67736594f7	Fix type conversion error. llvm-svn: 301987	2017-05-02 23:41:51 +00:00
Zachary Turner	7dba20bd2b	Make codeview::StringTable. Previously we had knowledge of how to serialize and deserialize a string table inside of DebugInfo/PDB, but the string table that it serializes contains a piece that is actually considered CodeView and can appear outside of a PDB. We already have logic in llvm-readobj and MCCodeView to read and write this format, so it doesn't make sense to duplicate the logic in DebugInfoPDB as well. This patch makes codeview::StringTable (for writing) and codeview::StringTableRef (for reading), updates DebugInfoPDB to use these classes for its own writing, and updates llvm-readobj to additionally use StringTableRef for reading. It's a bit more difficult to get MCCodeView to use this for writing, but it's a logical next step. llvm-svn: 301986	2017-05-02 23:36:17 +00:00
Greg Clayton	6707046f90	Add line table verification to lldb-dwarfdump --verify This patch verifies the .debug_line: - verify all addresses in a line table sequence have ascending addresses - verify that all line table file indexes are valid Unit tests added for both cases. Differential Revision: https://reviews.llvm.org/D32765 llvm-svn: 301984	2017-05-02 22:48:52 +00:00
Paul Robinson	2bc3873fe6	[DWARFv5] Parse new line-table header format. The directory and file tables now have form-based content descriptors. Parse these and extract the per-directory/file records based on the descriptors. For now we support only DW_FORM_string (inline) for the path names; follow-up work will add support for indirect forms (i.e., DW_FORM_strp, strx<N>, and line_strp). Differential Revision: http://reviews.llvm.org/D32713 llvm-svn: 301978	2017-05-02 21:40:47 +00:00
Greg Clayton	c7695a8e45	Verify that all references point to actual DIEs in "llvm-dwarfdump --verify" LTO and other fancy linking previously led to DWARF that contained invalid references. We already validate that CU relative references fall into the CU, and the DW_FORM_ref_addr references fall inside the .debug_info section, but we didn't validate that the references pointed to correct DIE offsets. This new verification will ensure that all references refer to actual DIEs and not an offset in between. This caught a bug in DWARFUnit::getDIEForOffset() where if you gave it any offset, it would match the DIE that mathes the offset _or_ the next DIE. This has been fixed. Differential Revision: https://reviews.llvm.org/D32722 llvm-svn: 301971	2017-05-02 20:28:33 +00:00
Zachary Turner	e204a6c9a3	Rename pdb::StringTable -> pdb::PDBStringTable. With the forthcoming codeview::StringTable which a pdb::StringTable would hold an instance of as one member, this ambiguity becomes confusing. Rename to PDBStringTable to avoid this. llvm-svn: 301948	2017-05-02 18:00:13 +00:00
Paul Robinson	ba1c91564b	Make DWARFDebugLine use StringRef for directory/file tables. NFC Differential Revision: http://reviews.llvm.org/D32728 llvm-svn: 301940	2017-05-02 17:37:32 +00:00
Zachary Turner	edef14510e	[PDB/CodeView] Read/write codeview inlinee line information. Previously we wrote line information and file checksum information, but we did not write information about inlinee lines and functions. This patch adds support for that. llvm-svn: 301936	2017-05-02 16:56:09 +00:00
Paul Robinson	9d4eb6922e	Stylistic makeover of DWARFDebugLine before working on it. NFC Rename parameters and locals to CamelCase, doxygenize the header, and run clang-format on the whole thing. llvm-svn: 301883	2017-05-01 23:27:55 +00:00
Zachary Turner	8a2ebfb1cd	[CodeView] Write CodeView line information. Differential Revision: https://reviews.llvm.org/D32716 llvm-svn: 301882	2017-05-01 23:27:42 +00:00
Greg Clayton	48432cfbeb	Adds initial llvm-dwarfdump --verify support with unit tests. lldb-dwarfdump gets a new "--verify" option that will verify a single file's DWARF debug info and will print out any errors that it finds. It will return an non-zero exit status if verification fails, and a zero exit status if verification succeeds. Adding the --quiet option will suppress any output the STDOUT or STDERR. The first part of the verify does the following: - verifies that all CU relative references (DW_FORM_ref1, DW_FORM_ref2, DW_FORM_ref4, DW_FORM_ref8, DW_FORM_ref_udata) have valid CU offsets - verifies that all DW_FORM_ref_addr references have valid .debug_info offsets - verifies that all DW_AT_ranges attributes have valid .debug_ranges offsets - verifies that all DW_AT_stmt_list attributes have valid .debug_line offsets - verifies that all DW_FORM_strp attributes have valid .debug_str offsets Unit tests were added for each of the above cases. Differential Revision: https://reviews.llvm.org/D32707 llvm-svn: 301844	2017-05-01 22:07:02 +00:00
Zachary Turner	7cc13e557c	[PDB/CodeView] Rename some classes. In preparation for introducing writing capabilities for each of these classes, I would like to adopt a Foo / FooRef naming convention, where Foo indicates that the class can manipulate and serialize Foos, and FooRef indicates that it is an immutable view of an existing Foo. In other words, Foo is a writer and FooRef is a reader. This patch names some existing readers to conform to the FooRef convention, while offering no functional change. llvm-svn: 301810	2017-05-01 16:46:39 +00:00
Zachary Turner	5b6e4e0aed	[llvm-pdbdump] Abstract some of the YAML/Raw printing code. There is a lot of duplicate code for printing line info between YAML and the raw output printer. This introduces a base class that can be shared between the two, and makes some minor cleanups in the process. llvm-svn: 301728	2017-04-29 01:13:21 +00:00
Zachary Turner	05bd9f3713	[llvm-readobj] Use LLVMDebugInfoCodeView to parse line tables. The llvm-readobj parsing code currently exists in our CodeView library, so we use that to parse instead of re-writing the logic in the tool. llvm-svn: 301718	2017-04-28 23:41:36 +00:00
George Rimar	96a3de2729	[DWARF] - Fix mistype in dump output of pub* tables. NFC. There was a garbage character in output introduced by myself in r290040 "[DWARF] - Introduce DWARFDebugPubTable class for dumping pub* sections." llvm-svn: 301631	2017-04-28 08:54:10 +00:00
Zachary Turner	c37cb0c6a5	[CodeView] Isolate Debug Info Fragments into standalone classes. Previously parsing of these were all grouped together into a single master class that could parse any type of debug info fragment. With writing forthcoming, the complexity of each individual fragment is enough to warrant them having their own classes so that reading and writing of each fragment type can be grouped together, but isolated from the code for reading and writing other fragment types. In doing so, I found a place where parsing code was duplicated for the FileChecksums fragment, across llvm-readobj and the CodeView library, and one of the implementations had a bug. Now that the codepaths are merged, the bug is resolved. Differential Revision: https://reviews.llvm.org/D32547 llvm-svn: 301557	2017-04-27 16:12:16 +00:00
Zachary Turner	e509447418	[Support] Make BinaryStreamArray extractors stateless. Instead, we now pass a context memeber through the extraction process. llvm-svn: 301556	2017-04-27 16:11:47 +00:00
Zachary Turner	67c5601404	Rename some PDB classes. We have a lot of very similarly named classes related to dealing with module debug info. This patch has NFC, it just renames some classes to be more descriptive (albeit slightly more to type). The mapping from old to new class names is as follows: Old \| New ModInfo \| DbiModuleDescriptor ModuleSubstream \| ModuleDebugFragment ModStream \| ModuleDebugStream With the corresponding Builder classes renamed accordingly. Differential Revision: https://reviews.llvm.org/D32506 llvm-svn: 301555	2017-04-27 16:11:19 +00:00
George Rimar	e6ef4488e1	[llvm-dwarfdump] - Change format for .gdb_index dump. It is useful to output size of ranges when address ranges section of .gdb_index is dumped. It helps to compare outputs produced by different linkers, for example. In that case address ranges can look very different, when they are the same at fact. Difference comes from different low address because of different address of .text. Differential revision: https://reviews.llvm.org/D32492 llvm-svn: 301527	2017-04-27 10:00:13 +00:00
Zachary Turner	da307b64dd	[llvm-pdbdump] Allow sorting / filtering by immediate padding llvm-svn: 301358	2017-04-25 20:22:29 +00:00
Zachary Turner	ee3b9c2558	[llvm-pdbdump] Dump File / Line Info to YAML. We were already parsing and dumping this to the human readable format, but not to the YAML format. This does so, in preparation for reading it in and reconstructing the line information from YAML. llvm-svn: 301357	2017-04-25 20:22:02 +00:00
Zachary Turner	1690164cac	[llvm-pdbdump] Re-write the record layout code to be more resilient. This reworks the way virtual bases are handled, and also the way padding is detected across multiple levels of aggregates, producing a much more accurate result. llvm-svn: 301203	2017-04-24 17:47:24 +00:00
George Rimar	ca53211beb	[DWARF] - Take relocations in account when extracting ranges from .debug_ranges I found this when investigated "Bug 32319 - .gdb_index is broken/incomplete" for LLD. When we have object file with .debug_ranges section it may be filled with zeroes. Relocations are exist in file to relocate this zeroes into real values later, but until that a pair of zeroes is treated as terminator. And DWARF parser thinks there is no ranges at all when I am trying to collect address ranges for building .gdb_index. Solution implemented in this patch is to take relocations in account when parsing ranges. Differential revision: https://reviews.llvm.org/D32228 llvm-svn: 301170	2017-04-24 10:19:45 +00:00
George Rimar	f8a9642526	[DWARF] - Refactoring: localize handling of relocations in a single place. This is splitted from D32228, currently DWARF parsers code has few places that applied relocations values manually. These places has similar duplicated code. Patch introduces separate method that can be used to obtain relocated value. That helps to reduce code and simplifies things. Differential revision: https://reviews.llvm.org/D32284 llvm-svn: 300956	2017-04-21 09:12:18 +00:00
Dehao Chen	db569bae55	Code style change as suggested in https://reviews.llvm.org/D32177 (NFC) llvm-svn: 300753	2017-04-19 20:52:21 +00:00
Dehao Chen	a364f09f18	Using address range map to speedup finding inline stack for address. Summary: In the current implementation, to find inline stack for an address incurs expensive linear search in 2 places: * linear search for the top-level DIE * recursive linear traverse the DIE tree to find the path to the leaf DIE In this patch, a map is built from address to its corresponding leaf DIE. The inline stack is built by traversing from the leaf DIE up to the root DIE. This speeds up batch symbolization by ~10X without noticible memory overhead. Reviewers: dblaikie Reviewed By: dblaikie Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D32177 llvm-svn: 300742	2017-04-19 20:09:38 +00:00
Dehao Chen	e0b77b24d9	Revert r300697 which causes buildbot failure. llvm-svn: 300708	2017-04-19 15:28:58 +00:00
Dehao Chen	74f3e0d426	Using address range map to speedup finding inline stack for address. Summary: In the current implementation, to find inline stack for an address incurs expensive linear search in 2 places: * linear search for the top-level DIE * recursive linear traverse the DIE tree to find the path to the leaf DIE In this patch, a map is built from address to its corresponding leaf DIE. The inline stack is built by traversing from the leaf DIE up to the root DIE. This speeds up batch symbolization by ~10X without noticible memory overhead. Reviewers: dblaikie Reviewed By: dblaikie Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D32177 llvm-svn: 300697	2017-04-19 14:50:57 +00:00
Dehao Chen	ef700d550e	Add GNU_discriminator support for inline callsites in llvm-symbolizer. Summary: LLVM symbolize cannot recognize GNU_discriminator for inline callsites. This patch adds support for it. Reviewers: dblaikie Reviewed By: dblaikie Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D32134 llvm-svn: 300486	2017-04-17 20:10:39 +00:00
Zachary Turner	4dc4f01a86	[llvm-pdbdump] Recursively dump class layout. llvm-svn: 300258	2017-04-13 21:11:00 +00:00
George Rimar	d4998b0344	[DWARF] - Simplify (use dyn_cast instead of isa + cast). This addresses post commit review comments for r300039. llvm-svn: 300188	2017-04-13 09:52:50 +00:00
Zachary Turner	75999dff93	Fix initialization order of class members. llvm-svn: 300137	2017-04-12 23:27:43 +00:00
Zachary Turner	9e7dda3c6d	[llvm-pdbdump] Minor prepatory refactor of Class Def Dumper. In a followup patch I intend to introduce an additional dumping mode which dumps a graphical representation of a class's layout. In preparation for this, the text-based layout printer needs to be split out from the graphical layout printer, and both need to be able to use the same code for printing the intro and outro of a class's definition (e.g. base class list, etc). This patch does so, and in the process introduces a skeleton definition for the graphical printer, while currently making the graphical printer just print nothing. NFC llvm-svn: 300134	2017-04-12 23:18:51 +00:00
Zachary Turner	c883a8c6dc	[llvm-pdbdump] More advanced class definition dumping. Previously the dumping of class definitions was very primitive, and it made it hard to do more than the most trivial of output formats when dumping. As such, we would only dump one line for each field, and then dump non-layout items like nested types and enums. With this patch, we do a complete analysis of the object hierarchy including aggregate types, bases, virtual bases, vftable analysis, etc. The only immediately visible effects of this are that a) we can now dump a line for the vfptr where before we would treat that as padding, and b) we now don't treat virtual bases that come at the end of a class as padding since we have a more detailed analysis of the class's storage usage. In subsequent patches, we should be able to use this analysis to display a complete graphical view of a class's layout including recursing arbitrarily deep into an object's base class / aggregate member hierarchy. llvm-svn: 300133	2017-04-12 23:18:21 +00:00
Krasimir Georgiev	4ed589d8d6	[DWARF] Fix compiler warnings in DWARFContext.cpp, NFCi llvm-svn: 300051	2017-04-12 11:33:26 +00:00
George Rimar	702dac6d35	[DWARF] - Refactoring of DWARFContextInMemory implementation. This change is basically relative to D31136, where I initially wanted to implement some relocations handling optimization which shows it can give significant boost. Though even without any caching algorithm looks code can have some cleanup at first. Refactoring separates out code for taking symbol address, used in relocations computation. Differential revision: https://reviews.llvm.org/D31747 llvm-svn: 300039	2017-04-12 08:59:15 +00:00
Reid Kleckner	6e545ffc4e	[PDB] Emit index/offset pairs for TPI and IPI streams Summary: This lets PDB readers lookup type record data by type index in O(log n) time. It also enables makes `cvdump -t` work on PDBs produced by LLD. cvdump will not dump a PDB that doesn't have an index-to-offset table. The table is sorted by type index, and has an entry every 8KB. Looking up a type record by index is a binary search of this table, followed by a scan of at most 8KB. Reviewers: ruiu, zturner, inglorion Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D31636 llvm-svn: 299958	2017-04-11 16:26:15 +00:00
Vassil Vassilev	e1f12fadc0	Remove unused functions. Remove static qualifier from functions in header files. NFC. llvm-svn: 299947	2017-04-11 14:55:32 +00:00
Adrian McCarthy	08eb343cce	Improves pretty printing of variable types in llvm-pdbdump * Adds support for pointers to arrays, which was missing * Adds some tests * Improves consistency of const and volatile qualifiers * Eliminates non-composable special case code for arrays and function by using a more general recursive approach * Has a hack for getting the calling convention into the right spot for pointer-to-functions Given the rapid changes happenning in llvm-pdbdump, this may be difficult to merge. Differential Revision: https://reviews.llvm.org/D31832 llvm-svn: 299848	2017-04-10 16:43:09 +00:00
Zachary Turner	1b1a70f172	General usability improvements to generic PDB library. 1. Added some asserts to make sure concrete symbol types don't get constructed with RawSymbols that have an incompatible SymTag enum value. 2. Added new forwarding macros that auto-define an Id/Sym method pair whenever there is a method that returns a SymIndexId. Previously we would just provide one method that returned only the SymIndexId and it was up to the caller to use the Session object to get a pointer to the symbol. Now we automatically get both the method that returns the Id, as well as a method that returns the pointer directly with just one macro. 3. Added some methods for dumping straight to stdout that can be used from inside the debugger for diagnostics during a debug session. 4. Added a clone() method and a cast<T>() method to PDBSymbol that can shorten some usage patterns. llvm-svn: 299831	2017-04-10 06:14:09 +00:00
Reid Kleckner	13fc411e39	[PDB] Save one type record copy Summary: The TypeTableBuilder provides stable storage for type records. We don't need to copy all of the bytes into a flat vector before adding it to the TpiStreamBuilder. This makes addTypeRecord take an ArrayRef<uint8_t> and a hash code to go with it, which seems like a simplification. Reviewers: ruiu, zturner, inglorion Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D31634 llvm-svn: 299406	2017-04-04 00:56:34 +00:00
Reid Kleckner	c4b5d794f1	[codeview] Cope with unsorted streams in type merging Summary: MASM can produce type streams that are not topologically sorted. It can even produce type streams with circular references, but those are not common in practice. Reviewers: inglorion, ruiu Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D31629 llvm-svn: 299403	2017-04-03 23:58:15 +00:00
Reid Kleckner	1c3b5087b7	[codeview] Add support for label type records MASM can produce these type records. llvm-svn: 299388	2017-04-03 21:25:20 +00:00
Reid Kleckner	acd9a6f09d	[codeview] Fix buggy BeginIndexMapSize assertion This assert is just trying to test that processing each record adds exactly one entry to the index map. The assert logic was wrong when the first record in the type stream was a field list. I've simplified the code by moving the LF_FIELDLIST-specific logic into the callback for that record type. llvm-svn: 299035	2017-03-29 22:51:22 +00:00
Adrian McCarthy	4d93d66ddd	Re-land: "Make NativeExeSymbol a concrete subclass of NativeRawSymbol [PDB]" This should work on all platforms now that r299006 has landed. Tested locally on Windows and Linux. This moves exe symbol-specific method implementations out of NativeRawSymbol into a concrete subclass. Also adds implementations for hasCTypes and hasPrivateSymbols and a simple test to ensure the native reader can access the summary information for the executable from the PDB. Original Differential Revision: https://reviews.llvm.org/D31059 llvm-svn: 299019	2017-03-29 19:27:08 +00:00
Reid Kleckner	5d57752c81	[PDB] Split item and type records when merging type streams Summary: MSVC does this when producing a PDB. Reviewers: ruiu Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D31316 llvm-svn: 298717	2017-03-24 17:26:38 +00:00
Reid Kleckner	a5d187b0ff	[PDB] Use two DBs when dumping the IPI stream Summary: When dumping these records from an object file section, we should use only one type database. However, when dumping from a PDB, we should use two: one for the type stream and one for the IPI stream. Certain type records that normally live in the .debug$T object file section get moved over to the IPI stream of the PDB file and they get new indices. So far, I've noticed that the MSVC linker always moves these records into IPI: - LF_FUNC_ID - LF_MFUNC_ID - LF_STRING_ID - LF_SUBSTR_LIST - LF_BUILDINFO - LF_UDT_MOD_SRC_LINE These records have index fields that can point into TPI or IPI. In particular, LF_SUBSTR_LIST and LF_BUILDINFO point to LF_STRING_ID records to describe compilation command lines. I've modified the dumper to have an optional pointer to the item DB, and to do type name lookup of these fields in that DB. See printItemIndex. The result is that our pdbdump-headers.test is more faithful to the PDB contents and the output is less confusing. Reviewers: ruiu Subscribers: amccarth, zturner, llvm-commits Differential Revision: https://reviews.llvm.org/D31309 llvm-svn: 298649	2017-03-23 21:36:25 +00:00
Adrian McCarthy	3c0328e011	Somehow this still breaks because of ANSI color codes in test output on Linux. Reverting until I can figure out the root cause. Revert "Re-land: Make NativeExeSymbol a concrete subclass of NativeRawSymbol [PDB]" This reverts commit f461a70cc376f0f91c8b4917be79479cc86330a5. llvm-svn: 298626	2017-03-23 17:18:50 +00:00
Adrian McCarthy	997a15c3c3	Re-land: Make NativeExeSymbol a concrete subclass of NativeRawSymbol [PDB] The new test should pass on all platforms now that llvm-pdbdump has the `-color-output` option. This moves exe symbol-specific method implementations out of NativeRawSymbol into a concrete subclass. Also adds implementations for hasCTypes and hasPrivateSymbols and a simple test to ensure the native reader can access the summary information for the executable from the PDB. Original Differential Revision: https://reviews.llvm.org/D31059 llvm-svn: 298623	2017-03-23 16:45:20 +00:00
Reid Kleckner	c573acd9e9	[codeview] Move type index remapping logic to type merger Summary: This removes the 'remapTypeIndices' method on every TypeRecord class. My original idea was that this would be the beginning of some kind of generic entry point that would enumerate all of the TypeIndices inside of a TypeRecord, so that we could write generic graph algorithms for them without duplicating the knowledge of which fields are type index fields everywhere. This never happened, and nothing else uses this method. I need to change the API to deal with merging into IPI streams, so let's move it into the file that uses it first. Reviewers: zturner, ruiu Reviewed By: zturner, ruiu Subscribers: mgorny, llvm-commits Differential Revision: https://reviews.llvm.org/D31267 llvm-svn: 298564	2017-03-23 00:14:23 +00:00
Reid Kleckner	45928018c5	[codeview] Use separate records for LF_SUBSTR_LIST and LF_ARGLIST They are structurally the same, but now we need to distinguish them because one record lives in the IPI stream and the other lives in TPI. llvm-svn: 298474	2017-03-22 01:37:38 +00:00
Zachary Turner	2d9c082033	Revert "Make NativeExeSymbol a concrete subclass of NativeRawSymbol [PDB]" For some reason this is causing ANSI color codes to be printed even when run through FileCheck. llvm-svn: 298026	2017-03-17 00:46:42 +00:00
Zachary Turner	2ed2aa75bf	[pdb] Fix an uninitialized read, and add a test for it. This was originally reported in pr32249, uncovered by PTVS-Studio. There was no code coverage for this path because it was difficult to construct odd-case PDB files that were not generated by cl. Now that we can write construct minimal PDB files from YAML, it's easy to construct fragments that generate whatever we want. In this patch I add a test that creates 2 type records. One with a unique name, and one without. I verify that we can go from PDB to Yaml with no errors. In a future patch I'd like to add something like llvm-pdbdump raw -lookup-type that will just dump one record and nothing else, which should make it a bit cleaner to find this kind of thing. llvm-svn: 298017	2017-03-17 00:15:55 +00:00
Zachary Turner	42cb87f401	[PDB] It is not an error getting the "Invalid" Annotation opcode. The linker can insert invalid opcodes to indicate padding bytes, and we should not fail in this case. llvm-svn: 298016	2017-03-17 00:15:27 +00:00
Adrian McCarthy	21b54cf632	Make NativeExeSymbol a concrete subclass of NativeRawSymbol [PDB] This moves exe symbol-specific method implementations out of NativeRawSymbol into a concrete subclass. Also adds implementations for hasCTypes and hasPrivateSymbols and a simple test to ensure the native reader can access the summary information for the executable from the PDB. Differential Revision: https://reviews.llvm.org/D31059 llvm-svn: 298005	2017-03-16 22:28:39 +00:00
Zachary Turner	c9500616d8	Silence -Wcovered-switch-default warning. llvm-svn: 297990	2017-03-16 20:45:11 +00:00
Zachary Turner	05d5e6136f	[PDB] Add support for parsing Flags from PDB Stream. This was discovered when running `llvm-pdbdump diff` against two files, the second of which was generated by running the first one through pdb2yaml and then yaml2pdb. The second one was missing some bytes from the PDB Stream, and tracking this down showed that at the end of the PDB Stream were some additional bytes that we were ignoring. Looking back to the reference code, these seem to specify some additional flags that indicate whether the PDB supports various optional features. This patch adds support for reading, writing, and round-tripping these flags through YAML and the raw dumper, and updates the tests accordingly. llvm-svn: 297984	2017-03-16 20:19:11 +00:00
Zachary Turner	02278ce09f	[llvm-pdbdump] Add support for diffing the PDB Stream. In doing so I discovered that we completely ignore some bytes of the PDB Stream after we "finish" loading it. These bytes seem to specify some additional information about what kind of data is present in the PDB. A subsequent patch will add code to read in those fields and store their values. llvm-svn: 297983	2017-03-16 20:18:41 +00:00
Zachary Turner	f1220084f6	[llvm-pdbdump] Add support for diffing the String Table. llvm-svn: 297901	2017-03-15 22:19:30 +00:00
Zachary Turner	ea4e60754e	[pdb] Write the module info and symbol record streams. Previously we did not have support for writing detailed module information for each module, as well as the symbol records. This patch adds support for this, and in doing so enables the ability to construct minimal PDBs from just a few lines of YAML. A test is added to illustrate this functionality. llvm-svn: 297900	2017-03-15 22:18:53 +00:00
Adrian McCarthy	ad6d60a46b	NFC: Corrects comments that were supposed to go in with earlier commit. llvm-svn: 297887	2017-03-15 20:29:06 +00:00
Adrian McCarthy	65d2688842	Introduce NativeEnumModules and NativeCompilandSymbol Together, these allow lldb-pdbdump to list all the modules from a PDB using a native reader (rather than DIA). Note that I'll probably be specializing NativeRawSymbol in a subsequent patch. Differential Revision: https://reviews.llvm.org/D30956 llvm-svn: 297883	2017-03-15 20:17:58 +00:00
David Blaikie	1914c82d6c	Fix llvm-symbolizer to navigate both DW_AT_abstract_origin and DW_AT_specification in a single chain In a recent refactoring (r291959) this regressed to only following one or the other, not both, in a single chain. llvm-svn: 297676	2017-03-13 21:46:37 +00:00
Zachary Turner	407dec59a4	[llvm-pdbdump] Add support for dumping symbols from Yaml -> PDB. Previously we could round-trip type records from PDB -> Yaml -> PDB, but for symbols we could only go from PDB -> Yaml. This completes the round-tripping for symbols as well. llvm-svn: 297625	2017-03-13 14:57:45 +00:00
Paul Robinson	f96e21ad6d	[DWARFv5] Update definitions to match published spec. Some late additions to DWARF v5 were not in Dwarf.def; also one form was redefined. Add the new cases to relevant switches in different parts of LLVM. Replace DW_FORM_ref_sup with DW_FORM_ref_sup[4,8]. I did not add support for DW_FORM_strx3/addrx3 other that defining the constants. We don't have any infrastructure to support these. Differential Revision: http://reviews.llvm.org/D30664 llvm-svn: 297085	2017-03-06 22:20:03 +00:00
Zachary Turner	d9dc2829ea	[Support] Move Stream library from MSF -> Support. After several smaller patches to get most of the core improvements finished up, this patch is a straight move and header fixup of the source. Differential Revision: https://reviews.llvm.org/D30266 llvm-svn: 296810	2017-03-02 20:52:51 +00:00
Paul Robinson	8932d64891	[DWARF] Print leading zeros in type signature llvm-svn: 296663	2017-03-01 19:43:29 +00:00
Eugene Zelenko	28db7e65e5	[DebugInfo] Fix some Include What You Use warnings; other minor fixes (NFC). llvm-svn: 296559	2017-03-01 01:14:23 +00:00
Paul Robinson	cddd60445e	[DWARFv5] Emit new unit header format. Requesting DWARF v5 will now get you the new compile-unit and type-unit headers. llvm-dwarfdump will also recognize them. Differential Revision: http://reviews.llvm.org/D30206 llvm-svn: 296514	2017-02-28 20:24:55 +00:00
Zachary Turner	52c0077df0	Fix -Wcovered-switch-default warning. llvm-svn: 296501	2017-02-28 18:35:40 +00:00
Zachary Turner	d0b44fa788	[PDB] Add BinaryStreamError. This migrates the stream code away from MSFError to using its own custom Error class. llvm-svn: 296494	2017-02-28 17:49:34 +00:00
Zachary Turner	695ed56ba5	[PDB] Make streams carry their own endianness. Before the endianness was specified on each call to read or write of the StreamReader / StreamWriter, but in practice it's extremely rare for streams to have data encoded in multiple different endiannesses, so we should optimize for the 99% use case. This makes the code cleaner and more general, but otherwise has NFC. llvm-svn: 296415	2017-02-28 00:04:07 +00:00
Eugene Zelenko	e94042cafe	[DebugInfo] Fix some Clang-tidy modernize and Include What You Use warnings; other minor fixes (NFC). llvm-svn: 296413	2017-02-27 23:43:14 +00:00
Zachary Turner	4c7458b00d	Remove some code accidentally left in. llvm-svn: 296407	2017-02-27 22:57:32 +00:00
Zachary Turner	120faca41b	[PDB] Partial resubmit of r296215, which improved PDB Stream Library. This was reverted because it was breaking some builds, and because of incorrect error code usage. Since the CL was large and contained many different things, I'm resubmitting it in pieces. This portion is NFC, and consists of: 1) Renaming classes to follow a consistent naming convention. 2) Fixing the const-ness of the interface methods. 3) Adding detailed doxygen comments. 4) Fixing a few instances of passing `const BinaryStream& X`. These are now passed as `BinaryStreamRef X`. llvm-svn: 296394	2017-02-27 22:11:43 +00:00
NAKAMURA Takumi	05a75e40da	Revert r296215, "[PDB] General improvements to Stream library." and followings. r296215, "[PDB] General improvements to Stream library." r296217, "Disable BinaryStreamTest.StreamReaderObject temporarily." r296220, "Re-enable BinaryStreamTest.StreamReaderObject." r296244, "[PDB] Disable some tests that are breaking bots." r296249, "Add static_cast to silence -Wc++11-narrowing." std::errc::no_buffer_space should be used for OS-oriented errors for socket transmission. (Seek discussions around llvm/xray.) I could substitute s/no_buffer_space/others/g, but I revert whole them ATM. Could we define and use LLVM errors there? llvm-svn: 296258	2017-02-25 17:04:23 +00:00
Victor Leschuk	96d9981ec6	[DebugInfo] Skip implicit_const attributes when dumping .debug_info. NFC. When dumping .debug_info section we loop through all attributes mentioned in .debug_abbrev section and dump values using DWARFFormValue::extractValue(). We need to skip implicit_const attributes here as their values are not really located in .debug_info but directly in .debug_abbrev. This patch fixes triggered assert() in DWARFFormValue::extractValue() caused by trying to access implicit_const values from .debug_info. llvm-svn: 296253	2017-02-25 13:15:57 +00:00
Zachary Turner	af299ea5d4	[PDB] General improvements to Stream library. This adds various new functionality and cleanup surrounding the use of the Stream library. Major changes include: * Renaming of all classes for more consistency / meaningfulness * Addition of some new methods for reading multiple values at once. * Full suite of unit tests for reader / writer functionality. * Full set of doxygen comments for all classes. * Streams now store their own endianness. * Fixed some bugs in a few of the classes that were discovered by the unit tests. llvm-svn: 296215	2017-02-25 00:44:30 +00:00
Zachary Turner	d2684b7969	[PDB] Rename Stream related source files. This is part of a larger effort to get the Stream code moved up to Support. I don't want to do it in one large patch, in part because the changes are so big that it will treat everything as file deletions and add, losing history in the process. Aside from that though, it's just a good idea in general to make small changes. So this change only changes the names of the Stream related source files, and applies necessary source fix ups. llvm-svn: 296211	2017-02-25 00:33:34 +00:00
Adrian McCarthy	649b8e0c45	Implement some methods for NativeRawSymbol This allows the ability to call IPDBSession::getGlobalScope with a NativeSession and to then query it for some basic fields from the PDB's InfoStream. Note that the symbols now have non-const references back to the Session so that NativeRawSymbol can access the PDBFile through the Session. Differential Revision: https://reviews.llvm.org/D30314 llvm-svn: 296049	2017-02-24 00:10:47 +00:00
Zachary Turner	181fe17b6f	Don't assume little endian in StreamReader / StreamWriter. In an effort to generalize this so it can be used by more than just PDB code, we shouldn't assume little endian. llvm-svn: 295525	2017-02-18 01:35:33 +00:00
Zachary Turner	7b327d051b	[pdb] Add the ability to resolve TypeServer PDBs. Some PDBs or object files can contain references to other PDBs where the real type information lives. When this happens, all type indices in the original PDB are meaningless because their records are not there. With this patch we add the ability to pull type info from those secondary PDBs. Differential Revision: https://reviews.llvm.org/D29973 llvm-svn: 295382	2017-02-16 23:35:45 +00:00
Eric Christopher	e4b10f5d37	Add an additional set of braces to deal with subobject initialization. llvm-svn: 294674	2017-02-10 00:02:09 +00:00
Adrian McCarthy	d6e091dcc5	Fix build break from r294633. llvm-svn: 294642	2017-02-09 22:49:35 +00:00
Adrian McCarthy	0beb3323c5	Introduce NativeRawSymbol for PDB reading. This is a stub for a new concrete implementation of IPDBRawSymbol. Nothing uses this uses this implementation yet. My plan is to locally switch lldb-pdbdump from the DIA reader to the Native one and flesh out the implementations of these method stubs in the order they're needed. llvm-svn: 294633	2017-02-09 21:51:19 +00:00
Eugene Zelenko	44d951226e	[MC] Fix some Clang-tidy modernize and Include What You Use warnings in SubtargetFeature; other minor fixes (NFC). Same changes in files affected by reduced SubtargetFeature.h dependencies. llvm-svn: 294548	2017-02-09 01:09:54 +00:00
David Blaikie	efc4eba816	Get function start line number from DWARF info DWARF info contains info about the line number at which a function starts (DW_AT_decl_line). This patch creates a function to look up the start line number for a function, and returns it in DILineInfo when looking up debug info for a particular address. Patch by Simon Que! Reviewed By: dblaikie Differential Revision: https://reviews.llvm.org/D27962 llvm-svn: 294231	2017-02-06 20:19:02 +00:00
Zachary Turner	5ce0f4a9de	Properly parse the TypeServer2 record. llvm-svn: 294046	2017-02-03 21:22:27 +00:00
Rui Ueyama	a9b29615fb	Re-submit r293820: Return Error instead of bool from mergeTypeStreams(). llvm-svn: 293847	2017-02-02 00:47:10 +00:00
Rui Ueyama	7d07a1652d	Revert r293820: Return Error instead of bool from mergeTypeStreams(). It broke buildbots. llvm-svn: 293824	2017-02-01 22:28:43 +00:00
Rui Ueyama	00d4f49717	Return Error instead of bool from mergeTypeStreams(). Previously, mergeTypeStreams returns only true or false, so it was impossible to know the reason if it failed. This patch changes the function signature so that it returns an Error object. Differential Revision: https://reviews.llvm.org/D29362 llvm-svn: 293820	2017-02-01 22:09:34 +00:00
Zachary Turner	d50c01308e	[pdb] Add a new command for analyzing hash collisions. This introduces the `analyze` subcommand. For now there is only one option, to analyze hash collisions in the type streams. In the future, however, we could add many more things here, such as performing size analyses, compacting, and statistics about the type of records etc. llvm-svn: 293795	2017-02-01 18:30:22 +00:00
David Blaikie	0012dd5db1	Add a verbose/human readable mode to llvm-symbolizer to investigate discriminators and other line table/backtrace features Patch by Simon Que! Differential Revision: https://reviews.llvm.org/D29094 llvm-svn: 293697	2017-01-31 22:19:38 +00:00
Matthias Braun	8c209aa877	Cleanup dump() functions. We had various variants of defining dump() functions in LLVM. Normalize them (this should just consistently implement the things discussed in http://lists.llvm.org/pipermail/cfe-dev/2014-January/034323.html For reference: - Public headers should just declare the dump() method but not use LLVM_DUMP_METHOD or #if !defined(NDEBUG) \|\| defined(LLVM_ENABLE_DUMP) - The definition of a dump method should look like this: #if !defined(NDEBUG) \|\| defined(LLVM_ENABLE_DUMP) LLVM_DUMP_METHOD void MyClass::dump() { // print stuff to dbgs()... } #endif llvm-svn: 293359	2017-01-28 02:02:38 +00:00
Adrian McCarthy	8f713190e7	NFC: Rename PDB_ReaderType::Raw to Native for consistency with the NativeSession rename. llvm-svn: 293235	2017-01-27 00:01:55 +00:00
Adrian McCarthy	6b6b8c4fb9	NFC: Rename (PDB) RawSession to NativeSession This eliminates one overload on the term Raw. Differential Revision: https://reviews.llvm.org/D29098 llvm-svn: 293104	2017-01-25 22:38:55 +00:00
Zachary Turner	29da5db7a0	[pdb] Correctly parse the hash adjusters table from TPI stream. This is not a list of pairs, it is a hash table data structure. We now correctly parse this out and dump it from llvm-pdbdump. We still need to understand the conditions that lead to a type getting an entry in the hash adjuster table. That will be done in a followup investigation / patch. Differential Revision: https://reviews.llvm.org/D29090 llvm-svn: 293090	2017-01-25 21:17:40 +00:00
Zachary Turner	760ad4da60	[pdb] Write the Named Stream mapping to Yaml and binary. Differential Revision: https://reviews.llvm.org/D28919 llvm-svn: 292665	2017-01-20 22:42:09 +00:00
Zachary Turner	60667ca0b2	[pdb] Merge NamedStreamMapBuilder and NamedStreamMap. While the builder pattern has proven useful for certain other larger types, in this case it was hampering the ability to use the data structure, as for runtime access we need a map that we can efficiently read from and write to. So the two are merged into a single data structure that can efficiently be read to, written from, deserialized from bytes, and serialized to bytes. llvm-svn: 292664	2017-01-20 22:41:40 +00:00
Zachary Turner	f04d6e8d52	[PDB] Rename some files to be more intuitive. llvm-svn: 292663	2017-01-20 22:41:15 +00:00
Chris Bieneman	2e752db47a	[DWARF] [ObjectYAML] Adding APIs for unittesting Summary: This patch adds some new APIs to enable using the YAML DWARF representation in unit tests. The most basic new API is DWARFYAML::EmitDebugSections which converts a YAML string into a series of owned MemoryBuffer objects stored in a StringMap. The string map can then be used to construct a DWARFContext for parsing in place of an ObjectFile. Reviewers: dblaikie, clayborg Subscribers: mgorny, fhahn, jgosnell, aprantl, llvm-commits Differential Revision: https://reviews.llvm.org/D28828 llvm-svn: 292634	2017-01-20 19:03:14 +00:00
Zachary Turner	a332fa38e9	Fix a few more build errors. llvm-svn: 292538	2017-01-19 23:44:14 +00:00
Zachary Turner	d54deaee6c	Fix incorrectly formed assert statement. llvm-svn: 292537	2017-01-19 23:41:11 +00:00
Zachary Turner	11036a909f	[pdb] Add HashTable data structure. This was being parsed / serialized ad-hoc inside the code for a specific PDB stream. But this data structure is used in multiple ways / places within the PDB format. To be able to re-use it we need to raise this code out and make it more generic. In doing so, a number of bugs are fixed in the original implementation, and support is added for growing the hash table and deleting items from the hash table, which had either been omitted or incorrect implemented in the initial version. Differential Revision: https://reviews.llvm.org/D28715 llvm-svn: 292535	2017-01-19 23:31:24 +00:00
Rui Ueyama	dcd32937dc	PDB: Add a class to create the /names stream contents. This patch adds a new class NameHashTableBuilder which creates /names streams. This patch contains a test to confirm that a stream created by NameHashTableBuilder can be read by NameHashTable reader class. Differential Revision: https://reviews.llvm.org/D28707 llvm-svn: 292040	2017-01-15 00:36:02 +00:00
Greg Clayton	c109bbea57	Add a variant of DWARFDie::find() and DWARFDie::findRecursively() that takes a llvm::ArrayRef<dwarf::Attribute>. This allows us efficiently look for more than one attribute, something that is quite common in DWARF consumption. Differential Revision: https://reviews.llvm.org/D28704 llvm-svn: 291967	2017-01-13 22:32:12 +00:00
Greg Clayton	97d22187d0	Cleanup how DWARFDie attributes are accessed and decoded. Removed all DWARFDie::getAttributeValueAs() calls. Renamed: Optional<DWARFFormValue> DWARFDie::getAttributeValue(dwarf::Attribute); To: Optional<DWARFFormValue> DWARFDie::find(dwarf::Attribute); Added: Optional<DWARFFormValue> DWARFDie::findRecursively(dwarf::Attribute); All decoding of Optional<DWARFFormValue> values are now done using the dwarf::to() functions from DWARFFormValue.h: Old code: auto DeclLine = DWARFDie.getAttributeValueAsSignedConstant(DW_AT_decl_line).getValueOr(0); New code: auto DeclLine = toUnsigned(DWARFDie.find(DW_AT_decl_line), 0); This composition helps us since we can now easily do: auto DeclLine = toUnsigned(DWARFDie.findRecursively(DW_AT_decl_line), 0); This allows us to easily find attribute values in the current DIE only (the first new code above) or in any DW_AT_abstract_origin or DW_AT_specification Dies using the line above. Note that the code line length is shorter and more concise. Differential Revision: https://reviews.llvm.org/D28581 llvm-svn: 291959	2017-01-13 21:08:18 +00:00
Benjamin Kramer	061f4a5fe6	Apply clang-tidy's performance-unnecessary-value-param to LLVM. With some minor manual fixes for using function_ref instead of std::function. No functional change intended. llvm-svn: 291904	2017-01-13 14:39:03 +00:00
Greg Clayton	0e62ee7d60	Add the ability to iterate across all attributes in a DIE. Differential Revision: https://reviews.llvm.org/D28386 llvm-svn: 291861	2017-01-13 00:13:42 +00:00
Zachary Turner	629cb7d8cc	[CodeView] Finish decoupling TypeDatabase from TypeDumper. Previously the type dumper itself was passed around to a lot of different places and manipulated in ways that were more appropriate on the type database. For example, the entire TypeDumper was passed into the symbol dumper, when all the symbol dumper wanted to do was lookup the name of a TypeIndex so it could print it. That's what the TypeDatabase is for -- mapping type indices to names. Another example is how if the user runs llvm-pdbdump with the option to dump symbols but not types, we still have to visit all types so that we can print minimal information about the type of a symbol, but just without dumping full symbol records. The way we did this before is by hacking it up so that we run everything through the type dumper with a null printer, so that the output goes to /dev/null. But really, we don't need to dump anything, all we want to do is build the type database. Since TypeDatabaseVisitor now exists independently of TypeDumper, we can do this. We just build a custom visitor callback pipeline that includes a database visitor but not a dumper. All the hackery around printers etc goes away. After this patch, we could probably even delete the entire CVTypeDumper class since really all it is at this point is a thin wrapper that hides the details of how to build a useful visitation pipeline. It's not a priority though, so CVTypeDumper remains for now. After this patch we will be able to easily plug in a different style of type dumper by only implementing the proper visitation methods to dump one-line output and then sticking it on the pipeline. Differential Revision: https://reviews.llvm.org/D28524 llvm-svn: 291724	2017-01-11 23:24:22 +00:00
Greg Clayton	d1efea89c9	Remove all variants of DWARFDie::getAttributeValueAs...() that had parameters that specified default values. Now we only support returning Optional<> values and have changed all clients over to use Optional::getValueOr(). Differential Revision: https://reviews.llvm.org/D28569 llvm-svn: 291686	2017-01-11 17:43:37 +00:00
George Rimar	4bf308317d	[lib/Object] - Introduce Decompressor class. Decompressor intention is to reduce duplication of code. Currently LLD has own implementation of decompressor for compressed debug sections. This class helps to avoid it and share the code. LLD patch for reusing it is D28106 Differential revision: https://reviews.llvm.org/D28105 llvm-svn: 291675	2017-01-11 15:26:41 +00:00
Zachary Turner	a9054ddd9c	[CodeView/PDB] Rename a bunch of files. We were starting to get some name clashes between llvm-pdbdump and the common CodeView framework, so I took this opportunity to rename a bunch of files to more accurately describe their usage. This also helps in llvm-pdbdump to distinguish between different files and whether they are used for pretty dump mode or raw dump mode. llvm-svn: 291627	2017-01-11 00:35:43 +00:00
Zachary Turner	c640b76db5	[CodeView] Add TypeDatabase class. This creates a centralized class in which to store type records. It stores types as an array of entries, which matches the notion of a type stream being a topologically sorted DAG. Logic to build up such a database was already being used in CVTypeDumper, so CVTypeDumper is now updated to to read from a TypeDatabase which is filled out by an earlier visitor in the pipeline. Differential Revision: https://reviews.llvm.org/D28486 llvm-svn: 291626	2017-01-11 00:35:08 +00:00
Victor Leschuk	cbddae74f5	DebugInfo: support for DW_FORM_implicit_const Support for DW_FORM_implicit_const DWARFv5 feature. When this form is used attribute value goes to .debug_abbrev section (as SLEB). As this form would break any debug tool which doesn't support DWARFv5 it is guarded by dwarf version check. Attempt to use this form with dwarf version <= 4 is considered a fatal error. Differential Revision: https://reviews.llvm.org/D28456 llvm-svn: 291599	2017-01-10 21:18:26 +00:00
Greg Clayton	93e4fe8aad	Add iterator support to DWARFDie to allow child DIE iteration. Differential Revision: https://reviews.llvm.org/D28303 llvm-svn: 291194	2017-01-05 23:47:37 +00:00
Michal Gorny	89b6f16b3e	[cmake] Add LLVM_ENABLE_DIA_SDK option, and expose it in LLVMConfig Add an explicit LLVM_ENABLE_DIA_SDK option to control building support for DIA SDK-based debugging. Control its value to match whether DIA SDK support was found and expose it in LLVMConfig (alike LLVM_ENABLE_ZLIB). Its value is needed for LLDB to determine whether to run tests requiring DIA support. Currently it is obtained from llvm/Config/config.h; however, this file is not available for standalone builds. Following this change, LLDB will be modified to use the value from LLVMConfig. Differential Revision: https://reviews.llvm.org/D26255 llvm-svn: 290818	2017-01-02 18:19:35 +00:00
Chris Bieneman	e0e451d927	[ObjectYAML] Support for DWARF debug_info section This patch adds support for YAML<->DWARF for debug_info sections. This re-lands r290147, reverted in 290148, re-landed in r290204 after fixing the issue that caused bots to fail (thank you UBSan!), and reverted again in r290209 due to failures on big endian systems. After adding support for preserving endianness, this should be good now. llvm-svn: 290386	2016-12-22 22:44:27 +00:00
Greg Clayton	78a07bfa66	Add the ability for DWARFDie objects to get the parent DWARFDie. In order for the llvm DWARF parser to be used in LLDB we will need to be able to get the parent of a DIE. This patch adds that functionality by changing the DWARFDebugInfoEntry class to store a depth field instead of a sibling index. Using a depth field allows us to easily calculate the sibling and the parent without increasing the size of DWARFDebugInfoEntry. I tested llvm-dsymutil on a debug version of clang where this fully parses DWARF in over 1200 .o files to verify there was no serious regression in performance. Added a full suite of unit tests to test this functionality. Differential Revision: https://reviews.llvm.org/D27995 llvm-svn: 290274	2016-12-21 21:37:06 +00:00
Chris Bieneman	abecaa2f8c	Revert "[ObjectYAML] Support for DWARF debug_info section" This reverts commit r290204. Still breaking bots... In a meeting now, so I can't fix it immediately. Bot URL: http://lab.llvm.org:8011/builders/clang-s390x-linux/builds/2415 llvm-svn: 290209	2016-12-20 22:36:42 +00:00
Chris Bieneman	ffc4aef542	[ObjectYAML] Support for DWARF debug_info section This patch adds support for YAML<->DWARF for debug_info sections. This re-lands r290147, after fixing the issue that caused bots to fail (thank you UBSan!). llvm-svn: 290204	2016-12-20 21:35:31 +00:00
Chris Bieneman	891cbcc093	Revert "[ObjectYAML] Support for DWARF debug_info section" This reverts commit r290147. This commit is breaking a bot (http://lab.llvm.org:8011/builders/clang-atom-d525-fedora-rel/builds/621). I don't have time to investigate at the moment, so I'll revert for now. llvm-svn: 290148	2016-12-20 00:42:06 +00:00
Chris Bieneman	b5b0b23a25	[ObjectYAML] Support for DWARF debug_info section This patch adds support for YAML<->DWARF for debug_info sections. llvm-svn: 290147	2016-12-20 00:26:24 +00:00
Greg Clayton	2520c9ebee	Make a function to correctly extract the DW_AT_high_pc given the low pc value. DWARF 4 and later supports encoding the PC as an address or as as offset from the low PC. Clients using DWARFDie should be insulated from how to extract the high PC value. This function takes care of extracting the form value and looking for the correct form. Differential Revision: https://reviews.llvm.org/D27885 llvm-svn: 290131	2016-12-19 20:36:41 +00:00
David Majnemer	b7477b540e	[PDB] Don't use the long type Long is not the same size across a number of the platforms we support. Use unsigned int here instead, it is more appropriate because overflow/wrap-around is possible and, in this case, expected. llvm-svn: 290068	2016-12-18 20:10:50 +00:00
David Majnemer	1d3dcb0602	[PDB] Don't reimplement CRC32 We already have a CRC32 implementation which is compatible with the PDB hash, reuse it. llvm-svn: 290054	2016-12-18 00:41:15 +00:00
David Majnemer	9bca03bf81	[PDB] Validate superblock addresses - Validate the address of the block map. - Validate the address of the free block map. llvm-svn: 290053	2016-12-18 00:41:10 +00:00
George Rimar	e71e33fe93	[DWARF] - Introduce DWARFDebugPubTable class for dumping pub* sections. Patch implements parser of pubnames/pubtypes tables instead of static function used before. It is now should be possible to reuse it in LLD or other projects and clean up the duplication code. Differential revision: https://reviews.llvm.org/D27851 llvm-svn: 290040	2016-12-17 09:10:32 +00:00
Zachary Turner	10005d915e	Delete unused file. llvm-svn: 290021	2016-12-17 00:58:19 +00:00
Zachary Turner	46225b193f	Resubmit "[CodeView] Hook CodeViewRecordIO for reading/writing symbols." The original patch was broken due to some undefined behavior as well as warnings that were triggering -Werror. llvm-svn: 290000	2016-12-16 22:48:14 +00:00
Zachary Turner	d0fffd1d14	Revert "[CodeView] Hook CodeViewRecordIO for reading/writing symbols." This reverts commit r289978, which is failing due to some rebase/merge issues. llvm-svn: 289981	2016-12-16 19:25:23 +00:00
Zachary Turner	a4e7dfbc16	[CodeView] Hook CodeViewRecordIO for reading/writing symbols. This is the 3rd of 3 patches to get reading and writing of CodeView symbol and type records to use a single codepath. Differential Revision: https://reviews.llvm.org/D26427 llvm-svn: 289978	2016-12-16 19:20:35 +00:00
David Blaikie	7d4a5599da	Revert "dwarfdump: Support/process relocations on a CU's abbrev_off" Reverting because this breaks lld's gdb_index support - it's probably double counting the abbrev relocation offset. This reverts commit r289954. llvm-svn: 289961	2016-12-16 17:10:17 +00:00
David Blaikie	e9fda9f201	dwarfdump: Support/process relocations on a CU's abbrev_off Input can be produced by ld -r, for example (a normal LLVM workflow never hits this - LLVM only ever produces a single abbrev table in an object (shared by multiple CUs), so the reloc's always 0, and when it's linked together the relocation's resolved so it doesn't need to be handled) llvm-svn: 289954	2016-12-16 16:31:10 +00:00
Greg Clayton	52fe1f68c8	Add the ability to get attribute values as Optional<T> When getting attributes it is sometimes nicer to use Optional<T> some of the time instead of magic values. I tried to cut over to only using the Optional values but it made many of the call sites very messy, so it makes sense the leave in the calls that can return a default value. Otherwise code that looks like this: uint64_t CallColumn = Die.getAttributeValueAsAddress(DW_AT_call_line, 0); Has to be turned into: uint64_t CallColumn = 0; if (auto CallColumnValue = Die.getAttributeValueAsAddress(DW_AT_call_line)) CallColumn = *CallColumnValue; The first snippet of code looks much better. But in cases where you want an offset that may or may not be there, the following code looks better: if (auto StmtOffset = Die.getAttributeValueAsSectionOffset(DW_AT_stmt_list)) { // Use StmtOffset } Differential Revision: https://reviews.llvm.org/D27772 llvm-svn: 289731	2016-12-14 22:38:08 +00:00
Eric Christopher	ba1024cfb8	This change does two things: Adds a "Discriminator" field to struct DILineInfo, which defaults to 0. Fills out the "Discriminator" field in DILineInfo in DWARFDebugLine::LineTable::getFileLineInfoForAddress(). in order to have a slightly nicer interface in getFileLineInfoForAddress. Patch by Simon Que! Differential Revision: https://reviews.llvm.org/D27649 llvm-svn: 289683	2016-12-14 18:29:39 +00:00
Greg Clayton	1cbf3fa94a	Switch functions that returned bool and filled in a DWARFFormValue arg with ones that return Optional<DWARFFormValue> Differential Revision: https://reviews.llvm.org/D27737 llvm-svn: 289611	2016-12-13 23:20:56 +00:00
Greg Clayton	c8c1032c0c	Make a DWARFDIE class that can help avoid using the wrong DWARFUnit when extracting attributes Many places pass around a DWARFDebugInfoEntryMinimal and a DWARFUnit. It is easy to get things wrong by using the wrong DWARFUnit with a DWARFDebugInfoEntryMinimal. This patch creates a DWARFDie class that contains the DWARFUnit and DWARFDebugInfoEntryMinimal objects so that they can't get out of sync. All attribute extraction has been moved out of DWARFDebugInfoEntryMinimal and into DWARFDie. DWARFDebugInfoEntryMinimal was also renamed to DWARFDebugInfoEntry. DWARFDie objects are temporary objects that are used by clients and contain 2 pointers that you always need to have anyway. Keeping them grouped will avoid errors and simplify many of the attribute extracting APIs by not having to pass in a DWARFUnit. Differential Revision: https://reviews.llvm.org/D27634 llvm-svn: 289565	2016-12-13 18:25:19 +00:00
Greg Clayton	3462a420d1	Make a DWARF generator so we can unit test DWARF APIs with gtest. The only tests we have for the DWARF parser are the tests that use llvm-dwarfdump and expect output from textual dumps. More DWARF parser modification are coming in the next few weeks and I wanted to add tests that can verify that we can encode and decode all form types, as well as test some other basic DWARF APIs where we ask DIE objects for their children and siblings. DwarfGenerator.cpp was added in the lib/CodeGen directory. This file contains the code necessary to easily create DWARF for tests: dwarfgen::Generator DG; Triple Triple("x86_64--"); bool success = DG.init(Triple, Version); if (!success) return; dwarfgen::CompileUnit &CU = DG.addCompileUnit(); dwarfgen::DIE CUDie = CU.getUnitDIE(); CUDie.addAttribute(DW_AT_name, DW_FORM_strp, "/tmp/main.c"); CUDie.addAttribute(DW_AT_language, DW_FORM_data2, DW_LANG_C); dwarfgen::DIE SubprogramDie = CUDie.addChild(DW_TAG_subprogram); SubprogramDie.addAttribute(DW_AT_name, DW_FORM_strp, "main"); SubprogramDie.addAttribute(DW_AT_low_pc, DW_FORM_addr, 0x1000U); SubprogramDie.addAttribute(DW_AT_high_pc, DW_FORM_addr, 0x2000U); dwarfgen::DIE IntDie = CUDie.addChild(DW_TAG_base_type); IntDie.addAttribute(DW_AT_name, DW_FORM_strp, "int"); IntDie.addAttribute(DW_AT_encoding, DW_FORM_data1, DW_ATE_signed); IntDie.addAttribute(DW_AT_byte_size, DW_FORM_data1, 4); dwarfgen::DIE ArgcDie = SubprogramDie.addChild(DW_TAG_formal_parameter); ArgcDie.addAttribute(DW_AT_name, DW_FORM_strp, "argc"); // ArgcDie.addAttribute(DW_AT_type, DW_FORM_ref4, IntDie); ArgcDie.addAttribute(DW_AT_type, DW_FORM_ref_addr, IntDie); StringRef FileBytes = DG.generate(); MemoryBufferRef FileBuffer(FileBytes, "dwarf"); auto Obj = object::ObjectFile::createObjectFile(FileBuffer); EXPECT_TRUE((bool)Obj); DWARFContextInMemory DwarfContext(*Obj.get()); This code is backed by the AsmPrinter code that emits DWARF for the actual compiler. While adding unit tests it was discovered that DIEValue that used DIEEntry as their values had bugs where DW_FORM_ref1, DW_FORM_ref2, DW_FORM_ref8, and DW_FORM_ref_udata forms were not supported. These are all now supported. Added support for DW_FORM_string so we can emit inlined C strings. Centralized the code to unique abbreviations into a new DIEAbbrevSet class and made both the dwarfgen::Generator and the llvm::DwarfFile classes use the new class. Fixed comments in the llvm::DIE class so that the Offset is known to be the compile/type unit offset. DIEInteger now supports more DW_FORM values. There are also unit tests that cover: Encoding and decoding all form types and values Encoding and decoding all reference types (DW_FORM_ref1, DW_FORM_ref2, DW_FORM_ref4, DW_FORM_ref8, DW_FORM_ref_udata, DW_FORM_ref_addr) including cross compile unit references with that go forward one compile unit and backward on compile unit. Differential Revision: https://reviews.llvm.org/D27326 llvm-svn: 289010	2016-12-08 01:03:48 +00:00
Bob Haarman	a5b4358956	[pdb] handle missing pdb streams more gracefully Summary: The code we use to read PDBs assumed that streams we ask it to read exist, and would read memory outside a vector and crash if this wasn't the case. This would, for example, cause llvm-pdbdump to crash on PDBs generated by lld. This patch handles such cases more gracefully: the PDB reading code in LLVM now reports errors when asked to get a stream that is not present, and llvm-pdbdump will report missing streams and continue processing streams that are present. Reviewers: ruiu, zturner Subscribers: thakis, amccarth Differential Revision: https://reviews.llvm.org/D27325 llvm-svn: 288722	2016-12-05 22:44:00 +00:00
Eugene Zelenko	570e39a25c	[DebugInfo] Fix some Clang-tidy modernize-use-default and Include What You Use warnings; other minor fixes (NFC). Per Zachary Turner and Mehdi Amini suggestion to make only post-commit reviews. llvm-svn: 287838	2016-11-23 23:16:32 +00:00
Rui Ueyama	2b4ba04d57	Remove PDBFileBuilder::build() and related functions. PDBFileBuilder supports two different ways to create files. One is PDBFileBuilder::commit. That function takes a filename and write a result to the file. The other is PDBFileBuilder::build. That returns a new PDBFile object. This patch removes the latter because no one is using it and in a real life situation we are very unlikely to need it. Even if you need it, it'd be easy to write a new PDB to a memory buffer and read it back. Removing PDBFileBuilder::build enables us to remove other classes build transitively. Differential Revision: https://reviews.llvm.org/D26987 llvm-svn: 287697	2016-11-22 20:32:22 +00:00
Rui Ueyama	fb1e6d22a3	Align Modi and FileInfo substreams on 32-byte offsets. This is required by DbiStream, but DbiStreamBuilder didn't align these substreams, so the output of DbiSTreamBuilder couldn't be read by DbiStream. Test will be added to LLD. llvm-svn: 287067	2016-11-16 00:59:27 +00:00
Rui Ueyama	507013180e	Fix Modi and File count if there are more than 65535 modules/files. These numbers are intended to be capped at 65535, but `std::max<uint16_t>(UINT16_MAX, N)` always returns N for any N because the expression is the same as `std::max((uint16_t)UINT16_MAX, (uint16_t)N)`. llvm-svn: 287060	2016-11-16 00:38:33 +00:00
Greg Clayton	6f6e4dbd5d	Improve DWARF parsing speed by improving DWARFAbbreviationDeclaration This patch gets a DWARF parsing speed improvement by having DWARFAbbreviationDeclaration instances know if they have a fixed byte size. If an abbreviation has a fixed byte size that can be calculated given a DWARFUnit, then parsing a DIE becomes two steps: parse ULEB128 abbrev code, and then add constant size to the offset. This patch also adds a fixed byte size to each DWARFAbbreviationDeclaration::AttributeSpec so that attributes can quickly skip their values if needed without the need to lookup the fixed for size. Notable improvements: - DWARFAbbreviationDeclaration::findAttributeIndex() now returns an Optional<uint32_t> instead of a uint32_t and we no longer have to look for the magic -1U return value - Optional<uint32_t> DWARFAbbreviationDeclaration::findAttributeIndex(dwarf::Attribute attr) const; - DWARFAbbreviationDeclaration now has a getAttributeValue() function that extracts an attribute value given a DIE offset that takes advantage of the DWARFAbbreviationDeclaration::AttributeSpec::ByteSize - bool DWARFAbbreviationDeclaration::getAttributeValue(const uint32_t DIEOffset, const dwarf::Attribute Attr, const DWARFUnit &U, DWARFFormValue &FormValue) const; - A DWARFAbbreviationDeclaration instance can return a fixed byte size for itself so DWARF parsing is faster: - Optional<size_t> DWARFAbbreviationDeclaration::getFixedAttributesByteSize(const DWARFUnit &U) const; - Any functions that used to take a "const DWARFUnit *U" that would crash if U was NULL now take a "const DWARFUnit &U" and are only called with a valid DWARFUnit Differential Revision: https://reviews.llvm.org/D26567 llvm-svn: 286924	2016-11-15 01:23:06 +00:00
Rui Ueyama	a8a68a993e	Remove extra semicolon. llvm-svn: 286688	2016-11-12 00:23:32 +00:00
Rui Ueyama	f7c9c3234c	Define DbiStreamBuilder::addSectionContribs. This patch defines a new function to add a SectionContribs stream to a PDB file. Unlike SectionMap, SectionContribs contains a list of input sections as opposed to output sections. Note that this patch needs improving because currently we do not set Module field in SectionContribs entries. In a follow-up patch, I'll add Modules and then fix it after that. Differential Revision: https://reviews.llvm.org/D26210 llvm-svn: 286677	2016-11-11 23:41:13 +00:00
Greg Clayton	04c19286a1	Fixed issues found by Paul Robinson with my patch for: https://reviews.llvm.org/D26526 - Fixed DW_FORM_strp to be correctly sized and extracted for DWARF64 - Added some missing strp variants as well - Fixed comment typo llvm-svn: 286603	2016-11-11 17:38:14 +00:00
Greg Clayton	82f12b149f	Clean up DWARFFormValue by reducing duplicated code and removing DWARFFormValue::getFixedFormSizes() In preparation for a follow on patch that improves DWARF parsing speed, clean up DWARFFormValue so that we have can get the fixed byte size of a form value given a DWARFUnit or given the version, address byte size and dwarf32/64. This patch cleans up code so that everyone is using one of the new DWARFFormValue functions: static Optional<uint8_t> DWARFFormValue::getFixedByteSize(dwarf::Form Form, const DWARFUnit *U = nullptr); static Optional<uint8_t> DWARFFormValue::getFixedByteSize(dwarf::Form Form, uint16_t Version, uint8_t AddrSize, bool Dwarf32); This patch changes DWARFFormValue::skipValue() to rely on the output of DWARFFormValue::getFixedByteSize(...) instead of duplicating the code in each function. This will reduce the number of changes we need to make to DWARF to fewer places in DWARFFormValue when we add support for new form. This patch also starts to support DWARF64 so that we can get correct byte sizes for forms that vary according the DWARF 32/64. To reduce the code duplication a new FormSizeHelper pure virtual class was created that can be created as a FormSizeHelperDWARFUnit when you have a DWARFUnit, or FormSizeHelperManual where you manually specify the DWARF version, address byte size and DWARF32/DWARF64. There is now a single implementation of a function that gets the fixed byte size (instead of two where one took a DWARFUnit and one took the DWARF version, address byte size and DWARFFormat enum) and one function to skip the form values. https://reviews.llvm.org/D26526 llvm-svn: 286597	2016-11-11 16:21:37 +00:00
Zachary Turner	44728f4014	Fix some size_t / uint32_t ambiguity errors. llvm-svn: 286305	2016-11-08 22:30:11 +00:00
Zachary Turner	4efa0a4201	[CodeView] Hook up CodeViewRecordIO to type serialization path. Previously support had been added for using CodeViewRecordIO to read (deserialize) CodeView type records. This patch adds support for writing those same records. With this patch, reading and writing of CodeView type records finally uses a single codepath. Differential Revision: https://reviews.llvm.org/D26253 llvm-svn: 286304	2016-11-08 22:24:53 +00:00
Justin Bogner	f9fb2abb01	PDB: Fix some APIs to avoid use-after-frees The buffer is already owned by the PDBFile for all of these APIs, so don't pass it in separately. llvm-svn: 285953	2016-11-03 18:28:04 +00:00
Zachary Turner	7251ede7c5	Add CodeViewRecordIO for reading and writing. Using a pattern similar to that of YamlIO, this allows us to have a single codepath for translating codeview records to and from serialized byte streams. The current patch only hooks this up to the reading of CodeView type records. A subsequent patch will hook it up for writing of CodeView type records, and then a third patch will hook up the reading and writing of CodeView symbols. Differential Revision: https://reviews.llvm.org/D26040 llvm-svn: 285836	2016-11-02 17:05:19 +00:00
Rui Ueyama	ddc79225c3	Define DbiStreamBuilder::addSectionMap. This change enables LLD to construct a Section Map stream in a PDB file. I do not understand all these fields in the Section Map yet, but it seems like a copy of a COFF section header in another format. With this patch, DbiStreamBuilder can emit a Section Map which llvm-pdbdump can dump. Differential Revision: https://reviews.llvm.org/D26112 llvm-svn: 285606	2016-10-31 17:38:56 +00:00
Greg Clayton	cddab279f6	Modify DWARFFormValue to remember the DWARFUnit that it was decoded with. Modifying DWARFFormValue to remember the DWARFUnit that it was encoded with can simplify the usage of instances of this class. Previously users would have to try and pass in the same DWARFUnit that was used to decode the form value and there was a possibility that a different DWARFUnit might be supplied to the functions that extract values (strings, CU relative references, addresses) and cause problems. This fixes this potential issue by storing the DWARFUnit inside the DWARFFormValue so that this mistake can't be made. Instances of DWARFFormValue are not stored permanently and are used as temporary values, so the increase in size of an instance of DWARFFormValue isn't a big deal. This makes decoding form values more bullet proof and is a change that will be used by future modifications. https://reviews.llvm.org/D26052 llvm-svn: 285594	2016-10-31 16:46:02 +00:00
Rui Ueyama	77be2403f6	Define calculateDbgStreamSize for consistency. llvm-svn: 285487	2016-10-29 00:56:44 +00:00
Adrian Prantl	c4fbbcf9ed	Import/update constants from the DWARF 5 public review draft document. https://reviews.llvm.org/D26051 llvm-svn: 285421	2016-10-28 17:59:50 +00:00
Greg Clayton	6c273763a3	Switch all DWARF variables for tags, attributes and forms over to use the llvm::dwarf enumerations instead of using raw uint16_t values. This allows easier debugging as users can see the values of the enumerations in the variables view that will show the enumeration string instead of just a number. https://reviews.llvm.org/D26013 llvm-svn: 285309	2016-10-27 16:32:04 +00:00
Bob Haarman	26a87bd030	[codeview] support emitting indirect virtual base class information Summary: Fixes PR28281. MSVC lists indirect virtual base classes in the field list of a class, using LF_IVBCLASS records. This change makes LLVM emit such records when processing DW_TAG_inheritance tags with the DIFlagVirtual and (newly introduced) DIFlagIndirect tags. Reviewers: rnk, ruiu, zturner Differential Revision: https://reviews.llvm.org/D25578 llvm-svn: 285130	2016-10-25 22:11:52 +00:00
Bob Haarman	653baa2aaa	[pdb] added support for dumping globals stream Summary: This adds support for dumping the globals stream from PDB files using llvm-pdbdump, similar to the support we have for the publics stream. Reviewers: ruiu, zturner Subscribers: beanz, mgorny, modocache Differential Revision: https://reviews.llvm.org/D25801 llvm-svn: 284861	2016-10-21 19:43:19 +00:00
Zachary Turner	4d49eb9fa0	[CodeView] Refactor serialization to use StreamInterface. This was all using ArrayRef<>s before which presents a problem when you want to serialize to or deserialize from an actual PDB stream. An ArrayRef<> is really just a special case of what can be handled with StreamInterface though (e.g. by using a ByteStream), so changing this to use StreamInterface allows us to plug in a PDB stream and get all the record serialization and deserialization for free on a MappedBlockStream. Subsequent patches will try to remove TypeTableBuilder and TypeRecordBuilder in favor of class that operate on Streams as well, which should allow us to completely merge the reading and writing codepaths for both types and symbols. Differential Revision: https://reviews.llvm.org/D25831 llvm-svn: 284762	2016-10-20 18:31:19 +00:00
Reid Kleckner	990504e625	Remove LLVM_NOEXCEPT and replace it with noexcept Now that we have dropped MSVC 2013, all supported compilers support noexcept and we can drop this portability macro. llvm-svn: 284672	2016-10-19 23:52:38 +00:00
Zachary Turner	383803230b	[pdb] Improve error messages when DIA is not found. llvm-svn: 284610	2016-10-19 16:42:20 +00:00
David Blaikie	69494a9805	dwarfdump: add space missing from the type unit header description llvm-svn: 284540	2016-10-18 21:18:43 +00:00
David Blaikie	e4c3915a5a	dwarfdump: Include the name in the unit description, even in non-summarized mode (accidentally removed this from my previous change when I was rejecting some clang-format formatting... ) llvm-svn: 284539	2016-10-18 21:16:45 +00:00
David Blaikie	50cc27ecb9	dwarfdump: -summarize-types: print a short summary (unqualified type name, hash, length) of type units rather than dumping contents This is just a quick utility handy for getting rough summaries of types in a given object or dwo file. I've been using it to investigate the amount of type info redundancy across a project build, for example. llvm-svn: 284537	2016-10-18 21:09:48 +00:00
Reid Kleckner	edfc9dcf42	Truncate long names in type records In the MS ABI, the frontend is supposed to MD5 such pathologically long names. LLVM should still defend itself from long names, though. Fixes part of PR29098. llvm-svn: 284136	2016-10-13 17:33:22 +00:00
Reid Kleckner	fb58be862c	Update _MSC_VER equality checks for msdiaNNN.dll Use inequality instead of equality to defend against minor version increases in _MSC_VER. An _MSC_VER value of 1901 should still use msdia140.dll, as described in this blog post: https://blogs.msdn.microsoft.com/vcblog/2016/10/05/visual-c-compiler-version/ llvm-svn: 284058	2016-10-12 21:51:14 +00:00
Reid Kleckner	5d0bc63d91	Avoid braced initialization for default member initializers for MSVC 2013 llvm-svn: 283928	2016-10-11 20:02:57 +00:00
Rui Ueyama	f9904043ca	Re-submit r283823: Define DbiStreamBuilder::addDbgStream to add stream. The previous commit was failing because we filled empty slots of the debug stream index with kInvalidStreamIndex. It should've been 0. llvm-svn: 283925	2016-10-11 19:43:12 +00:00
Rui Ueyama	8af4988f35	Revert r283824 and r283823: Define DbiStreamBuilder::addDbgStream to add stream. This reverts commit r283824 and r283823 to fix buildbots. llvm-svn: 283828	2016-10-11 00:15:50 +00:00
Rui Ueyama	914eef6a64	Fix a bug in DbiStreamBuilder::addDbgStream. This feature will be tested in LLD unit tests. llvm-svn: 283824	2016-10-10 23:44:04 +00:00
Rui Ueyama	70edd9e41d	Define DbiStreamBuilder::addDbgStream to add stream. Previously, there is no way to create a stream other than pre-defined special stream such as DBI or IPI. This patch adds a new method, addDbgStream, to add a debug stream to a PDB file. Differential Revision: https://reviews.llvm.org/D25356 llvm-svn: 283823	2016-10-10 23:35:36 +00:00
Zachary Turner	3b14764ce5	[pdb] Dump Module Symbols to Yaml. This is the first step towards round-tripping symbol information, and thusly being able to write symbol information to a PDB. This patch writes the symbol information for each compiland to the Yaml when running in pdb2yaml mode. There's still some loose ends, such as what to do about relocations (necessary in order to print linkage names), how to print enums with friendly names, and how to give the dumper access to the StringTable, but this is a good first start. llvm-svn: 283641	2016-10-08 01:12:01 +00:00
Zachary Turner	0d8407447d	Refactor Symbol visitor code. Type visitor code had already been refactored previously to decouple the visitor and the visitor callback interface. This was necessary for having the flexibility to visit in different ways (for example, dumping to yaml, reading from yaml, dumping to ScopedPrinter, etc). This patch merely implements the same visitation pattern for symbol records that has already been implemented for type records. llvm-svn: 283609	2016-10-07 21:34:46 +00:00
Mehdi Amini	149f6eaed9	Re-commit "Use StringRef in Support/Darf APIs (NFC)" This reverts commit r283285 and re-commit r283275 with a fix for format("%s", Str); where Str is a StringRef. llvm-svn: 283298	2016-10-05 05:59:29 +00:00
Mehdi Amini	2bcac0fac4	Revert "Re-commit "Use StringRef in Support/Darf APIs (NFC)"" One test seems randomly broken: DebugInfo/X86/gnu-public-names.ll llvm-svn: 283285	2016-10-05 01:04:02 +00:00
Mehdi Amini	32b297a42f	Re-commit "Use StringRef in Support/Darf APIs (NFC)" This reverts commit r283278 and re-commit r283275 with the update to fix the build on the LLDB side. llvm-svn: 283281	2016-10-05 00:37:18 +00:00
Mehdi Amini	78b04ae7ac	Revert "Use StringRef in Support/Darf APIs (NFC)" This reverts commit r283275, it broke LLDB Android debug server. llvm-svn: 283278	2016-10-05 00:21:14 +00:00
Mehdi Amini	e0327be584	Use StringRef in Support/Darf APIs (NFC) llvm-svn: 283275	2016-10-04 23:55:40 +00:00
Rui Ueyama	5d6714e593	Do not pass a superblock to PDBFileBuilder. When we create a PDB file using PDBFileBuilder, the information in the superblock, such as the size of the resulting file, is not available. Previously, PDBFileBuilder::initialize took a superblock assuming that all the members of the struct are correct. That is useful when you want to restore the exact information from a YAML file, but that's probably the only use case in which that is useful. When we are creating a PDB file on the fly, we have to backfill the members. This patch redefines PDBFileBuilder::initialize to take only a block size. Now all the other members are left as default values, so that they'll be updated when commit() is called. Differential Revision: https://reviews.llvm.org/D25108 llvm-svn: 282944	2016-09-30 20:52:12 +00:00
Rui Ueyama	fc22cef98e	Pass a filename instead of a msf::WritableStream to PDBFileBuilder::commit. WritableStream needs the exact file size to open a file, but until we fix the final layout of a PDB file, we don't know the size of the file. This patch changes the parameter type of PDBFileBuilder::commit to solve that chiecken-and-egg problem. Now the function opens a file after fixing the layout, so it can create a file with the exact size. Differential Revision: https://reviews.llvm.org/D25107 llvm-svn: 282940	2016-09-30 20:34:44 +00:00
George Rimar	4f82df52ae	Revert r282238 "Revert r282235 "[llvm-dwarfdump] - Teach dwarfdump to dump gdb-index section."" Build bot issues (http://lab.llvm.org:8011/builders/clang-x64-ninja-win7/builds/15856/steps/ninja%20check%201/logs/FAIL%3A%20LLVM%3A%3Adwarfdump-dump-gdbindex.test) should be fixed in that version. Issue was that MSVS does not support "%zu". Though it works fine on MSCS 2015, Bot looks running MSVS 2013 that does not like it. MSDN also says that "z" prefix is not supported: https://msdn.microsoft.com/en-us/library/tcxf1dw6.aspx I had to use PRId64 instead. Original commit message: [llvm-dwarfdump] - Teach dwarfdump to dump gdb-index section. gold linker's --gdb-index option currently is able to create the .gdb_index section that allows GDB to locate and read the .dwo files as it needs them, this helps reduce the total size of the object files processed by the linker. More info about that: https://gcc.gnu.org/wiki/DebugFission https://sourceware.org/gdb/onlinedocs/gdb/Index-Section-Format.html Patch teaches dwarfdump tool to dump this section. Differential revision: https://reviews.llvm.org/D21503 llvm-svn: 282239	2016-09-23 11:01:53 +00:00
George Rimar	a348527186	Revert r282235 "[llvm-dwarfdump] - Teach dwarfdump to dump gdb-index section." It broke BB: http://lab.llvm.org:8011/builders/clang-x64-ninja-win7/builds/15856 llvm-svn: 282238	2016-09-23 10:12:56 +00:00
George Rimar	a77bcf5e42	[llvm-dwarfdump] - Teach dwarfdump to dump gdb-index section. gold linker's --gdb-index option currently is able to create the .gdb_index section that allows GDB to locate and read the .dwo files as it needs them, this helps reduce the total size of the object files processed by the linker. More info about that: https://gcc.gnu.org/wiki/DebugFission https://sourceware.org/gdb/onlinedocs/gdb/Index-Section-Format.html Patch teaches dwarfdump tool to dump this section. Differential revision: https://reviews.llvm.org/D21503 llvm-svn: 282235	2016-09-23 09:09:26 +00:00
Zachary Turner	de9ba15511	[pdb] Write the IPI stream. The IPI stream is structurally identical to the TPI stream, but it contains different record types. So we just re-use the TPI writing code. llvm-svn: 281638	2016-09-15 18:22:31 +00:00
Zachary Turner	a6cbfb53c2	[pdb] Fix the TPI stream size computation. We were inadvertently adding the size of the hash value stream to the size of the TPI stream, even though the hash value stream is an entirely separate stream. llvm-svn: 281636	2016-09-15 18:22:21 +00:00
Zachary Turner	c67b00c695	[pdb] Get rid of Data and RawData in CVType. The `CVType` had two redundant fields which were confusing and error-prone to fill out. By treating member records as a distinct type from leaf records, we are able to simplify this quite a bit. Reviewed By: rnk Differential Revision: https://reviews.llvm.org/D24432 llvm-svn: 281556	2016-09-14 23:00:16 +00:00
Zachary Turner	620961deb9	[pdb] Write TPI hash values to the TPI stream. This completes being able to write all the interesting values of a PDB TPI stream. Reviewed By: rnk Differential Revision: https://reviews.llvm.org/D24370 llvm-svn: 281555	2016-09-14 23:00:02 +00:00
Zachary Turner	36efbfa6d8	[pdb] Print out some more info when dumping a raw stream. We have various command line options that print the type of a stream, the size of a stream, etc but nowhere that it can all be viewed together. Since a previous patch introduced the ability to dump the bytes of a stream, this seems like a good place to present a full view of the stream's properties including its size, what kind of data it represents, and the blocks it occupies. So I added the ability to print that information to the -stream-data command line option. llvm-svn: 281077	2016-09-09 19:00:49 +00:00
Zachary Turner	9ba31a5efe	[pdb] Pass CVRecord's through the visitor as non-const references. This simplifies a lot of code, and will actually be necessary for an upcoming patch to serialize TPI record hash values. The idea before was that visitors should be examining records, not modifying them. But this is no longer true with a visitor that constructs a CVRecord from Yaml. To handle this until now, we were doing some fixups on CVRecord objects at a higher level, but the code is really awkward, and it makes sense to just have the visitor write the bytes into the CVRecord. In doing so I uncovered a few bugs related to `Data` and `RawData` and fixed those. Reviewed By: rnk Differential Revision: https://reviews.llvm.org/D24362 llvm-svn: 281067	2016-09-09 18:03:39 +00:00
Zachary Turner	c6d54da891	[pdb] Write PDB TPI Stream from Yaml. This writes the full sequence of type records described in Yaml to the TPI stream of the PDB file. Reviewed By: rnk Differential Revision: https://reviews.llvm.org/D24316 llvm-svn: 281063	2016-09-09 17:46:17 +00:00
Reid Kleckner	fa28396f97	[codeview] Use the correct max CV record length of 0xFF00 Previously we were splitting our records at 0xFFFF bytes, which the Microsoft tools don't like. Should fix failure on the new Windows self-host buildbot. This length appears in microsoft-pdb/PDB/dbi/dbiimpl.h llvm-svn: 280522	2016-09-02 18:43:27 +00:00
Reid Kleckner	d1882f2188	Fix the ASan fuse-lld.cc test after LLD r280012 With that change, images built with 'lld-link /debug' always have a debug directory. If no PDB filename was passed on the command line, then the filename in the executable is empty. PDB information would never work anyway if the PDB file name is empty, so go ahead and try DWARF in that case. llvm-svn: 280410	2016-09-01 20:28:59 +00:00
Zachary Turner	5c7c2307a8	[codeview] Properly propagate the TypeLeafKind through the pipeline. llvm-svn: 280388	2016-09-01 18:08:19 +00:00
Zachary Turner	77807637ff	[codeview] Have visitTypeBegin return the record type. Previously we were assuming that any visitation of types would necessarily be against a type we had binary data for. Reasonable assumption when were just reading PDBs and dumping them, but once we start writing PDBs from Yaml this breaks down, because we have no binary data yet, only Yaml, and from that we need to read the record kind and perform the switch based on that. So this patch does that. Instead of having the visitor switch on the kind that is already in the CVType record, we change the visitTypeBegin() method to return the Kind, and switch on the returned value. This way, the default implementation can still return the value from the CVType, but the implementation which visits Yaml records and serializes binary PDB type records can use the field in the Yaml as the source of the switch. llvm-svn: 280307	2016-08-31 23:14:31 +00:00
Zachary Turner	2f951ce9c9	[codeview] Add TypeVisitorCallbackPipeline. We were kind of hacking this together before by embedding the ability to forward requests into the TypeDeserializer. When we want to start adding more different kinds of visitor callback interfaces though, this doesn't scale well and is very inflexible. So introduce the notion of a pipeline, which itself implements the TypeVisitorCallbacks interface, but which contains an internal list of other callbacks to invoke in sequence. Also update the existing uses of CVTypeVisitor to use this new pipeline class for deserializing records before visiting them with another visitor. llvm-svn: 280293	2016-08-31 21:42:26 +00:00
Reid Kleckner	9dac47319d	[codeview] Emit vtable shape information The shape of the vtable is passed down as the size of the __vtbl_ptr_type. This special pointer type appears both as the pointee type of the vptr type, and by itself in every dynamic class. For classes with multiple vtables, only the shape of the primary vftable is included, as the shape of all secondary vftables will be the same as in the base class. Fixes PR28150 llvm-svn: 280254	2016-08-31 15:59:30 +00:00
Zachary Turner	f6884a1aac	Remove unused translation unit. llvm-svn: 279561	2016-08-23 20:08:02 +00:00
Eugene Zelenko	61a72d8850	[LLVM] Fix some Clang-tidy modernize-use-using and Include What You Use warnings Differential revision: https://reviews.llvm.org/D23675 llvm-svn: 279102	2016-08-18 17:56:27 +00:00
Vedant Kumar	c948d182e1	Fix -Wpessimizing-move error, NFC llvm-svn: 279095	2016-08-18 17:39:53 +00:00
Zachary Turner	ac5763eca4	Resubmit "Write the TPI stream from a PDB to Yaml." The original patch was breaking some buildbots due to an incorrect ordering of function definitions which caused some compilers to recognize a definition but others to not. llvm-svn: 279089	2016-08-18 16:49:29 +00:00
Justin Bogner	39eec466a2	Revert "Write the TPI stream from a PDB to Yaml." This is hitting a "use of undeclared identifier 'skipPadding' error locally and on some bots. This reverts r278869. llvm-svn: 278871	2016-08-16 23:37:10 +00:00
Zachary Turner	8321ba5437	Write the TPI stream from a PDB to Yaml. Reviewed By: ruiu, rnk Differential Revision: https://reviews.llvm.org/D23226 llvm-svn: 278869	2016-08-16 23:28:54 +00:00
Saleem Abdulrasool	015280211b	CodeView: extract the OMF Directory Header The DebugDirectory contains a pointer to the CodeView info structure which is a derivative of the OMF debug directory. The structure has evolved a bit over time, and PDB 2.0 used a slightly different definition from PDB 7.0. Both of these are specific to CodeView and not COFF. Reflect this by moving the structure definitions into the DebugInfo/CodeView headers. Define a generic DebugInfo union type that can be used to pass around a reference to the DebugInfo irrespective of the versioning. NFC. llvm-svn: 278075	2016-08-09 00:25:12 +00:00
Justin Bogner	272cbacc25	CodeView: Remove an unused variable It was breaking the -Werror build. llvm-svn: 277878	2016-08-05 21:57:10 +00:00
Zachary Turner	5e35eaac83	Fix non portable include path. llvm-svn: 277876	2016-08-05 21:50:02 +00:00
Zachary Turner	5e3e4bb26b	[CodeView] Decouple record deserialization from visitor dispatch. Until now, our use case for the visitor has been to take a stream of bytes representing a type stream, deserialize the records in sequence, and do something with them, where "something" is determined by how the user implements a particular set of callbacks on an abstract class. For actually writing PDBs, however, we want to do the reverse. We have some kind of description of the list of records in their in-memory format, and we want to process each one. Perhaps by serializing them to a byte stream, or perhaps by converting them from one description format (Yaml) to another (in-memory representation). This was difficult in the current model because deserialization and invoking the callbacks were tightly coupled. With this patch we change this so that TypeDeserializer is itself an implementation of the particular set of callbacks. This decouples deserialization from the iteration over a list of records and invocation of the callbacks. TypeDeserializer is initialized with another implementation of the callback interface, so that upon deserialization it can pass the deserialized record through to the next set of callbacks. In a sense this is like an implementation of the Decorator design pattern, where the Deserializer is a decorator. This will be useful for writing Pdbs from yaml, where we have a description of the type records in Yaml format. In this case, the visitor implementation would have each visitation callback method implemented in such a way as to extract the proper set of fields from the Yaml, and it could maintain state that builds up a list of these records. Finally at the end we can pass this information through to another set of callbacks which serializes them into a byte stream. Reviewed By: majnemer, ruiu, rnk Differential Revision: https://reviews.llvm.org/D23177 llvm-svn: 277871	2016-08-05 21:45:34 +00:00
Zachary Turner	660230eba4	[CodeView] Use llvm::Error instead of std::error_code. This eliminates the remnants of std::error_code from the DebugInfo libraries. llvm-svn: 277758	2016-08-04 19:39:55 +00:00
Rui Ueyama	d1d8c8312a	pdbdump: Fix crash bug. pdbdump calls DbiStreamBuilder::commit through PDBFileBuilder::commit without calling DbiStreamBuilder::finalize. Because `finalize` initializes `Header` member, `Header` remained nullptr which caused a crash bug. Differential Revision: https://reviews.llvm.org/D23143 llvm-svn: 277681	2016-08-03 23:43:23 +00:00
Zachary Turner	8cf51c340d	[msf] Make FPM reader use MappedBlockStream. MappedBlockSTream can work with any sequence of block data where the ordering is specified by a list of block numbers. So rather than manually stitch them together in the case of the FPM, reuse this functionality so that we can treat the FPM as if it were contiguous. Reviewed By: ruiu Differential Revision: https://reviews.llvm.org/D23066 llvm-svn: 277609	2016-08-03 16:53:21 +00:00
Rui Ueyama	4ee7f3c9aa	PDB: Mark extended file pages as free by default. BitVector::extend initializes extended bits as true by default. That is not desirable because new pages should be initially free. Differential Revision: https://reviews.llvm.org/D23048 llvm-svn: 277529	2016-08-02 21:56:37 +00:00
Zachary Turner	d3c7b8e303	[msf] Teach LLVM to parse a split Fpm. The FPM is split at regular intervals across the MSF file, as the MS code suggests. It turns out that the value of the interval is precisely the block size. If the block size is 4096, then there are two Fpm pages every 4096 blocks. So here we teach the PDBFile class to parse a split FPM, and also add more options when dumping the FPM to display some additional information such as orphaned pages (pages which the FPM says are allocated, but which nothing appears to use), use after free pages (pages which the FPM says are not allocated, but which are referenced by a stream), and multiple use pages (pages which the FPM says are allocated but are used more than once). Reviewed By: ruiu Differential Revision: https://reviews.llvm.org/D23022 llvm-svn: 277388	2016-08-01 21:19:45 +00:00
Rui Ueyama	7a5cdc6225	pdbdump: Dump Free Page Map contents. Differential Revision: https://reviews.llvm.org/D22974 llvm-svn: 277216	2016-07-29 21:38:00 +00:00
Zachary Turner	a3225b0451	[msf] Resubmit "Rename Msf -> MSF". Previously this change was submitted from a Windows machine, so changes made to the case of filenames and directory names did not survive the commit, and as a result the CMake source file names and the on-disk file names did not match on case-sensitive file systems. I'm resubmitting this patch from a Linux system, which hopefully allows the case changes to make it through unfettered. llvm-svn: 277213	2016-07-29 20:56:36 +00:00
Zachary Turner	334aec4dd2	Revert "[msf] Rename Msf to MSF." This reverts commit 4d1557ffac41e079bcb1abbcf04f512474dcd6fe. llvm-svn: 277194	2016-07-29 18:38:47 +00:00
Zachary Turner	a010f5cef0	[msf] Rename Msf to MSF. In a previous patch, it was suggested to use all caps instead of rolling caps for initialisms, so this patch changes everything to do this. llvm-svn: 277190	2016-07-29 18:24:26 +00:00
Zachary Turner	9f73c20228	[pdb] Fix an ambiguity when writing size_t on x64 platforms. llvm-svn: 277025	2016-07-28 19:29:52 +00:00
Zachary Turner	e98137c47f	[pdb] Fix some warnings that break -Werror builds. llvm-svn: 277021	2016-07-28 19:18:02 +00:00
Zachary Turner	d66889cbae	[pdb] Refactor library to more clearly separate reading/writing Reviewed By: amccarth, ruiu Differential Revision: https://reviews.llvm.org/D22693 llvm-svn: 277019	2016-07-28 19:12:28 +00:00
Zachary Turner	199f48a5f0	Get rid of IMsfStreamData class. This was a pure virtual base class whose purpose was to abstract away the notion of how you retrieve the layout of a discontiguous stream of blocks in an Msf file. This led to too many layers of abstraction making it difficult to figure out what was going on and extend things. Ultimately, a stream's layout is decided by its length and the array of block numbers that it lives on. So rather than have an abstract base class which can return this in any number of ways, it's more straightforward to simply store them as fields of a trivial struct, and also to give a more appropriate name. This patch does that. It renames IMsfStreamData to MsfStreamLayout, and deletes the 2 concrete implementations, DirectoryStreamData and IndexedStreamData. MsfStreamLayout is a trivial struct with the necessary data. llvm-svn: 277018	2016-07-28 19:11:09 +00:00
Vassil Vassilev	fe68d81709	[modules] Add missing includes. llvm-svn: 276970	2016-07-28 10:26:33 +00:00
Zachary Turner	e4a4f33daf	Make PDBFile store an msf::Layout. Previously it was storing all the fields of an msf::Layout as separate members. This is a trivial cleanup to make it store an msf::Layout directly. This makes the code more readable since it becomes clear which fields of PDBFile are actually the msf specific layout information in a sea of other bookkeeping fields. llvm-svn: 276460	2016-07-22 19:56:33 +00:00
Zachary Turner	e109dc63f9	[pdb] Have builders share a single BumpPtrAllocator. This makes it easier to have the writable and readable PDB interfaces share code since the read/write and write-only interfaces now share a single allocator, you don't have to worry about a builder building a read only interface and then having the read-only interface's data become corrupt when the builder goes out of scope. Now the allocator is specified explicitly to all constructors, so all interfaces can share a single allocator that is scoped appropriately. llvm-svn: 276459	2016-07-22 19:56:26 +00:00
Zachary Turner	bac69d33d0	[msf] Create LLVMDebugInfoMsf This provides a better layering of responsibilities among different aspects of PDB writing code. Some of the MSF related code was contained in CodeView, and some was in PDB prior to this. Further, we were often saying PDB when we meant MSF, and the two are actually independent of each other since in theory you can have other types of data besides PDB data in an MSF. So, this patch separates the MSF specific code into its own library, with no dependencies on anything else, and DebugInfoCodeView and DebugInfoPDB take dependencies on DebugInfoMsf. llvm-svn: 276458	2016-07-22 19:56:05 +00:00
Zachary Turner	b383d628df	[pdb] Move file layout header structs to RawTypes.h This facilitates code reuse between the builder classes and the "frozen" read only versions of the classes used for parsing existing PDB files. llvm-svn: 276427	2016-07-22 15:46:46 +00:00
Zachary Turner	d218c26124	[pdb] Round-trip module & file info to/from YAML. This implements support for writing compiland and compiland source file info to a binary PDB. This is tested by adding support for dumping these fields from an existing PDB to yaml, reading them back in, and dumping them again and verifying the values are as expected. llvm-svn: 276426	2016-07-22 15:46:37 +00:00
Pete Cooper	b2ba776aed	Avoid dsymutil calls to getFileNameByIndex. This change adds a hasFileAtIndex method. getChildDeclContext can first call this method, and if it returns true it knows it can then lookup the resolved path cache for the given file index. If we hit that cache then we don't even have to call getFileNameByIndex. Running dsymutil against the swift executable built from github gives a 20% performance improvement without any change in the binary. Differential Revision: https://reviews.llvm.org/D22655 Reviewed by friss. llvm-svn: 276380	2016-07-22 01:41:32 +00:00
Zachary Turner	b927e02e1b	[pdb] Teach MsfBuilder and other classes about the Free Page Map. Block 1 and 2 of an MSF file are bit vectors that represent the list of blocks allocated and free in the file. We had been using these blocks to write stream data and other data, so we mark them as the free page map now. We don't yet serialize these pages to the disk, but at least we make a note of what it is, and avoid writing random data to them. Doing this also necessitated cleaning up some of the tests to be more general and hardcode fewer values, which is nice. llvm-svn: 275629	2016-07-15 22:17:19 +00:00
Zachary Turner	5e534c7fb3	[pdb] Round trip the NameMap data structure to YAML. llvm-svn: 275628	2016-07-15 22:17:08 +00:00
Zachary Turner	faa554b2fd	[pdb] Use MsfBuilder to handle the writing PDBs. Previously we would read a PDB, then write some of it back out, but write the directory, super block, and other pertinent metadata back out unchanged. This generates incorrect PDBs since the amount of data written was not always the same as the amount of data read. This patch changes things to use the newly introduced `MsfBuilder` class to write out a correct and accurate set of Msf metadata for the data actually written, which opens up the door for adding and removing type records, symbol records, and other types of data to an existing PDB. llvm-svn: 275627	2016-07-15 22:16:56 +00:00
Saleem Abdulrasool	ea6a4fe841	DebugInfo: reorder some initializers Fix a few initialization ordering warnings from gcc from `-Wreorder`. NFC. llvm-svn: 275615	2016-07-15 21:10:31 +00:00
Zachary Turner	f52a899f4a	[pdb] Introduce MsfBuilder for laying out PDB files. Reviewed by: ruiu Differential Revision: https://reviews.llvm.org/D22308 llvm-svn: 275611	2016-07-15 20:43:38 +00:00
Rui Ueyama	dbdfe62c3f	Dump enum unique names. llvm-svn: 275152	2016-07-12 03:33:48 +00:00
Rui Ueyama	ef5ec2da4a	Re-enable TPI hash verification for enum records. We didn't read unique names correctly. As a result, we computed hashes on (non-)unique names instead of unique names. llvm-svn: 275150	2016-07-12 03:25:03 +00:00
Zachary Turner	dbeaea7b35	Refactor the PDB writing to use a builder approach llvm-svn: 275110	2016-07-11 21:45:26 +00:00
Benjamin Kramer	4d09892e9a	Give helper classes/functions internal linkage. NFC. llvm-svn: 275014	2016-07-10 11:28:51 +00:00
David Majnemer	1b79e9a5b9	[pdb] Sanity check the stream map Some abstractions in LLVM "know" that they are reading in-bounds, FixedStreamArray, and provide a simple result. This breaks down if the stream map is bogus. llvm-svn: 275010	2016-07-10 05:32:05 +00:00
David Majnemer	6211b1f1f9	[llvm-pdbdump] Propagate errors a little more consistently PDBFile::getBlockData didn't really return any indication that it failed. It merely returned an empty buffer. llvm-svn: 275009	2016-07-10 03:34:47 +00:00
David Majnemer	7abd269aa9	[CodeView] Emit an appropriate symbol kind for globals We emitted debug info for globals/functions as if they all had external linkage. Instead, emit local symbol records when appropriate. llvm-svn: 274676	2016-07-06 21:07:47 +00:00
Zachary Turner	8848a7a6b2	[pdb] Round trip the PDB stream between YAML and binary PDB. This gets writing of the PDB stream working. llvm-svn: 274647	2016-07-06 18:05:57 +00:00
Zachary Turner	fbabf2d040	Disable hash verification of enums. llvm-svn: 274639	2016-07-06 17:25:12 +00:00
Reid Kleckner	dafc5d75ea	Prune RelocVisitor.h include to avoid including COFF.h from MCJIT.h This helps to mitigate the conflict between COFF.h and winnt.h, which is PR28399. llvm-svn: 274637	2016-07-06 16:56:42 +00:00
Reid Kleckner	6e96a4c64a	[pdb] Check the display name for <unnamed-tag>, not the linkage name This issue was encountered on libcmt.pdb, which has a type record that looks like this: Struct (0x1094) { TypeLeafKind: LF_STRUCTURE (0x1505) MemberCount: 3 Properties [ (0x200) HasUniqueName (0x200) ] FieldList: <field list> (0x1093) DerivedFrom: 0x0 VShape: 0x0 SizeOf: 4 Name: <unnamed-tag> LinkageName: .?AU<unnamed-tag>@@ } The checks for startswith/endswith "<unnamed-tag>" should look at the display name, not the linkage name. llvm-svn: 274376	2016-07-01 18:43:29 +00:00
Reid Kleckner	64b16171df	[pdb] Avoid reporting an error when the module symbol stream is empty llvm-svn: 274309	2016-07-01 00:37:49 +00:00
Reid Kleckner	7aa95a9fca	[PDB] Indicate which type record failed hash validation llvm-svn: 274308	2016-07-01 00:37:25 +00:00
Zachary Turner	ab58ae8730	[pdb] Re-add code to write PDB files. Somehow all the functionality to write PDB files got removed, probably accidentally when uploading the patch perhaps the wrong one got uploaded. This re-adds all the code, as well as the corresponding test. llvm-svn: 274248	2016-06-30 17:43:00 +00:00
David Majnemer	f15064871a	[CodeView] Healthy paranoia around strings Make sure strings don't get too big for a record, truncate them if need-be. llvm-svn: 273710	2016-06-24 19:34:41 +00:00
Kevin Enderby	931cb65df2	Thread Expected<...> up from libObject’s getSymbolAddress() for symbols to allow a good error message to be produced. This is nearly the last libObject interface that used ErrorOr and the last one that appears in llvm/include/llvm/Object/MachO.h . For Mach-O objects this is just a clean up because it’s version of getSymbolAddress() can’t return an error. I will leave it to the experts on COFF and ELF to actually add meaning full error messages in their tests if they wish. And also leave it to these experts to change the last two ErrorOr interfaces in llvm/include/llvm/Object/ObjectFile.h for createCOFFObjectFile() and createELFObjectFile() if they wish. Since there are no test cases for COFF and ELF error cases with respect to getSymbolAddress() in the test suite this is no functional change (NFC). llvm-svn: 273701	2016-06-24 18:24:42 +00:00
Reid Kleckner	33848faa5e	[codeview] Use one byte for S_FRAMECOOKIE CookieKind and add flags byte We bailed out while printing codeview for an MSVC compiled SemaExprCXX.cpp that used this record. The MS reference headers look incorrect here, which is probably why we had this bug. They use a 32-bit enum as the field type, but the actual record appears to use one byte for the cookie kind followed by a flags byte. llvm-svn: 273691	2016-06-24 17:23:49 +00:00
Reid Kleckner	5aba52ff21	[pdb] Treat a stream size of ~0U as 0 My PDBs always have this size for stream 11. Not sure why. llvm-svn: 273504	2016-06-22 22:42:24 +00:00
Reid Kleckner	ac460619d2	[codeview] Fix the alignment padding that we add to list records Tweak the big-types.ll test case to catch this bug. We just need an enumerator name that doesn't have a length that is a multiple of 4. llvm-svn: 273477	2016-06-22 20:59:17 +00:00
Reid Kleckner	5b335b864b	[codeview] Add support for splitting field list records over 64KB The basic structure is that once a list record goes over 64K, the last subrecord of the list is an LF_INDEX record that refers to the next record. Because the type record graph must be toplogically sorted, this means we have to emit them in reverse order. We build the type record in order of declaration, so this means that if we don't want extra copies, we need to detect when we were about to split a record, and leave space for a continuation subrecord that will point to the eventual split top-level record. Also adds dumping support for these records. Next we should make sure that large method overload lists work properly. llvm-svn: 273294	2016-06-21 18:33:01 +00:00
Rui Ueyama	1abbb31bd4	[codeview] Add an extra check for TPI hash values. This patch adds a function that corresponds to `fUDTAnon` and use that to compute TPI hash values as the reference does. llvm-svn: 273139	2016-06-20 07:31:29 +00:00
Reid Kleckner	604105bb90	[codeview] Add DIFlags for pointer to member representations Summary: This seems like the least intrusive way to pass this information through. Fixes PR28151 Reviewers: majnemer, aprantl, dblaikie Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D21444 llvm-svn: 273053	2016-06-17 21:31:33 +00:00
Reid Kleckner	11582c59d7	[pdb] Don't error on missing FPO streams 64-bit PDBs never have FPO data. They have xdata instead. Also improve error recovery of stream summary dumping while I'm here. llvm-svn: 273046	2016-06-17 20:38:01 +00:00
Rui Ueyama	74c4341dde	[codeview] Use hashBufferV8 to verify all type records. Differential Revision: http://reviews.llvm.org/D21393 llvm-svn: 272930	2016-06-16 18:39:17 +00:00
Zachary Turner	01ee3dae04	Resubmit "[pdb] Change type visitor pattern to be dynamic." There was a regression introduced during type stream merging when visiting a field list record. This has been fixed in this patch. llvm-svn: 272929	2016-06-16 18:22:27 +00:00
Zachary Turner	73b0b2f555	Revert "[pdb] Change type visitor pattern to be dynamic." This reverts commit fb0dd311e1ad945827b8ffd5354f4810e2be1579. This breaks some llvm-readobj tests. llvm-svn: 272927	2016-06-16 18:09:04 +00:00
Zachary Turner	1f6372c429	[pdb] Change type visitor pattern to be dynamic. This allows better catching of compiler errors since we can use the override keyword to verify that methods are actually overridden. Also in this patch I've changed from storing a boolean Error code everywhere to returning an llvm::Error, to propagate richer error information up the call stack. Reviewed By: ruiu, rnk Differential Revision: http://reviews.llvm.org/D21410 llvm-svn: 272926	2016-06-16 18:00:28 +00:00
Rui Ueyama	43ed08efa3	[codeview] Pass CVRecord to visitTypeBegin callback. Both parameters to visitTypeBegin are actually members of CVRecord, so we can just pass CVRecord instead of destructuring it. Differential Revision: http://reviews.llvm.org/D21435 llvm-svn: 272899	2016-06-16 14:47:23 +00:00
Rui Ueyama	b9095ae7ee	[codeview] Remove unused parameter. Differential Revision: http://reviews.llvm.org/D21433 llvm-svn: 272898	2016-06-16 14:41:22 +00:00
Rui Ueyama	5c7248c959	Implement pdb::hashBufferV8 hash function. llvm-svn: 272894	2016-06-16 13:48:16 +00:00
Rui Ueyama	9caea82d3e	Remove redundant namespace specifiers. llvm-svn: 272889	2016-06-16 13:17:59 +00:00
Rui Ueyama	8b0ae136e2	[codeview] Use CVTypeVisitor instead of a hand-written switch-cases. Differential Revision: http://reviews.llvm.org/D21418 llvm-svn: 272888	2016-06-16 13:14:42 +00:00
Rui Ueyama	5dbea9db10	[Codeview] Add a class for LF_UDT_MOD_SRC_LINE. Differential Revision: http://reviews.llvm.org/D21406 llvm-svn: 272843	2016-06-15 21:25:29 +00:00
Reid Kleckner	b82f08fa3d	Axe some trailing whitespace from my last commit llvm-svn: 272830	2016-06-15 20:32:42 +00:00
Reid Kleckner	828c4f64e2	[codeview] Move deserialization methods out of line They aren't performance critical and don't need to be inline. llvm-svn: 272829	2016-06-15 20:30:34 +00:00
Rui Ueyama	41974f1e4d	[pdbdump] Verify LF_{CLASS,ENUM,INTERFACE,STRUCTURE,UNION} records. Differential Revision: http://reviews.llvm.org/D21361 llvm-svn: 272815	2016-06-15 18:26:59 +00:00
Rui Ueyama	9f3e96115c	[pdbdump] Verify TPI hash for LF_ENUM type records. llvm-svn: 272728	2016-06-14 22:25:07 +00:00
Zachary Turner	1dc9fd3c4a	Resubmit "[pdb] Actually write a PDB to disk from YAML."" Reviewed By: ruiu Differential Revision: http://reviews.llvm.org/D21220 llvm-svn: 272708	2016-06-14 20:48:36 +00:00
Zachary Turner	07c229c9e7	Revert "[pdb] Actually write a PDB to disk from YAML." This reverts commit 879139e1c6577b09df52de56a6bab856a19ed185. This was committed accidentally when I blindly typed git svn dcommit instead of the command to generate a patch. llvm-svn: 272693	2016-06-14 18:51:35 +00:00
Zachary Turner	fe5bc02492	[pdb] Actually write a PDB to disk from YAML. llvm-svn: 272692	2016-06-14 18:49:36 +00:00
Zachary Turner	97609bb2fd	[pdb] Fix issues with pdb writing. This fixes an alignment issue by forcing all cached allocations to be 8 byte aligned, and also fixes an issue arising on big endian systems by writing ulittle32_t's instead of uint32_t's in the test. llvm-svn: 272437	2016-06-10 21:47:26 +00:00
Zachary Turner	b84faa8baa	Make PDBFile take a StreamInterface instead of a MemBuffer. This is the next step towards being able to write PDBs. MemoryBuffer is immutable, and StreamInterface is our replacement which can be any combination of read-only, read-write, or write-only depending on the particular implementation. The one place where we were creating a PDBFile (in RawSession) is updated to subclass ByteStream with a simple adapter that holds a MemoryBuffer, and initializes the superclass with the buffer's array, so that all the functionality of ByteStream works transparently. llvm-svn: 272370	2016-06-10 05:10:19 +00:00
Zachary Turner	5acb4ac6d7	Add support for writing through StreamInterface. This adds method and tests for writing to a PDB stream. With this, even a PDB stream which is discontiguous can be treated as a sequential stream of bytes for the purposes of writing. Reviewed By: ruiu Differential Revision: http://reviews.llvm.org/D21157 llvm-svn: 272369	2016-06-10 05:09:12 +00:00
Rui Ueyama	c41cd6dcf7	[pdbdump] Verify part of TPI hash streams. TPI hash table contains a parallel array for the type records. For each type record R, a hash value is calculated by `H(R) % NumBuckets` where H is a hash function, and the result is stored to a bucket element. H is TPI1::hashPrec function in microsoft-pdb repository. Our hash function does not support all type record types yet. Currently it supports only records for line number. I'll extend it in a follow up patch. The aim of verify the hash table is not only detect corrupted files. It ensures that our understanding of how the hash values are calculated is correct. llvm-svn: 272229	2016-06-09 00:10:19 +00:00
Rui Ueyama	f05f360deb	Function names should start with lowercase letters. llvm-svn: 272225	2016-06-08 23:15:09 +00:00
Rui Ueyama	170988f21f	[PDB] Move PDB functions to a separate file. We are going to use the hash functions from TPI streams. Differential Revision: http://reviews.llvm.org/D21142 llvm-svn: 272223	2016-06-08 23:11:14 +00:00
Benjamin Kramer	c321e53402	Apply most suggestions of clang-tidy's performance-unnecessary-value-param Avoids unnecessary copies. All changes audited & pass tests with asan. No functional change intended. llvm-svn: 272190	2016-06-08 19:09:22 +00:00
Zachary Turner	a1657a9e64	[pdb] Handle stream index errors better. Reviewed By: ruiu Differential Revision: http://reviews.llvm.org/D21128 llvm-svn: 272172	2016-06-08 17:26:39 +00:00
Rui Ueyama	ced0853b46	Remove a patch .rej file. llvm-svn: 272171	2016-06-08 16:54:31 +00:00
Zachary Turner	d2b2bfed94	[pdb] Try to fix use after free. llvm-svn: 272078	2016-06-08 00:25:08 +00:00
Rui Ueyama	f14a74c102	[pdbdump] Print out # of hash buckets. In the reference code, the field name is `cHashBuckets`. llvm-svn: 272075	2016-06-07 23:53:43 +00:00
Rui Ueyama	d833917f98	[pdbdump] Print out TPI hash key size. llvm-svn: 272073	2016-06-07 23:44:27 +00:00
Zachary Turner	e6fee88ce1	[pdb] Convert StringRefs to ArrayRef<uint8_t>s. llvm-svn: 272058	2016-06-07 20:38:37 +00:00
Zachary Turner	5839503f08	[pdb] Fix a potential overflow and remove unnecessary comments. llvm-svn: 272043	2016-06-07 18:42:39 +00:00
Zachary Turner	d8447990b0	[pdb] Use MappedBlockStream to parse the PDB directory. In order to efficiently write PDBs, we need to be able to make a StreamWriter class similar to a StreamReader, which can transparently deal with writing to discontiguous streams, and we need to use this for all writing, similar to how we use StreamReader for all reading. Most discontiguous streams are the typical numbered streams that appear in a PDB file and are described by the directory, but the exception to this, that until now has been parsed by hand, is the directory itself. MappedBlockStream works by querying the directory to find out which blocks a stream occupies and various other things, so naturally the same logic could not possibly work to describe the blocks that the directory itself resided on. To solve this, I've introduced an abstraction IPDBStreamData, which allows the client to query for the list of blocks occupied by the stream, as well as the stream length. I provide two implementations of this: one which queries the directory (for indexed streams), and one which queries the super block (for the directory stream). This has the side benefit of vastly simplifying the code to parse the directory. Whereas before a mini state machine was rolled by hand, now we simply use FixedStreamArray to read out the stream sizes, then build a vector of FixedStreamArrays for the stream map, all in just a few lines of code. Reviewed By: ruiu Differential Revision: http://reviews.llvm.org/D21046 llvm-svn: 271982	2016-06-07 05:28:55 +00:00
Rui Ueyama	4a1ebae537	Add comments. llvm-svn: 271967	2016-06-07 00:59:04 +00:00
Reid Kleckner	4ece163c92	Try one more time to pacify -Wpessimizing-move, MSVC, libstdc++4.7, and the world without a named variable llvm-svn: 271964	2016-06-06 23:46:14 +00:00
Reid Kleckner	52a155fca3	Attempt to work around lack of std::map::emplace in libstdc++4.7 llvm-svn: 271958	2016-06-06 23:28:03 +00:00
Rui Ueyama	ba0aab94cc	[pdbdump] Verify the size of TPI hash records. llvm-svn: 271954	2016-06-06 23:19:23 +00:00
Rui Ueyama	ef2b488482	[pdbdump] Print out New FPO stream contents. The data strucutre in the new FPO stream is described in the PE/COFF spec. There is one record per function if frame pointer is omitted. Differential Revision: http://reviews.llvm.org/D20999 llvm-svn: 271926	2016-06-06 18:39:21 +00:00
David Majnemer	36b7b08d4f	[DebugInfo, PDB] Use sparse bitfields for the name map The name map might not be densely packed on disk. Using a sparse map will save memory in such situations. llvm-svn: 271811	2016-06-04 22:47:39 +00:00
David Majnemer	862a8ae812	[CodeView] Fix a busted assert in TypeTableBuilder::writeClass It was checking for Union when it should have checked for Interface. llvm-svn: 271792	2016-06-04 15:40:31 +00:00
David Majnemer	067e3d0cc5	[TypeStreamMerger] visitUnknownMember was supposed to be visitUnknownType llvm-svn: 271790	2016-06-04 15:40:27 +00:00
Rui Ueyama	fd97bf1f76	pdbdump: print out TPI hashes. Differential Revision: http://reviews.llvm.org/D20945 llvm-svn: 271736	2016-06-03 20:48:51 +00:00
Reid Kleckner	ab1dfaae06	Fix non-Windows build when inserting a move only type into a map llvm-svn: 271727	2016-06-03 20:29:51 +00:00
Reid Kleckner	f27f3f8491	[Symbolize] Check if the PE file has a PDB and emit an error if we can't load it Summary: Previously we would try to load PDBs for every PE executable we tried to symbolize. If that failed, we would fall back to DWARF. If there wasn't any DWARF, we'd print mostly useless symbol information using the export table. With this change, we only try to load PDBs for executables that claim to have them. If that fails, we can now print an error rather than falling back silently. This should make it a lot easier to diagnose and fix common symbolization issues, such as not having DIA or not having a PDB. Reviewers: zturner, eugenis Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D20982 llvm-svn: 271725	2016-06-03 20:25:09 +00:00
Reid Kleckner	a8d5740757	[codeview] Add basic record type translation This only translates data members for now. Translating overloaded methods is complicated, so I stopped short of doing that. Reviewers: aaboud Differential Revision: http://reviews.llvm.org/D20924 llvm-svn: 271680	2016-06-03 15:58:20 +00:00
Zachary Turner	3df1bfaaec	[pdb] Print out file names instead of file offsets. When printing line information and file checksums, we were printing the file offset field from the struct header. This teaches llvm-pdbdump how to turn those numbers into the filename. In the case of file checksums, this is done by looking in the global string table. In the case of line contributions, this is done by indexing into the file names buffer of the DBI stream. Why they use a different technique I don't know. llvm-svn: 271630	2016-06-03 05:52:57 +00:00
Zachary Turner	d0563f29f9	[pdb] Dump file checksums from pdb codeview line info. llvm-svn: 271622	2016-06-03 04:01:48 +00:00
Zachary Turner	a96cce64a5	[codeview] Dump line number and column information. To facilitate this, a couple of changes had to be made: 1. `ModuleSubstream` got moved from `DebugInfo/PDB` to `DebugInfo/CodeView`, and various codeview related types are defined there. It turns out `DebugInfo/CodeView/Line.h` already defines many of these structures, but this is really old code that is not endian aware, doesn't interact well with `StreamInterface` and not very helpful for getting stuff out of a PDB. Eventually we should migrate the old readobj `COFFDumper` code to these new structures, or at least merge their functionality somehow. 2. A `ModuleSubstream` visitor is introduced. Depending on where your module substream array comes from, different subsets of record types can be expected. We are already hand parsing these substream arrays in many places especially in `COFFDumper.cpp`. In the future we can migrate these paths to the visitor as well, which should reduce a lot of code in `COFFDumper.cpp`. Differential Revision: http://reviews.llvm.org/D20936 Reviewed By: ruiu, majnemer llvm-svn: 271621	2016-06-03 03:25:59 +00:00
Rui Ueyama	0350bf0966	Add comments. llvm-svn: 271597	2016-06-02 21:13:47 +00:00
Zachary Turner	7eb6d358af	[llvm-pdbdump] Dump CodeView line information. This first pass only splits apart the records and dumps the line info kinds and binary data. Subsequent patches will parse out the binary data into more useful information and dump it in detail. llvm-svn: 271576	2016-06-02 20:11:22 +00:00
Zachary Turner	f4e9c9ac08	[codeview] Fix a nasty use after free. StreamRef was designed to be a thin wrapper over an abstract stream interface that could itself be treated the same as any other stream interface. For this reason, it inherited publicly from StreamInterface, and stored a StreamInterface* internally. But StreamRef was also designed to be lightweight and easily copyable, similar to ArrayRef. This led to two misuses of the classes. 1) When creating a StreamRef A from another StreamRef B, it was possible to end up with A storing a pointer to B, even when B was a temporary object, leading to use after free. 2) The above situation could be repeated ad nauseum, so that A stores a pointer to B, which itself stores a pointer to another StreamRef C, and so on and so on, creating an unnecessarily level of nesting depth. This patch removes the public inheritance relationship between StreamRef and StreamInterface, making it so that we can never accidentally convert a StreamRef to a StreamInterface. llvm-svn: 271570	2016-06-02 19:51:48 +00:00
David Majnemer	b68f32f0cf	[CodeView] Use None instead of Void if there is no subprogram llvm-svn: 271566	2016-06-02 18:51:24 +00:00
Rui Ueyama	90db78816b	pdbdump: print out COFF section headers. Unlike other sections that can grow to any size, the COFF section header stream has maximum length because each record is fixed size and the COFF file format limits the maximum number of sections. So I decided to not create a specific stream class for it. Instead, I added a member function to DbiStream class which returns a vector of COFF headers. Differential Revision: http://reviews.llvm.org/D20717 llvm-svn: 271557	2016-06-02 18:20:20 +00:00
Zachary Turner	93839cb4ac	[pdb] Parse and dump section map and section contribs Differential Revision: http://reviews.llvm.org/D20876 Reviewed By: rnk, ruiu llvm-svn: 271488	2016-06-02 05:07:49 +00:00
David Majnemer	a7c29321be	[PDB] Make ModStream::symbols report errors llvm-svn: 271417	2016-06-01 18:13:04 +00:00
Zachary Turner	90b8b8db2e	[pdb] Add unit tests for PDB MappedBlockStream and zero copy Differential Revision: http://reviews.llvm.org/D20837 Reviewed By: ruiu llvm-svn: 271346	2016-05-31 22:41:52 +00:00
Kevin Enderby	9acb109930	Change llvm-objdump, llvm-nm and llvm-size when reporting an object file error when the object is from a slice of a Mach-O Universal Binary use something like "foo.o (for architecture i386)" as part of the error message when expected. Also fixed places in these tools that were ignoring object file errors from MachOUniversalBinary::getAsObjectFile() when the code moved on to see if the slice was an archive. To do this MachOUniversalBinary::getAsObjectFile() and MachOUniversalBinary::getObjectForArch() were changed from returning ErrorOr<...> to Expected<...> then that was threaded up to its users. Converting these interfaces to Expected<> from ErrorOr<> does involve touching a number of places. To contain the changes for now the use of errorToErrorCode() is still used in two places yet to be fully converted. llvm-svn: 271332	2016-05-31 20:35:34 +00:00
Reid Kleckner	fbdbe9e22b	[codeview] Improve readability of type record assembly Adds the method MCStreamer::EmitBinaryData, which is usually an alias for EmitBytes. In the MCAsmStreamer case, it is overridden to emit hex dump output like this: .byte 0x0e, 0x00, 0x08, 0x10 .byte 0x03, 0x00, 0x00, 0x00 .byte 0x00, 0x00, 0x00, 0x00 .byte 0x00, 0x10, 0x00, 0x00 Also, when verbose asm comments are enabled, this patch prints the dump output for each comment before its record, like this: # ArgList (0x1000) { # TypeLeafKind: LF_ARGLIST (0x1201) # NumArgs: 0 # Arguments [ # ] # } .byte 0x06, 0x00, 0x01, 0x12 .byte 0x00, 0x00, 0x00, 0x00 This should make debugging easier and testing more convenient. Reviewers: aaboud Subscribers: majnemer, zturner, amccarth, aaboud, llvm-commits Differential Revision: http://reviews.llvm.org/D20711 llvm-svn: 271313	2016-05-31 18:45:36 +00:00
Reid Kleckner	3b3f490f9c	[codeview] Add a CVTypeDumper::dump(ArrayRef<uint8_t>) overload This is a convenient wrapper when the type record is already laid out as bytes in memory. llvm-svn: 271309	2016-05-31 18:15:23 +00:00
David Majnemer	ba1439229a	Make sure we don't add an empty string to the stringmap llvm-svn: 271172	2016-05-29 06:18:06 +00:00
David Majnemer	c6cb2ec36e	[SymbolDumper] Validate the string table offset before using it llvm-svn: 271145	2016-05-28 20:04:46 +00:00
David Majnemer	b343310b4f	[SymbolDumper] Validate the string table offset before using it llvm-svn: 271142	2016-05-28 19:45:56 +00:00
David Majnemer	328b6d3903	Tighten some of the name map checks further llvm-svn: 271130	2016-05-28 18:03:37 +00:00
David Majnemer	869631f987	Bounds check the number of bitmap blocks in the name map llvm-svn: 271105	2016-05-28 05:59:25 +00:00
David Majnemer	7e950b261a	Make sure the directory contains info for all streams llvm-svn: 271103	2016-05-28 05:59:19 +00:00
Zachary Turner	0d43c1c339	[pdb] Finish conversion to zero copy pdb access. This converts remaining uses of ByteStream, which was still left in the symbol stream and type stream, to using the new StreamInterface zero-copy classes. RecordIterator is finally deleted, so this is the only way left now. Additionally, more error checking is added when iterating the various streams. With this, the transition to zero copy pdb access is complete. llvm-svn: 271101	2016-05-28 05:21:57 +00:00
David Majnemer	74b1fb00f7	Don't discard errors llvm-svn: 271056	2016-05-27 22:07:50 +00:00
Zachary Turner	7dd42598be	[pdb] Fix size check when reading stream bytes. We were accidentally bounds checking the read against the output ArrayRef instead of against the size of the read. llvm-svn: 271040	2016-05-27 20:17:33 +00:00
David Majnemer	6c13db402f	Make sure data is available before dereferencing it llvm-svn: 271032	2016-05-27 18:50:02 +00:00
Zachary Turner	1de49c9ffd	Resubmit "[pdb] Allow zero-copy read support for symbol streams."" Due to differences in template instantiation rules, it is not portable to static_assert(false) inside of an invalid specialization of a template. Instead I just =delete the method so that it can't be used, and leave a comment that it must be explicitly specialized. llvm-svn: 271027	2016-05-27 18:47:20 +00:00
Chad Rosier	6c247c8cc8	Revert "[pdb] Allow zero-copy read support for symbol streams." This reverts commit r271024 due to error: static_assert failed "You must either provide a specialization of VarStreamArrayExtractor or a custom extractor" llvm-svn: 271026	2016-05-27 18:31:02 +00:00
Zachary Turner	3a9a23ae62	[pdb] Allow zero-copy read support for symbol streams. This reduces the amount of memory used by llvm-pdbdump by roughly 1/3 of the size of the PDB file. Differential Revision: http://reviews.llvm.org/D20724 Reviewed By: ruiu llvm-svn: 271025	2016-05-27 18:20:20 +00:00
David Majnemer	836937ed79	Make sure these error codes are marked as checked llvm-svn: 271013	2016-05-27 16:16:56 +00:00
David Majnemer	9efba74778	Make sure there are enough blocks for the stream llvm-svn: 271012	2016-05-27 16:16:48 +00:00
David Majnemer	5d842ea68e	Make sure the directory block array fits in the file llvm-svn: 271011	2016-05-27 16:16:45 +00:00
David Majnemer	878cadb663	Validate the blocksize before using it The blocksize could be zero on disk causing later checks to divide by zero. llvm-svn: 271008	2016-05-27 15:57:38 +00:00
Benjamin Kramer	82de7d323d	Apply clang-tidy's misc-move-constructor-init throughout LLVM. No functionality change intended, maybe a tiny performance improvement. llvm-svn: 270997	2016-05-27 14:27:24 +00:00
Zachary Turner	b393d95359	[codeview] Remove StreamReader copying method. Since we want to move toward zero-copy access to stream data, we want to remove all instances of copying operations. So get rid of some of those here. Differential Revision: http://reviews.llvm.org/D20720 Reviewed By: ruiu llvm-svn: 270960	2016-05-27 03:51:53 +00:00
Zachary Turner	8dbe3629a0	[codeview,pdb] Try really hard to conserve memory when reading. PDBs can be extremely large. We're already mapping the entire PDB into the process's address space, but to make matters worse the blocks of the PDB are not arranged contiguously. So, when we have something like an array or a string embedded into the stream, we have to make a copy. Since it's convenient to use traditional data structures to iterate and manipulate these records, we need the memory to be contiguous. As a result of this, we were using roughly twice as much memory as the file size of the PDB, because every stream was copied out and re-stitched together contiguously. This patch addresses this by improving the MappedBlockStream to allocate from a BumpPtrAllocator only when a read requires a discontiguous read. Furthermore, it introduces some data structures backed by a stream which can iterate over both fixed and variable length records of a PDB. Since everything is backed by a stream and not a buffer, we can read almost everything from the PDB with zero copies. Differential Revision: http://reviews.llvm.org/D20654 Reviewed By: ruiu llvm-svn: 270951	2016-05-27 01:54:44 +00:00
Zachary Turner	d5d37dcf83	[codeview] Move StreamInterface and StreamReader to libcodeview. We have need to reuse this functionality, including making additional generic stream types that are smarter about how and when they copy memory versus referencing the original memory. So all of these structures belong in the common library rather than being pdb specific. llvm-svn: 270751	2016-05-25 20:37:03 +00:00
Zachary Turner	d3076ab36f	[llvm-pdbdump] Decipher the remaining PDB streams. We know at least know the meaning of every stream of the PDB file. Yay! llvm-svn: 270669	2016-05-25 05:49:48 +00:00
Zachary Turner	c9972c64f5	[llvm-pdbdump] Dump the IPI stream and all records. llvm-svn: 270661	2016-05-25 04:35:22 +00:00
Rui Ueyama	b12b158f20	pdbdump: fix bug in name hash table. name_ids() did not return all IDs but only the first NameCount items. The number of non-zero entries in IDs vector is NameCount, but it does not mean that all non-zero entries are at the beginning of IDs vector. Differential Revision: http://reviews.llvm.org/D20611 llvm-svn: 270656	2016-05-25 04:07:17 +00:00
Zachary Turner	c59261ca37	[llvm-pdbdump] Stream 0 isn't actually the MSF superblock. Oddly enough, I realized we don't actually know what stream 0 is (if anything). llvm-svn: 270655	2016-05-25 03:53:16 +00:00
Zachary Turner	85ed80b9e6	[llvm-pdbdump] Dump stream summary list. Try to figure out what each stream is, and dump its name. This gives us a better picture of what streams we still don't understand. llvm-svn: 270653	2016-05-25 03:43:17 +00:00
Zachary Turner	172d59c105	[codeview] Add support for new types and symbols. This patch adds support for: S_EXPORT LF_BITFIELD With this patch, I have run through a couple of gigabytes of PDB files and cannot find a type or symbol that we do not understand. llvm-svn: 270637	2016-05-25 00:12:48 +00:00
Zachary Turner	9f054d424f	[codeview] Add support for S_EXPORT symbol. llvm-svn: 270636	2016-05-25 00:12:40 +00:00
Zachary Turner	4caa1bf0bd	[codeview] Add support for new type records. This adds support for parsing and dumping the following symbol types: S_LPROCREF S_ENVBLOCK S_COMPILE2 S_REGISTER S_COFFGROUP S_SECTION S_THUNK32 S_TRAMPOLINE As of this patch, the test PDB files no longer have any unknown symbol types. llvm-svn: 270628	2016-05-24 22:58:46 +00:00
Zachary Turner	96e60f7573	[llvm-pdbdump] Rework command line options. When dumping huge PDB files, too many of the options were grouped together so you would get neverending spew of output. This patch introduces more granular display options so you can only dump the fields you actually care about. llvm-svn: 270607	2016-05-24 20:31:48 +00:00
Peter Collingbourne	4718f8b5f1	Add FIXMEs to all derived classes of std::error_category. This helps make clear that we're moving away from std::error_code. Differential Revision: http://reviews.llvm.org/D20592 llvm-svn: 270604	2016-05-24 20:13:46 +00:00
Zachary Turner	9e33e6f89b	[codeview, pdb] Dump symbol records in publics stream Differential Revision: http://reviews.llvm.org/D20580 Reviewed By: ruiu llvm-svn: 270597	2016-05-24 18:55:14 +00:00
Zachary Turner	00d847b19e	Fix build errors llvm-svn: 270587	2016-05-24 17:44:29 +00:00
Zachary Turner	cac29ae038	Dump symbol record details in llvm-pdbdump This makes use of the newly introduced `CVSymbolVisitor` to dump details of each type of symbol record in the symbol streams. Future patches will bring this visitor based dumping to the publics stream, as well as creating a `SymbolDumpDelegate` to print more information about relocations etc. Differential Revision: http://reviews.llvm.org/D20545 Reviewed By: ruiu llvm-svn: 270585	2016-05-24 17:30:25 +00:00
George Rimar	401e4e570e	Recommit r270547 ([llvm-dwarfdump] - Teach dwarfdump to decompress debug sections in zlib style.) Fix was: 1) Had to regenerate dwarfdump-test-zlib.elf-x86-64, dwarfdump-test-zlib-gnu.elf-x86-64 (because llvm-symbolizer-zlib.test uses that inputs for its purposes and failed). 2) Updated llvm-symbolizer-zlib.test (updated used call function address to match new files + added one more check for newly created dwarfdump-test-zlib-gnu.elf-x86-64 binary input). 3) Updated comment in dwarfdump-test-zlib.cc. Original commit message: [llvm-dwarfdump] - Teach dwarfdump to decompress debug sections in zlib style. Before this llvm-dwarfdump only recognized zlib-gnu compression style of headers, this patch adds support for zlib style. It looks reasonable to support both styles for dumping, even if we are not going to suport generating of deprecated gnu one. Differential revision: http://reviews.llvm.org/D20470 llvm-svn: 270557	2016-05-24 12:48:46 +00:00
George Rimar	f059dd4f76	Revert r270543 ("Recommit r270540") Failed build bot in another test. I am sorry for noise. http://lab.llvm.org:8080/green/job/clang-stage1-cmake-RA-incremental_check/23679/testReport/junit/LLVM/DebugInfo/llvm_symbolizer_zlib_test/ llvm-svn: 270547	2016-05-24 11:03:10 +00:00
George Rimar	e9b2e19109	Recommit r270540 fix: forgot to commit the updated dwarfdump-test-zlib.elf-x86-64 Original commit message: [llvm-dwarfdump] - Teach dwarfdump to decompress debug sections in zlib style. Before this llvm-dwarfdump only recognized zlib-gnu compression style of headers, this patch adds support for zlib style. It looks reasonable to support both styles for dumping, even if we are not going to suport generating of deprecated gnu one. Differential revision: http://reviews.llvm.org/D20470 llvm-svn: 270543	2016-05-24 10:46:43 +00:00
George Rimar	6a6185fd78	Revert r270540 "[llvm-dwarfdump] - Teach dwarfdump to decompress debug sections in zlib style." it broked bot: http://lab.llvm.org:8011/builders/clang-s390x-linux/builds/5036 llvm-svn: 270541	2016-05-24 09:44:44 +00:00
George Rimar	6bcbf4c572	[llvm-dwarfdump] - Teach dwarfdump to decompress debug sections in zlib style. Before this llvm-dwarfdump only recognized zlib-gnu compression style of headers, this patch adds support for zlib style. It looks reasonable to support both styles for dumping, even if we are not going to suport generating of deprecated gnu one. Differential revision: http://reviews.llvm.org/D20470 llvm-svn: 270540	2016-05-24 09:28:36 +00:00
Zachary Turner	3e78e2d43f	Remove unused variable. llvm-svn: 270516	2016-05-24 00:06:04 +00:00
Zachary Turner	aaad57440d	Make a symbol visitor and use it to dump CV symbols. Differential Revision: http://reviews.llvm.org/D20534 Reviewed By: rnk llvm-svn: 270511	2016-05-23 23:41:13 +00:00
Rui Ueyama	2a58779198	Fix struct member names and simplify. NFC. llvm-svn: 270289	2016-05-20 22:59:05 +00:00
Rui Ueyama	0fcd82605e	pdbdump: print out symbol names referred by publics stream. DBI stream contains a stream number of the symbol record stream. Symbol record streams is an array of length-type-value members. Each member represents one symbol. Publics stream contains offsets to the symbol record stream. This patch is to print out all symbols that are referenced by the publics stream. Note that even with this patch, llvm-pdbdump cannot dump all the information in a publics stream since it contains more information than symbol names. I'll improve it in followup patches. Differential Revision: http://reviews.llvm.org/D20480 llvm-svn: 270262	2016-05-20 19:55:17 +00:00
Reid Kleckner	e1587bce96	Fix -Wmicrosoft-enum-value warning llvm-svn: 270110	2016-05-19 20:20:22 +00:00
Rui Ueyama	0376b1a2d7	pdbdump: Rename NumberOfSymbols -> SymbolRecordStreamIndex. Differential Revision: http://reviews.llvm.org/D20441 llvm-svn: 270088	2016-05-19 18:05:58 +00:00
Rui Ueyama	350b29862f	pdbdump: Print out section offsets in the publics stream. llvm-svn: 269955	2016-05-18 16:24:16 +00:00
Daniel Sanders	016e6c4354	Try again to fix pdbdump-headers.test on big-endian hosts after r269861. r269898 fixed the problem with HashBuckets but the same issue occurred with AddressMap and ThunkMap too. llvm-svn: 269913	2016-05-18 12:36:25 +00:00
Daniel Sanders	c819d903e1	Attempt to fix pdbdump-headers.test on big-endian hosts after r269861. llvm-svn: 269898	2016-05-18 09:59:14 +00:00
Rui Ueyama	8dc18c5f45	pdbdump: Print out more strcutures. I don't yet fully understand the meaning of these data strcutures, but at least it seems that their sizes and types are correct. With this change, we can read publics streams till end. Differential Revision: http://reviews.llvm.org/D20343 llvm-svn: 269861	2016-05-17 23:07:48 +00:00
Reid Kleckner	fcc5550544	[codeview] Test serialization of all known type records This just checks that we emit all type records once, and then after merging the type stream with no other type streams, we still emit every kind of type record. We could test the dumper output more closely, but that would make the test very brittle. Currently we're just getting coverage. llvm-svn: 269778	2016-05-17 16:20:35 +00:00
Benjamin Kramer	a65b610bd2	Move helper classes into anonymous namespaces. NFC. llvm-svn: 269591	2016-05-15 15:18:11 +00:00
Reid Kleckner	0b269748a6	[codeview] Add type stream merging prototype Summary: This code is intended to be used as part of LLD's PDB writing. Until that exists, this is exposed via llvm-readobj for testing purposes. Type stream merging uses the following algorithm: - Begin with a new empty stream, and a new empty hash table that maps from type record contents to new type index. - For each new type stream, maintain a map from source type index to destination type index. - For each record, copy it and rewrite its type indices to be valid in the destination type stream. - If the new type record is not already present in the destination stream hash table, append it to the destination type stream, assign it the next type index, and update the two hash tables. - If the type record already exists in the destination stream, discard it and update the type index map to forward the source type index to the existing destination type index. Reviewers: zturner, ruiu Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D20122 llvm-svn: 269521	2016-05-14 00:02:53 +00:00
Rui Ueyama	1f6b6e2c53	pdbdump: Print "Publics" stream. Publics stream seems to contain information as to public symbols. It actually contains a serialized hash table along with fixed-sized headers. This patch is not complete. It scans only till the end of the stream and dump the header information. I'll write code to de-serialize the hash table later. Reviewers: zturner Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D20256 llvm-svn: 269484	2016-05-13 21:21:53 +00:00
Reid Kleckner	4525fbe22a	[codeview] Align class and print names of types Summary: This way we can get rid of one of the fields in the .def file. Reviewers: llvm-commits Subscribers: zturner Differential Revision: http://reviews.llvm.org/D20251 llvm-svn: 269461	2016-05-13 19:37:07 +00:00
Reid Kleckner	bab3fab806	[codeview] Dump the type index on the first line of each record This will make it easier to write FileCheck tests. llvm-svn: 269444	2016-05-13 17:48:24 +00:00
Reid Kleckner	ce5196e728	[codeview] Try to handle errors better in record iterator llvm-svn: 269381	2016-05-12 23:26:23 +00:00
Zachary Turner	123a52735d	Get rid of CVLeafTypes.def and combine with TypeRecords.def This merges the functionality of the macros in `CVLeafTypes.def` and the macros in `TypeRecords.def` into a single set of macros. Differential Revision: http://reviews.llvm.org/D20190 Reviewed By: rnk, amccarth llvm-svn: 269316	2016-05-12 17:45:51 +00:00
Zachary Turner	38cc8b3f21	Make CodeView record serialization more generic. This introduces a variadic template and some helper macros to safely and correctly deserialize many types of common record fields while maintaining error checking. Differential Revision: http://reviews.llvm.org/D20183 Reviewed By: rnk, amccarth llvm-svn: 269315	2016-05-12 17:45:44 +00:00
Zachary Turner	3f61c1ab5e	Fix build breakage in DebugInfoCodeview llvm-svn: 269217	2016-05-11 17:54:20 +00:00
Zachary Turner	ae3882a19a	Refactor CodeView type records to use common code. Differential Revision: http://reviews.llvm.org/D20138 Reviewed By: rnk llvm-svn: 269216	2016-05-11 17:47:35 +00:00
Eugene Zelenko	417d4c508b	Fix some Clang-tidy modernize-deprecated-headers and Include What You Use warnings; other minor fixes. Differential revision: http://reviews.llvm.org/D20042 llvm-svn: 268989	2016-05-09 23:11:38 +00:00
Zachary Turner	06c2b4be25	[pdb] Parse the module info stream for each module. Differential Revision: http://reviews.llvm.org/D20026 Reviewed By: rnk llvm-svn: 268942	2016-05-09 17:45:21 +00:00
Zachary Turner	9073ed6e5a	Make TypeIterator generic so it can iterate symbols too. Reviewed By: amccarth Differential Revision: http://reviews.llvm.org/D20038 llvm-svn: 268941	2016-05-09 17:44:58 +00:00
Zachary Turner	5d105a977e	Drop error when trying to fallback from PDB to DWARF. llvm-svn: 268813	2016-05-06 22:29:34 +00:00
Zachary Turner	5a1b5ef9eb	Make llvm-pdbdump print CV type records This reuses the CVTypeDumper from libcodeview to dump full information about type records within a PDB file. Differential Revision: http://reviews.llvm.org/D20022 Reviewed By: rnk llvm-svn: 268808	2016-05-06 22:15:42 +00:00
Zachary Turner	2b37017c38	Add missing include. llvm-svn: 268792	2016-05-06 20:59:35 +00:00
Zachary Turner	819e77d196	Port DebugInfoPDB over to using llvm::Error. Differential Revision: http://reviews.llvm.org/D19940 Reviewed By: rnk llvm-svn: 268791	2016-05-06 20:51:57 +00:00
Reid Kleckner	745f3cbcfc	[codeview] Improve some comments This FIXME was already fixed, and these LF_* enum names were inconsistent. llvm-svn: 268683	2016-05-05 20:58:46 +00:00
Reid Kleckner	338034759a	Fix CVTypeDumperImpl formatting after class rename llvm-svn: 268678	2016-05-05 20:31:16 +00:00
Reid Kleckner	4a14bcac41	[codeview] Move dumper into lib/DebugInfo/CodeView So that we can call it from llvm-pdbdump. llvm-svn: 268580	2016-05-05 00:34:33 +00:00
Zachary Turner	ec28fc3499	Move pdb code into pdb namespace. llvm-svn: 268544	2016-05-04 20:32:13 +00:00
Reid Kleckner	7960de99db	[codeview] Add a type visitor to help abstract away type stream handling Summary: Port the dumper in llvm-readobj over to it. I'm planning to use this visitor to power type stream merging. While we're at it, try to switch from StringRef to ArrayRef<uint8_t> in some places. Reviewers: zturner, amccarth Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D19899 llvm-svn: 268535	2016-05-04 19:39:28 +00:00
Zachary Turner	ce48c4d975	Remove unused variable. llvm-svn: 268455	2016-05-03 22:26:46 +00:00
Zachary Turner	2d02ceefdc	Move CodeViewTypeStream to DebugInfo/CodeView Ability to parse codeview type streams is also needed by DebugInfoPDB for parsing PDBs, so moving this into a library gives us this option. Since DebugInfoPDB had already hand rolled some code to do this, that code is now convereted over to using this common abstraction. Differential Revision: http://reviews.llvm.org/D19887 Reviewed By: dblaikie, amccarth llvm-svn: 268454	2016-05-03 22:18:17 +00:00
Zachary Turner	66635f0235	Change operation_not_supported to not_supported. Apparently operation_not_supported is... not supported everywhere. llvm-svn: 268348	2016-05-03 00:53:16 +00:00
Zachary Turner	f5c59654f7	Parse the TPI (type information) stream of PDB files. This parses the TPI stream (stream 2) from the PDB file. This stream contains some header information followed by a series of codeview records. There is some additional complexity here in that alongside this stream of codeview records is a serialized hash table in order to efficiently query the types. We parse the necessary bookkeeping information to allow us to reconstruct the hash table, but we do not actually construct it yet as there are still a few things that need to be understood first. Differential Revision: http://reviews.llvm.org/D19840 Reviewed By: ruiu, rnk llvm-svn: 268343	2016-05-03 00:28:21 +00:00
Zachary Turner	d6192f482f	[llvm-pdbdump] Fix read past EOF when file is too small. llvm-svn: 268316	2016-05-02 22:16:57 +00:00
Kevin Enderby	7bd8d99497	Thread Expected<...> up from libObject’s getType() for symbols to allow llvm-objdump to produce a good error message. Produce another specific error message for a malformed Mach-O file when a symbol’s section index is more than the number of sections. The existing test case in test/Object/macho-invalid.test for macho-invalid-section-index-getSectionRawName now reports the error with the message indicating that a symbol at a specific index has a bad section index and that bad section index value. Again converting interfaces to Expected<> from ErrorOr<> does involve touching a number of places. Where the existing code reported the error with a string message or an error code it was converted to do the same. Also there some were bugs in the existing code that did not deal with the old ErrorOr<> return values. So now with Expected<> since they must be checked and the error handled, I added a TODO and a comment: "// TODO: Actually report errors helpfully" and a call something like consumeError(NameOrErr.takeError()) so the buggy code will not crash since needed to deal with the Error. llvm-svn: 268298	2016-05-02 20:28:12 +00:00
Zachary Turner	a801dc17d9	Fix build breakage due to implicit conversion. llvm-svn: 268277	2016-05-02 18:36:58 +00:00
Zachary Turner	b56d904433	PDB - Instead of hardcoding stream numbers, use an enum. llvm-svn: 268270	2016-05-02 18:09:21 +00:00
Zachary Turner	0eace0bae5	Parse PDB Name Hash Table PDB has a lot of similar data structures. We already have code for parsing a Name Map, but PDB seems to have a different but very similar structure that is a hash table. This is the beginning of code needed in order to parse the name hash table, but it is not yet complete. It parses the basic metadata of the hash table, the bucket array, and the names buffer, but doesn't use any of these fields yet as the data structure requires a non-trivial amount of work to understand. llvm-svn: 268268	2016-05-02 18:09:14 +00:00
Zachary Turner	9213ba5304	Fix crash in PDB when loading corrupt file. There are probably hundreds of crashers we can find by fuzzing more. For now we do the simplest possible validation of the block size. Later, more complicated validations can verify that other fields of the super block such as directory size, number of blocks, agree with the size of the file etc. llvm-svn: 268084	2016-04-29 18:09:19 +00:00
Zachary Turner	2f09b5091c	Put PDB parsing code into a pdb namespace. llvm-svn: 268072	2016-04-29 17:28:47 +00:00
Zachary Turner	6ba65deeb9	Refactor the PDB Stream reading interface. The motivation for this change is that PDB has the notion of streams and substreams. Substreams often consist of variable length structures that are convenient to be able to treat as guaranteed, contiguous byte arrays, whereas the streams they are contained in are not necessarily so, as a single stream could be spread across many discontiguous blocks. So, when processing data from a substream, we want to be able to assume that we have a contiguous byte array so that we can cast pointers to variable length arrays and such. This leads to the question of how to be able to read the same data structure from either a stream or a substream using the same interface, which is where this patch comes in. We separate out the stream's read state from the underlying representation, and introduce a `StreamReader` class. Then we change the name of `PDBStream` to `MappedBlockStream`, and introduce a second kind of stream called a `ByteStream` which is simply a sequence of contiguous bytes. Finally, we update all of the std::vectors in `PDBDbiStream` to use `ByteStream` instead as a proof of concept. llvm-svn: 268071	2016-04-29 17:22:58 +00:00
David Majnemer	ca9ac4721d	[llvm-pdbdump] Try to appease the ASan bot We didn't check that the file was large enough to hold a super block. llvm-svn: 267965	2016-04-29 01:00:17 +00:00
David Majnemer	1573b242ae	[llvm-pdbdump] Restore error messages, handle bad block sizes We lost the ability to report errors, bring it back. Also, correctly validate the block size. llvm-svn: 267955	2016-04-28 23:47:27 +00:00
David Majnemer	5baa2bc2e1	[llvm-pdbdump] Correctly read data larger than a block A bug was introduced when the code was refactored which resulted in a bad memory access. This fixes PR27565. llvm-svn: 267953	2016-04-28 23:24:23 +00:00
Dehao Chen	1b54fce319	Read discriminators correctly from object file. Summary: This is the follow-up patch for http://reviews.llvm.org/D19436 * Update the discriminator reading algorithm to match the assignment algorithm. * Add test to cover the new algorithm. Reviewers: dnovillo, echristo, dblaikie Subscribers: danielcdh, dblaikie, echristo, llvm-commits, joker.eph Differential Revision: http://reviews.llvm.org/D19522 llvm-svn: 267945	2016-04-28 22:09:37 +00:00
Amaury Sechet	5575d079a5	Fix warning in PDB code. NFC llvm-svn: 267938	2016-04-28 20:39:39 +00:00
Zachary Turner	897067e3f1	Add parentheses to silence -Wparentheses warnings. llvm-svn: 267934	2016-04-28 20:26:30 +00:00
Zachary Turner	84c3a8ba3d	Read the rest of the DBI substreams, and parse source info. We now read out the rest of the substreams from the DBI streams. One of these substreams, the FileInfo substream, contains information about which source files contribute to each module (aka compiland). This patch additionally parses out the file information from that substream, and dumps it in llvm-pdbdump. Differential Revision: http://reviews.llvm.org/D19634 Reviewed by: ruiu llvm-svn: 267928	2016-04-28 20:05:18 +00:00
Zachary Turner	1822af542f	Parse module information from DBI stream. This gets more data out of the DBI strema of the PDB. In particular it extracts the metadata for the list of modules (compilands) that this PDB contains info about, and adds support for dumping these fields to llvm-pdbdump. Differential Revision: http://reviews.llvm.org/D19570 Reviewed By: ruiu llvm-svn: 267818	2016-04-27 23:41:42 +00:00
Reid Kleckner	0336cc05e7	[PDB] Fix function names for private symbols in PDBs Summary: llvm-symbolizer wants to get linkage names of functions for historical reasons. Linkage names are only recorded in the PDB for public symbols, and the linkage name is apparently stored separately in some "public symbol" record. We had a workaround in PDBContext which would look for such symbols when the user requested linkage names. However, when given an address that was truly in a private function and public funciton, we would accidentally find nearby public symbols and return those function names. The fix is to look for both function symbols and public symbols and only prefer the public symbol name if the addresses of the symbols agree. Fixes PR27492 Reviewers: zturner Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D19571 llvm-svn: 267732	2016-04-27 16:10:29 +00:00
Zachary Turner	c3c4e15697	Remove more unused variables. llvm-svn: 267598	2016-04-26 20:32:35 +00:00
Zachary Turner	7756127077	[llvm-pdbdump] Fix version reading on big endian systems. llvm-svn: 267595	2016-04-26 19:48:18 +00:00
Zachary Turner	ff788aa0ee	Fix warnings and -Werror build on clang. llvm-svn: 267589	2016-04-26 19:24:10 +00:00
Zachary Turner	53a65ba5c9	Parse and dump PDB DBI Stream Header Information The DBI stream contains a lot of bookkeeping information for other streams. In particular it contains information about section contributions and linked modules. This patch is a first attempt at parsing some of the information out of the DBI stream. It currently only parses and dumps the headers of the DBI stream, so none of the module data or section contribution data is pulled out. This is just a proof of concept that we understand the basic properties of the DBI stream's metadata, and followup patches will try to extract more detailed information out. Differential Revision: http://reviews.llvm.org/D19500 Reviewed By: majnemer, ruiu llvm-svn: 267585	2016-04-26 18:42:34 +00:00
Zachary Turner	ce36c1f2ec	Fix build broken due to order of initialization problem. llvm-svn: 267571	2016-04-26 16:57:53 +00:00
Zachary Turner	f34e01624a	Refactor some more PDB reading code into DebugInfoPDB. Differential Revision: http://reviews.llvm.org/D19445 Reviewed By: David Majnemer llvm-svn: 267564	2016-04-26 16:20:00 +00:00
Zachary Turner	0a43efea95	Resubmit "Refactor raw pdb dumper into library" This fixes a number of endianness issues as well as an ODR violation that hopefully causes everything to be happy. llvm-svn: 267431	2016-04-25 17:38:08 +00:00
David Blaikie	e438cff475	llvm-symbolizer: Avoid infinite recursion walking dwos where the dwo contains a dwo_name attribute The dwo_name was added to dwo files to improve diagnostics in dwp, but it confuses tools that attempt to load any dwo named by a dwo_name, even ones inside dwos. Avoid this by keeping track of whether a unit is already a dwo unit, and if so, not loading further dwos. llvm-svn: 267241	2016-04-22 22:50:56 +00:00
David Blaikie	9a4f3cb275	llvm-symbolizer: prefer .dwo contents over fission-gmlt-like-data when .dwo file is present Rather than relying on the gmlt-like data emitted into the .o/executable which only contains the simple name of any inlined functions, use the .dwo file if present. Test symbolication with/without a .dwo, and the old test that was testing behavior when no gmlt-like data was present. (I haven't included a test of non-gmlt-like data + no .dwo (that would be akin to symbolication with no debug info) but we could add one for completeness) The test was simplified a bit to be a little clearer (unoptimized, force inline, using a function call as the inlined entity) and regenerated with ToT clang. For the no-gmlt-like-data case, I modified Clang back to its old behavior temporarily & the .dwo file is identical so it is shared between the two executables. llvm-svn: 267227	2016-04-22 21:32:59 +00:00
Daniel Sanders	d41718e8af	Revert r267049, r26706[16789], r267071 - Refactor raw pdb dumper into library r267049 broke multiple buildbots (e.g. clang-cmake-mips, and clang-x86_64-linux-selfhost-modules) which the follow-ups have not yet resolved and this is preventing subsequent committers from being notified about additional failures on the affected buildbots. llvm-svn: 267148	2016-04-22 12:04:42 +00:00
Reid Kleckner	5037674ae2	Fix PDB warnings and test llvm-svn: 267071	2016-04-21 22:37:55 +00:00
Amaury Sechet	d46e58d38e	Remove dead code. NFC llvm-svn: 267069	2016-04-21 22:17:39 +00:00
Zachary Turner	d01a7a7894	Fix -Wreturn-type warning with HAVE_DIA_SDK is false. llvm-svn: 267068	2016-04-21 22:16:19 +00:00
Zachary Turner	b2fe61bd8c	Fix for case sensitive filename failure. llvm-svn: 267066	2016-04-21 22:08:27 +00:00
Amaury Sechet	7e16ce5a84	Remove various warnings. NFC llvm-svn: 267061	2016-04-21 21:36:11 +00:00
Zachary Turner	a12b3d4626	Refactor raw pdb dumper into library PDB parsing code was hand-rolled into llvm-pdbdump. This patch moves the parsing of this code into DebugInfoPDB and makes the dumper use this. This is achieved by implementing the skeleton of RawPdbSession, the non-DIA counterpart to the existing PDB read interface. None of the type / source file / etc information is accessible yet, so this implementation is not yet close to achieving parity with the DIA counterpart, but the RawSession class simply holds a reference to a PDBFile class which handles parsing the file format. Additionally a PDBStream class is introduced which allows accessing the bytes of a particular stream in a PDB file. Differential Revision: http://reviews.llvm.org/D19343 Reviewed By: majnemer llvm-svn: 267049	2016-04-21 20:58:35 +00:00
Kevin Enderby	81e8b7d949	Thread Expected<...> up from libObject’s getName() for symbols to allow llvm-objdump to produce a good error message. Produce another specific error message for a malformed Mach-O file when a symbol’s string index is past the end of the string table. The existing test case in test/Object/macho-invalid.test for macho-invalid-symbol-name-past-eof now reports the error with the message indicating that a symbol at a specific index has a bad sting index and that bad string index value. Again converting interfaces to Expected<> from ErrorOr<> does involve touching a number of places. Where the existing code reported the error with a string message or an error code it was converted to do the same. There is some code for this that could be factored into a routine but I would like to leave that for the code owners post-commit to do as they want for handling an llvm::Error. An example of how this could be done is shown in the diff in lib/ExecutionEngine/RuntimeDyld/RuntimeDyldImpl.h which had a Check() routine already for std::error_code so I added one like it for llvm::Error . Also there some were bugs in the existing code that did not deal with the old ErrorOr<> return values. So now with Expected<> since they must be checked and the error handled, I added a TODO and a comment: “// TODO: Actually report errors helpfully” and a call something like consumeError(NameOrErr.takeError()) so the buggy code will not crash since needed to deal with the Error. Note there fixes needed to lld that goes along with this that I will commit right after this. So expect lld not to built after this commit and before the next one. llvm-svn: 266919	2016-04-20 21:24:34 +00:00
Zachary Turner	23ee87bda0	[llvm-pdbdump] Print a better error message when PDB loading fails. Differential Revision: http://reviews.llvm.org/D19234 llvm-svn: 266772	2016-04-19 17:36:58 +00:00
Nico Weber	179b383d74	Unbreak building llvm-pdbdump on Windows after r266595. llvm-svn: 266612	2016-04-18 13:31:31 +00:00
Mehdi Amini	b550cb1750	[NFC] Header cleanup Removed some unused headers, replaced some headers with forward class declarations. Found using simple scripts like this one: clear && ack --cpp -l '#include "llvm/ADT/IndexedMap.h"' \| xargs grep -L 'IndexedMap[<]' \| xargs grep -n --color=auto 'IndexedMap' Patch by Eugene Kosov <claprix@yandex.ru> Differential Revision: http://reviews.llvm.org/D19219 From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 266595	2016-04-18 09:17:29 +00:00
Richard Smith	2db6f2e508	Update and fix LLVM_ENABLE_MODULES: 1) We need to add this flag prior to adding any other, in case the user has specified a -fmodule-cache-path= flag in their custom CXXFLAGS. Such a flag causes -Werror builds to fail, and thus all config checks fail, until we add the corresponding -fmodules flag. The modules selfhost bot does this, for instance. 2) Delete module maps that were putting .cpp files into modules. 3) Enable -fmodules-local-submodule-visibility, to get proper module visibility rules applied across submodules of the same module. Disable -fmodules for C builds, since that flag is not available there. llvm-svn: 266502	2016-04-16 00:48:58 +00:00
Kevin Enderby	3fcdf6ae2a	Thread Expected<...> up from createMachOObjectFile() to allow llvm-objdump to produce a real error message Produce the first specific error message for a malformed Mach-O file describing the problem instead of the generic message for object_error::parse_failed of "Invalid data was encountered while parsing the file”. Many more good error messages will follow after this first one. This is built on Lang Hames’ great work of adding the ’Error' class for structured error handling and threading Error through MachOObjectFile construction. And making createMachOObjectFile return Expected<...> . So to to get the error to the llvm-obdump tool, I changed the stack of these methods to also return Expected<...> : object::ObjectFile::createObjectFile() object::SymbolicFile::createSymbolicFile() object::createBinary() Then finally in ParseInputMachO() in MachODump.cpp the error can be reported and the specific error message can be printed in llvm-objdump and can be seen in the existing test case for the existing malformed binary but with the updated error message. Converting these interfaces to Expected<> from ErrorOr<> does involve touching a number of places. To contain the changes for now use of errorToErrorCode() and errorOrToExpected() are used where the callers are yet to be converted. Also there some were bugs in the existing code that did not deal with the old ErrorOr<> return values. So now with Expected<> since they must be checked and the error handled, I added a TODO and a comment: “// TODO: Actually report errors helpfully” and a call something like consumeError(ObjOrErr.takeError()) so the buggy code will not crash since needed to deal with the Error. Note there is one fix also needed to lld/COFF/InputFiles.cpp that goes along with this that I will commit right after this. So expect lld not to built after this commit and before the next one. llvm-svn: 265606	2016-04-06 22:14:09 +00:00
Nico Weber	73853ab4f8	Make DIASession work if msdia*.dll isn't registered. This fixes various symbolization test failures for me when I build with a hermetic VS2015 without having run the 2015 installer. http://reviews.llvm.org/D18707 llvm-svn: 265193	2016-04-01 22:21:51 +00:00
Eugene Zelenko	35623fb7d5	Fix Clang-tidy modernize-deprecated-headers warnings in some files; other minor fixes. Differential revision: http://reviews.llvm.org/D18469 llvm-svn: 264598	2016-03-28 17:40:08 +00:00
Kevin Enderby	5afbc1cda7	Fix a crash in running llvm-objdump -t with an invalid Mach-O file already in the test suite. While this is not really an interesting tool and option to run on a Mach-O file to show the symbol table in a generic libObject format it shouldn’t crash. The reason for the crash was in MachOObjectFile::getSymbolType() when it was calling MachOObjectFile::getSymbolSection() without checking its return value for the error case. What makes this fix require a fair bit of diffs is that the method getSymbolType() is in the class ObjectFile defined without an ErrorOr<> so I needed to add that all the sub classes. And all of the uses needed to be updated and the return value needed to be checked for the error case. The MachOObjectFile version of getSymbolType() “can” get an error in trying to come up with the libObject’s internal SymbolRef::Type when the Mach-O symbol symbol type is an N_SECT type because the code is trying to select from the SymbolRef::ST_Data or SymbolRef::ST_Function values for the SymbolRef::Type. And it needs the Mach-O section to use isData() and isBSS to determine if it will return SymbolRef::ST_Data. One other possible fix I considered is to simply return SymbolRef::ST_Other when MachOObjectFile::getSymbolSection() returned an error. But since in the past when I did such changes that “ate an error in the libObject code” I was asked instead to push the error out of the libObject code I chose not to implement the fix this way. As currently written both the COFF and ELF versions of getSymbolType() can’t get an error. But if isReservedSectionNumber() wanted to check for the two known negative values rather than allowing all negative values or the code wanted to add the same check as in getSymbolAddress() to use getSection() and check for the error then these versions of getSymbolType() could return errors. At the end of the day the error printed now is the generic “Invalid data was encountered while parsing the file” for object_error::parse_failed. In the future when we thread Lang’s new TypedError for recoverable error handling though libObject this will improve. And where the added // Diagnostic(… comment is, it would be changed to produce and error message like “bad section index (42) for symbol at index 8” for this case. llvm-svn: 264187	2016-03-23 20:27:00 +00:00
Simon Atanasyan	f69c7e5382	[DebugInfo] Dump CIE augmentation data as a list of hex bytes CIE augmentation data might contain non-printable characters. The patch prints the data as a list of hex bytes. Differential Revision: http://reviews.llvm.org/D17759 llvm-svn: 262361	2016-03-01 18:38:05 +00:00
Zachary Turner	43ec3af952	[DebugInfoPDB] Add source / line number accessors for PDB. This patch adds a variety of different methods to query source and line number information from PDB files. llvm-svn: 261239	2016-02-18 18:47:29 +00:00
Zachary Turner	2a9ac0d2c5	[DebugInfoPDB] A few cleanups on PDB Variant class. Also implements the PDBSymbolCompilandEnv::getValue() method, which until now had been unimplemented specifically because variant did not support string values. llvm-svn: 261173	2016-02-17 22:46:33 +00:00
Zachary Turner	0cdb055f23	[DebugInfoPDB] Raise getSymIndexId() up to PDBSymbol Every symbol, no matter what it's tag is, supports the method getSymIndexId(). However, this was being forwarded on every concrete symbol type, so if someone had a PDBSymbol that they didn't know what type it was (or simply didn't have an instance of the concrete symbol type), they would not be able to get its index id. This patch moves the method up to PDBSymbol, so that no matter what type of object you have, you can always get its id. llvm-svn: 261153	2016-02-17 21:13:34 +00:00
Zachary Turner	da292bd4bd	[DebugInfoPDB] Teach Variant to support string types. The IDiaSymbol::getValue() method returns a variant. Until now, I had never encountered a string value, so the Variant wrapper did not support VT_BSTR. Now we have need to support string values, so this patch just adds support for one extra type to Variant. llvm-svn: 261152	2016-02-17 21:13:15 +00:00
Dmitry Polukhin	f4124a72c1	[DebugInfo] Eliminate compilation warning about used variable LSDA The waring was: lib/DebugInfo/DWARF/DWARFDebugFrame.cpp:643:20: warning: variable ‘LSDA’ set but not used llvm-svn: 259877	2016-02-05 09:24:34 +00:00
Igor Laevsky	ff291b5833	[DebugInfo] Support zero-length CIE in the _eh_frame parser MCJIT emits zero-length CIE at the end of the _eh_frame section. This change ensures that parser inside DebugInfo will not crash and correctly record such cases. We are now recording DW_EH_PE_omit as a default value for FDE and LSDA encodings. Also Offset != EndAugmentationOffset assertion check will only happen if augmentation string had 'z' letter in it. Differential Revision: http://reviews.llvm.org/D16588 llvm-svn: 258931	2016-01-27 14:05:35 +00:00
Chris Bieneman	e49730d4ba	Remove autoconf support Summary: This patch is provided in preparation for removing autoconf on 1/26. The proposal to remove autoconf on 1/26 was discussed on the llvm-dev thread here: http://lists.llvm.org/pipermail/llvm-dev/2016-January/093875.html "I felt a great disturbance in the [build system], as if millions of [makefiles] suddenly cried out in terror and were suddenly silenced. I fear something [amazing] has happened." - Obi Wan Kenobi Reviewers: chandlerc, grosbach, bob.wilson, tstellarAMD, echristo, whitequark Subscribers: chfast, simoncook, emaste, jholewinski, tberghammer, jfb, danalbert, srhines, arsenm, dschuff, jyknight, dsanders, joker.eph, llvm-commits Differential Revision: http://reviews.llvm.org/D16471 llvm-svn: 258861	2016-01-26 21:29:08 +00:00
Igor Laevsky	03a670c0ec	Re-submit r256008 "Improve DWARFDebugFrame::parse to also handle __eh_frame." Originally this change was causing failures on windows buildbots. But those problems were fixed in r258806. llvm-svn: 258811	2016-01-26 15:09:42 +00:00
Igor Laevsky	0e1605a3b4	[DebugInfo] Fix DWARFDebugFrame instruction operand ordering We can't rely on the evalution order of function arguments. Differential Revision: http://reviews.llvm.org/D16509 llvm-svn: 258806	2016-01-26 13:31:11 +00:00
Reid Kleckner	734a50c280	Fix instance of -Wcovered-switch-default llvm-svn: 257665	2016-01-13 20:39:22 +00:00
Reid Kleckner	72e2ba7abb	[readobj] Expand CodeView dumping functionality This rewrites and expands the existing codeview dumping functionality in llvm-readobj using techniques similar to those in lib/Object. This defines a number of new records and enums useful for reading memory mapped codeview sections in COFF objects. The dumper is intended as a testing tool for LLVM as it grows more codeview output capabilities. Reviewers: majnemer Differential Revision: http://reviews.llvm.org/D16104 llvm-svn: 257658	2016-01-13 19:32:35 +00:00
Mike Aizatsky	6129dacdbb	fixing type. llvm-svn: 257238	2016-01-09 00:31:56 +00:00
NAKAMURA Takumi	0929fcfbcf	llvm/lib/DebugInfo/Symbolize/DIPrinter.cpp: Fix build in -m32. 1L is incompatible to int64_t. llvm-svn: 257237	2016-01-09 00:28:50 +00:00
Mike Aizatsky	17dbc2831e	[llvm-symbolizer] -print-source-context-lines option to print source code around the line. Differential Revision: http://reviews.llvm.org/D15909 llvm-svn: 257236	2016-01-09 00:14:35 +00:00
Dimitry Andric	227b928abc	Fix several accidental DOS line endings in source files Summary: There are a number of files in the tree which have been accidentally checked in with DOS line endings. Convert these to native line endings. There are also a few files which have DOS line endings on purpose, and I have set the svn:eol-style property to 'CRLF' on those. Reviewers: joerg, aaron.ballman Subscribers: aaron.ballman, sanjoy, dsanders, llvm-commits Differential Revision: http://reviews.llvm.org/D15848 llvm-svn: 256707	2016-01-03 17:22:03 +00:00
Dave Bartolomeo	a779f5a401	Remove unused constants from TypeTableBuilder.cpp. llvm-svn: 256389	2015-12-24 19:15:56 +00:00
Bill Seurer	8771bbfbe2	Fix case of path name llvm-svn: 256388	2015-12-24 18:54:35 +00:00
Dave Bartolomeo	dd38b1bf12	Fix CodeView library name and non-CMake builds llvm-svn: 256387	2015-12-24 18:51:35 +00:00
Dave Bartolomeo	89ba802b92	LLVM CodeView library Summary: This diff is the initial implementation of the LLVM CodeView library. There is much more work to be done, namely a CodeView dumper and tests. This patch should help others make progress on the LLVM->CodeView debug info emission while I continue with the implementation of the dumper and tests. This library implements support for emitting debug info in the CodeView format. This phase of the implementation only includes support for CodeView type records. Clients that need to emit type records will use a class derived from TypeTableBuilder. TypeTableBuilder provides member functions for writing each kind of type record; each of these functions eventually calls the writeRecord virtual function to emit the actual bits of the record. Derived classes override writeRecord to implement the folding of duplicate records and the actual emission to the appropriate destination. LLVMCodeView provides MemoryTypeTableBuilder, which creates the table in memory. In the future, other classes derived from TypeTableBuilder will write to other destinations, such as the type stream in a PDB. The rest of the types in LLVMCodeView define the actual CodeView type records and all of the supporting enums and other types used in the type records. The TypeIndex class is of particular interest, because it is used by clients as a handle to a type in the type table. The library provides a relatively low-level interface based on the actual on-disk format of CodeView. For example, type records refer to other type records by TypeIndex, rather than by an actual pointer to the referent record. This allows clients to emit type records one at a time, rather than having to keep the entire transitive closure of type records in memory until everything has been emitted. At some point, having a higher-level interface layered on top of this one may be useful for debuggers and other tools that want a more holistic view of the debug info. The lower-level interface should be sufficient for compilers and linkers to do the debug info manipulation that they need to do efficiently. Reviewers: rnk, majnemer Subscribers: silvas, rnk, jevinskie, llvm-commits Differential Revision: http://reviews.llvm.org/D14961 llvm-svn: 256385	2015-12-24 18:12:38 +00:00
Alexey Samsonov	1eaae4c3b1	[Symbolize] Improve the ownership of parsed objects. This code changes the way Symbolize handles parsed binaries: now parsed OwningBinary<Binary> is not broken into (binary, memory buffer) pair, and is just stored as-is in a cache. ObjectFile components of Mach-O universal binaries are also stored explicitly in a separate cache. Additionally, this change: * simplifies the code that parses/caches binaries: it's now done in a single place, not three different functions. * makes flush() method behave as expected, and actually clear the cached parsed binaries and objects. * fixes a dangling pointer issue described in http://reviews.llvm.org/D15638 llvm-svn: 256041	2015-12-18 22:02:14 +00:00
Pete Cooper	98052537f0	Revert "Improve DWARFDebugFrame::parse to also handle __eh_frame." This reverts commit r256008. Its breaking multiple buildbots, although works for me locally. llvm-svn: 256013	2015-12-18 19:45:38 +00:00
Pete Cooper	6c97f4c7d7	Improve DWARFDebugFrame::parse to also handle __eh_frame. LLVM MC has single methods which can handle the output of EH frame and DWARF CIE's and FDE's. This code improves DWARFDebugFrame::parse to do the same for parsing. This also allows llvm-objdump to support the --dwarf=frames option which objdump supports. This option dumps the .eh_frame section using the new code in DWARFDebugFrame::parse. http://reviews.llvm.org/D15535 Reviewed by Rafael Espindola. llvm-svn: 256008	2015-12-18 18:51:08 +00:00
Dave Bartolomeo	ea039c121b	Test commit llvm-svn: 255926	2015-12-17 20:54:16 +00:00
David Blaikie	ad07b5d65e	[llvm-dwp] Retrieve the DWOID from the CU for the cu_index entry llvm-svn: 254731	2015-12-04 17:20:04 +00:00
David Blaikie	725c4f71d1	dwarfdump: Correctly indentify the indicies for DWP records The indicies are one-based, not zero-based, per the spec. llvm-svn: 254626	2015-12-03 18:41:59 +00:00
David Blaikie	20f52662d4	[llvm-dwp] Don't rely on implicit move assignment operator (MSVC won't synthesize one) llvm-svn: 254492	2015-12-02 07:09:26 +00:00
David Blaikie	b073cb9be2	[llvm-dwp] Emit a rather fictional debug_cu_index This is very rudimentary support for debug_cu_index, but it is enough to allow llvm-dwarfdump to find the offsets for contributions and correctly dump debug_info. It will need to actually find the real signature of the unit and build the real hash table with the right number of buckets, as per the DWP specification. It will also need to be expanded to cover the tu_index as well. llvm-svn: 254489	2015-12-02 06:21:34 +00:00
Craig Topper	66059c9f4d	Replace dyn_cast with isa in places that weren't using the returned value for more than a boolean check. NFC. llvm-svn: 253441	2015-11-18 07:07:59 +00:00
David Blaikie	4689ef5943	Fix null dereference committed in r253277 llvm-svn: 253393	2015-11-17 22:39:26 +00:00
David Blaikie	35c2eebfe4	dwarfdump: support indexed string dumping in dwp based on the STR_OFFSETS component of the index llvm-svn: 253392	2015-11-17 22:39:23 +00:00
David Blaikie	c4e2bed738	dwarfdump: Reference the appropriate line table segment when dumping dwp files Also improves .dwo type unit dumping which didn't handle this either. llvm-svn: 253377	2015-11-17 21:08:05 +00:00
David Blaikie	cdec7ee565	Fix indentation llvm-svn: 253278	2015-11-17 00:41:02 +00:00
David Blaikie	82641be467	dwarfdump: Use the index to find the right abbrev offset in DWP files llvm-svn: 253277	2015-11-17 00:39:55 +00:00
David Blaikie	8e8dd57e0b	dwarfdump: Add support for dumping the table contents of DWP indexes This is a recommit of 252842 which was reverted in 252859. The issue was using %s format specifier for a StringRef - used Format's left_justify(StringRef, int) instead. It'd be nice to have __attribute__((format(..))) on llvm::format, but apparently it's only implemented for c-style variadics, not C++ variadic templates. Perhaps we could fix that & conditionalize the attribute on such... llvm-svn: 253065	2015-11-13 19:18:49 +00:00
Reid Kleckner	c038e2db4d	[Symbolizer] Don't use PE symbol tables to override PDB symbols Summary: PE files are stripped by default, and only contain the names of exported symbols. The actual reason that we bother to do this override by default is actually due to a quirk of the way -gline-tables-only is implemented, so I phrased the check as "if we are symbolizing from dwarf, do the symtab override". This fixes lots of Windows ASan tests that I broke in r250582. Reviewers: samsonov Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D14594 llvm-svn: 253051	2015-11-13 17:00:36 +00:00
Amjad Aboud	e59cc3e540	dwarfdump: Added macro support to llvm-dwarfdump tool. Added "macro" option to "-debug-dump" flag, which trigger parsing and dumping of the ".debug_macinfo" section. Differential Revision: http://reviews.llvm.org/D14294 llvm-svn: 252866	2015-11-12 09:38:54 +00:00
David Blaikie	6400fc146e	Mostly revert 252842 due to failures on some buildbots. I imagine there's some UB in here somewhere, though Valgrind doesn't seem to have picked it up (not sure if I have a working asan build right now to test there). GDB bot seems to be crashing: http://lab.llvm.org:8011/builders/clang-x86_64-ubuntu-gdb-75/builds/26267/steps/check-all/logs/FAIL%3A%20LLVM%3A%3Adwarfdump-dwp.test Hexagon ELF bot is, presumably, just getting different output: http://lab.llvm.org:8011/builders/clang-hexagon-elf/builds/32927/steps/check-all/logs/FAIL%3A%20LLVM%3A%3Adwarfdump-dwp.test llvm-svn: 252859	2015-11-12 06:33:14 +00:00
David Blaikie	ee293c0aac	dwarfdump: Add error checking to fix the buildbots/correctness llvm-svn: 252845	2015-11-12 01:57:33 +00:00
David Blaikie	6e9c4f7f0d	dwarfdump: Add some error handling for DWP index sections of the wrong size llvm-svn: 252843	2015-11-12 01:41:59 +00:00
David Blaikie	5b9bf49c6f	dwarfdump: Dump the contents of DWP indexes llvm-svn: 252842	2015-11-12 01:41:52 +00:00
Hemant Kulkarni	bdce12a01b	[Symbolizer]: Add -pretty-print option Differential Revision: http://reviews.llvm.org/D13671 llvm-svn: 252798	2015-11-11 20:41:43 +00:00
David Blaikie	51c402838c	dwarfdump: DWP type unit index dumping skeleton llvm-svn: 252786	2015-11-11 19:40:49 +00:00
David Blaikie	0b44dcc44a	Format my previous commit llvm-svn: 252782	2015-11-11 19:30:47 +00:00
David Blaikie	65a8efe441	dwarfdump: First piece of support for DWP dumping Just a tiny piece of index dumping - the header in this instance. llvm-svn: 252781	2015-11-11 19:28:21 +00:00
Colin LeMahieu	da6cafffc0	Reverting r252760 llvm-svn: 252770	2015-11-11 18:11:06 +00:00
Hemant Kulkarni	c6638c7561	[Symbolizer]: Add -pretty-print option Differential Revision: http://reviews.llvm.org/D13671 llvm-svn: 252760	2015-11-11 17:47:54 +00:00
Alexey Samsonov	5365a01dc7	[LLVMSymbolize] Reduce indentation by using helper function. NFC. llvm-svn: 252022	2015-11-04 00:30:26 +00:00
Alexey Samsonov	884adda0fb	[LLVMSymbolize] Properly propagate object parsing errors from the library. llvm-svn: 252021	2015-11-04 00:30:24 +00:00
Alexey Samsonov	d6aa820262	[LLVMSymbolize] Factor out the logic for printing structs from DIContext. NFC. Introduce DIPrinter which takes care of rendering DILineInfo and friends. This allows LLVMSymbolizer class to return a structured data instead of plain std::strings. llvm-svn: 251989	2015-11-03 22:20:52 +00:00
Alexey Samsonov	6881249895	[LLVMSymbolize] Move demangling away from printing routines. NFC. Make printDILineInfo and friends responsible for just rendering the contents of the structures, demangling should actually be performed earlier, when we have the information about the originating SymbolizableModule at hand. llvm-svn: 251981	2015-11-03 21:36:13 +00:00
Alexey Samsonov	46c1ce6ff5	Let the users of LLVMSymbolizer decide whether they want to symbolize inlined frames. Introduce LLVMSymbolizer::symbolizeInlinedCode() instead of switching on PrintInlining option passed to the constructor. This will be needed once we retrun structured data (instead of std::string) from LLVMSymbolizer and move printing logic out. llvm-svn: 251675	2015-10-30 00:40:20 +00:00
Alexey Samsonov	e46bd74147	[LLVMSymbolize] Simplify SymbolizableObjectFile::symbolizeInlinedCode(). NFC. llvm-svn: 251672	2015-10-30 00:02:55 +00:00
Alexey Samsonov	76f7ecb83a	[LLVMSymbolize] Move printing the description of a global into a separate function. NFC. llvm-svn: 251669	2015-10-29 23:49:19 +00:00
Alexey Samsonov	8df3a07aa8	[LLVMSymbolize] Move ModuleInfo into a separate class (SymbolizableModule). Summary: This is mostly NFC. It is a first step in cleaning up LLVMSymbolize library. It removes "ModuleInfo" class which bundles together ObjectFile and its debug info context in favor of: * abstract SymbolizableModule in public headers; * SymbolizableObjectFile subclass in implementation. Additionally, SymbolizableObjectFile is now created via factory, so we can properly detect object parsing error at this stage instead of keeping the broken half-parsed object. As a next step, we would be able to propagate the error all the way back to the library user. Further improvements might include: * factoring out the logic of finding appropriate file with debug info for a given object file, and caching all parsed object files into a separate class [A]. * factoring out DILineInfo rendering [B]. This would make what is now a heavyweight "LLVMSymbolizer" a relatively straightforward class, that calls into [A] to turn filepath into a SymbolizableModule, delegates actual symbolization to concrete SymbolizableModule implementation, and lets [C] render the result. Reviewers: dblaikie, echristo, rafael Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D14099 llvm-svn: 251662	2015-10-29 22:21:37 +00:00
Alexey Samsonov	0fb6451ade	[LLVMSymbolize] Don't use LLVMSymbolizer::Options in ModuleInfo. NFC. LLVMSymbolizer::Options is mostly used in LLVMSymbolizer class anyway. Let's keep their usage restricted to that class, especially given that it's worth to move ModuleInfo to a different header, independent from the symbolizer class. llvm-svn: 251363	2015-10-26 22:34:56 +00:00
Alexey Samsonov	ff8a80b477	Fix build failure on GCC 4.7 (old libstdc++ doesn't have std::map::emplace). llvm-svn: 251347	2015-10-26 21:20:37 +00:00
David Blaikie	efbb29153e	Remove use of std::map<>::emplace which is not supported on some older versions of libstdc++ llvm-svn: 251346	2015-10-26 21:10:36 +00:00
Alexey Samsonov	f3ecfd3af4	[LLVMSymbolize] Use symbol table only if function linkage name was requested. Now it's enough to just specify -functions=short without additionally providing -use-symbol-table=false. llvm-svn: 251339	2015-10-26 20:12:29 +00:00
Alexey Samsonov	1d3f3271ac	Fix build error by fully qualifying llvm::make_unique. llvm-svn: 251338	2015-10-26 20:12:27 +00:00
Alexey Samsonov	7a952e53f9	[LLVMSymbolize] Use std::unique_ptr more extensively to clarify ownership. llvm-svn: 251336	2015-10-26 19:41:23 +00:00
Alexey Samsonov	57f8837ada	Move parts of llvm-symbolizer tool into LLVMSymbolize library. Summary: See http://lists.llvm.org/pipermail/llvm-dev/2015-October/091624.html Reviewers: echristo Subscribers: llvm-commits, aizatsky Differential Revision: http://reviews.llvm.org/D13998 llvm-svn: 251316	2015-10-26 17:56:12 +00:00
Matt Arsenault	45cbfa5949	Use numeric_limits instead of LLONG_MAX This is a build fix for configurations where LLONG_MAX is not defined in system headers. llvm-svn: 250946	2015-10-21 21:10:12 +00:00
Reid Kleckner	e94fef7b3d	[llvm-symbolizer] Make --relative-address work with DWARF contexts Summary: Previously the relative address flag only affected PDB debug info. Now both DIContext implementations always expect to be passed virtual addresses. llvm-symbolizer is now responsible for adding ImageBase to module offsets when --relative-offset is passed. Reviewers: zturner Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D12883 llvm-svn: 249784	2015-10-09 00:15:01 +00:00
Craig Topper	0013be16ff	Use makeArrayRef or None to avoid unnecessarily mentioning the ArrayRef type extra times. NFC llvm-svn: 248140	2015-09-21 05:32:41 +00:00
Frederic Riss	7bb12261a3	[dwarfdump] Do not apply relocations in mach-o files if there is no LoadedObjectInfo. Not only do we not need to do anything to read correct values from the object files, but the current logic actually wrongly applies twice the section base address when there is no LoadedObjectInfo passed to the DWARFContext creation (as the added test shows). Simply do not apply any relocations on the mach-o debug info if there is no load offset to apply. llvm-svn: 245807	2015-08-23 04:44:21 +00:00
Benjamin Kramer	df005cbe19	Fix some comment typos. llvm-svn: 244402	2015-08-08 18:27:36 +00:00
Rafael Espindola	8bab889b0f	Convert getSymbolSection to return an ErrorOr. This function can actually fail since the symbol contains an index to the section and that can be invalid. llvm-svn: 244375	2015-08-07 23:27:14 +00:00
Frederic Riss	f6402bfe78	[dwarfdump] Ignore scattered relocations for mach-o. When encountering a scattered relocation, the code would assert trying to access an unexisting section. I couldn't find a way to expose the result of the processing of a scattered reloc, and I'm really unsure what the right thing to do is. This patch just skips them during the processing in DwarfContext and adds a mach-o file to the tests that exposed the asserting behavior. (This is a new failure that is being exposed by Rafael's recent work on the libObject interfaces. I think the wrong behavior has always happened, but now it's asserting) llvm-svn: 243778	2015-07-31 20:22:50 +00:00
Lang Hames	2e88f4fc5f	[RuntimeDyld] Make LoadedObjectInfo::getLoadedSectionAddress take a SectionRef rather than a string section name. llvm-svn: 243456	2015-07-28 17:52:11 +00:00
Rafael Espindola	ed067c45d4	Return ErrorOr from getSymbolAddress. It can fail trying to get the section on ELF and COFF. This makes sure the error is handled. llvm-svn: 241366	2015-07-03 18:19:00 +00:00
Rafael Espindola	41bb43252b	Don't return error_code from a function that doesn't fail. llvm-svn: 241042	2015-06-30 04:08:37 +00:00
Rafael Espindola	99c041b72f	Don't return error_code from a function that doesn't fail. llvm-svn: 241033	2015-06-30 01:53:01 +00:00
Rafael Espindola	96d071cd0c	Don't return error_code from function that never fails. llvm-svn: 241021	2015-06-29 23:29:12 +00:00
Alexander Kornienko	f00654e31b	Revert r240137 (Fixed/added namespace ending comments using clang-tidy. NFC) Apparently, the style needs to be agreed upon first. llvm-svn: 240390	2015-06-23 09:49:53 +00:00
Dan Liew	7b62aec0d3	Try to fix generation of LLVMExports.cmake under Visual Studio. If LLVMDebugInfoPDB links against the DIA SDK then the exports file would contain an INTERFACE_LINK_LIBRARIES property that contained an absolute path to ``diaguids.lib`` which used a native windows path (interpreted as escape sequences when LLVMExports.cmake is imported causing ``find_package(LLVM)`` to fail) rather than the correct CMake style path. llvm-svn: 240181	2015-06-19 21:50:27 +00:00
Rafael Espindola	63a88ce5d4	Make getRelocationSection MachO only. There are 3 types of relocations on MachO * Scattered * Section based * Symbol based On ELF and COFF relocations are symbol based. We were in the strange situation that we abstracted over two of them. This makes section based relocations MachO only. llvm-svn: 240149	2015-06-19 17:54:28 +00:00
Alexander Kornienko	70bc5f1398	Fixed/added namespace ending comments using clang-tidy. NFC The patch is generated using this command: tools/clang/tools/extra/clang-tidy/tool/run-clang-tidy.py -fix \ -checks=-,llvm-namespace-comment -header-filter='llvm/.\|clang/.*' \ llvm/lib/ Thanks to Eugene Kosov for the original patch! llvm-svn: 240137	2015-06-19 15:57:42 +00:00
Keno Fischer	c2c6018cce	[DWARF] Fix a bug in line info handling This fixes a bug in the line info handling in the dwarf code, based on a problem I when implementing RelocVisitor support for MachO. Since addr+size will give the first address past the end of the function, we need to back up one line table entry. Fix this by looking up the end_addr-1, which is the last address in the range. Note that this also removes a duplicate output from the llvm-rtdyld line table dump. The relevant line is the end_sequence one in the line table and has an offset of the first address part the end of the range and hence should not be included. Also factor out the common functionality into a separate function. This comes up on MachO much more than on ELF, since MachO doesn't store the symbol size separately, hence making said situation always occur. Differential Revision: http://reviews.llvm.org/D9925 llvm-svn: 238699	2015-05-31 23:37:04 +00:00
Benjamin Kramer	f5e2fc474d	Replace push_back(Constructor(foo)) with emplace_back(foo) for non-trivial types If the type isn't trivially moveable emplace can skip a potentially expensive move. It also saves a couple of characters. Call sites were found with the ASTMatcher + some semi-automated cleanup. memberCallExpr( argumentCountIs(1), callee(methodDecl(hasName("push_back"))), on(hasType(recordDecl(has(namedDecl(hasName("emplace_back")))))), hasArgument(0, bindTemporaryExpr( hasType(recordDecl(hasNonTrivialDestructor())), has(constructExpr()))), unless(isInTemplateInstantiation())) No functional change intended. llvm-svn: 238602	2015-05-29 19:43:39 +00:00
Ed Maste	6d0bee5fc2	DebugInfo: .debug_line DWARF64 support This adds support for the 64-bit DWARF format, but is still limited to less than 4GB of debug data by the DataExtractor class. Some versions of the GNU MIPS toolchain generate 64-Bit DWARF even though it isn't actually necessary. Differential Revision: http://reviews.llvm.org/D1988 llvm-svn: 238434	2015-05-28 15:38:17 +00:00
Benjamin Kramer	68a29562f9	Refactor: Simplify boolean conditional return statements in llvm/lib/DebugInfo/DWARF Use clang-tidy to simplify boolean conditional return statements. Patch by Richard Thomson <legalize@xmission.com>! Differential Revision: http://reviews.llvm.org/D9972 llvm-svn: 238132	2015-05-25 13:28:03 +00:00
Keno Fischer	c780e8ebcc	Make it easier to use DwarfContext with MCJIT Summary: This supersedes http://reviews.llvm.org/D4010, hopefully properly dealing with the JIT case and also adds an actual test case. DwarfContext was basically already usable for the JIT (and back when we were overwriting ELF files it actually worked out of the box by accident), but in order to resolve relocations correctly it needs to know the load address of the section. Rather than trying to get this out of the ObjectFile or requiring the user to create a new ObjectFile just to get some debug info, this adds the capability to pass in that info directly. As part of this I separated out part of the LoadedObjectInfo struct from RuntimeDyld, since it is now required at a higher layer. Reviewers: lhames, echristo Reviewed By: echristo Subscribers: vtjnash, friss, rafael, llvm-commits Differential Revision: http://reviews.llvm.org/D6961 llvm-svn: 237961	2015-05-21 21:24:32 +00:00
Alexey Samsonov	7a18c06128	[DWARF parser] Make DWARF parser more robust against missing compile/type units. DWARF standard claims that each compilation/type unit header in .debug_info/.debug_types section must be followed by corresponding compile/type unit DIE, possibly with its children. Two situations are possible: * compile/type unit DIE is missing because DWARF producer failed to emit it. * DWARF parser failed to parse unit DIE correctly, for instance if it contains some unsupported attributes (see r237721, for instance). In either of these cases, the library, and the tools that use it (llvm-dwarfdump, llvm-symbolizer) should not crash. Insert appropriate checks to protect against this. llvm-svn: 237733	2015-05-19 21:54:32 +00:00
Alexey Samsonov	bf19a578e6	[DWARF parser] Add basic support for DWZ DWARF multifile extensions. This change implements basic support for DWARF alternate sections proposal: http://www.dwarfstd.org/ShowIssue.php?issue=120604.1&type=open LLVM tools now understand new forms: DW_FORM_GNU_ref_alt and DW_FORM_GNU_strp_alt, which are used as references to .debug_info and .debug_str sections respectively, stored in a separate file, and possibly shared between different executables / shared objects. llvm-dwarfdump and llvm-symbolizer don't yet know how to access this alternate debug file (usually pointed by .gnu_debugaltlink section), but they can at lease properly parse and dump regular files, which refer to it. This change should fix crashes of llvm-dwarfdump and llvm-symbolizer on files produced by running "dwz" tool. Such files are already installed on some modern Linux distributions. llvm-svn: 237721	2015-05-19 20:29:28 +00:00
Jim Grosbach	4c98cf77d9	MC: MCCodeGenInfo naming update. NFC. s/InitMCCodeGenInfo/initMCCodeGenInfo/ llvm-svn: 237471	2015-05-15 19:13:31 +00:00
Keith Walker	ea9483f847	[DWARF] Add CIE header fields address_size and segment_size when generating dwarf-4 The DWARF-4 specification added 2 new fields in the CIE header called address_size and segment_size. Create these 2 new fields when generating dwarf-4 CIE entries, print out the new fields when dumping the CIE and update tests Differential Revision: http://reviews.llvm.org/D9558 llvm-svn: 237145	2015-05-12 15:25:08 +00:00
Zachary Turner	c007aa41b6	A few fixes for llvm-symbolizer on Windows. Specifically, this patch correctly respects the -demangle option, and additionally adds a hidden --relative-address option allows input addresses to be relative to the module load address instead of absolute addresses into the image. llvm-svn: 236653	2015-05-06 22:26:30 +00:00
Zachary Turner	6799af41fe	Fix build. llvm-svn: 236343	2015-05-01 20:33:10 +00:00
Zachary Turner	e5cb269352	[llvm-pdbdump] Support dynamic load address and external symbols. This patch adds the --load-address command line option to llvm-pdbdump, which dumps all addresses assuming the module has loaded at the specified address. Additionally, this patch adds an option to llvm-pdbdump to support dumping of public symbols (i.e. symbols with external linkage). llvm-svn: 236342	2015-05-01 20:24:26 +00:00
Pete Cooper	5b39524313	Add missing library dependency in libPDB. PDB uses COFFObjectFile::getPE32Header which lives in libObject. Make sure that LLVMBuild.txt reflects this dependency. llvm-svn: 235920	2015-04-27 21:23:12 +00:00
Zachary Turner	20dbd0d0de	Make llvm-symbolizer work on Windows. Differential Revision: http://reviews.llvm.org/D9234 Reviewed By: Alexey Samsonov llvm-svn: 235900	2015-04-27 17:19:51 +00:00
Zachary Turner	6489d7b949	Move DIContext.h to common DebugInfo location. This will enable us to create a PDBContext so as to expose some amount of debug info functionality through a common interace. Differential Revision: http://reviews.llvm.org/D9205 Reviewed by: Alexey Samsonov llvm-svn: 235612	2015-04-23 17:37:47 +00:00
Zachary Turner	4b08354b0e	[PDB] Support executables and source/line info. Previously DebugInfoPDB could only load data for a PDB given a path to the PDB. It could not open an EXE and find the matching PDB and verify it matched, etc. This patch adds support for that so that we can simply load debug information for a PDB directly. Additionally, this patch extends DebugInfoPDB to support getting source and line information for symbols. llvm-svn: 235237	2015-04-17 22:40:36 +00:00
Alexander Kornienko	f817c1cb9a	Use 'override/final' instead of 'virtual' for overridden methods The patch is generated using clang-tidy misc-use-override check. This command was used: tools/clang/tools/extra/clang-tidy/tool/run-clang-tidy.py \ -checks='-*,misc-use-override' -header-filter='llvm\|clang' \ -j=32 -fix -format http://reviews.llvm.org/D8925 llvm-svn: 234679	2015-04-11 02:11:45 +00:00
Chris Bieneman	6a1b54acc7	Raising minimum required CMake version to 2.8.12.2. This commit is in reference to the llvm-dev thread: http://lists.cs.uiuc.edu/pipermail/llvmdev/2015-March/083672.html llvm-svn: 233008	2015-03-23 20:03:57 +00:00
Benjamin Kramer	16132e6faa	Purge unused includes throughout libSupport. NFC. llvm-svn: 232976	2015-03-23 18:07:13 +00:00
Frederic Riss	77f850e336	DWARFFormValue: Add getAsSignedConstant method. The implementation accepts explicitely signed forms (DW_FORM_sdata), but also unsigned forms as long as they fit in an int64_t. llvm-svn: 231299	2015-03-04 22:07:41 +00:00
Zachary Turner	653236596a	[llvm-pdbdump] Display full enum definitions. This will now display enum definitions both at the global scope as well as nested inside of classes. Additionally, it will no longer display enums at the global scope if the enum is nested. Instead, it will omit the definition of the enum globally and instead emit it in the corresponding class definition. llvm-svn: 231215	2015-03-04 06:09:53 +00:00
Zachary Turner	7797c726b9	[llvm-pdbdump] Many minor fixes and improvements A short list of some of the improvements: 1) Now supports -all command line argument, which implies many other command line arguments to simplify usage. 2) Now supports -no-compiler-generated command line argument to exclude compiler generated types. 3) Prints base class list. 4) -class-definitions implies -types. 5) Proper display of bitfields. 6) Can now distinguish between struct/class/interface/union. And a few other minor tweaks. llvm-svn: 230933	2015-03-02 04:39:56 +00:00
Zachary Turner	b52d08d9dd	[llvm-pdbdump] Clean up method signatures. llvm-svn: 230889	2015-03-01 06:51:29 +00:00
Zachary Turner	ccf0415973	[llvm-pdbdump] Better error handling. Previously it was impossible to distinguish between "There is no PDB implementation for this platform" and "I tried to load the PDB, but couldn't find the file", making it hard to figure out if you built llvm-pdbdump incorrectly or if you just mistyped a file name. This patch adds proper error handling so that we can know exactly what went wrong. llvm-svn: 230868	2015-02-28 20:23:18 +00:00
Benjamin Kramer	4f6ac16292	Replace std::copy with a back inserter with vector append where feasible All of the cases were just appending from random access iterators to a vector. Using insert/append can grow the vector to the perfect size directly and moves the growing out of the loop. No intended functionalty change. llvm-svn: 230845	2015-02-28 10:11:12 +00:00
Zachary Turner	d270d22f35	[llvm-pdbdump] Fix dumping of function pointers and basic types. Function pointers were not correctly handled by the dumper, and they would print as "* name". They now print as "int (__cdecl *name)(int arg1, int arg2)" as they should. Also, doubles were being printed as floats. This fixes that bug as well, and adds tests for all builtin types. as well as a test for function pointers. llvm-svn: 230703	2015-02-26 23:49:23 +00:00
Frederic Riss	de3743453f	[dwarfdump] Fix frame info register number dump. llvm-svn: 230559	2015-02-25 22:30:09 +00:00
Frederic Riss	ac10b0d61d	Try to appease buildbots. It seems ArrayRefs to multi-dimensional arrays confuse some compilers. llvm-svn: 230554	2015-02-25 22:07:43 +00:00
Frederic Riss	c0dd7243ee	[dwarfdump] Make debug_frame dump actually useful. This adds support for pretty-printing instruction operands. The new output looks like: 00000000 00000010 ffffffff CIE Version: 1 Augmentation: Code alignment factor: 1 Data alignment factor: -4 Return address column: 8 DW_CFA_def_cfa: reg4 +4 DW_CFA_offset: reg8 -4 DW_CFA_nop: DW_CFA_nop: 00000014 00000010 00000000 FDE cie=00000000 pc=00000000...00000022 DW_CFA_advance_loc: 3 DW_CFA_def_cfa_offset: +12 DW_CFA_nop: llvm-svn: 230551	2015-02-25 21:30:22 +00:00
Frederic Riss	2fe0e54fd6	[dwarfdump] Don't print meaningless pointer. CIE pointers were never filled in before, and printing the pointer is totally pointless anyway. llvm-svn: 230550	2015-02-25 21:30:19 +00:00
Frederic Riss	056ad058bb	DWARFDebugFrame: Move some code around. NFC. Move the FrameEntry::dumpInstructions down in the file at some place where it can see the declarations of FDE and CIE. llvm-svn: 230549	2015-02-25 21:30:16 +00:00
Frederic Riss	41bb2c6d4f	DWARFDebugFrame: Add some trivial accessors. NFC. To be used for dumping. llvm-svn: 230548	2015-02-25 21:30:13 +00:00
Frederic Riss	baf195f7eb	DWARFDebugFrame: Actually collect CIEs associated with FDEs. This is the first commit in a small series aiming at making debug_frame dump more useful (right now it prints a list of opeartions without their operands). llvm-svn: 230547	2015-02-25 21:30:09 +00:00
Tobias Grosser	2ca0ae2a24	Revert "Raising minimum required CMake version to 2.8.12.2." This reverts commit r230062. Debian stable (wheezy) ships still with cmake 2.8.9. The commit broke my LLVM/Polly buildbot, to my knowledge our only Linux+cmake buildbot. llvm-svn: 230343	2015-02-24 16:39:46 +00:00
Chad Rosier	1df9124289	Revert "Revert "Raising minimum required CMake version to 2.8.12.2."" This reverts commit r230240, which was an accidental commit. llvm-svn: 230246	2015-02-23 19:34:04 +00:00
Chad Rosier	7c3310694c	Revert "Raising minimum required CMake version to 2.8.12.2." This reverts commit 247aed4710e8befde76da42b27313661dea7cf66. llvm-svn: 230240	2015-02-23 19:15:08 +00:00
Zachary Turner	bc42da0326	[llvm-pdbdump] Very minor code cleanup. This just removes some dead enums as well as some debug flushes of stdout. llvm-svn: 230204	2015-02-23 05:59:14 +00:00
Zachary Turner	29c69105fb	[llvm-pdbdump] Add an option to dump full class definitions. This adds the --class-definitions flag. If specified, when dumping types, instead of "class Foo" you will see the full class definition, with member functions, constructors, access specifiers. NOTE: Using this option can be very slow, as generating a full class definition requires accessing many different parts of the PDB. llvm-svn: 230203	2015-02-23 05:58:34 +00:00
Zachary Turner	9a818ad193	[llvm-pdbdump] Rewrite dumper using visitor pattern. This increases the flexibility of how to dump different symbol types -- necessary for context-sensitive formatting of symbol types -- and also improves the modularity by allowing the dumping to be implemented in the actual dumper, as opposed to in the PDB library. llvm-svn: 230184	2015-02-22 22:03:38 +00:00
Zachary Turner	fc4ecedb75	[llvm-pdbdump] Simplify options and output. This removes a wealth of options, and instead now only provides three options. -symbols, -types, and -compilands. This greatly simplifies use of the tool, and makes it easier to understand what you're going to see when you run the tool. llvm-svn: 230182	2015-02-22 21:45:38 +00:00
Chris Bieneman	e88a396f86	Raising minimum required CMake version to 2.8.12.2. llvm-svn: 230062	2015-02-20 21:28:18 +00:00
Zachary Turner	c0acf6837b	llvm-pdbdump: Add flags controlling the type of values to dump. llvm-svn: 229330	2015-02-15 20:27:53 +00:00
Zachary Turner	26ebe3fbcd	llvm-pdbdump: Only dump whitelisted global symbols. Dumping the global scope contains a lot of very uninteresting things and is generally polluted with a lot of random junk. Furthermore, it dumps values unsorted, making it hard to read. This patch dumps known interesting types only, and as a side effect sorts the list by symbol type. llvm-svn: 229232	2015-02-14 03:54:28 +00:00
Zachary Turner	52c9f881de	llvm-pdbdump: Re-order header files according to LLVM style guide. llvm-svn: 229231	2015-02-14 03:53:56 +00:00
Zachary Turner	6a582f9fc8	Fix -Wunused-variable warning. llvm-svn: 229130	2015-02-13 18:11:49 +00:00
Zachary Turner	04b966d9dc	llvm-pdbdump: Improve printing of functions and signatures. This correctly prints the function pointers, and also prints function signatures for symbols as opposed to just types. So actual functions in your program will now be printed with full name and signature, as opposed to just name as before. llvm-svn: 229129	2015-02-13 17:57:09 +00:00
Chandler Carruth	71f308adb7	Re-sort #include lines using my handy dandy ./utils/sort_includes.py script. This is in preparation for changes to lots of include lines. llvm-svn: 229088	2015-02-13 09:09:03 +00:00
Zachary Turner	a952c49c20	llvm-pdbdump: Add more comprehensive dumping of symbol types. In particular this patch adds the ability to dump complete function signature information including argument types as correctly formatted strings. A side effect of this is that almost all symbol and meta types are now formatted. llvm-svn: 229076	2015-02-13 07:40:03 +00:00
Zachary Turner	2a5c0a27b6	Improve llvm-pdbdump output display. This patch adds a number of improvements to llvm-pdbdump. 1) Dumping of the entire global scope, and not only those symbols that live in individual compilands. 2) Prepend class name to member functions and data 3) Improved display of bitfields. 4) Support for dumping more kinds of data symbols. llvm-svn: 229012	2015-02-13 01:23:51 +00:00
Zachary Turner	c074de041b	Add concrete type overloads to PDBSymbol::findChildren(). Frequently you only want to iterate over children of a specific type (e.g. functions). Previously you would get back a generic interface that allowed iteration over the base symbol type, which you would have to dyn_cast<> each one of. With this patch, we allow the user to specify the concrete type as a template parameter, and it will return an iterator which returns instances of the concrete type directly. llvm-svn: 228960	2015-02-12 21:09:24 +00:00
Peter Collingbourne	d20eff0ea6	Fix build for CMake < 2.8.12. llvm-svn: 228810	2015-02-11 05:58:57 +00:00
Zachary Turner	3bd47cee78	Use ADDITIONAL_HEADER_DIRS in all LLVM CMake projects. This allows IDEs to recognize the entire set of header files for each of the core LLVM projects. Differential Revision: http://reviews.llvm.org/D7526 Reviewed By: Chris Bieneman llvm-svn: 228798	2015-02-11 03:28:02 +00:00
Andrew Kaylor	7ad134a746	Temporary workaround to fix MSVC 2012 build problems llvm-svn: 228788	2015-02-11 02:16:34 +00:00
Zachary Turner	df3cc51f06	Fix some warnings due to -Wcovered-switch-default. llvm-svn: 228773	2015-02-11 00:13:39 +00:00
Zachary Turner	be6d1e49b0	Convert std::make_unique<> to llvm::make_unique<>. llvm-svn: 228768	2015-02-10 23:46:48 +00:00
Zachary Turner	a5549178f1	Rewrite llvm-pdbdump in terms of LLVMDebugInfoPDB. This makes llvm-pdbdump available on all platforms, although it will currently fail to create a dumper if there is no PDB reader implementation for the current platform. It implements dumping of compilands and children, which is less information than was previously available, but it has to be rewritten from scratch using the new set of interfaces, so the rest of the functionality will be added back in subsequent commits. llvm-svn: 228755	2015-02-10 22:43:25 +00:00
Zachary Turner	cffff26b68	Provide DIA implementation of DebugInfoPDB. This implements DebugInfoPDB when the DIA SDK is present on the system. Specifically, this means that the following conditions are met: 1) You are building on Windows. 2) You are building with MSVC. 3) Visual Studio did not corrupt the installation of DIA due to a known issue with side-by-side installations of VS2012 and VS2013. If all of these conditions are true, you will be able to pass a value of PDB_Reader::DIA to PDB::createPdbReader(). There are no tests for this yet, as any test will be in the form of a lit test which tests the llvm-pdbdump.exe, which still needs to be rewritten in terms of this library. llvm-svn: 228747	2015-02-10 21:17:52 +00:00
David Blaikie	e4698b9a6c	Fix -Wuninitialized build by referencing the relevant ctor parameter instead of the base class member variable. llvm-svn: 228554	2015-02-08 23:15:37 +00:00
Zachary Turner	98571ed996	Make PDBSymbol's IPDBSymbol reference const. llvm-svn: 228553	2015-02-08 22:53:53 +00:00
Zachary Turner	bae16b3f53	DebugInfoPDB: Make the symbol base case hold an IPDBSession ref. Dumping a symbol often requires access to data that isn't inside the symbol hierarchy, but which is only accessible through the top-level session. This patch is a pure interface change to give symbols a reference to the session. llvm-svn: 228542	2015-02-08 20:58:09 +00:00
Zachary Turner	92545a03df	Removed unused function mistakenly left in, triggering -Werror. llvm-svn: 228517	2015-02-08 00:41:31 +00:00
Zachary Turner	21473f7bb6	Some cleanup for libpdb. This patch implements a few of the optional suggestions from the initial patch comitting libpdb. In particular, it implements a virtual function out of line for each of the concrete classes. A few other minor cleanups exist as well, such as using override instead of virtual, etc. llvm-svn: 228516	2015-02-08 00:29:29 +00:00
Zachary Turner	94269a4db3	Resubmit unittests for DebugInfoPDB. These were originally submitted as part of r228428, but this part caused a build breakage in LLVMConfig. The library portion was resubmitted independently since it was not causing breakage. There were two reasons this was causing the build to fail. The first is that there were no Makefiles added for the PDB tests. And the second is that the DebugInfoPDB library was only being built by CMake behind an "if (MSVC)" check. This is wrong since this the library hides platform specific details, and it was causing LLVM-Config to not find the library when trying to build unittests. llvm-svn: 228482	2015-02-07 01:47:14 +00:00
Zachary Turner	8022f24fe8	Try to fix Makefile build for LLVMDebugInfoPDB. llvm-svn: 228437	2015-02-06 20:42:03 +00:00
Zachary Turner	0e9e66330a	Resubmit "Create lib/DebugInfo/PDB" (r228428) This change resubmits the patch that broke the build, this time without unittests. The unittests will be submitted separately after the problem has been addressed: --Original Commit Message-- Create lib/DebugInfo/PDB. This patch creates a platform-independent interface to a PDB reader. There is currently no implementation of this interface, which will be provided in future patches. This defines the basic object model which any implementation must conform to. Reviewed by: David Blaikie Differential Revision: http://reviews.llvm.org/D7356 llvm-svn: 228435	2015-02-06 20:30:52 +00:00
Zachary Turner	8c89a82c88	Revert "Create lib/DebugInfo/PDB." This reverts commit 21028, as it is causing failures in LLVMConfig. llvm-svn: 228431	2015-02-06 20:00:18 +00:00
Zachary Turner	079ba9224f	Create lib/DebugInfo/PDB. This patch creates a platform-independent interface to a PDB reader. There is currently no implementation of this interface, which will be provided in future patches. This defines the basic object model which any implementation must conform to. Reviewed by: David Blaikie Differential Revision: http://reviews.llvm.org/D7356 llvm-svn: 228428	2015-02-06 19:44:09 +00:00
Zachary Turner	82af9438d0	Move DebugInfo to DebugInfo/DWARF. In preparation for adding PDB support to LLVM, this moves the DWARF parsing code to its own subdirectory under DebugInfo, and renames LLVMDebugInfo to LLVMDebugInfoDWARF. This is purely a mechanical / build system change. Differential Revision: http://reviews.llvm.org/D7269 Reviewed by: Eric Christopher llvm-svn: 227586	2015-01-30 18:07:45 +00:00
Chandler Carruth	d9903888d9	[cleanup] Re-sort all the #include lines in LLVM using utils/sort_includes.py. I clearly haven't done this in a while, so more changed than usual. This even uncovered a missing include from the InstrProf library that I've added. No functionality changed here, just mechanical cleanup of the include order. llvm-svn: 225974	2015-01-14 11:23:27 +00:00
Adrian Prantl	0c36a75499	Implement a very basic colored syntax highlighting for llvm-dwarfdump. The color scheme is the same as the one used by the colorize dwarfdump script on Darwin. A new --color option can be used to forcibly turn color on or off. http://reviews.llvm.org/D6852 llvm-svn: 225269	2015-01-06 16:50:25 +00:00
Frederic Riss	b88adbdeb0	[DebugInfo] Move all DWARF headers to the public include directory. dsymutil needs access to DWARF specific inforamtion, the small DIContext wrapper isn't sufficient. Other DWARF consumers might want to use it too (I'm looking at you lldb). Differential Revision: http://reviews.llvm.org/D6694 llvm-svn: 224594	2014-12-19 18:26:33 +00:00
Michael Ilseman	addddc441f	Silence more static analyzer warnings. Add in definedness checks for shift operators, null checks when pointers are assumed by the code to be non-null, and explicit unreachables. llvm-svn: 224255	2014-12-15 18:48:43 +00:00
Frederic Riss	77a0743216	Make DWARFAcceleratorTable::dump() const. As dump() methods should be. To allow that, do not store the DWARFFormValue objects used for the dump in the header data. Per Alexey's suggestion! llvm-svn: 222436	2014-11-20 16:21:11 +00:00
Frederic Riss	7c41c64db7	Add missing copyright headers. llvm-svn: 222435	2014-11-20 16:21:06 +00:00

... 12 13 14 15 16 ...

1625 Commits