llvm-project

Commit Graph

Author	SHA1	Message	Date
James Henderson	9e09a54c69	[DebugInfo] Use Cursor to detect errors in debug line prologue parser Previously, the debug line parser would keep attempting to read data even if it had run out of data to read. This meant errors in parsing would often end up being reported as something else, such as an unknown version or malformed directory/filename table. This patch fixes the issues by using the Cursor API to capture errors. Reviewed by: labath Differential Revision: https://reviews.llvm.org/D83043	2020-07-03 11:52:06 +01:00
James Henderson	9782c922cb	[DebugInfo] Print line table extended opcode bytes if parsing fails Previously, if there was an error whilst parsing the operands of an extended opcode, the operands would be treated as zero and printed. This could potentially be slightly confusing. This patch changes the behaviour to print the raw bytes instead. Reviewed by: ikudrin Differential Revision: https://reviews.llvm.org/D81570	2020-06-23 10:04:02 +01:00
James Henderson	b21794a91c	[DebugInfo] Unify Cursor usage for all debug line opcodes This is a natural extension of the previous changes to use the Cursor class independently in the standard and extended opcode paths, and in turn allows delaying error handling until the entire line has been printed in verbose mode, removing interleaved output in some cases. Reviewed by: MaskRay, JDevlieghere Differential Revision: https://reviews.llvm.org/D81562	2020-06-17 09:19:24 +01:00
Pavel Labath	9ed452f370	[llvm/DWARFDebugLine] Remove spurious full stop from warning messages Other warnings messages don't have a trailing full stop.	2020-06-11 13:14:21 +02:00
Pavel Labath	fccaa89e23	[llvm/DWARFDebugLine] Fix a typo in one warning message	2020-06-11 13:04:52 +02:00
Pavel Labath	6f55b5a101	[DWARFDebugLine] Use truncating data extractors for prologue parsing Summary: This makes the code easier to reason about, as it will behave the same way regardless of whether there is any more data coming after the presumed end of the prologue. Reviewers: jhenderson, dblaikie, probinson, ikudrin Subscribers: hiraditya, MaskRay, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D77557	2020-06-10 16:12:53 +02:00
James Henderson	e3547ade68	[DebugInfo] Improve new line printing in debug line verbose output The new line printing for debug line verbose output was inconsistent. For new rows in the matrix, a blank line followed, whilst the DW_LNS_copy opcode actually resulted in two blank lines. There was also potential inconsistency in the blank lines at the end of the table. This patch mostly resolves these issues - no blank lines appear in the output except for a single line after the prologue and at table end to separate it from any subsquent table, plus some instances after error messages. Also add a unit test for verbose output to test the fine details of new line placement and other aspects of verbose output. Reviewed by: dblaikie Differential Revision: https://reviews.llvm.org/D81102	2020-06-09 14:27:16 +01:00
James Henderson	dbd26fe0b6	[DebugInfo] Print non-verbose output at some point as verbose output Verbose and non-verbose parsing of .debug_line produced their output at different points in the program. The most obvious impact of this was that error messages were produced at different times, but it also potentially reduced what clients could do by customising the stream or warning/error handlers. This change makes the two variants consistent by printing non-verbose output inline, the same as verbose output. Testing of the error messages has been modified to check the messages always appear in the same location to illustrate the behaviour. Reviewed by: JDevlieghere, dblaikie, MaskRay, labath Differential Revision: https://reviews.llvm.org/D80989	2020-06-09 14:24:53 +01:00
James Henderson	6378167409	[DebugInfo] Remove unnecessary flushes and add additional testing The flushes previously existed to help ensure consistent error message output when stdout and stderr were passed to the same location. This is no longer necessary as errs() is now tied to outs(). Reviewed by: dblaikie, MaskRay, JDevlieghere, labath Differential Revision: https://reviews.llvm.org/D80803	2020-06-09 13:50:58 +01:00
James Henderson	5777570d24	[DebugInfo] Check for errors when reading data for extended opcode Previously, if an extended opcode was truncated, it would manifest as an "unexpected line op length error" which wasn't quite accurate. This change checks for errors any time data is read whilst parsing an extended opcode, and reports any errors detected. Reviewed by: MaskRay, labath, aprantl Differential Revision: https://reviews.llvm.org/D80797	2020-06-09 09:56:37 +01:00
Fangrui Song	9be3567df2	[llvm-dwarfdump] Add a table header for -debug-line -verbose output Like non-verbose output, so that it is easy to recognize the `Line,Column,File,ISA,Discriminator` column values. Reviewed By: JDevlieghere, jhenderson Differential Revision: https://reviews.llvm.org/D80874	2020-06-04 08:56:17 -07:00
Igor Kudrin	da913259c7	[DebugInfo] Report the format of line tables [7/10] Differential Revision: https://reviews.llvm.org/D80523	2020-06-02 17:55:31 +07:00
James Henderson	e8bcf4ef07	[DebugInfo] Add use of truncating data extractor to debug line parsing This will ensure that nothing can ever start parsing data from a future sequence and part-read data will be returned as 0 instead. Reviewed by: aprantl, labath Differential Revision: https://reviews.llvm.org/D80796	2020-06-01 12:33:21 +01:00
James Henderson	7bcde99f77	[llvm-dwarfdump][test] Use verbose output to check expected opcodes The debug_line_invalid.test test case was previously using the interpreted line table dumping to identify which opcodes have been parsed. This change moves to looking for the expected opcodes explicitly. This is probably a little clearer and also allows for testing some cases that wouldn't be easily identifiable from the interpreted table. Reviewed by: MaskRay Differential Revision: https://reviews.llvm.org/D80795	2020-06-01 11:48:02 +01:00
Pavel Labath	c475856d05	[DWARFDebugLine] Check for errors when parsing v2 file/dir lists Summary: Without this we could silently accept an invalid prologue because the default DataExtractor behavior is to return an empty string when reaching the end of file. And empty string is also used to terminate these lists. This makes the parsing code slightly more complicated, but this complexity will go away once the parser starts working with truncating data extractors. The reason I am doing it this way is because without this, the truncation would regress the quality of error messages (right now, we produce bad error messages only near EOF, but truncation would make everything behave as if it was near EOF). Reviewers: dblaikie, probinson, jhenderson Subscribers: hiraditya, MaskRay, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D77555	2020-04-21 16:55:36 +02:00
Pavel Labath	d978656fd0	[DWARFDebugLine] Use new DWARFDataExtractor::getInitialLength Summary: The error messages change somewhat, but I believe the overall informational value remains unchanged. Reviewers: jhenderson, dblaikie, ikudrin Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D75116	2020-03-02 11:14:29 +01:00
Pavel Labath	ced45978a2	Recommit "[DWARFDebugLine] Avoid dumping prologue members we did not parse" The patch was reverted in `69da40033` because of test failures on windows. The problem was the unpredictable order of some of the error messages, which I've tried to strenghten in that patch. It turns out this is not possible to do in verbose mode because there the data is being writted as it is being parsed. No amount of flushing (as I've done in the non-verbose mode) will help that. Indeed, even without any buffering the warning messages can end in the middle of a line in non-verbose mode. In this patch, I have reverted the changes which tested the relative position of the warning message, except for the messages about unsupported initial length, which are the ones I really wanted to test, and which do come out reasonably. The original commit message was: This patch if motivated by D74560, specifically the subthread about what to print upon encountering reserved initial length values. If the debug_line prologue has an unsupported version, we skip parsing the rest of the data. If we encounter an reserved initial length field, we don't even parse the version. However, we still print out all members (with value 0) in the dump function. This patch introduces early exits in the Prologue::dump function so that we print only the fields that were parsed successfully. In case of an unsupported version, we skip printing all subsequent prologue fields -- because we don't even know if this version has those fields. In case of a reserved unit length, we don't print anything -- if the very first field of the prologue is invalid, it's hard to say if we even have a prologue to begin with. Note that the user will still be able to see the invalid/reserved initial length value in the error message. I've modified (reordered) debug_line_invalid.test to show that the error message comes straight after the debug_line offset. I've also added some flush() calls to the dumping code to ensure this is the case in all situations (without that, the warnings could get out of sync if the output was not a terminal -- I guess this is why std::iostreams have the tie() function). Reviewers: jhenderson, ikudrin, dblaikie Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D75043	2020-02-26 16:42:25 +01:00
Pavel Labath	69da400331	Revert "[DWARFDebugLine] Avoid dumping prologue members we did not parse" The changed test started failing on the windows bots. Reverting while I investigate. This reverts commit `deb116ee0a`.	2020-02-25 17:58:50 +01:00
Pavel Labath	deb116ee0a	[DWARFDebugLine] Avoid dumping prologue members we did not parse Summary: This patch if motivated by D74560, specifically the subthread about what to print upon encountering reserved initial length values. If the debug_line prologue has an unsupported version, we skip parsing the rest of the data. If we encounter an reserved initial length field, we don't even parse the version. However, we still print out all members (with value 0) in the dump function. This patch introduces early exits in the Prologue::dump function so that we print only the fields that were parsed successfully. In case of an unsupported version, we skip printing all subsequent prologue fields -- because we don't even know if this version has those fields. In case of a reserved unit length, we don't print anything -- if the very first field of the prologue is invalid, it's hard to say if we even have a prologue to begin with. Note that the user will still be able to see the invalid/reserved initial length value in the error message. I've modified (reordered) debug_line_invalid.test to show that the error message comes straight after the debug_line offset. I've also added some flush() calls to the dumping code to ensure this is the case in all situations (without that, the warnings could get out of sync if the output was not a terminal -- I guess this is why std::iostreams have the tie() function). Reviewers: jhenderson, ikudrin, dblaikie Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D75043	2020-02-25 16:29:02 +01:00
James Henderson	bf4d8f2952	[DebugInfo] Add checks for v2 directory and file name table terminators The DWARFv2-4 specification for the line table header states that the include directories and file name tables both end with a single null byte. Prior to this change, the parser did not detect if this byte was missing, because it also stopped reading the tables once it reached the prologue end, as claimed by the header_length field. This change adds a check that the terminator has been seen at the end of each table. Reviewed by: dblaikie, MaskRay Differential Revision: https://reviews.llvm.org/D74413	2020-02-12 14:49:22 +00:00
James Henderson	23cf0a30b1	[DebugInfo] Add check for zero debug line opcode_base The number of standard opcodes is defined to be opcode_base - 1, so a value of 0 for the opcode_base caused a crash as an attempt was made to reserve many entries in a vector. This change fixes the crash, by issuing a warning and skipping reading of standard opcode lengths in the event of an opcode_base of 0. Reviewed by: dblaikie Differential Revision: https://reviews.llvm.org/D74309	2020-02-12 14:49:22 +00:00
James Henderson	1da62b51a5	[DebugInfo] Print version in error message in decimal Also remove some test duplication and add a test case that shows the maximum version is rejected (this also shows that the value in the error message is actually in decimal, and not just missing an 0x prefix). Reviewed by: dblaikie Differential Revision: https://reviews.llvm.org/D74403	2020-02-12 14:49:22 +00:00
James Henderson	720be5c016	[DebugInfo][test][NFC] Remove case numbers This reduces the noise caused by adding cases earlier in the sequence. Reviewed by: dblaikie Differential Revision: https://reviews.llvm.org/D74402	2020-02-12 11:05:32 +00:00
James Henderson	47f7df8946	[test][DebugInfo][NFC] Fix line endings	2020-02-11 16:11:40 +00:00
James Henderson	1dc62d0358	[DebugInfo][test] Replace pre-canned binary test The DebugInfo/dwarfdump-invalid-line-table test used a pre-canned binary generated by a fuzzer to demonstrate a bug fix. Unfortunately, the binary is rigid and requires hand-editing if we change behaviour, such as rejecting certain properties within it (as I plan on doing in another change). Rather than hand-edit the binary, I have replaced it with two tests. The first tests the high-level code path from the debug line parser that produces the same error as this test previously did, and the second is a set of unit test cases that comprehensively cover the FormValue::skipValue method, which in turn covers the area that the original bug fix touched. Reviewed by: MaskRay, dblaikie Differential Revision: https://reviews.llvm.org/D74202	2020-02-10 13:54:40 +00:00
James Henderson	7116e431c0	[DebugInfo] Make most debug line prologue errors non-fatal to parsing Many of the debug line prologue errors are not inherently fatal. In most cases, we can make reasonable assumptions and carry on. This patch does exactly that. In the case of length problems, the approach of "assume stated length is correct" is taken which means the offset might need adjusting. This is a relanding of `b94191fe`, fixing an LLD test and the LLDB build. Reviewed by: dblaikie, labath Differential Revision: https://reviews.llvm.org/D72158	2020-01-29 10:23:41 +00:00
James Henderson	5c05165984	Revert "[DebugInfo] Make most debug line prologue errors non-fatal to parsing" This reverts commit `b94191fecd`. The change broke both an LLD test and the LLDB build.	2020-01-28 11:49:30 +00:00
James Henderson	b94191fecd	[DebugInfo] Make most debug line prologue errors non-fatal to parsing Many of the debug line prologue errors are not inherently fatal. In most cases, we can make reasonable assumptions and carry on. This patch does exactly that. In the case of length problems, the approach of "the claimed length is correct" is taken to be consistent with other instances such as the SectionParser, which ignores the read length. Reviewed by: dblaikie Differential Revision: https://reviews.llvm.org/D72158	2020-01-28 11:29:50 +00:00
James Henderson	c963b5fbd6	[test][llvm-dwarfdump] Add extra test case for invalid MD5 form A subsequent patch will change how an invalid file name table is handled to allow parsing to continue. This patch adds a test case that will demonstrate a difference in behaviour with that change between invalid file tables where the error is before the end of the stated prologue length and where the error occurs after the stated length. Reviewed by: dblaikie Differential Revision: https://reviews.llvm.org/D72157	2020-01-27 15:33:34 +00:00
James Henderson	f1be770ff6	[DebugInfo] Make incorrect debug line extended opcode length non-fatal It is possible to try to keep parsing a debug line program even when the length of an extended opcode does not match what is expected for that opcode. This patch changes what was previously a fatal error to be non-fatal. The parser now continues by assuming the the claimed length is correct, even if it means moving the offset backwards. Reviewed by: dblaikie Differential Revision: https://reviews.llvm.org/D72155	2020-01-27 15:32:41 +00:00
James Henderson	6e3ca962fa	[DebugInfo] Improve error message text Unlike most of our errors in the debug line parser, the "no end of sequence" message was missing any reference to which line table it refererred to. This change adds the offset to this message. Reviewed by: dblaikie Differential Revision: https://reviews.llvm.org/D72443	2020-01-10 14:59:58 +00:00
James Henderson	418cd8216b	[DebugInfo] Remove redundant checks for past-the-end of prologue The V5 directory and filename tables had checks in to make sure we hadn't read past the end of the line table prologue. Since previous changes to the data extractor class ensure we never read past the end, these checks are now redundant, so this patch removes them. There is still a check to show that the whole prologue remains within the prologue length. Reviewed By: JDevlieghere Differential Revision: https://reviews.llvm.org/D71768	2020-01-03 12:35:32 +00:00
James Henderson	7b4badf6f9	[test][llvm-dwarfdump] Use --implicit-check-not to simplify test checks This removes the need to duplicate the LASTONLY check pattern and the last part of the NONFATAL pattern in the modified test. Reviewed By: MaskRay, JDevlieghere Differential Revision: https://reviews.llvm.org/D71757	2020-01-03 11:22:20 +00:00
James Henderson	f6f0cb4fd1	[test][llvm-dwarfdump] Normalise contents and checks for line tables The line tables in debug_line_malformed.s had contents that varied more than was necessary for the testing, making it harder to follow what was important. This patch normalises them so that they all share more-or-less the same body. Additionally, it makes the testing for what was printed more consistent, to show that the right parts of the line table prologue and body are/are not parsed and printed. Reviewed By: JDevlieghere Differential Revision: https://reviews.llvm.org/D71755	2020-01-03 11:19:00 +00:00
James Henderson	91cbcbd9e5	[test][llvm-dwarfdump] Add missing checks for table dumping Some of the tables in debug_line_malformed.s were not being checked in the NONFATAL checks in debug_line_invalid.test (only the warnings coming from them were being checked). This made the test harder to follow. Additionally, a later change will change the way the errors are handled such that more of the line table will be printed. That will require checks for these tables (or something equivalent) so that the difference in behaviour can be observed. This patch adds checks for the three tables that were missing checks. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D71753	2020-01-03 11:06:31 +00:00
James Henderson	19a71e358e	[test][llvm-dwarfdump][NFC] Remove unused check Reviewed by: JDevlieghere Differential Revision: https://reviews.llvm.org/D71756	2020-01-02 17:05:37 +00:00
James Henderson	0ae9e952f7	[test][llvm-dwarfdump][NFC] Improve test readability This patch adds and improves comments in the debug_line_invalid.test and its associated input file so that it is easier to follow. It uses '##' to make comments stand out from lit and FileCheck commands. It also reflows some commands so that the lines are not so long and are easier to read and fixes some copy/paste errors. Reviewed by: JDevlieghere Differential Revision: https://reviews.llvm.org/D71752	2020-01-02 16:59:09 +00:00
Jonas Devlieghere	0e7ba06e82	[DWARF] Add more error handling to debug line parser. This patch exnteds the error handling in the debug line parser to get rid of the existing MD5 assertion. I want to reuse the debug line parser from LLVM in LLDB where we cannot crash on invalid input. Differential revision: https://reviews.llvm.org/D64544 llvm-svn: 366762	2019-07-22 23:23:34 +00:00
James Henderson	a3acf99e59	[DWARF] Rework debug line parsing to use llvm::Error and callbacks Reviewed by: dblaikie, JDevlieghere, espindola Differential Revision: https://reviews.llvm.org/D44560 Summary: The .debug_line parser previously reported errors by printing to stderr and return false. This is not particularly helpful for clients of the library code, as it prevents them from handling the errors in a manner based on the calling context. This change switches to using llvm::Error and callbacks to indicate what problems were detected during parsing, and has updated clients to handle the errors in a location-specific manner. In general, this means that they continue to do the same thing to external users. Below, I have outlined what the known behaviour changes are, relating to this change. There are two levels of "errors" in the new error mechanism, to broadly distinguish between different fail states of the parser, since not every failure will prevent parsing of the unit, or of subsequent unit. Malformed table errors that prevent reading the remainder of the table (reported by returning them) and other minor issues representing problems with parsing that do not prevent attempting to continue reading the table (reported by calling a specified callback funciton). The only example of this currently is when the last sequence of a unit is unterminated. However, I think it would be good to change the handling of unrecognised opcodes to report as minor issues as well, rather than just printing to the stream if --verbose is used (this would be a subsequent change however). I have substantially extended the DwarfGenerator to be able to handle custom-crafted .debug_line sections, allowing for comprehensive unit-testing of the parser code. For now, I am just adding unit tests to cover the basic error reporting, and positive cases, and do not currently intend to test every part of the parser, although the framework should be sufficient to do so at a later point. Known behaviour changes: - The dump function in DWARFContext now does not attempt to read subsequent tables when searching for a specific offset, if the unit length field of a table before the specified offset is a reserved value. - getOrParseLineTable now returns a useful Error if an invalid offset is encountered, rather than simply a nullptr. - The parse functions no longer use `WithColor::warning` directly to report errors, allowing LLD to call its own warning function. - The existing parse error messages have been updated to not specifically include "warning" in their message, allowing consumers to determine what severity the problem is. - If the line table version field appears to have a value less than 2, an informative error is returned, instead of just false. - If the line table unit length field uses a reserved value, an informative error is returned, instead of just false. - Dumping of .debug_line.dwo sections is now implemented the same as regular .debug_line sections. - Verbose dumping of .debug_line[.dwo] sections now prints the prologue, if there is a prologue error, just like non-verbose dumping. As a helper for the generator code, I have re-added emitInt64 to the AsmPrinter code. This previously existed, but was removed way back in r100296, presumably because it was dead at the time. This change also requires a change to LLD, which will be committed separately. llvm-svn: 331971	2018-05-10 10:51:33 +00:00

39 Commits