llvm-project

Commit Graph

Author	SHA1	Message	Date
Matt Morehouse	f3c2e0bcee	[libFuzzer] Enable entropic by default. Entropic has performed at least on par with vanilla scheduling on Clusterfuzz, and has shown a slight coverage improvement on FuzzBench: https://www.fuzzbench.com/reports/2020-08-31/index.html Reviewed By: Dor1s Differential Revision: https://reviews.llvm.org/D87476	2020-09-16 10:44:34 -07:00
Dokyung Song	1bb1eac6b1	[libFuzzer] Add a command-line option for tracing mutation of corpus inputs in the dot graph format. This patch adds a new command-line option -mutation_graph_file=FILE for debugging purposes, which traces how corpus inputs evolve during a fuzzing run. For each new input that is added to the corpus, a new vertex corresponding to the added input, as well as a new edge that connects its base input to itself are written to the given file. Each vertex is labeled with the filename of the input, and each edge is labeled with the mutation sequence that led to the input w.r.t. its base input. The format of the mutation graph file is the dot file format. Once prepended and appended with "graph {" and "}", respectively, the graph becomes a valid dot file and can be visualized. Differential Revision: https://reviews.llvm.org/D86560	2020-09-09 03:28:53 +00:00
Dokyung Song	5cda4dc7b4	[libFuzzer] Scale energy assigned to each input based on input execution time. This patch scales the energy computed by the Entropic schedule based on the execution time of each input. The input execution time is compared with the average execution time of inputs in the corpus, and, based on the amount by which they differ, the energy is scaled from 0.1x (for inputs executing slow) to 3x (for inputs executing fast). Note that the exact scaling criteria and formula is borrowed from AFL. On FuzzBench, this gives a sizeable throughput increase, which in turn leads to more coverage on several benchmarks. For details, see the following report. https://storage.googleapis.com/fuzzer-test-suite-public/exectime-report/index.html Differential Revision: https://reviews.llvm.org/D86092	2020-09-03 20:38:20 +00:00
Dokyung Song	b53243e194	[libFuzzer] Evenly select inputs to cross over with from the corpus regardless of the input's coverage. This patch adds an option "cross_over_uniform_dist", which, if 1, considers all inputs in the corpus for the crossover input selection. More specifically, this patch uses a uniform distribution of all inputs in the corpus for the CrossOver input selection. Note that input selection for mutation is still fully determined by the scheduling policy (i.e., vanilla or Entropic); the uniform distribution only applies to the secondary input selection, only for the crossover mutation of the base input chosen by the scheduling policy. This way the corpus inputs that have useful fragments in them, even though they are deprioritized by the scheduling policy, have chances of getting mixed with other inputs that are prioritized and selected as base input for mutation. Differential Revision: https://reviews.llvm.org/D86954	2020-09-03 19:47:00 +00:00
Dokyung Song	62673c430d	[libFuzzer] Add an option to keep initial seed inputs around. This patch adds an option "keep_seed" to keep all initial seed inputs in the corpus. Previously, only the initial seed inputs that find new coverage were added to the corpus, and all the other initial inputs were discarded. We observed in some circumstances that useful initial seed inputs are discarded as they find no new coverage, even though they contain useful fragments in them (e.g., SQLITE3 FuzzBench benchmark). This newly added option provides a way to keeping seed inputs in the corpus for those circumstances. With this patch, and with -keep_seed=1, all initial seed inputs are kept in the corpus regardless of whether they find new coverage or not. Further, these seed inputs are not replaced with smaller inputs even if -reduce_inputs=1. Differential Revision: https://reviews.llvm.org/D86577	2020-09-03 15:54:39 +00:00
Matt Morehouse	711b980654	[fuzzer] Create user provided fuzzer writeable directories when requested if they dont exist Currently, libFuzzer will exit with an error message if a non-existent directory is provided for any of the appropriate arguments. For cases where libFuzzer is used in a specialized embedded environment, it would be much easier to have libFuzzer create the directories for the user. This patch accommodates for this scenario by allowing the user to provide the argument `-create_missing_dirs=1` which makes libFuzzer attempt to create the `artifact_prefix`, `exact_artifact_path`, `features_dir` and/or corpus directory if they don't already exist rather than throw an error and exit. Split off from D84808 as requested [here](https://reviews.llvm.org/D84808#2208546). Reviewed By: morehouse Differential Revision: https://reviews.llvm.org/D86733	2020-09-03 08:31:59 -07:00
Matt Morehouse	10670bdf54	Revert "[fuzzer] Create user provided fuzzer writeable directories when requested if they dont exist" This reverts commit `cb8912799d`, since the test fails on Windows.	2020-09-01 12:05:46 -07:00
Matt Morehouse	cb8912799d	[fuzzer] Create user provided fuzzer writeable directories when requested if they dont exist Currently, libFuzzer will exit with an error message if a non-existent directory is provided for any of the appropriate arguments. For cases where libFuzzer is used in a specialized embedded environment, it would be much easier to have libFuzzer create the directories for the user. This patch accommodates for this scenario by allowing the user to provide the argument `-create_missing_dirs=1` which makes libFuzzer attempt to create the `artifact_prefix`, `exact_artifact_path`, `features_dir` and/or corpus directory if they don't already exist rather than throw an error and exit. Split off from D84808 as requested [here](https://reviews.llvm.org/D84808#2208546). Reviewed By: morehouse Differential Revision: https://reviews.llvm.org/D86733	2020-09-01 11:50:47 -07:00
Matt Morehouse	2392ff093a	[libFuzzer] Error and exit if user supplied fuzzer writeable directories don't exist Currently, libFuzzer will exit with an error message if a non-existent corpus directory is provided. However, if a user provides a non-existent directory for the `artifact_prefix`, `exact_artifact_path`, or `features_dir`, libFuzzer will continue execution but silently fail to write artifacts/features. To improve the user experience, this PR adds validation for the existence of all user supplied directories before executing the main fuzzing loop. If they don't exist, libFuzzer will exit with an error message. Patch By: dgg5503 Reviewed By: morehouse Differential Revision: https://reviews.llvm.org/D84808	2020-08-26 09:27:07 -07:00
Ilya Leoshkevich	9df7ee34e1	[libFuzzer] Fix minimizing timeouts When one tries to minimize timeouts using -minimize_crash=1, minimization immediately fails. The following sequence of events is responsible for this: [parent] SIGALRM occurs [parent] read() returns -EINTR (or -ERESTARTSYS according to strace) [parent] fgets() returns NULL [parent] ExecuteCommand() closes child's stdout and returns [child ] SIGALRM occurs [child ] AlarmCallback() attempts to write "ALARM: ..." to stdout [child ] Dies with SIGPIPE without calling DumpCurrentUnit() [parent] Does not see -exact_artifact_path and exits When minimizing, the timer in parent is not necessary, so fix by not setting it in this case. Reviewed By: morehouse Differential Revision: https://reviews.llvm.org/D85359	2020-08-11 22:16:12 +02:00
Matt Morehouse	34ddf0b2b0	Replace fuzzer::FuzzerDriver's INTERFACE marking with new LLVMRunFuzzerDriver. This adds a new extern "C" function that serves the same purpose. This removes the need for external users to depend on internal headers in order to use this feature. It also standardizes the interface in a way that other fuzzing engines will be able to match. Patch By: IanPudney Reviewed By: kcc Differential Revision: https://reviews.llvm.org/D84561	2020-07-27 18:38:04 +00:00
Vitaly Buka	c12f111846	Corrected __libfuzzer_is_present symbol inclusion for MSVC x86 32-bit The incorrect symbol will cause linking failures for 32-bit targets: clang_rt.fuzzer-i386.lib(FuzzerDriver.obj) : error LNK2001: unresolved external symbol __libfuzzer_is_present Verified no longer fails to link with this change for 32-bit and still succeeds for 64-bit MSVC. Reviewed By: vitalybuka Differential Revision: https://reviews.llvm.org/D83594	2020-07-17 18:29:47 -07:00
Dokyung Song	226866e115	[libFuzzer] Separate platform related macros out from FuzzerDefs.h into FuzzerPlatform.h, and adjust includes in other files. Summary: This patch separates platform related macros in lib/fuzzer/FuzzerDefs.h into lib/fuzzer/FuzzerPlatform.h, and use FuzzerPlatform.h where necessary. This separation helps when compiling libFuzzer's interceptor module (under review); an unnecessary include of standard headers (such as string.h) may produce conflicts/ambiguation with the interceptor's declarations/definitions of library functions, which complicates interceptor implementation. Reviewers: morehouse, hctim Reviewed By: morehouse Subscribers: krytarowski, #sanitizers Tags: #sanitizers Differential Revision: https://reviews.llvm.org/D83805	2020-07-14 21:31:56 +00:00
Matt Morehouse	e2e38fca64	Entropic: Boosting LibFuzzer Performance Summary: This is collaboration between Marcel Boehme @ Monash, Australia and Valentin Manès plus Sang Kil Cha @ KAIST, South Korea. We have made a few modifications to boost LibFuzzer performance by changing how weights are assigned to the seeds in the corpus. Essentially, seeds that reveal more "information" about globally rare features are assigned a higher weight. Our results on the Fuzzer Test Suite seem quite promising. In terms of bug finding, our Entropic patch usually finds the same errors much faster and in more runs. In terms of coverage, our version Entropic achieves the same coverage in less than half the time for the majority of subjects. For the lack of space, we shared more detailed performance results directly with @kcc. We'll publish the preprint with all the technical details as soon as it is accepted. Happy to share if you drop us an email. There should be plenty of opportunities to optimise further. For instance, while Entropic achieves the same coverage in less than half the time, Entropic has a much lower #execs per second. We ran the perf-tool and found a few performance bottlenecks. Thanks for open-sourcing LibFuzzer (and the entire LLVM Compiler Infrastructure)! This has been such a tremendous help to my research. Patch By: Marcel Boehme Reviewers: kcc, metzman, morehouse, Dor1s, vitalybuka Reviewed By: kcc Subscribers: dgg5503, Valentin, llvm-commits, kcc Tags: #llvm Differential Revision: https://reviews.llvm.org/D73776	2020-05-19 10:28:57 -07:00
Yuanfang Chen	4caeb62e51	[Fuzzer] Rename ExecuteCommandWithPopen to ExecuteCommandNon-Fushsia target will keep using popen/pclose implementation. OnFuchsia, Two-args version of `ExecuteCommand` is a simple wrapper of theone-arg version. (Hopefully) Fix D73329 build on Fuchsia.	2020-02-12 16:03:55 -08:00
Yuanfang Chen	4f3c3bbbf8	Reland "[NFC][libFuzzer] Prefix TempPath with string showing the work it is doing." With fix (somehow one hunk is missed).	2020-02-10 18:23:39 -08:00
Yuanfang Chen	b1c7623982	Revert "[NFC][libFuzzer] Prefix TempPath with string showing the work it is doing." This reverts commit `8a29cb4421`. fuzzer-linux bot has failure because of this.	2020-02-10 18:06:17 -08:00
Yuanfang Chen	8a29cb4421	[NFC][libFuzzer] Prefix TempPath with string showing the work it is doing.	2020-02-10 16:45:10 -08:00
Yuanfang Chen	85515c7fd5	[libFuzzer] communicate through pipe to subprocess for MinimizeCrashInput For CleanseCrashInput, discards stdout output anyway since it is not used. These changes are to defend against aggressive PID recycle on windows to reduce the chance of contention on files. Using pipe instead of file also workaround the problem that when the process is spawned by llvm-lit, the aborted process keeps a handle to the output file such that the output file can not be removed. This will cause random test failures. https://devblogs.microsoft.com/oldnewthing/20110107-00/?p=11803 Reviewers: kcc, vitalybuka Reviewed By: vitalybuka Differential Revision: https://reviews.llvm.org/D73329	2020-02-10 16:45:10 -08:00
Max Moroz	15f1d5d144	[libFuzzer] Add INFO output when LLVMFuzzerCustomMutator is found. Summary: there is an ongoing work on interchangeable custom mutators (https://github.com/google/clusterfuzz/pull/1333/files#r367706283) and having some sort of signalling from libFuzzer that it has loaded a custom mutator would be helpful. The initial idea was to make the mutator to print something, but given the anticipated variety of different mutators, it does not seem possible to make all of them print the same message to signal their execution. Reviewers: kcc, metzman Reviewed By: metzman Subscribers: #sanitizers, llvm-commits Tags: #sanitizers, #llvm Differential Revision: https://reviews.llvm.org/D73136	2020-01-22 12:56:16 -08:00
Jonathan Metzman	23bee0b0cf	[fuzzer] Add basic support for emscripten. Summary: Add basic support for emscripten. This enables libFuzzer to build (using build.sh) for emscripten and fuzz a target compiled with -fsanitize-coverage=inline-8bit-counters. Basic fuzzing and bug finding work with this commit. RSS limit and timeouts will not work because they depend on system functions that are not implemented/widely supported in emscripten. Reviewers: kcc, vitalybuka, hctim Reviewed By: hctim Subscribers: #sanitizers, llvm-commits Tags: #sanitizers, #llvm Differential Revision: https://reviews.llvm.org/D71285	2019-12-12 08:56:47 -08:00
Matt Morehouse	1c8e05110c	[libFuzzer] Remove lazy counters. Summary: Lazy counters haven't improved performance for large fuzz targets. Reviewers: kcc Reviewed By: kcc Subscribers: llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D67476 llvm-svn: 373403	2019-10-01 22:49:06 +00:00
Kostya Serebryany	db88fc56b9	[libFuzzer] implement a better queue for the fork mode. Add an internal flag -stop_file to allow graceful shutdown of fuzzing. Enhance the logging in the fork mode llvm-svn: 363470	2019-06-14 22:56:50 +00:00
Max Moroz	0784e01a98	[libFuzzer] Disable len_control by default if LLVMFuzzerCustomMutator is used. Summary: Some custom mutators may not peform well when size restriction is enforced by len_control. Because of that, it's safer to disable len_control by default in such cases, but still allow users to enable it manually. Bug example: https://bugs.chromium.org/p/chromium/issues/detail?id=919530. Tested manually with LPM-based and regular fuzz targets. Reviewers: kcc, vitalybuka, metzman Reviewed By: kcc, metzman Subscribers: delcypher, #sanitizers, llvm-commits Tags: #llvm, #sanitizers Differential Revision: https://reviews.llvm.org/D63334 llvm-svn: 363443	2019-06-14 19:34:11 +00:00
Kostya Serebryany	b7cc3d9953	[libFuzzer] automatically collect the data flow trace (DFT) in the fork mode if -collect_data_flow= is given llvm-svn: 361448	2019-05-23 00:22:46 +00:00
Kostya Serebryany	4c7353c53b	[libFuzzer] code refactoring; NFC llvm-svn: 360400	2019-05-10 01:34:26 +00:00
Kostya Serebryany	da96d92175	[libFuzzer] small refactoring in the driver; dummy implementation of collect_data_flow; attempt to fix the windows bot llvm-svn: 360399	2019-05-10 00:59:32 +00:00
Jonathan Metzman	f3ee97731e	[libFuzzer] Replace -seed_corpus to better support fork mode on Win Summary: Pass seed corpus list in a file to get around argument length limits on Windows. This limit was preventing many uses of fork mode on Windows. Reviewers: kcc, morehouse Reviewed By: kcc Subscribers: #sanitizers, llvm-commits Tags: #sanitizers, #llvm Differential Revision: https://reviews.llvm.org/D60980 llvm-svn: 359610	2019-04-30 20:56:18 +00:00
Kostya Serebryany	4614cc3dfd	[libFuzzer] add -features_dir= flag to dump unique input features on disk llvm-svn: 358317	2019-04-13 00:20:31 +00:00
Kostya Serebryany	9982ee5472	[libFuzzer] form mode: add -ignore_crashes flag, honor the max_total_time flag, print the number of ooms/timeouts/crashes, fix a typo llvm-svn: 354175	2019-02-15 21:51:15 +00:00
Kostya Serebryany	8c2791469d	[libFuzzer] when doing the merge, keep track of the coveraged edges, not just features llvm-svn: 354087	2019-02-15 00:08:16 +00:00
Kostya Serebryany	60c5ded269	[libFuzzer] add threads to the fork mode: now you can pass -fork=N to run N concurrent workers. Fork mode is still work-in-progress. llvm-svn: 353997	2019-02-14 00:25:43 +00:00
Kostya Serebryany	5c08e811de	[libFuzzer] move the implementation of the fork mode into a separate file llvm-svn: 353891	2019-02-12 22:48:55 +00:00
Kostya Serebryany	2b9a8f37a4	[libFuzzer] make the fork mode less verbose llvm-svn: 353794	2019-02-12 03:12:40 +00:00
Kostya Serebryany	cdbb9dc962	[libFuzzer] teach the fork mode to ignore OOMs and timeouts llvm-svn: 353792	2019-02-12 02:18:53 +00:00
Kostya Serebryany	63f48717b5	[libFuzzer] extend the -fork=1 functionality. Still not fully usable, but good enough for the first unit test llvm-svn: 353775	2019-02-12 00:12:33 +00:00
Kostya Serebryany	0fda9dcbe9	[libFuzzer] more refactoring; change some of the exit codes (timeout, OOM, interrupt) so that the parent process can distinguish those llvm-svn: 353584	2019-02-09 00:16:21 +00:00
Kostya Serebryany	114cfafe05	[libFuzzer] refactor the merging code, NFC llvm-svn: 353576	2019-02-08 22:59:03 +00:00
Kostya Serebryany	b1e8b8149b	[libFuzzer] remove two unused experimental flags llvm-svn: 353573	2019-02-08 22:02:37 +00:00
Kostya Serebryany	f762a11544	[libFuzzer] introduce an experimental mode -fork=1, where fuzzing happens in a subprocess (still running multiple inputs per process), thus making the fuzzing more resilient to timeouts and OOMs. This is just a skeleton of the code, and some associated refactoring, not a fully working feature yet. llvm-svn: 353570	2019-02-08 21:27:23 +00:00
Kostya Serebryany	0719b3527f	[libFuzzer] refactor the way we choose the element to cross-over with, NFC (expected1); add a flag -seed_inputs= to pass extra seed inputs as file paths, not dirs llvm-svn: 353494	2019-02-08 01:20:54 +00:00
Kostya Serebryany	23482e1d07	[libFuzzer] set libFuzzer's own SEGV handler even one is already present, but call that handler from ours (unless we are unprotecting lazy counters). Call ProtectLazyCounters later, so that it runs after the initialization code in the target. llvm-svn: 352713	2019-01-31 01:40:14 +00:00
Kostya Serebryany	8da9479e40	[libFuzzer] experimental performance optimization -lazy_counters, off by default. Posix-only for now, tested on Linux llvm-svn: 352700	2019-01-31 00:09:43 +00:00
Kostya Serebryany	5c0751ec3e	[libFuzzer] remove stale code, NFC llvm-svn: 352604	2019-01-30 06:21:20 +00:00
Kostya Serebryany	2891b257c2	[libFuzzer] remove stale code llvm-svn: 352571	2019-01-29 23:53:28 +00:00
Chandler Carruth	2946cd7010	Update the file headers across all of the LLVM projects in the monorepo to reflect the new license. We understand that people may be surprised that we're moving the header entirely to discuss the new license. We checked this carefully with the Foundation's lawyer and we believe this is the correct approach. Essentially, all code in the project is now made available by the LLVM project under our new license, so you will see that the license headers include that license only. Some of our contributors have contributed code under our old license, and accordingly, we have retained a copy of our old license notice in the top-level files in each project and repository. llvm-svn: 351636	2019-01-19 08:50:56 +00:00
Jonathan Metzman	b795c31dff	[libFuzzer][MSVC] Make attribute-use compatible with MSVC Summary: Replace attributes with macros that use equivalent declspecs for MSVC. Reviewers: vitalybuka Reviewed By: vitalybuka Differential Revision: https://reviews.llvm.org/D56512 llvm-svn: 351456	2019-01-17 16:36:05 +00:00
Jonathan Metzman	9e14cccf6f	[libFuzzer] Remove unstable edge handling Summary: Remove code for handling unstable edges from libFuzzer since it has not been found useful. Differential Revision: https://reviews.llvm.org/D56730 llvm-svn: 351262	2019-01-15 22:12:51 +00:00
Max Moroz	8c95b48ba2	[libFuzzer] Remove mutation stats and weighted mutation selection. Summary: This was an experimental feature. After evaluating it with: 1) https://github.com/google/fuzzer-test-suite/tree/master/engine-comparison 2) enabling on real world fuzz targets running at ClusterFuzz and OSS-Fuzz The following conclusions were made: 1) With fuzz targets that have reached a code coverage plateau, the feature does not improve libFuzzer's ability to discover new coverage and may actually negatively impact it. 2) With fuzz targets that have not yet reached a code coverage plateau, the feature might speed up new units discovery in some cases, but it is quite rare and hard to confirm with a high level on confidence. Revert of https://reviews.llvm.org/D48054 and https://reviews.llvm.org/D49621. Reviewers: metzman, morehouse Reviewed By: metzman, morehouse Subscribers: delcypher, #sanitizers, llvm-commits, kcc Differential Revision: https://reviews.llvm.org/D51455 llvm-svn: 340976	2018-08-29 21:53:15 +00:00
Max Moroz	5a9baa330c	[libFuzzer] Initial implementation of weighted mutation leveraging during runtime. Summary: Added functions that calculate stats while fuzz targets are running and give mutations weight based on how much new coverage they provide, and choose better performing mutations more often. Patch by Kodé Williams (@kodewilliams). Reviewers: Dor1s, metzman, morehouse Reviewed By: Dor1s, morehouse Subscribers: delcypher, kcc, llvm-commits, #sanitizers Differential Revision: https://reviews.llvm.org/D49621 llvm-svn: 338776	2018-08-02 22:30:03 +00:00

1 2

87 Commits