mirrors/zig - "Borealis" Git by INX: Hosted by INX "Xenon".

mirror of https://codeberg.org/ziglang/zig.git synced 2025-12-07 22:34:28 +00:00

Author	SHA1	Message	Date
Kendall Condon	93775de45f	rework fuzz testing to be smith based -- On the standard library side: The `input: []const u8` parameter of functions passed to `testing.fuzz` has changed to `smith: testing.Smith`. `Smith` is used to generate values from libfuzzer or input bytes generated by libfuzzer. `Smith` contains the following base methods: `value` as a generic method for generating any type * `eos` for generating end-of-stream markers. Provides the additional guarantee `true` will eventually by provided. * `bytes` for filling a byte array. * `slice` for filling part of a buffer and providing the length. `Smith.Weight` is used for giving value ranges a higher probability of being selected. By default, every value has a weight of zero (i.e. they will not be selected). Weights can only apply to values that fit within a u64. The above functions have corresponding ones that accept weights. Additionally, the following functions are provided: * `baselineWeights` which provides a set of weights containing every possible value of a type. * `eosSimpleWeighted` for unique weights for `true` and `false` * `valueRangeAtMost` and `valueRangeLessThan` for weighing only a range of values. -- On the libfuzzer and abi side: --- Uids These are u32s which are used to classify requested values. This solves the problem of a mutation causing a new value to be requested and shifting all future values; for example: 1. An initial input contains the values 1, 2, 3 which are interpreted as a, b, and c respectively by the test. 2. The 1 is mutated to a 4 which causes the test to request an extra value interpreted as d. The input is now 4, 2, 3, 5 (new value) which the test corresponds to a, d, b, c; however, b and c no longer correspond to their original values. Uids contain a hash component and type component. The hash component is currently determined in `Smith` by taking a hash of the calling `@returnAddress()` or via an argument in the corresponding `WithHash` functions. The type component is used extensively in libfuzzer with its hashmaps. --- Mutations At the start of a cycle (a run), a random number of values to mutate is selected with less being exponentially more likely. The indexes of the values are selected from a selected uid with a logarithmic bias to uids with more values. Mutations may change a single values, several consecutive values in a uid, or several consecutive values in the uid-independent order they were requested. They may generate random values, mutate from previous ones, or copy from other values in the same uid from the same input or spliced from another. For integers, mutations from previous ones currently only generates random values. For bytes, mutations from previous mix new random data and previous bytes with a set number of mutations. --- Passive Minimization A different approach has been taken for minimizing inputs: instead of trying a fixed set of mutations when a fresh input is found, the input is instead simply added to the corpus and removed when it is no longer valuable. The quality of an input is measured based off how many unique pcs it hit and how many values it needed from the fuzzer. It is tracked which inputs hold the best qualities for each pc for hitting the minimum and maximum unique pcs while needing the least values. Once all an input's qualities have been superseded for the pcs it hit, it is removed from the corpus. -- Comparison to byte-based smith A byte-based smith would be much more inefficient and complex than this solution. It would be unable to solve the shifting problem that Uids do. It is unable to provide values from the fuzzer past end-of-stream. Even with feedback, it would be unable to act on dynamic weights which have proven essential with the updated tests (e.g. to constrain values to a range). -- Test updates All the standard library tests have been updated to use the new smith interface. For `Deque`, an ad hoc allocator was written to improve performance and remove reliance on heap allocation. `TokenSmith` has been added to aid in testing Ast and help inform decisions on the smith interface.	2025-11-23 14:58:22 -05:00
Ali Cheraghi	dec1163fbb	all: replace all `@Type` usages Co-authored-by: Matthew Lugg <mlugg@mlugg.co.uk>	2025-11-22 22:42:38 +00:00
Benjamin Jurk	4b5351bc0d	update deprecated ArrayListUnmanaged usage (#25958 )	2025-11-20 14:46:23 -08:00
Matthew Lugg	010dcd6a9b	fuzzer: account for runtime address slide This is relevant to PIEs, which are notably enabled by default on macOS. The build system needs to only see virtual addresses, that is, those which do not have the slide applied; but the fuzzer itself naturally sees relocated addresses (i.e. with the slide applied). We just need to subtract the slide when we communicate addresses to the build system.	2025-11-20 10:42:20 +00:00
Andrew Kelley	aadd8d4a3e	std: back out the StackTrace byval changes Let's keep passing this thing by pointer	2025-10-29 06:20:50 -07:00
Andrew Kelley	10b1eef2d3	std: fix compilation errors on Windows	2025-10-29 06:20:50 -07:00
Andrew Kelley	47aa5a70a5	std: updating to std.Io interface got the build runner compiling	2025-10-29 06:20:48 -07:00
Andrew Kelley	d801a71d29	add std.testing.io	2025-10-29 06:20:48 -07:00
mlugg	d0b92a8022	std.Build: do not expect server protocol for tests using immature backends For instance, when running a Zig test using the self-hosted aarch64 backend, this logic was previously expecting `std.zig.Server` to be used, but the default test runner intentionally does not do this because the backend is too immature to handle it. On 'master', this is causing sporadic failures; on this branch, they became consistent failures.	2025-10-18 09:28:42 +01:00
mlugg	e4456d03f3	std.Build.Step.Run: many enhancements This is a major refactor to `Step.Run` which adds new functionality, primarily to the execution of Zig tests. * All tests are run, even if a test crashes. This happens through the same mechanism as timeouts where the test processes is repeatedly respawned as needed. * The build status output is more precise. For each unit test, it differentiates pass, skip, fail, crash, and timeout. Memory leaks are reported separately, as they do not indicate a test's "status", but are rather an additional property (a test with leaks may still pass!). * The number of memory leaks is tracked and reported, both per-test and for a whole `Run` step. * Reporting is made clearer when a step is failed solely due to error logs (`std.log.err`) where every unit test passed.	2025-10-18 09:28:41 +01:00
mlugg	7e7d7875b9	std.Build: implement unit test timeouts For now, there is a flag to `zig build` called `--test-timeout-ms` which accepts a value in milliseconds. If the execution time of any individual unit test exceeds that number of milliseconds, the test is terminated and marked as timed out. In the future, we may want to increase the granularity of this feature by allowing timeouts to be specified per-step or even per-test. However, a global option is actually very useful. In particular, it can be used in CI scripts to ensure that no individual unit test exceeds some reasonable limit (e.g. 60 seconds) without having to assign limits to every individual test step in the build script. Also, individual unit test durations are now shown in the time report web interface -- this was fairly trivial to add since we're timing tests (to check for timeouts) anyway. This commit makes progress on #19821, but does not close it, because that proposal includes a more sophisticated mechanism for setting timeouts. Co-Authored-By: David Rubin <david@vortan.dev>	2025-10-18 09:28:39 +01:00
mlugg	51d08f4b9b	fix compile errors and minor bugs	2025-09-30 13:44:54 +01:00
Loris Cro	9bb0b43ea3	implement review suggestions	2025-09-25 18:20:19 +02:00
Loris Cro	0feacc2b81	fuzzing: implement limited fuzzing Adds the limit option to `--fuzz=[limit]`. the limit expresses a number of iterations that each fuzz test will perform at maximum before exiting. The limit argument supports also 'K', 'M', and 'G' suffixeds (e.g. '10K'). Does not imply `--web-ui` (like unlimited fuzzing does) and prints a fuzzing report at the end. Closes #22900 but does not implement the time based limit, as after internal discussions we concluded to be problematic to both implement and use correctly.	2025-09-24 12:46:48 +02:00
Kendall Condon	e66b269333	greatly improve capabilities of the fuzzer This PR significantly improves the capabilities of the fuzzer. The changes made to the fuzzer to accomplish this feat mostly include tracking memory reads from .rodata to determine fresh inputs, new mutations (especially the ones that insert const values from .rodata reads and __sanitizer_conv_const_cmp), and minimizing found inputs. Additionally, the runs per second has greatly been increased due to generating smaller inputs and avoiding clearing the 8-bit pc counters. An additional feature added is that the length of the input file is now stored and the old input file is rerun upon start. Other changes made to the fuzzer include more logical initialization, using one shared file `in` for inputs, creating corpus files with proper sizes, and using hexadecimal-numbered corpus files for simplicity. Furthermore, I added several new fuzz tests to gauge the fuzzer's efficiency. I also tried to add a test for zstandard decompression, which it crashed within 60,000 runs (less than a second.) Bug fixes include: * Fixed a race conditions when multiple fuzzer processes needed to use the same coverage file. * Web interface stats now update even when unique runs is not changing. * Fixed tokenizer.testPropertiesUpheld to allow stray carriage returns since they are valid whitespace.	2025-09-18 18:56:10 -04:00
Jacob Young	5060ab99c9	aarch64: add new from scratch self-hosted backend	2025-07-22 19:43:47 -07:00
Andrew Kelley	c40fb96ca3	std.Io.Writer: fix writeSliceSwap tried to be too clever, wrote bad code	2025-07-19 22:12:37 -07:00
Andrew Kelley	93378e2e7b	std.zig: finish updating to new I/O API	2025-07-19 19:57:37 -07:00
Andrew Kelley	0e37ff0d59	std.fmt: breaking API changes added adapter to AnyWriter and GenericWriter to help bridge the gap between old and new API make std.testing.expectFmt work at compile-time std.fmt no longer has a dependency on std.unicode. Formatted printing was never properly unicode-aware. Now it no longer pretends to be. Breakage/deprecations: * std.fs.File.reader -> std.fs.File.deprecatedReader * std.fs.File.writer -> std.fs.File.deprecatedWriter * std.io.GenericReader -> std.io.Reader * std.io.GenericWriter -> std.io.Writer * std.io.AnyReader -> std.io.Reader * std.io.AnyWriter -> std.io.Writer * std.fmt.format -> std.fmt.deprecatedFormat * std.fmt.fmtSliceEscapeLower -> std.ascii.hexEscape * std.fmt.fmtSliceEscapeUpper -> std.ascii.hexEscape * std.fmt.fmtSliceHexLower -> {x} * std.fmt.fmtSliceHexUpper -> {X} * std.fmt.fmtIntSizeDec -> {B} * std.fmt.fmtIntSizeBin -> {Bi} * std.fmt.fmtDuration -> {D} * std.fmt.fmtDurationSigned -> {D} * {} -> {f} when there is a format method * format method signature - anytype -> std.io.Writer - inferred error set -> error{WriteFailed} - options -> (deleted) std.fmt.Formatted - now takes context type explicitly - no fmt string	2025-07-07 22:43:51 -07:00
Andrew Kelley	0b3f0124dc	std.io: move getStdIn, getStdOut, getStdErr functions to fs.File preparing to rearrange std.io namespace into an interface how to upgrade: std.io.getStdIn() -> std.fs.File.stdin() std.io.getStdOut() -> std.fs.File.stdout() std.io.getStdErr() -> std.fs.File.stderr()	2025-07-07 22:43:51 -07:00
Bingwu Zhang	ff06de4c89	riscv64: enable test summary printing	2025-06-28 06:47:09 +08:00
Bingwu Zhang	5db395f20b	compiler: test runner: fix tests never fails on crippled architectures	2025-06-27 19:12:16 +08:00
Alex Rønne Petersen	999777e73a	compiler: Scaffold stage2_powerpc backend. Nothing interesting here; literally just the bare minimum so I can work on this on and off in a branch without worrying about merge conflicts in the non-backend code.	2025-05-20 10:23:16 +02:00
Linus Groh	79460d4a3e	Remove uses of deprecated callconv aliases	2025-03-05 03:01:43 +00:00
Ali Cheraghi	181a89e728	build: add spirv to test matrix Signed-off-by: Ali Cheraghi <alichraghi@proton.me>	2025-02-24 19:12:38 +01:00
Andrew Kelley	d789f1e5cf	fuzzer: write inputs to shared memory before running breaking change to the fuzz testing API; it now passes a type-safe context parameter to the fuzz function. libfuzzer is reworked to select inputs from the entire corpus. I tested that it's roughly as good as it was before in that it can find the panics in the simple examples, as well as achieve decent coverage on the tokenizer fuzz test. however I think the next step here will be figuring out why so many points of interest are missing from the tokenizer in both Debug and ReleaseSafe modes. does not quite close #20803 yet since there are some more important things to be done, such as opening the previous corpus, continuing fuzzing after finding bugs, storing the length of the inputs, etc.	2025-02-11 13:39:20 -08:00
mlugg	f83bb94ca6	test_runner: replace ugly hack with `@FieldType`	2024-11-11 09:05:51 +00:00
mlugg	d11bbde5f9	compiler: remove anonymous struct types, unify all tuples This commit reworks how anonymous struct literals and tuples work. Previously, an untyped anonymous struct literal (e.g. `const x = .{ .a = 123 }`) was given an "anonymous struct type", which is a special kind of struct which coerces using structural equivalence. This mechanism was a holdover from before we used RLS / result types as the primary mechanism of type inference. This commit changes the language so that the type assigned here is a "normal" struct type. It uses a form of equivalence based on the AST node and the type's structure, much like a reified (`@Type`) type. Additionally, tuples have been simplified. The distinction between "simple" and "complex" tuple types is eliminated. All tuples, even those explicitly declared using `struct { ... }` syntax, use structural equivalence, and do not undergo staged type resolution. Tuples are very restricted: they cannot have non-`auto` layouts, cannot have aligned fields, and cannot have default values with the exception of `comptime` fields. Tuples currently do not have optimized layout, but this can be changed in the future. This change simplifies the language, and fixes some problematic coercions through pointers which led to unintuitive behavior. Resolves: #16865	2024-10-31 20:42:53 +00:00
Linus Groh	8588964972	Replace deprecated default initializations with decl literals	2024-09-12 16:01:23 +01:00
Andrew Kelley	9bc731b30a	fuzzing: better std.testing.allocator lifetime management	2024-09-11 13:41:29 -07:00
Andrew Kelley	2b76221a46	libfuzzer: use a function pointer instead of extern solves the problem presented in the previous commit message	2024-09-11 13:41:29 -07:00
Andrew Kelley	892ce7ef52	rework fuzzing API The previous API used `std.testing.fuzzInput(.{})` however that has the problem that users call it multiple times incorrectly, and there might be work happening to obtain the corpus which should not be included in coverage analysis, and which must not slow down iteration speed. This commit restructures it so that the main loop lives in libfuzzer and directly calls the "test one" function. In this commit I was a little too aggressive because I made the test runner export `fuzzer_one` for this purpose. This was motivated by performance, but it causes "exported symbol collision: fuzzer_one" to occur when more than one fuzz test is provided. There are three ways to solve this: 1. libfuzzer needs to be passed a function pointer instead. Possible performance downside. 2. build runner needs to build a different process per fuzz test. Potentially wasteful and unclear how to isolate them. 3. test runner needs to perform a relocation at runtime to point the function call to the relevant unit test. Portability issues and dubious performance gains.	2024-09-11 13:41:29 -07:00
Andrew Kelley	13b5cee4cc	fuzzing: fix entry address logic * the pcs list is unsorted * use the function address Fixes entry points in ReleaseSafe mode.	2024-08-28 18:07:13 -07:00
mlugg	0fe3fd01dd	std: update `std.builtin.Type` fields to follow naming conventions The compiler actually doesn't need any functional changes for this: Sema does reification based on the tag indices of `std.builtin.Type` already! So, no zig1.wasm update is necessary. This change is necessary to disallow name clashes between fields and decls on a type, which is a prerequisite of #9938.	2024-08-28 08:39:59 +01:00
Andrew Kelley	e64a00950e	fuzzer web ui: introduce entry points so you can have somewhere to start browsing	2024-08-07 00:48:32 -07:00
Andrew Kelley	e0ffac4e3c	introduce a web interface for fuzzing * new .zig-cache subdirectory: 'v' - stores coverage information with filename of hash of PCs that want coverage. This hash is a hex encoding of the 64-bit coverage ID. * build runner * fixed bug in file system inputs when a compile step has an overridden zig_lib_dir field set. * set some std lib options optimized for the build runner - no side channel mitigations - no Transport Layer Security - no crypto fork safety * add a --port CLI arg for choosing the port the fuzzing web interface listens on. it defaults to choosing a random open port. * introduce a web server, and serve a basic single page application - shares wasm code with autodocs - assets are created live on request, for convenient development experience. main.wasm is properly cached if nothing changes. - sources.tar comes from file system inputs (introduced with the `--watch` feature) * receives coverage ID from test runner and sends it on a thread-safe queue to the WebServer. * test runner - takes a zig cache directory argument now, for where to put coverage information. - sends coverage ID to parent process * fuzzer - puts its logs (in debug mode) in .zig-cache/tmp/libfuzzer.log - computes coverage_id and makes it available with `fuzzer_coverage_id` exported function. - the memory-mapped coverage file is now namespaced by the coverage id in hex encoding, in `.zig-cache/v` * tokenizer - add a fuzz test to check that several properties are upheld	2024-08-07 00:48:32 -07:00
Andrew Kelley	97643c1ecc	fuzzer: track code coverage from all runs When a unique run is encountered, track it in a bit set memory-mapped into the fuzz directory so it can be observed by other processes, even while the fuzzer is running.	2024-08-07 00:48:32 -07:00
Andrew Kelley	5c6f5e6cf2	test runner: avoid spawning progress thread when instrumented because it causes unwanted concurrent accesses to pc tracking	2024-08-01 13:47:09 -07:00
David Rubin	1a7d89a84d	riscv: clean up and unify encoding logic	2024-07-26 04:19:13 -07:00
David Rubin	8d30fc45c4	riscv: implement more operators we can run `std.debug.print` now, with both run-time strings and integers!	2024-07-26 04:05:39 -07:00
Andrew Kelley	90dfd86ebe	test runner: always report fuzz tests This way they can be smoke tested.	2024-07-25 18:52:21 -07:00
Andrew Kelley	7366b4b9e2	test runner: handle start_fuzzing message	2024-07-25 18:52:21 -07:00
Andrew Kelley	047640383e	add `--fuzz` CLI argument to `zig build` This flag makes the build runner rebuild unit tests after the pipeline finishes, if it finds any unit tests. I did not make this integrate with file system watching yet. The test runner is updated to detect which tests are fuzz tests. Run step is updated to track which test indexes are fuzz tests.	2024-07-25 18:52:20 -07:00
Andrew Kelley	6f3767862d	implement std.testing.fuzzInput For now this returns a dummy fuzz input.	2024-07-25 18:52:20 -07:00
Andrew Kelley	33c7984183	add std.testing.random_seed closes #17609	2024-07-23 11:43:12 -07:00
Andrew Kelley	5f78e28899	test runner: disable stderr printing for riscv64 Make it a little easier for contributors to see CI failures in the logs.	2024-07-20 19:16:00 -07:00
David Rubin	3530308476	test: refactor `mainSimple` added some comments to make it easier for future contributors.	2024-06-13 02:24:39 -07:00
David Rubin	6603a9c26c	testing: fix test runner	2024-06-13 02:20:48 -07:00
David Rubin	39c95e8930	riscv: switch the test runner to `mainSimple`	2024-06-13 02:20:47 -07:00
Andrew Kelley	795c5791a9	test runner: update to new std.Progress API	2024-05-27 20:56:48 -07:00

1 2

57 commits