PublicArchive/zig

mirror of https://codeberg.org/ziglang/zig.git synced 2026-06-27 02:55:26 -04:00

Author	SHA1	Message	Date
Matthew Lugg	fecd28371d	Sema: fix crash bitcasting undefined to bitpack type Resolves: https://codeberg.org/ziglang/zig/issues/31944	2026-05-07 06:22:47 +02:00
Matthew Lugg	4c330e053b	compiler: use 'std.lang' instead of 'std.builtin'	2026-05-03 12:23:30 +01:00
Matthew Lugg	e133f793ee	compiler: depend on 'std.lang' instead of 'std.builtin'	2026-05-03 12:23:29 +01:00
Justus Klausecker	1f22b2cbb2	LowerZon: fix `packed` containers Since `packed` containers are now internally represented by a `bitpack`, they need special handling on initialization: they need to be either bitpacked or bitcasted to their backing integer. `Sema` already did this, but `LowerZon` didn't yet.	2026-05-02 20:05:33 +02:00
Matthew Lugg	57634b7809	compiler: remove `i0` from the language Resolves: https://github.com/ziglang/zig/issues/1593	2026-04-30 08:57:51 +01:00
glowsquid	8111d3d63c	fix comptime @ptrcasting from a larger type to a smaller one (#31774 ) closes #30180 Note from mlugg: this fix is very much a hack, but it definitely won't break anything and it demonstrably fixes one case, so I'm merging it for now with the expectation that I'll be replacing the broken code soon. Reviewed-on: https://codeberg.org/ziglang/zig/pulls/31774 Reviewed-by: mlugg <mlugg@noreply.codeberg.org> Co-authored-by: glowsquid <sachabarsayuracko@gmail.com> Co-committed-by: glowsquid <sachabarsayuracko@gmail.com>	2026-04-10 23:31:21 +02:00
Matthew Lugg	c0f3a23831	llvm: get rid of a bunch of `PerThread` usages Also, notably, remove `Air.value`! The `onePossibleValue` check was actually dead code, because it is a bug if Sema ever emits code which considers a value of OPV type to be runtime-known---and at that point `Air.value` is just a thin wrapper around `Air.Ref.toInterned`.	2026-03-28 16:46:59 +00:00
Justus Klausecker	524345b635	Sema: handle containers as inline assembly output types correctly Inline assembly now rejects output types that don't have a well-defined in-memory layout and correctly resolves the layout of the ones that do.	2026-03-17 01:51:55 +01:00
Matthew Lugg	5d215838a7	InternPool.Nav: fix race, refactor I've realised that the cause of at least some of our weird CI flakiness was a bug in how `Nav` values were resolved. Consider this scenario: the frontend resolves the type of a `Nav`, and then sends a function to the backend, which requires the backend to lower a pointer to that `Nav`. The backend calls `InternPool.getNav` to determine the `Nav`'s type. However, this races with the frontend resolving the value of that `Nav`. This involves writing separately to two fields, `bits` and `type_or_value`. If only one of these changes is observed, then the backend will incorrectly interpret the type as the value or vice versa, leading to a crash or even a miscompilation. (Of course, there's also the straightforward issue that the racing loads were non-atomic, making them illegal). The only good solution to this was to make `Nav` 4 bytes bigger, giving it separate `type` and `value` fields. In theory that's a quite small change, but it ended up having a bunch of nice consequences which led to this diff being a bit bulkier than expected: * `Nav.Repr.Bits` was simplified, because it no longer has to track "resolution status": we can use `.none` for that. This frees up some bits to make things more consistent between the "type resolved" and "fully resolved" states. * This consistency allowed the `Nav.status` union to be replaced with a simpler field `Nav.resolved`, which is a bit nicer to work with. * Most of the "getter" functions were able to be removed from `Nav` because the state they were fetching had been moved to simple fields on `Nav.resolved`. * There were still a handful of free bits in `Nav.Repr.Bits`, which could be used to represent the "const" and "threadlocal" flags rather than these being stored on `Key.Extern` and `Key.Variable`. This is a bit more convenient for linkers. * With those bits gone, `Key.Variable` is a trivial wrapper around a type and an initial value, and the fact that a declaration is mutable can be represented solely through the "const" flag. Therefore, `Key.Variable` no longer served a purpose, and could be eliminated entirely in favour of storing the variable's initial value directly in the "value" field of the `Nav`. So, I'm quite pleased with this refactor! But anyway, regarding the bug fix which actually motivated this: if I've done my job correctly, this should solve some crashes, such as these (which were what tipped me off to this bug in the first place): https://codeberg.org/ziglang/zig/actions/runs/2306/jobs/7/attempt/1 https://codeberg.org/ziglang/zig/actions/runs/2173/jobs/6/attempt/1 ...and, who knows, perhaps even the random SIGSEGVs we've seen on some targets! Probably not, but one can hope.	2026-03-15 11:47:14 +00:00
Matthew Lugg	4eb8360911	compiler: various lil' fixes	2026-03-10 10:26:14 +00:00
Matthew Lugg	d462794e20	get the compiler building The change in codegen/x86_64/CodeGen.zig was not strictly necessary (the Sema change I did solves the error I was getting there), I just think it's better style anyway.	2026-03-10 10:26:13 +00:00
Matthew Lugg	51c23f7ba4	compiler: split default field values back out from layout resolution I was trying out combining struct layout resolution with resolution of default field values, but it broke a few cases which it's not clear we want to break. The simplest such case was a struct with a field which was a slice of itself, with a default value of `&.{}`. So, at least for now, I'm accepting defeat and splitting this back out. This allows a couple of behavior tests which were removed to be re-introduced---I will do that in the commit following this one. I have not made this separate phase of resolution "lazy": instead, it is tied to layout resolution, in the sense that if a struct's layout is referenced, then its default field values are also referenced. I chose this approach for simplicity---not of the implementation (it's actually slightly more code to do it this way!), but in terms of the language specification. I think this behavior is easier to understand and keep in your head. It can be easily changed in future if we decide we want to. This partially reverts the commit titled "compiler: merge struct default value resolution into layout resolution".	2026-03-10 10:26:13 +00:00
Matthew Lugg	7170e0f020	Sema: small fixes	2026-03-10 10:26:11 +00:00
Matthew Lugg	5865abf7f5	Sema: defer extern function type validation to declaration or call Because of packed structs, checking whether a type is extern-compatible requires that its layout be resolved. For functions to do this validation as soon as the function type is created would lead to dependency loops in cases like 'const fn (@This()) void callconv(.c)`. Therefore, when creating a function type, we no longer perform this check immediately, instead waiting until the function is called.	2026-03-10 10:26:11 +00:00
Matthew Lugg	031d109310	Sema: small error message fix	2026-03-10 10:26:10 +00:00
Matthew Lugg	5c41b6db87	Sema: disallow empty extern/packed unions These types don't really make much sense: you can't pack together bits of a type which cannot exist, nor can you pass it over an ABI boundary.	2026-03-10 10:26:09 +00:00
Matthew Lugg	03e23bcbde	resolve some of my TODOs	2026-03-10 10:26:09 +00:00
Matthew Lugg	1826ba69d8	compiler: make dependency loop errors good	2026-03-10 10:26:09 +00:00
Matthew Lugg	38fdced8bb	Sema: small cleanup	2026-03-10 10:26:09 +00:00
Matthew Lugg	650185692d	compiler: merge struct default value resolution into layout resolution This actually doesn't cause any dependency loops in std, which is pretty much my benchmark for it being acceptable. This can be reverted if it turns out to be problematic, but for now, let's err on the side of language simplicity. To be clear, this does regress some cases which previously worked: I will have to remove some behavior tests as a result of this commit. To be honest, the tests which look to be failing as a result of this are things which I think are generally unadvisable; I actually reckon a bit more friction to use default field values in non-trivial ways might be a good thing to stop people from misusing them as much. Struct fields should very rarely have default values; about the only common situation where they make sense is "options" structs.	2026-03-10 10:26:08 +00:00
Matthew Lugg	187fef209f	compiler: rework OPV and noreturn-like types	2026-03-10 10:26:08 +00:00
Matthew Lugg	b19074d252	compiler: represent bitpacks as their backing integer Now that https://github.com/ziglang/zig/issues/24657 has been implemented, the compiler can simplify its internal representation of comptime-known `packed struct` and `packed union` values. Instead of storing them field-wise, we can simply store their backing integer value. This simplifies many operations and improves efficiency in some cases.	2026-03-10 10:26:08 +00:00
Matthew Lugg	911294116d	compiler: make type resolution lazy ...and rework some of the incremental reference tracking. Almost all kinds of AnalUnit have one property in common: they might never be referenced in any update despite conceptually "existing", in which case we don't want to waste time semantically analyzing them. As of the lazy type resolution introduced in this commit, the only units to which this does not apply are `memoized_state` and `@"comptime"`. Previously, I had a somewhat hacky system in `Zcu` for dealing with this, but I now have a better understanding of the design incremental compilation is converging on, so can implement a better solution. By finding a few unused bits lying around (...or making them), we can represent a single bit of state indicating whether something's corresponding units have ever been referenced. This is akin to the units being in `Zcu.outdated`, with the key difference being that the compiler will not attempt to analyze units which are in this state. Once they are first referenced or depended on, the flag is set to true and the unit is added to `outdated` so that it can participate in the normal dependency resolution logic.	2026-03-10 10:26:08 +00:00
Matthew Lugg	334189ce6d	compiler: simplify IESes It is always a bug in Sema to check whether an IES is resolved. This is because whether the IES is resolved depends on whether the function which owns it has been analyzed yet, which depends on the order the compiler analyzes declarations in, which it is incorrect to have any dependency on. Instead, we must always either not look at the resolved set, or resolve it first (with `Sema.ensureFuncIesResolved`) and then look at the definitely-resolved concrete error set. Luckily, removing a bunch of the buggy logic which tried to opportunistically use already-resolved inferred error sets actually didn't regress anything! It seems this logic was mostly left over from before Andrew reworked inferred error sets, and had become essentially dead code. This is because inferred error sets are stricter than they used to be, and in particular, we make no attempt to support mutual recursion. I suspect that most of the logic touching IESes can be simplified even further than I have done here without regressing any existing code; my goal in this commit was just to remove any buggy code I could find.	2026-03-10 10:26:08 +00:00
Matthew Lugg	3086c7977b	type resolution progress	2026-03-10 10:26:07 +00:00
Matthew Lugg	510ea6f61f	type resolution progress	2026-03-10 10:26:07 +00:00
Matthew Lugg	18bc7e802f	compiler: replace thread pool with `std.Io` Eliminate the `std.Thread.Pool` used in the compiler for concurrency and asynchrony, in favour of the new `std.Io.async` and `std.Io.concurrent` primitives. This removes the last usage of `std.Thread.Pool` in the Zig repository.	2025-12-22 12:55:16 +00:00
Matthew Lugg	c091e27aac	compiler: spring cleaning I started this diff trying to remove a little dead code from the C backend, but ended up finding a bunch of dead code sprinkled all over the place: * `packed` handling in the C backend which was made dead by `Legalize` * Representation of pointers to runtime-known vector indices * Handling for the `vector_store_elem` AIR instruction (now removed) * Old tuple handling from when they used the InternPool repr of structs * Straightforward unused functions * TODOs in the LLVM backend for features which Zig just does not support	2025-11-12 16:00:15 +00:00
Justus Klausecker	277e4a8337	fix: emit vector instead of scalar u1_zero in shl_with_overflow logic	2025-08-12 16:33:58 +02:00
Justus Klausecker	4ec421372f	add remaining undef value tests ; fix `@truncate` undef retval type	2025-08-12 16:33:58 +02:00
Justus Klausecker	79e5c138c6	replace even more aggregate interns	2025-08-12 16:33:57 +02:00
Justus Klausecker	05762ca02f	address most comments	2025-08-12 16:33:57 +02:00
Justus Klausecker	0ef26d113a	make `>>` a compile error with any undef arg ; add a bunch of test cases	2025-08-12 16:33:57 +02:00
Justus Klausecker	d0586da18e	Sema: Improve comptime arithmetic undef handling This commit expands on the foundations laid by https://github.com/ziglang/zig/pull/23177 and moves even more `Sema`-only functionality from `Value` to `Sema.arith`. Specifically all shift and bitwise operations, `@truncate`, `@bitReverse` and `@byteSwap` have been moved and adapted to the new rules around `undefined`. Especially the comptime shift operations have been basically rewritten, fixing many open issues in the process. New rules applied to operators: * `<<`, `@shlExact`, `@shlWithOverflow`, `>>`, `@shrExact`: compile error if any operand is undef * `<<\|`, `~`, `^`, `@truncate`, `@bitReverse`, `@byteSwap`: return undef if any operand is undef * `&`, `\|`: Return undef if both operands are undef, turn undef into actual `0xAA` bytes otherwise Additionally this commit canonicalizes the representation of aggregates with all-undefined members in the `InternPool` by disallowing them and enforcing the usage of a single typed `undef` value instead. This reduces the amount of edge cases and fixes a bunch of bugs related to partially undefined vecs. List of operations directly affected by this patch: * `<<`, `<<\|`, `@shlExact`, `@shlWithOverflow` * `>>`, `@shrExact` * `&`, `\|`, `~`, `^` and their atomic rmw + reduce pendants * `@truncate`, `@bitReverse`, `@byteSwap`	2025-08-12 16:33:57 +02:00
Andrew Kelley	749f10af49	std.ArrayList: make unmanaged the default	2025-08-11 15:52:49 -07:00
Andrew Kelley	7e2a26c0c4	std.io.Writer.printValue: rework logic Alignment and fill options only apply to numbers. Rework the implementation to mainly branch on the format string rather than the type information. This is more straightforward to maintain and more straightforward for comptime evaluation. Enums support being printed as decimal, hexadecimal, octal, and binary. `formatInteger` is another possible format method that is unconditionally called when the value type is struct and one of the integer-printing format specifiers are used.	2025-07-07 22:43:53 -07:00
Andrew Kelley	30c2921eb8	compiler: update a bunch of format strings	2025-07-07 22:43:52 -07:00
Andrew Kelley	f409457925	compiler: fix a bunch of format strings	2025-07-07 22:43:52 -07:00
Andrew Kelley	0e37ff0d59	std.fmt: breaking API changes added adapter to AnyWriter and GenericWriter to help bridge the gap between old and new API make std.testing.expectFmt work at compile-time std.fmt no longer has a dependency on std.unicode. Formatted printing was never properly unicode-aware. Now it no longer pretends to be. Breakage/deprecations: * std.fs.File.reader -> std.fs.File.deprecatedReader * std.fs.File.writer -> std.fs.File.deprecatedWriter * std.io.GenericReader -> std.io.Reader * std.io.GenericWriter -> std.io.Writer * std.io.AnyReader -> std.io.Reader * std.io.AnyWriter -> std.io.Writer * std.fmt.format -> std.fmt.deprecatedFormat * std.fmt.fmtSliceEscapeLower -> std.ascii.hexEscape * std.fmt.fmtSliceEscapeUpper -> std.ascii.hexEscape * std.fmt.fmtSliceHexLower -> {x} * std.fmt.fmtSliceHexUpper -> {X} * std.fmt.fmtIntSizeDec -> {B} * std.fmt.fmtIntSizeBin -> {Bi} * std.fmt.fmtDuration -> {D} * std.fmt.fmtDurationSigned -> {D} * {} -> {f} when there is a format method * format method signature - anytype -> std.io.Writer - inferred error set -> error{WriteFailed} - options -> (deleted) std.fmt.Formatted - now takes context type explicitly - no fmt string	2025-07-07 22:43:51 -07:00
Jacob Young	6b41beb370	big.int: implement float conversions These conversion routines accept a `round` argument to control how the result is rounded and return whether the result is exact. Most callers wanted this functionality and had hacks around it being missing. Also delete `std.math.big.rational` because it was only being used for float conversion, and using rationals for that is a lot more complex than necessary. It also required an allocator, whereas the new integer routines only need to be passed enough memory to store the result.	2025-06-15 14:15:18 -04:00
mlugg	71baa5e769	compiler: improve progress output Update the estimated total items for the codegen and link progress nodes earlier. Rather than waiting for the main thread to dispatch the tasks, we can add the item to the estimated total as soon as we queue the main task. The only difference is we need to complete it even in error cases.	2025-06-12 17:51:31 +01:00
mlugg	c4ec382fc8	InternPool: store the Nav types are named after When the name strategy is `.parent`, the DWARF info really wants to know what `Nav` we were named after to emit a more optimal hierarchy.	2025-06-12 13:55:41 +01:00
mlugg	424e6ac54b	compiler: minor refactors to ZCU linking * The `codegen_nav`, `codegen_func`, `codegen_type` tasks are renamed to `link_nav`, `link_func`, and `link_type`, to more accurately reflect their purpose of sending data to the linker. Currently, `link_func` remains responsible for codegen; this will change in an upcoming commit. * Don't go on a pointless detour through `PerThread` when linking ZCU functions/`Nav`s; so, the `linkerUpdateNav` etc logic now lives in `link.zig`. Currently, `linkerUpdateFunc` is an exception, because it has broader responsibilities including codegen, but this will be solved in an upcoming commit.	2025-06-12 13:55:39 +01:00
Jacob Young	b483defc5a	Legalize: implement scalarization of binary operations	2025-05-31 18:54:28 -04:00
mlugg	d717c96877	compiler: include inline calls in the reference trace Inline calls which happened in the erroring `AnalUnit` still show as error notes, because they tend to make very important context (e.g. to see how comptime values propagate through them). However, "earlier" inline calls are still useful to see to understand how something is being referenced, so we should include them in the reference trace.	2025-05-16 13:28:15 +01:00
mlugg	f83fe2714b	compiler: fix comptime memory store bugs * When storing a zero-bit type, we should short-circuit almost immediately. Zero-bit stores do not need to do any work. * The bit size computation for arrays is incorrect; the `abiSize` will already be appropriately aligned, but the logic to do so here incorrectly assumes that zero-bit types have an alignment of 0. They don't; their alignment is 1. Resolves: #21202 Resolves: #21508 Resolves: #23307	2025-05-03 20:10:26 +01:00
Mun Maks	4fc783670a	Sema/arith.zig: Fixing more typos from #23177 . This is a complementary PR to #23487 (I had only found one typo before). Now I've looked at the whole `arith.zig` file, trying to find other potential problems. Discussion about these changes: https://github.com/ziglang/zig/pull/23177#discussion_r1997957095	2025-04-09 12:53:11 +01:00
Maksat	4995509028	#23177 , maintainter 'mlugg' wanted to fix that typo, 4 weeks without changes, might be forgotten	2025-04-07 16:50:28 +01:00
Mason Remaley	06ee383da9	compiler: allow `@import` of ZON without a result type In particular, this allows importing `build.zig.zon` at comptime.	2025-04-02 05:53:22 +01:00
mlugg	2a4e06bcb3	Sema: rewrite comptime arithmetic This commit reworks how Sema handles arithmetic on comptime-known values, fixing many bugs in the process. The general pattern is that arithmetic on comptime-known values is now handled by the new namespace `Sema.arith`. Functions handling comptime arithmetic no longer live on `Value`; this is because some of them can emit compile errors, so some can't go on `Value`. Only semantic analysis should really be doing arithmetic on `Value`s anyway, so it makes sense for it to integrate more tightly with `Sema`. This commit also implements more coherent rules surrounding how `undefined` interacts with comptime and mixed-comptime-runtime arithmetic. The rules are as follows. * If an operation cannot trigger Illegal Behavior, and any operand is `undefined`, the result is `undefined`. This includes operations like `0 \| undef`, where the LHS logically could* be used to determine a defined result. This is partly to simplify the language, but mostly to permit codegen backends to represent `undefined` values as completely invalid states. * If an operation can trigger Illegal Behvaior, and any operand is `undefined`, then Illegal Behavior results. This occurs even if the operand in question isn't the one that "decides" illegal behavior; for instance, `undef / 1` is undefined. This is for the same reasons as described above. * An operation which would trigger Illegal Behavior, when evaluated at comptime, instead triggers a compile error. Additionally, if one operand is comptime-known undef, such that the other (runtime-known) operand isn't needed to determine that Illegal Behavior would occur, the compile error is triggered. * The only situation in which an operation with one comptime-known operand has a comptime-known result is if that operand is undefined, in which case the result is either undefined or a compile error per the above rules. This could potentially be loosened in future (for instance, `0 * rt` could be comptime-known 0 with a runtime assertion that `rt` is not undefined), but at least for now, defining it more conservatively simplifies the language and allows us to easily change this in future if desired. This commit fixes many bugs regarding the handling of `undefined`, particularly in vectors. Along with a collection of smaller tests, two very large test cases are added to check arithmetic on `undefined`. The operations which have been rewritten in this PR are: * `+`, `+%`, `+\|`, `@addWithOverflow` * `-`, `-%`, `-\|`, `@subWithOverflow` * ``, `%`, `\|`, `@mulWithOverflow` `/`, `@divFloor`, `@divTrunc`, `@divExact` * `%`, `@rem`, `@mod` Other arithmetic operations are currently unchanged. Resolves: #22743 Resolves: #22745 Resolves: #22748 Resolves: #22749 Resolves: #22914	2025-03-16 08:17:50 +00:00

1 2

68 Commits