SpacetimeDB

mirror of https://github.com/clockworklabs/SpacetimeDB.git synced 2026-05-12 18:57:51 -04:00

Author	SHA1	Message	Date
Zeke Foppa	4ee0e9ce09	[tyler/translate-smoketests]: debug changes	2026-01-29 10:21:51 -08:00
Tyler Cloutier	45498b6f1a	Merge branch 'master' into tyler/translate-smoketests Signed-off-by: Tyler Cloutier <cloutiertyler@users.noreply.github.com>	2026-01-27 17:12:28 -05:00
John Detter	c3848d53db	CI: Fix of C# tests failures (#4121 ) # Description of Changes (jdetter): Github marked Ryan's PR as merged because I rebased it without updating the base branch first - my bad. Original PR: https://github.com/clockworklabs/SpacetimeDB/pull/4120 This PR fixes C#-related CI failures when testing against `v1.12.0` before the corresponding NuGet packages are available on nuget.org. * CI: Ensure local C# NuGet packages are used consistently * Pack `crates/bindings-csharp/{BSATN.Runtime,Runtime}` with `-c Release` so the configured local package sources (`bin/Release`) actually contain the `.nupkg`s. * Run `./sdks/csharp/tools~/write-nuget-config.sh` in CI to generate a `NuGet.Config` that maps `SpacetimeDB.BSATN.Runtime` / `SpacetimeDB.Runtime` to those local sources (with `nuget.org` as fallback for non-SpacetimeDB dependencies). * Add an explicit `dotnet restore --configfile NuGet.Config` step before tests, and `run dotnet test --no-restore`, so restore always uses the intended config and does not “accidentally” restore from `nuget.org`. * `write-nuget-config.sh`: Make NuGet config discovery + mapping deterministic * Write `NuGet.Config` (capitalized) and include `<clear />` + an explicit `nuget.org` source to avoid inherited sources and to keep `PackageSourceMapping` functional. * Also write a repo-root `NuGet.Config` so `dotnet publish/pack` invoked from templates (outside `sdks/csharp/`) still discovers the same override behavior. * Smoketests quickstart: avoid `PackageSourceMapping` restore failures * Ensure smoketest helper packing uses `Release`, and when repo-root `NuGet.Config` exists, perform `dotnet restore --configfile ...` + `dotnet pack --no-restore` so pack/restore are evaluated with the same sources/mapping. * When editing configs, prefer `NuGet.Config` casing for Linux discovery and ensure `nuget.org` exists as a source for the fallback mapping. # API and ABI breaking changes No changes # Expected complexity level and risk 1 # Testing - [X] Locally tested concept after simulating local repro. - [x] Confirmed CI passes in this branch --------- Co-authored-by: rekhoff <r.ekhoff@clockworklabs.io> Co-authored-by: Zeke Foppa <bfops@users.noreply.github.com>	2026-01-27 05:22:47 +00:00
Tyler Cloutier	7414be7b71	Add permissions block to warn-python-smoketests job	2026-01-26 22:40:27 -05:00
Tyler Cloutier	ed8660067e	Apply suggestions from code review Co-authored-by: Zeke Foppa <196249+bfops@users.noreply.github.com> Signed-off-by: Tyler Cloutier <cloutiertyler@users.noreply.github.com>	2026-01-26 22:39:11 -05:00
Tyler Cloutier	b5d01f3de9	Match Python smoketests job to master's docker_smoketests configuration - Add smoketest_args matrix (--docker for Linux, --no-build-cli for Windows) - Add excluded tests: -x clear_database replication teams - Add Build crates step - Add Docker daemon startup and database setup for Linux - Add database startup for Windows - Add container cleanup step - Match Python version to 3.12 on Windows like master	2026-01-26 21:39:08 -05:00
Tyler Cloutier	fa54086f7d	Uncomment CARGO_TARGET_DIR for smoketest jobs	2026-01-26 21:37:52 -05:00
Tyler Cloutier	0f67c824e8	Fix Python smoketests to use python3/python like master's cargo ci smoketests	2026-01-26 15:59:46 -05:00
Tyler Cloutier	5b82570fdc	Make Python smoketests job match original smoketests job setup	2026-01-26 15:56:37 -05:00
Tyler Cloutier	236b3d787a	Simplify Python smoketest check to fail instead of commenting	2026-01-26 12:51:10 -05:00
Tyler Cloutier	363f9212aa	Add Rust smoketests to CI with nextest, keep Python smoketests running - Rename existing smoketests job to use cargo smoketest (xtask) which handles pre-building binaries, using nextest, and optimal parallelism - Add cargo-nextest installation step for better parallel execution - Add new "Smoketests (Python Legacy)" job that runs the old Python smoketests alongside the Rust ones for comparison during transition - Add warn-python-smoketests job that posts a PR comment when Python smoketests are modified, encouraging use of Rust smoketests instead - Update smoketests/README.md to note the transition to Rust smoketests	2026-01-26 12:46:41 -05:00
Zeke Foppa	27be9ed84e	Merge branch 'master' into tyler/translate-smoketests	2026-01-23 15:29:03 -08:00
Zeke Foppa	0f343aaacc	fix	2026-01-23 15:22:45 -08:00
Zeke Foppa	3c69f75fa7	re-enable lints	2026-01-23 15:14:05 -08:00
Zeke Foppa	ba1f9923e9	temp fix	2026-01-23 15:13:49 -08:00
Ryan	0f0a867d90	Removing1.5.0 DLLs (#4100 ) # Description of Changes Removes 1.5.0 DLLs from repo # API and ABI breaking changes Unity will complain locally until you regenerate the DLLs # Expected complexity level and risk 1 - Trivial # Testing - [x] Confirm CI pass - [X] Local tests are passing --------- Signed-off-by: Ryan <r.ekhoff@clockworklabs.io> Co-authored-by: Zeke Foppa <196249+bfops@users.noreply.github.com> Co-authored-by: John Detter <4099508+jdetter@users.noreply.github.com> Co-authored-by: clockwork-labs-bot <clockwork-labs-bot@users.noreply.github.com>	2026-01-23 21:28:24 +00:00
Zeke Foppa	03bbd19a99	[tyler/translate-smoketests]: actually fix windows ci	2026-01-23 11:08:23 -08:00
Zeke Foppa	8ebf8e6315	[tyler/translate-smoketests]: fix windows ci	2026-01-23 11:01:01 -08:00
Zeke Foppa	017a744052	[tyler/translate-smoketests]: windows CI	2026-01-23 10:56:52 -08:00
Zeke Foppa	09b53ded90	[tyler/translate-smoketests]: ci	2026-01-23 10:47:11 -08:00
Zeke Foppa	fdba9e51a7	[tyler/translate-smoketests]: CI fixes?	2026-01-23 10:47:00 -08:00
Zeke Foppa	95308f2797	[tyler/translate-smoketests]: update ci stuff	2026-01-23 10:39:52 -08:00
John Detter	afb0a71502	cargo fmt + don't block on lints for now	2026-01-23 02:50:04 -06:00
John Detter	2a03818d1b	Do not block CI on llm benchmarks (#4095 ) # Description of Changes <!-- Please describe your change, mention any related tickets, and so on here. --> @cloutiertyler when you fix the llm benchmarks feel free to revert this PR. # API and ABI breaking changes <!-- If this is an API or ABI breaking change, please apply the corresponding GitHub label. --> None # Expected complexity level and risk 0 <!-- How complicated do you think these changes are? Grade on a scale from 1 to 5, where 1 is a trivial change, and 5 is a deep-reaching and complex change. This complexity rating applies not only to the complexity apparent in the diff, but also to its interactions with existing and future code. If you answered more than a 2, explain what is complex about the PR, and what other components it interacts with in potentially concerning ways. --> # Testing <!-- Describe any testing you've done, and any testing you'd like your reviewers to do, so that you're confident that all the changes work as expected! --> - [x] CI runs even if llm benchmark fails	2026-01-22 10:09:28 +00:00
Tyler Cloutier	b5a7b37660	Fixes C# benchmark test failures caused by table naming convention mismatches (#4059 ) # Description of Changes Adds TypeScript as a third language for LLM benchmark tests alongside Rust and C#, and fixes table naming convention mismatches. TypeScript Support: - Added `Lang::TypeScript` variant with camelCase naming conventions - Created TypeScript project template (`templates/typescript/server/`) with package.json, tsconfig.json, and index.ts - Added TypeScript publisher that uses `spacetime build` and `spacetime publish` - Created 22 TypeScript task prompts and golden answer files for all benchmark tests - Updated prompt discovery to find `tasks/typescript.txt` files Table Naming Fix: - Standardized on singular table names across all languages: - Rust: `user` (snake_case singular) - C#: `User` (PascalCase singular) - TypeScript: `user` (camelCase singular) - Updated `table_name()` helper to convert singular names to appropriate case per language - Updated all spec.rs files to use `table_name("user", lang)` instead of hardcoded `"users"` CI/Hashing Improvements: - Added `compute_processed_context_hash()` for language-specific hash computation after tab filtering - Updated CI check to verify both `rustdoc_json` and `docs` modes for Rust - Fixed `--hash-only` mode to skip golden builds # API and ABI breaking changes None - these are internal benchmark tooling changes only. # Expected complexity level and risk Complexity: 2 The changes add a new language following existing patterns for Rust and C#. The table naming fixes are straightforward find-and-replace style updates. Low risk since this only affects the benchmark tooling, not the core SpacetimeDB codebase. # Testing - [x] `cargo build -p xtask-llm-benchmark` compiles successfully - [x] All 22 TypeScript golden modules build and publish successfully - [x] Rust and C# benchmarks unaffected by changes --------- Signed-off-by: Tyler Cloutier <cloutiertyler@users.noreply.github.com> Co-authored-by: clockwork-labs-bot <clockwork-labs-bot@users.noreply.github.com> Co-authored-by: John Detter <4099508+jdetter@users.noreply.github.com>	2026-01-21 03:51:33 +00:00
John Detter	b68921c061	Move unity testsuite onto unity runner (#4061 ) # Description of Changes <!-- Please describe your change, mention any related tickets, and so on here. --> - This will eliminate a lot of the test flakes with the unity testsuite. A huge amount of the test flakes here were that we were trying to acquire more than 2 unity license seats at a time which is not allowed. This will have the largest impact when the runners are busy. The downside here: This will cause jobs to wait which is what we want right now. Jobs may wait to queue for several minutes while other jobs finish but I don't expect this to have a big impact on the total time a PR will spend in the merge queue. # API and ABI breaking changes <!-- If this is an API or ABI breaking change, please apply the corresponding GitHub label. --> None # Expected complexity level and risk 1 <!-- How complicated do you think these changes are? Grade on a scale from 1 to 5, where 1 is a trivial change, and 5 is a deep-reaching and complex change. This complexity rating applies not only to the complexity apparent in the diff, but also to its interactions with existing and future code. If you answered more than a 2, explain what is complex about the PR, and what other components it interacts with in potentially concerning ways. --> # Testing <!-- Describe any testing you've done, and any testing you'd like your reviewers to do, so that you're confident that all the changes work as expected! --> - [x] Unity CI passes even if multiple jobs are running at the same time	2026-01-19 22:00:49 +00:00
bradleyshep	b75bf6decf	LLM Benchmarking (#3486 ) # Description of Changes Introduce a new LLM benchmarking app and supporting code. * CLI: `llm` with subcommands `run`, `routes list`, `diff`, `ci-check`. * Runner: executes globally numbered tasks; filters by `--lang`, `--categories`, `--tasks`, `--providers`, `--models`. * Providers/clients: route layer (`provider:model`) with HTTP LLM Vendor clients; env-driven keys/base URLs. * Evaluation: deterministic scorers (hash/equality, JSON shape/count, light schema/reducer parity) with clear failure messages. * Results: stable JSON schema; single-file HTML viewer to inspect/filter/export CSV. * Build & guards: build script for compile-time setup; * Docs: `DEVELOP.md` includes `cargo llm …` usage. This PR is the initial addition of the app and its modules (runner, config, routes, prompt/segmentation, scorers, schema/types, defaults/constants/paths/hashing/combine, publishers, spacetime guard, HTML stats viewer). ### How it works 1. Pick what to run * Choose tasks (`--tasks 0,7,12`), or a language (`--lang rust\|csharp`), or categories (`--categories basics,schema`). * Optionally limit vendors/models (`--providers …`, `--models …`). 2. Resolve routes * Read env (API keys + base URLs) and build the active set (e.g., `openai:gpt-5`). 3. Build context * Start Spacetime * Publish golden answer modules * Prepare prompts and send to LLM model * Attempt to publish LLM module 4. Execute calls * Run the selected tasks within each test against selected models and languages. 5. Score outputs * Apply deterministic scorers (hash/equality, JSON shape/count, simple schema/reducer checks). * Record the score and any short failure reason. 6. Update results file * Write/update the single results JSON with task/route outcomes, timings, and summaries. # API and ABI breaking changes None. New application and modules; no existing public APIs/ABIs altered. # Expected complexity level and risk 4/5. New CLI, routing, evaluation, and artifact format. * External model APIs may rate-limit/timeout; concurrency tunable via `LLM_BENCH_CONCURRENCY` / `LLM_BENCH_ROUTE_CONCURRENCY`. # Testing I ran the full test matrix and generated results for every task against every vendor, model, and language (rust + C#). I also tested the CI check locally using [act](https://github.com/nektos/act). Please verify * [ ] `llm run --tasks 0,1,2` (explicit `run`) * [ ] `llm run --lang rust --categories basics` (filters) * [ ] `llm run --categories basics,schema` (multiple categories) * [ ] `llm run --lang csharp` (language switch) * [ ] `llm run --providers openai,anthropic --models "openai:gpt-5 anthropic:claude-sonnet-4-5"` (provider/model limits) * [ ] `llm run --hash-only` (dry integrity) * [ ] `llm run --goldens-only` (test goldens only) * [ ] `llm run --force` (skip hash check) * [ ] `llm ci-check` * [ ] Stats viewer loads the JSON; filtering and CSV export work * [ ] CI works as intended --------- Signed-off-by: bradleyshep <148254416+bradleyshep@users.noreply.github.com> Signed-off-by: Tyler Cloutier <cloutiertyler@users.noreply.github.com> Co-authored-by: Copilot Autofix powered by AI <62310815+github-advanced-security[bot]@users.noreply.github.com> Co-authored-by: Tyler Cloutier <cloutiertyler@aol.com> Co-authored-by: Tyler Cloutier <cloutiertyler@users.noreply.github.com> Co-authored-by: spacetimedb-bot <spacetimedb-bot@users.noreply.github.com> Co-authored-by: John Detter <4099508+jdetter@users.noreply.github.com>	2026-01-06 22:22:57 +00:00
Zeke Foppa	264e45eafc	CI - Hackily fix V8 linker errors (#3921 ) # Description of Changes Introduce a hacky workaround to our `csharp-testsuite` to address missing `librusty_v8.a`: manually check for that file and manually build the package if it's missing. # API and ABI breaking changes CI-only change # Expected complexity level and risk 1 # Testing - [x] Locally tested removing the `librusty_v8.a` and then running `cargo clean -p v8 && cargo build -p v8`, and this does seem to repair it. - [x] The CI has run with a cache that is "broken", but successfully passes `csharp-testsuite` --------- Co-authored-by: Zeke Foppa <bfops@users.noreply.github.com>	2025-12-19 21:22:18 +00:00
Zeke Foppa	c38b135038	CI - No cache-on-failure (#3903 ) # Description of Changes Disable `cache-on-failure` as we think that it's contributing to mysterious `rusty_v8` linker issues. I bumped the prefix key so that PRs with this change don't share caches with PRs missing this change. # API and ABI breaking changes CI only. # Expected complexity level and risk 1 # Testing None. We'll have to see if we stop having issues once this is merged. --------- Co-authored-by: Zeke Foppa <bfops@users.noreply.github.com>	2025-12-18 18:20:02 +00:00
Zeke Foppa	70628fb51e	`cargo ci` on windows (#3859 ) # Description of Changes Make `cargo ci` work properly on Windows, in preparation for https://github.com/clockworklabs/SpacetimeDB/pull/3702. # API and ABI breaking changes No. CI-only. # Expected complexity level and risk 2. Not trivial, but not complicated. # Testing - [x] CI output seems to be genuinely running the tests, and it's passing on Windows - [x] Make a change to `crates/bindings-csharp` and see that `cargo ci test` fails - [x] I can manually run a minimal `cargo ci smoketests` invocation on a Windows machine --------- Signed-off-by: Zeke Foppa <196249+bfops@users.noreply.github.com> Co-authored-by: Kasama <robertoaall@gmail.com> Co-authored-by: Zeke Foppa <bfops@users.noreply.github.com>	2025-12-17 17:34:49 +00:00
Zeke Foppa	8967ea7b59	CI - Use rust-toolchain-file everywhere (#3872 ) # Description of Changes We were using `rust-toolchain` in some places, and `rust-toolchain-file` in others. I think Rust released a new version, which made the `rust-toolchain` parts break with: ``` info: downloading component 'rustfmt' info: removing previous version of component 'rust-src' info: rolling back changes error: could not rename component file from '/root/.rustup/toolchains/stable-x86_64-unknown-linux-gnu/lib/rustlib/src/rust/library/test/src/term/terminfo/searcher' to '/root/.rustup/tmp/a4eo07uz83vsyfhk_dir/bk': Invalid cross-device link (os error 18) Error: Process completed with exit code 1. ``` (Separately, this breakage is confusing.. we'll probably run into this again when we roll forward our rust version..) # API and ABI breaking changes None. CI-only change. # Expected complexity level and risk 1 # Testing - [x] CI passes - [x] There are no more instances of `rust-toolchain`: ``` $ grep -rIP 'rust-toolchain(?!-file)' .github/workflows .github/CODEOWNERS:/rust-toolchain.toml @cloutiertyler ``` (on `master`, this finds the instances we changed) --------- Co-authored-by: Zeke Foppa <bfops@users.noreply.github.com>	2025-12-11 18:51:48 +00:00
Roberto Pommella Alegro	5ac65739e5	add initial cargo ci (#3409 ) # Description of Changes This changes the ci runs to execute `cargo ci` instead of running commands directly from the github workflow. The goal here is to unify the commands under `cargo ci` so that it's easier and more intuitive to run locally # API and ABI breaking changes There are no API/ABI changes. <!-- If this is an API or ABI breaking change, please apply the corresponding GitHub label. --> # Expected complexity level and risk Complexity: 1 It is not a complex change as it is mostly localized to the ci runs and is easily reversible if something goes wrong. The biggest risk here is to have future CI runs break, which can be remediated by reverting these changes. <!-- How complicated do you think these changes are? Grade on a scale from 1 to 5, where 1 is a trivial change, and 5 is a deep-reaching and complex change. This complexity rating applies not only to the complexity apparent in the diff, but also to its interactions with existing and future code. If you answered more than a 2, explain what is complex about the PR, and what other components it interacts with in potentially concerning ways. --> # Testing <!-- Describe any testing you've done, and any testing you'd like your reviewers to do, so that you're confident that all the changes work as expected! --> - [x] run `cargo ci` and its subcommands locally - [x] run the github workflow against this branch to check if the CI jobs are working properly. --------- Signed-off-by: Zeke Foppa <196249+bfops@users.noreply.github.com> Signed-off-by: Roberto Pommella Alegro <robertoaall@gmail.com> Co-authored-by: Zeke Foppa <bfops@users.noreply.github.com> Co-authored-by: Zeke Foppa <196249+bfops@users.noreply.github.com>	2025-12-10 19:18:43 +00:00
Zeke Foppa	64aa808957	CI - Skip the Unity testsuite on external PRs (#3805 ) # Description of Changes Bubbling up an `if` since we split out the unity testsuite into its own job. The GitHub docs say (https://docs.github.com/en/pull-requests/collaborating-with-pull-requests/collaborating-on-repositories-with-code-quality-features/about-status-checks): > A job that is skipped will report its status as "Success". It will not prevent a pull request from merging, even if it is a required check. # API and ABI breaking changes None. CI-only change. # Expected complexity level and risk 1 # Testing Unsure how to test this honestly 🤷 Co-authored-by: Zeke Foppa <bfops@users.noreply.github.com>	2025-12-02 21:09:52 +00:00
John Detter	1d3421745c	Split Unity and C# tests into separate jobs (#3779 ) # Description of Changes <!-- Please describe your change, mention any related tickets, and so on here. --> This has 2 benefits: 1. If the Unity test fails because of a license issue then we don't have to re-run the C# tests again as part of this flow. Re-running the Unity tests will be much faster if that's the only thing the job is doing. 2. These tests will run faster because they will now run in parallel as separate CI jobs. # API and ABI breaking changes <!-- If this is an API or ABI breaking change, please apply the corresponding GitHub label. --> None # Expected complexity level and risk 1 <!-- How complicated do you think these changes are? Grade on a scale from 1 to 5, where 1 is a trivial change, and 5 is a deep-reaching and complex change. This complexity rating applies not only to the complexity apparent in the diff, but also to its interactions with existing and future code. If you answered more than a 2, explain what is complex about the PR, and what other components it interacts with in potentially concerning ways. --> # Testing <!-- Describe any testing you've done, and any testing you'd like your reviewers to do, so that you're confident that all the changes work as expected! --> - [x] Both tests pass --------- Signed-off-by: John Detter <4099508+jdetter@users.noreply.github.com> Co-authored-by: Copilot Autofix powered by AI <62310815+github-advanced-security[bot]@users.noreply.github.com> Co-authored-by: Zeke Foppa <196249+bfops@users.noreply.github.com> Co-authored-by: Zeke Foppa <bfops@users.noreply.github.com>	2025-11-26 21:33:35 +00:00
John Detter	9c6e5a5cbe	Set timeout for smoketests (#3772 ) # Description of Changes <!-- Please describe your change, mention any related tickets, and so on here. --> Sets a 2 hour timeout for the smoketests. If the smoketests are taking more than 2 hours they likely will never finish and we should just time them out. # API and ABI breaking changes None <!-- If this is an API or ABI breaking change, please apply the corresponding GitHub label. --> # Expected complexity level and risk 1 <!-- How complicated do you think these changes are? Grade on a scale from 1 to 5, where 1 is a trivial change, and 5 is a deep-reaching and complex change. This complexity rating applies not only to the complexity apparent in the diff, but also to its interactions with existing and future code. If you answered more than a 2, explain what is complex about the PR, and what other components it interacts with in potentially concerning ways. --> # Testing <!-- Describe any testing you've done, and any testing you'd like your reviewers to do, so that you're confident that all the changes work as expected! --> - [x] https://github.com/clockworklabs/SpacetimeDB/actions/runs/19697510980/job/56425630374?pr=3772	2025-11-26 19:08:49 +00:00
Zeke Foppa	462cac9af7	CI - Migrate off of spacetimedb-runner (#3763 ) # Description of Changes Move anything running on `spacetimedb-runner` to `spacetimedb-new-runner` # API and ABI breaking changes None. # Expected complexity level and risk 1 # Testing - [x] CI passes --------- Co-authored-by: Zeke Foppa <bfops@users.noreply.github.com>	2025-11-25 20:16:23 +00:00
Zeke Foppa	f071c213d3	CI - CLI doc check shouldn't fail on `Cargo.lock` diff (#3739 ) # Description of Changes Stop the CLI doc check from failing on `Cargo.lock` diffs, which it was doing before. # API and ABI breaking changes CI only. # Expected complexity level and risk 1 # Testing - [x] CI still fails elsewhere if `Cargo.lock` needs updating --------- Co-authored-by: Zeke Foppa <bfops@users.noreply.github.com>	2025-11-24 20:11:37 +00:00
Zeke Foppa	c829c9d73d	Disable Unreal tests (#3711 ) # Description of Changes These tests have been flaky, so we're disabling them until they're working more consistently. # API and ABI breaking changes None # Expected complexity level and risk 1 # Testing - [x] Tests are indeed skipped? 🤷 Co-authored-by: Zeke Foppa <bfops@users.noreply.github.com>	2025-11-20 19:25:45 +00:00
Zeke Foppa	ca2fd8b709	Add `smoketests/requirements.txt` (#3701 ) # Description of Changes We have added one too many python dependencies. I'm putting them in a shared file, which will also ease installation for devs running them locally. # API and ABI breaking changes None. CI only. # Expected complexity level and risk 1 # Testing Existing CI still passes --------- Co-authored-by: Zeke Foppa <bfops@users.noreply.github.com>	2025-11-19 22:35:44 +00:00
Zeke Foppa	3395b1e905	CI - Cache more of our build outputs (#3521 ) # Description of Changes Changes with cache hit: CI / Smoketests (spacetimedb-runner): 17m -> 10m CI / Smoketests (windows-latest): 39m -> 31m CI / Lints: 3m30s -> 2m unity-testsuite: 23m -> 19m This included creating a stripped-down `Dockerfile` that only runs SpacetimeDB (as opposed to e.g. building tracy, flamegraphs, etc.). That shaves off about 4-5m on the linux smoketests job. # API and ABI breaking changes No. CI only. # Expected complexity level and risk 2 # Testing Ran existing CI and noted the change in times in the description above --------- Signed-off-by: Zeke Foppa <196249+bfops@users.noreply.github.com> Co-authored-by: Zeke Foppa <bfops@users.noreply.github.com> Co-authored-by: John Detter <4099508+jdetter@users.noreply.github.com>	2025-11-19 20:18:26 +00:00
John Detter	05ecc80326	Use new runner for testsuite (#3656 ) # Description of Changes <!-- Please describe your change, mention any related tickets, and so on here. --> Speed improvements: \|CI Job\|Before\|After\|Change\| \|---\|---\|---\|---\| \|CI/Unreal Engine\|29m\|19m\|34% faster\| \|CI/Testsuite\|21m\|11m\|47% faster\| \|C#/Unity\|20m\|16m\|20% faster\| \|CI/Build and test wasm bindings\|10m\|3m\|70% faster\| \|CI/Smoketests (linux)\|17m\|12m\|29% faster\| CI/Unreal Engine - switch to serial test execution CI/Testsuite - partially switch to serial test execution I also haven't seen an unexpected `Operation cancelled`, so I think that problem might be behind us. # API and ABI breaking changes <!-- If this is an API or ABI breaking change, please apply the corresponding GitHub label. --> No user facing changes # Expected complexity level and risk <!-- How complicated do you think these changes are? Grade on a scale from 1 to 5, where 1 is a trivial change, and 5 is a deep-reaching and complex change. This complexity rating applies not only to the complexity apparent in the diff, but also to its interactions with existing and future code. If you answered more than a 2, explain what is complex about the PR, and what other components it interacts with in potentially concerning ways. --> 2 - this touches a lot of our CI. It has been solid so far but if we start getting random CI failures this could be the cause. # Testing <!-- Describe any testing you've done, and any testing you'd like your reviewers to do, so that you're confident that all the changes work as expected! --> - [x] All CI passes (ok as of this update the internal tests are failing but that isn't related to this PR 😅 ) # Future Work There are 2 remaining tests that could be optimized by moving them to faster runners: 1. (~45m) The windows smoketests (difficult because it requires Windows). @bfops has brought up the idea of only running this test on PRs that are in the merge queue and I think that would be the best 80/20 here. There really is not a great reason to be running this on every commit anyway. 2. (36m) The internal tests (easier because it's on Linux) I would also consider moving from instantiating container images to instantiating VM snapshots. This would allow us a lot more flexibility and we wouldn't have to have as much container config in our workflow files. Based on guides I've seen online this is also somewhat common. Also managing docker within docker has been kind of a nightmare so it would be easier to run containers inside of VMs to remove a layer of containerization there. ## Testsuite inconsistencies We have at least 2 different jobs that are inconsistent/flaky: the "testsuite" and the "unreal engine tests". I'd like @JasonAtClockwork to make the unreal engine tests run sequentially so that we have less random failures. I know this will increase the testing time but due to this PR we should have extra headroom there. I've also pinged Joshua about one of the tests that I've seen sometimes fails randomly. This is either an issue in the test or an issue in SpacetimeDB itself, I'm not sure yet. These inconsistencies are present both in our custom runner and in the Github runners, so merging this PR will not make this situation any worse than it already is.	2025-11-17 20:24:09 +00:00
Zeke Foppa	8787fbea00	CI - Fix "Check that packages are publishable" check (#3660 ) # Description of Changes Our `Check that packages are publishable` CI check was silently failing due to a subshell error not getting propagated upward. This PR both fixes that error-swallowing behavior, and fixes the errors that were present. # API and ABI breaking changes No. CI only. # Expected complexity level and risk 1 # Testing - [x] Check fails if the script to find crates fails (https://github.com/clockworklabs/SpacetimeDB/actions/runs/19349995837/job/55359458732?pr=3660) - [x] Check is now passing --------- Co-authored-by: Zeke Foppa <bfops@users.noreply.github.com>	2025-11-14 01:14:56 +00:00
Kim Altintop	a36f7091d5	[teams 3/5] API authorization, CLI, smoketests (#3523 ) This adds authorization to the relevant API endpoints, updates the CLI commands and adds smoketests for the teams feature. Note: Authorizing SQL (incl. subscriptions) is a bit more involved, and submitted as a separate PR in the series. Depends-on: https://github.com/clockworklabs/SpacetimeDB/pull/3519	2025-11-11 14:10:58 +00:00
Zeke Foppa	6bf3efc006	CI - Fix format strings (#3627 ) # Description of Changes I allowed chatgpt to mislead me about how to do these format strings. Apparently this is the wrong syntax. I've now verified the correct syntax: https://docs.github.com/en/actions/reference/workflows-and-actions/expressions#format # API and ABI breaking changes CI only. # Expected complexity level and risk 1 # Testing I honestly don't know how to check what concurrency group something is running in.. Co-authored-by: Zeke Foppa <bfops@users.noreply.github.com>	2025-11-11 00:12:53 +00:00
Kim Altintop	cfd0d4b712	commitlog,durability: Support preallocation of disk space (#3437 ) When a new commitlog segment is created, allocate disk space for it up to the maximum segment size. Also do this when resuming writes to an existing segment, such that segments created without preallocation will allocate as well when the database is opened. Preallocation is gated behind the feature "fallocate", because it is not always desirable to preallocate, e.g. for local `standalone` users. The feature can only be enabled on Linux targets, because allocation is done using the Linux-specific `fallocate(2)` system call. Unlike `ftruncate(2)` or the portable `posix_fallocate(3)`, `fallocate(2)` supports allocating disk space without zeroing. This is currently required, because the commitlog format does not handle padding bytes. If not enough space can be allocated, the commitlog refuses writes. For commitlogs that were created without preallocation, this means that the commitlog cannot even be opened in this situation. The local durability impl will crash if it detects that the commitlog is unable to allocate enough space. This means that a database will eventually crash and be unable to start in an out-of-space situation. Allocated space is not included in the reported size of the commitlog. Instead, allocated blocks are reported separately. # Expected complexity level and risk 3 - Disk size monitoring may need to be adjusted. # Testing - [x] Adds a test that demonstrates the crash behavior of [`spacetimedb_durability::Local`] when there is insufficient space. The test performs I/O against a loop device. - [x] Modified the `repo::Memory` impl so that it can run out of space. No test currently utilizes this, but existing tests assuming infinite space still pass.	2025-11-10 16:55:55 +00:00
Zeke Foppa	7c4c3ddeea	CI - Fix the merge queue (#3571 ) # Description of Changes The merge queue was (partly) getting borked because we were putting all non-PR CI events into the same concurrency group, which meant they all non-PR CI jobs would run sequentially instead of running in parallel. This sometimes caused _painfully_ long delays in the merge queue. This was due to my misunderstanding in https://github.com/clockworklabs/SpacetimeDB/pull/3501#discussion_r2466570395, where I didn't realize that `cancel-in-progress: false` would cause everything to queue up. Now, for non-PR events, we append the commit SHA to the concurrency group. For merge queue events, this should be the SHA of the ephemeral merge commit that GH creates, so it will never conflict. For push events or manual workflow dispatch events, the SHA should be a sane way to recognize/cancel redundant events. # API and ABI breaking changes None. CI-only change. # Expected complexity level and risk 1 # Testing - [x] PR CI passes on this PR - [x] PR CI is still canceled on this PR if a new commit is pushed Unfortunately it's hard to test the behavior for non-PR events without merging and seeing if it works. --------- Co-authored-by: Zeke Foppa <bfops@users.noreply.github.com>	2025-11-05 20:25:00 +00:00
Noa	3cc59de66c	Add typescript quickstart smoketest (#3463 ) # Description of Changes Based on #3461, which fixes a bug encountered in the quickstart. # API and ABI breaking changes <!-- If this is an API or ABI breaking change, please apply the corresponding GitHub label. --> n/a # Expected complexity level and risk <!-- How complicated do you think these changes are? Grade on a scale from 1 to 5, where 1 is a trivial change, and 5 is a deep-reaching and complex change. This complexity rating applies not only to the complexity apparent in the diff, but also to its interactions with existing and future code. If you answered more than a 2, explain what is complex about the PR, and what other components it interacts with in potentially concerning ways. --> 1 # Testing <!-- Describe any testing you've done, and any testing you'd like your reviewers to do, so that you're confident that all the changes work as expected! --> - [x] Yes - [ ] <!-- maybe a test you want a reviewer to do, so they can check it off when they're satisfied. -->	2025-10-31 16:30:11 +00:00
Zeke Foppa	2516357c8d	CI - Skip Internal Tests and Unreal Tests on external PRs (#3522 ) # Description of Changes These tests fail on external PRs, but not for any real reasons - just because GH secrets are missing. "Skipped" is more informative than "failed". # API and ABI breaking changes None. # Expected complexity level and risk 1 # Testing None, but I just copied the logic from the unity testsuite. --------- Co-authored-by: Zeke Foppa <bfops@users.noreply.github.com>	2025-10-28 19:28:12 +00:00
Zeke Foppa	9ad5e7038a	CI - Cancel runs on new pushes (#3501 ) # Description of Changes Add `cancel-in-progress` to our GitHub workflows. # API and ABI breaking changes None # Expected complexity level and risk 1 # Testing - [x] Pushing new commits to this PR causes cancels of previous CI runs --------- Co-authored-by: Zeke Foppa <bfops@users.noreply.github.com>	2025-10-27 19:15:21 +00:00
Zeke Foppa	f726b02c74	CI - C# quickstart test properly uses nuget (#3466 ) # Description of Changes The quickstart smoketest was not correctly fudging our NuGet dependencies to use the local versions of the packages, so it was pulling them from NuGet. This ended up causing issues when we tried to modify local packages and then use them in a way that affected the quickstart (e.g. https://github.com/clockworklabs/SpacetimeDB/pull/3386). We had a few issues: we weren't ensuring that the local packages were built, we weren't using the right directory for those packages, and we weren't adding a "package source mapping" that forced those packages to be used from the local directory. # API and ABI breaking changes None. CI-only. # Expected complexity level and risk 2 # Testing - [x] CI passes when https://github.com/clockworklabs/SpacetimeDB/pull/3386 is merged together with this PR (it wasn't before). --------- Co-authored-by: Zeke Foppa <bfops@users.noreply.github.com>	2025-10-23 19:50:18 +00:00

1 2 3

118 Commits