ci: Improve autoevals CI and fix failing Python CI#195
Merged
Abhijeet Prasad (AbhiPrasad) merged 14 commits intoJun 8, 2026
Conversation
Update the JS CI matrix to include Node 24 and simplify dependency caching by using setup-node's built-in pnpm cache. Also refresh checkout and setup-node action pins.
Braintrust eval reportAutoevals (abhi-autoevals-ci-improvements-1780938254)
|
Braintrust eval report
|
| pull_request: | ||
| # Uncomment to run only when files in the 'evals' directory change | ||
| # paths: | ||
| # - "evals/**" |
There was a problem hiding this comment.
I think this means fork PRs would fail since they don't have our secrets. Not sure if that's a big deal but wanted to point it out just in case
Andrew Kent (realark)
approved these changes
Jun 8, 2026
Braintrust eval report
|
2 tasks
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
This branch focuses on CI and Python dependency modernization:
uv, addingpyproject.toml,uv.lock, and updatingMakefile/env.sh. Also adds Python 3.13 and 3.14 to the test matrix.uvcommands.AGENTS.mdandCLAUDE.md.package.jsonto align JS tooling expectations.