feat: `annotation` queue `testset` export by ashrafchowdury · Pull Request #4261 · Agenta-AI/agenta

ashrafchowdury · 2026-05-05T07:27:35Z

What's new

Added a feature to create testset from the annotation queue, from both traces and test case types
Extend the toast message function to receive links and link-text for or demand entity navigation
Improved the annotation focus view ui to display the keyboard shortcuts

Fixes

Fix chat view rendering issue on trace drawer (that changes reflected on annotation page)
Fix the table column rendering issue to avoid redundant information display for messages specifically
Fixed the table to render the evaluator column properly

QA scope (Annotation-queue)

Create/update testset from both the focus and all tab
From the focus tab you should be able to add the currently viewed scenario to testsets
From the all view, you should be able to select scenarios from the table add those to testset
From the all view, you should be able to add the entire queue to the test set if you don't select any scenarios from the table
It should remember the last used testset on the commit modal

Adds PRD, RFC, and README to docs/designs/testset-annotation-queue/. The design covers: - Per-scenario "Add to Testset" button in the annotate tab - "Done with queue" screen supporting both trace and testcase queues - All-annotations tab with row selection and upgraded commit modal - New AddToTestsetModal component with EntityPicker integration - Controller actions for addScenariosToTestset (trace + testcase paths) - Default target testset heuristics and last-used persistence https://claude.ai/code/session_01B2uQKidAr1KJ4CR9sroY2H

Key corrections: - Replace new AddToTestsetModal with EntityCommitModal + renderModeContent - Replace useState for selected testset with pendingTestsetSelectionAtom in annotationSessionController — survives re-renders and can be read imperatively by addScenariosToTestset without closure capture - selectedScenarioIdsAtom for row selection (also atom, not useState) - openAddToTestsetModal seeds pendingTestsetSelectionAtom from default - addScenariosToTestset reads target testset from atom, not payload - Add state atom summary table documenting all atom lifecycles https://claude.ai/code/session_01B2uQKidAr1KJ4CR9sroY2H

Inputs (agData.inputs) spread into N columns — one per input key. Outputs (agData.outputs) always map to a single "output" column regardless of value shape, matching extractOutputs() behaviour in trace/utils/selectors.ts which treats outputs as a leaf. Annotation values add one column per evaluator slug.

…n reset 1. Trace annotation resolution: use scenarioAnnotationsAtomFamily(scenarioId) not a raw query by traceId. The atom handles step-based resolution to avoid cross-queue bleed (as documented in the controller). 2. Scope label when scope="all": addToTestsetScenarioIds() is empty for the "all" case — read actual count from scenarioIds() instead of falling back to the "all" string. 3. Row selection reset: addScenariosToTestset clears selectedScenarioIdsAtom after a successful export so stale selections don't persist.

vercel · 2026-05-05T07:27:40Z

The latest updates on your projects. Learn more about Vercel for GitHub.

Project	Deployment	Actions	Updated (UTC)
agenta-documentation	Ready	Preview, Comment	May 7, 2026 11:03am

coderabbitai · 2026-05-05T07:27:43Z

Note

Reviews paused

It looks like this branch is under active development. To avoid overwhelming you with review comments due to an influx of new commits, CodeRabbit has automatically paused this review. You can configure this behavior by changing the reviews.auto_review.auto_pause_after_reviewed_commits setting.

Use the following commands to manage reviews:

@coderabbitai resume to resume automatic reviews.
@coderabbitai review to trigger a single review.

Use the checkboxes below for quick actions:

▶️ Resume reviews
🔍 Trigger review

📝 Walkthrough

Walkthrough

Adds an "Add to Testset" feature: controller atoms/actions and types for modal/export jobs, background export implementation with row builders/remapping for traces/testcases, UI wiring and modal integration, annotation column child support, and related testset API adjustments.

Changes

Add to Testset Feature

Layer / File(s)	Summary
Type Shape `web/packages/agenta-annotation/src/state/types.ts`, `web/packages/agenta-annotation-ui/src/components/AnnotationSession/type.ts`	Adds `AnnotationOutputColumnDef` and optional `outputColumns` on `AnnotationDataColumnDef`; introduces `AddToTestsetTargetSelection` type for testset picker.
Controller State & API Types `web/packages/agenta-annotation/src/state/controllers/annotationSessionController.ts`, `web/packages/agenta-annotation/src/state/index.ts`, `web/packages/agenta-annotation/src/index.ts`	New exported types `AddToTestsetExportJob`/`AddToTestsetScope`; adds atoms/selectors/actions/get/set for modal lifecycle, pending selection, selected scenario IDs, export job state, `canAddToTestset`, and session-close cleanup.
Export Row Builders & Normalization `web/packages/agenta-annotation/src/state/testsetSync.ts`	Introduces helpers to read/flatten annotation outputs, build evaluator slugs, apply output entries, normalize trace content, and exported builders `buildTraceTestsetRows` and `buildTestcaseExportRows`; refactors `buildTestsetSyncPreview` to use shared helpers.
Export Execution & Remapping `web/packages/agenta-annotation/src/state/controllers/annotationSessionController.ts`	Implements `addScenariosToTestsetAtom` background flow: resolves export scope, waits for annotation/trace data, builds rows, queries existing testset revision columns, remaps rows to existing columns or creates/patches revisions, updates job state, invalidates queries, and reports status.
Annotation Column Refactor `web/packages/agenta-annotation-ui/src/components/AnnotationSession/ScenarioListView.tsx`	Annotation parent columns may now expose concrete `outputColumns` (fallback to `outputKeys`); export column state carries per-child `annotationDef`, `outputKey`, and `title`; export label/value resolution uses `title`.
UI Wiring & Modal Integration `web/packages/agenta-annotation-ui/src/components/AnnotationSession/index.tsx`, `.../SessionNavigation.tsx`, `.../FocusView.tsx`, `.../ScenarioListView.tsx`	Integrates EntityCommitModal for Add-to-Testset, wires `openAddToTestsetModal`, pending selection handlers, submit flow, one-time success toast, and controller actions; adds Add-to-Testset buttons (FocusView, SessionNavigation, ScenarioListView); FocusView props signature reduced (removed onSyncToTestset/isSyncing).
New UI Components & Constants `web/packages/agenta-annotation-ui/src/components/AnnotationSession/constants.ts`, `.../EmptyQueueState.tsx`, `.../SessionHeaderRight.tsx`, `.../SessionTitle.tsx`	Adds session tabs/constants, commit modes, `ADD_TO_TESTSET_TARGET_ADAPTER`, and new components EmptyQueueState, SessionHeaderRight, SessionTitle.
EntityPicker / Commit UX Changes `web/packages/agenta-entity-ui/src/selection/.../types.ts`, `.../CascadingVariant.tsx`, `.../useCascadingMode.ts`, `.../modals/commit/*`	Adds optional `onDeselect` to cascading picker props and hooks; `useCascadingMode` invokes `onDeselect` when selection becomes null after user interaction; reorders EntityCommitContent elements; commit modal success path now closes modal without immediate reset.
Testset API Changes `web/packages/agenta-entities/src/testset/api/api.ts`, `.../mutations.ts`, `.../api/index.ts`, `.../index.ts`	`fetchRevisionWithTestcases` accepts optional `testcaseLimit` and may use `/retrieve`; adds `fetchLatestRevisionWithTestcases`; `commitRevision` accepts `testsetVariantId` and forwards it in payload.
Public Export Propagation `web/packages/agenta-annotation/src/state/controllers/index.ts`, `web/packages/agenta-annotation/src/state/index.ts`, `web/packages/agenta-annotation/src/index.ts`	Re-exports `AddToTestsetExportJob` and `AddToTestsetScope` across controller/state package entry points.

sequenceDiagram
participant User
participant UI as AnnotationSession UI
participant State as annotationSessionController
participant API as Testset API
participant DB as Database

User->>UI: Click "Add to Testset" (single/selected/all)
UI->>State: openAddToTestsetModal({scope, scenarioIds})
State->>UI: Modal opens
User->>UI: Choose existing testset or "New" and submit
UI->>State: addScenariosToTestset({scope, mode, selection})
State->>State: Resolve scenarios, wait for annotations/traces
State->>State: buildTraceTestsetRows / buildTestcaseExportRows
State->>API: fetch revision(s) / latest revision with testcases
API->>DB: retrieve testset revision & columns
State->>State: remap rows to existing columns or prepare new revision
State->>API: create or patch revision (commit)
API->>DB: persist revision changes
API-->>State: success
State->>UI: set job status "success", close modal, invalidate queries
UI-->>User: show success toast

Estimated Code Review Effort

🎯 4 (Complex) | ⏱️ ~60 minutes

Possibly Related PRs

Agenta-AI/agenta#4237: Touches Add-to-Testset export/mapping workflow and overlaps on testset export logic.

🚥 Pre-merge checks | ✅ 3 | ❌ 2

❌ Failed checks (1 warning, 1 inconclusive)

Check name	Status	Explanation	Resolution
Docstring Coverage	⚠️ Warning	Docstring coverage is 16.00% which is insufficient. The required threshold is 60.00%.	Write docstrings for the functions missing them to satisfy the coverage threshold.
Description check	❓ Inconclusive	No pull request description was provided by the author, making it impossible to assess whether the description relates to the changeset.	Add a pull request description explaining the feature, scope of changes, and any relevant implementation details or testing notes.

✅ Passed checks (3 passed)

Check name	Status	Explanation
Title check	✅ Passed	The title 'feat: `annotation` queue `testset` export' directly describes the main feature added: enabling testset export functionality within annotation queues.
Linked Issues check	✅ Passed	Check skipped because no linked issues were found for this pull request.
Out of Scope Changes check	✅ Passed	Check skipped because no linked issues were found for this pull request.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

🧪 Generate unit tests (beta)

Create PR with unit tests
Commit unit tests in branch feat/frontend-annotation-queue-testset

Tip

💬 Introducing Slack Agent: The best way for teams to turn conversations into code.

Slack Agent is built on CodeRabbit's deep understanding of your code, so your team can collaborate across the entire SDLC without losing context.

Generate code and open pull requests
Plan features and break down work
Investigate incidents and troubleshoot customer tickets together
Automate recurring tasks and respond to alerts with triggers
Summarize progress and report instantly

Built for teams:

Shared memory across your entire org—no repeating context
Per-thread sandboxes to safely plan and execute work
Governance built-in—scoped access, auditability, and budget controls

One agent for your entire SDLC. Right inside Slack.

👉 Get started

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

coderabbitai

🧹 Nitpick comments (2)

docs/designs/testset-annotation-queue/PRD.md (1)
42-48: 💤 Low value

Add blank lines around tables for markdown compliance.

The tables in this document are missing surrounding blank lines, which causes markdownlint warnings. This affects tables at lines 42, 89, 107, and 118.
📝 Example fix for the competitive reference table
 ## Competitive Reference
 
 ### LangSmith
+
 | Capability | Supported |
 |------------|-----------|
 | Create or extend a testset from both test cases and traces | Yes |
 | Add a single scenario to a testset | Yes |
 | Add to a testset from the focused/annotate tab | Yes |
 | Select table rows and add them to a testset | Yes |
+
 LangSmith supports all four capabilities; Agenta currently supports none of them in full.
docs/designs/testset-annotation-queue/RFC.md (1)
18-57: 💤 Low value

Add language specifiers to fenced code blocks.

Several code blocks in this RFC lack language specifiers, which affects syntax highlighting and readability. The architecture diagram at line 18 and examples at lines 244, 361, 366, 385, 391, 485, 500, 517 should specify a language (use text or plaintext for ASCII diagrams).
📝 Example fix for ASCII diagram
-```
+```text
 User action (button click)
         │
         ▼

ℹ️ Review info

⚙️ Run configuration

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro Plus

Run ID: 22636d88-f5f8-408b-a088-51bb814f81e5

📥 Commits

Reviewing files that changed from the base of the PR and between bd447c0 and ee683a1.

📒 Files selected for processing (11)

docs/designs/testset-annotation-queue/PRD.md
docs/designs/testset-annotation-queue/README.md
docs/designs/testset-annotation-queue/RFC.md
web/packages/agenta-annotation-ui/src/components/AnnotationSession/FocusView.tsx
web/packages/agenta-annotation-ui/src/components/AnnotationSession/ScenarioListView.tsx
web/packages/agenta-annotation-ui/src/components/AnnotationSession/SessionNavigation.tsx
web/packages/agenta-annotation-ui/src/components/AnnotationSession/index.tsx
web/packages/agenta-entity-ui/src/modals/commit/components/EntityCommitContent.tsx
web/packages/agenta-entity-ui/src/selection/components/UnifiedEntityPicker/types.ts
web/packages/agenta-entity-ui/src/selection/components/UnifiedEntityPicker/variants/CascadingVariant.tsx
web/packages/agenta-entity-ui/src/selection/hooks/modes/useCascadingMode.ts

coderabbitai

Actionable comments posted: 4

Caution

Some comments are outside the diff and can’t be posted inline due to platform limitations.

⚠️ Outside diff range comments (1)

web/packages/agenta-annotation-ui/src/components/AnnotationSession/ScenarioListView.tsx (1)

552-569: ⚠️ Potential issue | 🟡 Minor | ⚡ Quick win

Keep the testcase fallback key separate from the metric path.

AnnotationOutputKeyCell passes outputKey into useAnnotationCellFallback(), and after Line 912 that value is often the full mapped path (attributes.ag.data.outputs.score, etc.). The fallback hook indexes the testcase container object by that same string, so grouped annotation cells now miss synced testcase values and render — until an annotation exists.

Possible fix

 const AnnotationOutputKeyCell = memo(function AnnotationOutputKeyCell({
     scenarioId,
     def,
     outputKey,
+    fallbackOutputKey,
     fallbackDataKey,
 }: {
     scenarioId: string
     def: AnnotationColumnDef
     outputKey: string
+    fallbackOutputKey?: string
     fallbackDataKey?: string | null
 }) {
     const runId = useAtomValue(annotationSessionController.selectors.activeRunId()) ?? undefined
     const PopoverWrapper = useMetricPopoverWrapper()
     const {fallbackValue, isPending} = useAnnotationCellFallback(
         scenarioId,
         fallbackDataKey,
-        outputKey,
+        fallbackOutputKey ?? outputKey,
     )

                         render: (_value: unknown, record: ScenarioTableRow) => (
                             <AnnotationOutputKeyCell
                                 scenarioId={record.scenarioId}
                                 def={outputColumn.annotationDef}
                                 outputKey={outputColumn.annotationDef.path ?? outputColumn.title}
+                                fallbackOutputKey={outputColumn.title}
                                 fallbackDataKey={def.fallbackDataKey}
                             />
                         ),

Also applies to: 899-913

ℹ️ Review info

⚙️ Run configuration

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro Plus

Run ID: e691f741-e404-4527-b6c9-1dbc7e659952

📥 Commits

Reviewing files that changed from the base of the PR and between ee683a1 and 7bea0bc.

📒 Files selected for processing (12)

web/packages/agenta-annotation-ui/src/components/AnnotationSession/ScenarioListView.tsx
web/packages/agenta-annotation/src/index.ts
web/packages/agenta-annotation/src/state/controllers/annotationSessionController.ts
web/packages/agenta-annotation/src/state/controllers/index.ts
web/packages/agenta-annotation/src/state/index.ts
web/packages/agenta-annotation/src/state/testsetSync.ts
web/packages/agenta-annotation/src/state/types.ts
web/packages/agenta-entities/src/testset/api/api.ts
web/packages/agenta-entities/src/testset/api/index.ts
web/packages/agenta-entities/src/testset/index.ts
web/packages/agenta-entity-ui/src/modals/commit/components/EntityCommitModal.tsx
web/packages/agenta-entity-ui/src/modals/commit/state.ts

💤 Files with no reviewable changes (2)

web/packages/agenta-entity-ui/src/modals/commit/state.ts
web/packages/agenta-entity-ui/src/modals/commit/components/EntityCommitModal.tsx

✅ Files skipped from review due to trivial changes (1)

web/packages/agenta-annotation/src/state/index.ts

github-actions · 2026-05-05T14:11:35Z

Railway Preview Environment


Preview URL	https://gateway-production-eac2.up.railway.app/w
Project	`agenta-oss-pr-4261`
Image tag	`pr-4261-931b7f4`
Status	Deployed
Railway logs	Open logs
Workflow logs	View workflow run
Updated at 2026-05-06T19:39:14.604Z

coderabbitai

Actionable comments posted: 1

Caution

Some comments are outside the diff and can’t be posted inline due to platform limitations.

⚠️ Outside diff range comments (1)

web/packages/agenta-entities/src/testset/api/mutations.ts (1)

47-73: ⚠️ Potential issue | 🟠 Major | ⚡ Quick win

Keep createTestset’s return shape consistent.

Line 73 still returns raw response.data, but the callers in web/packages/agenta-entities/src/testset/state/testsetMolecule.ts:685-705 and web/packages/agenta-entities/src/testset/state/mutations.ts:489-515 only understand { testset, revisionId }. If the backend ever omits data.testset, those flows fall into the "No revision ID returned from API" path even though the request itself may have succeeded. Throw here or normalize the fallback into the same shape.

🧹 Nitpick comments (2)

web/packages/agenta-annotation-ui/src/components/AnnotationSession/EmptyQueueState.tsx (1)

39-47: 💤 Low value

Consider using theme variables for button colors.

The hardcoded hex colors (#051729, #0a2540) bypass the theme system. If these are intentional brand colors for the observability feature, consider extracting them to CSS variables or a shared constant for consistency across the codebase.
web/packages/agenta-annotation-ui/src/components/AnnotationSession/index.tsx (1)
206-224: 💤 Low value

Inline array creation in render callback.

Line 213 creates [pendingTestsetSelection] on each invocation. If EntityPicker performs shallow comparison on initialSelections, this could cause unnecessary re-renders. Consider memoizing the array:
♻️ Suggested optimization
+ const initialTestsetSelections = useMemo(
+     () => [pendingTestsetSelection],
+     [pendingTestsetSelection],
+ )
+
  const renderAddToTestsetModeContent = useCallback(
      ({mode}: {mode?: string}) => (
          <div className="flex flex-col gap-3">
              {mode !== "new" && (
                  <EntityPicker<AddToTestsetTargetSelection>
                      variant="cascading"
                      adapter={ADD_TO_TESTSET_TARGET_ADAPTER}
-                     initialSelections={[pendingTestsetSelection]}
+                     initialSelections={initialTestsetSelections}
                      showLabels
                      showAutoIndicator={false}
                      placeholders={["Select testset"]}
                      onSelect={handleTestsetSelect}
                      onDeselect={handleTestsetDeselect}
                  />
              )}
          </div>
      ),
-     [handleTestsetSelect, handleTestsetDeselect, pendingTestsetSelection],
+     [handleTestsetSelect, handleTestsetDeselect, initialTestsetSelections],
  )

ℹ️ Review info

⚙️ Run configuration

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro Plus

Run ID: 403b6771-e83c-477b-bbb2-86059983e95e

📥 Commits

Reviewing files that changed from the base of the PR and between 7bea0bc and 077deb8.

📒 Files selected for processing (8)

web/packages/agenta-annotation-ui/src/components/AnnotationSession/EmptyQueueState.tsx
web/packages/agenta-annotation-ui/src/components/AnnotationSession/FocusView.tsx
web/packages/agenta-annotation-ui/src/components/AnnotationSession/SessionHeaderRight.tsx
web/packages/agenta-annotation-ui/src/components/AnnotationSession/SessionTitle.tsx
web/packages/agenta-annotation-ui/src/components/AnnotationSession/constants.ts
web/packages/agenta-annotation-ui/src/components/AnnotationSession/index.tsx
web/packages/agenta-annotation-ui/src/components/AnnotationSession/type.ts
web/packages/agenta-entities/src/testset/api/mutations.ts

coderabbitai

Actionable comments posted: 1

ℹ️ Review info

⚙️ Run configuration

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro Plus

Run ID: f72c55ad-f996-4d48-b99d-45e5d8841ed3

📥 Commits

Reviewing files that changed from the base of the PR and between 077deb8 and 5e471f0.

📒 Files selected for processing (1)

web/packages/agenta-entities/src/testset/api/mutations.ts

coderabbitai

Actionable comments posted: 3

♻️ Duplicate comments (3)

web/packages/agenta-annotation/src/state/controllers/annotationSessionController.ts (3)
2548-2562: ⚠️ Potential issue | 🟠 Major | 🏗️ Heavy lift

Schema inference from a single testcase row may miss existing columns.

fetchLatestRevisionWithRows requests testcaseLimit: 1, but extractExistingColumns (Line 2829) uses this single row to build the set of existing columns. If that sampled row is sparse or doesn't contain all leaf paths present in the target testset, remapRowsToExistingLeafColumns won't recognize valid existing columns. This can cause patchRevision to add duplicate or misaligned column paths.

Consider fetching the testset's column schema directly (if available via API) or increasing the sample size to union columns across multiple rows:

Also applies to: 2823-2830

2633-2657: ⚠️ Potential issue | 🟠 Major | ⚡ Quick win

Step and annotation query waits don't validate returned state before building rows.

Unlike the trace query (Lines 2616-2621), the waits for stepsQueryAtom and annotationsQueryAtom (Lines 2633-2644) don't check if the query succeeded or errored. When these queries are still pending at timeout or have errored, buildTraceAnnotationOutputs (Line 2648) reads from scenarioAnnotationsAtomFamily which returns an empty array on failure. This silently produces export rows with missing annotation outputs.

Add validation after each wait to fail the export or skip the scenario with a warning:
Proposed fix to validate query results
         const stepsQueryAtom = scenarioStepsQueryStateAtomFamily(scenarioId)
-        await waitForStoreAtomValue<QueryStateLike | null | undefined>(
+        const stepsQuery = await waitForStoreAtomValue<QueryStateLike | null | undefined>(
             stepsQueryAtom,
             isQuerySettledOrNullForExport,
         )
+        if (stepsQuery?.error) {
+            console.warn(`[export] Skipping scenario ${scenarioId}: steps query failed`)
+            processed += 1
+            params.setProcessed(processed)
+            continue
+        }

         const annotationsQueryAtom = scenarioAnnotationsQueryStateAtomFamily(scenarioId)
-        await waitForStoreAtomValue<QueryStateLike | null | undefined>(
+        const annotationsQuery = await waitForStoreAtomValue<QueryStateLike | null | undefined>(
             annotationsQueryAtom,
             isQuerySettledOrNullForExport,
             2500,
         )
+        if (annotationsQuery?.error) {
+            console.warn(`[export] Skipping scenario ${scenarioId}: annotations query failed`)
+            processed += 1
+            params.setProcessed(processed)
+            continue
+        }
2339-2365: ⚠️ Potential issue | 🟠 Major | 🏗️ Heavy lift

Timeout resolution silently returns potentially invalid state.

When the timeout expires, waitForStoreAtomValue resolves with the current atom value regardless of whether isReady() passes. This allows callers to proceed with stale or pending data when queries take longer than expected.

This relates to the past review concern: if annotation/step queries are still pending when the timeout fires, Lines 2646-2653 will build export rows from incomplete store state, potentially exporting rows with missing annotation outputs.

Consider rejecting on timeout or returning a discriminated result indicating whether the wait succeeded:
Proposed fix to reject on timeout
 async function waitForStoreAtomValue<T>(
     atomToWatch: unknown,
     isReady: (value: T) => boolean,
     timeoutMs = 5000,
 ): Promise<T> {
     const store = getStore()
     const atomRef = atomToWatch as unknown as Parameters<typeof store.get>[0]
     const subRef = atomToWatch as unknown as Parameters<typeof store.sub>[0]
     const current = store.get(atomRef) as T
     if (isReady(current)) return current

     return await new Promise<T>((resolve, reject) => {
         const timeout = setTimeout(() => {
             unsubscribe()
-            resolve(store.get(atomRef) as T)
+            const final = store.get(atomRef) as T
+            if (isReady(final)) {
+                resolve(final)
+            } else {
+                reject(new Error("Timed out waiting for atom value"))
+            }
         }, timeoutMs)

         const unsubscribe = store.sub(subRef, () => {
             const next = store.get(atomRef) as T
             if (isReady(next)) {
                 clearTimeout(timeout)
                 unsubscribe()
                 resolve(next)
             }
         })
     })
 }

🧹 Nitpick comments (1)

web/packages/agenta-annotation/src/state/controllers/annotationSessionController.ts (1)
1142-1219: 💤 Low value

Remove or gate debug logging in production code.

The scenarioAnnotationTraceIdsAtomFamily atom contains extensive console.log statements with [annot-debug] prefix (Lines 1142-1144, 1151-1153, 1180-1183, 1192-1218). These appear to be development debugging aids that would create noise in production console output.

Consider removing these logs or gating them behind a debug flag:
Suggested cleanup
+const DEBUG_ANNOTATIONS = false // or process.env.NODE_ENV === 'development'
+
 const scenarioAnnotationTraceIdsAtomFamily = atomFamily((scenarioId: string) =>
     atom<string[]>((get) => {
         const runId = get(activeRunIdAtom)
         if (!runId || !scenarioId) {
-            console.log(
-                `[annot-debug] traceIds(${scenarioId?.slice(0, 8)}): no runId or scenarioId`,
-            )
+            if (DEBUG_ANNOTATIONS) {
+                console.log(
+                    `[annot-debug] traceIds(${scenarioId?.slice(0, 8)}): no runId or scenarioId`,
+                )
+            }
             return []
         }
         // ... similar changes for other debug logs

ℹ️ Review info

⚙️ Run configuration

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro Plus

Run ID: c15582c6-c412-43ae-80cd-a8c6c4f664cf

📥 Commits

Reviewing files that changed from the base of the PR and between 5e471f0 and f79a470.

📒 Files selected for processing (2)

web/packages/agenta-annotation/src/state/controllers/annotationSessionController.ts
web/packages/agenta-annotation/src/state/testsetSync.ts

ardaerzin · 2026-05-06T13:51:42Z

+    newTestsetSlug?: string
+}
+
+const lastUsedTestsetByProjectAtom = atom<Record<string, string | null>>({})


atom with storage?

…xt acceptance to provide entity link easily on the message for faster and better navigation

- refactor code - rendering json data in beautified mode by default

… annotated yet

claude and others added 9 commits May 4, 2026 14:56

fix

f1894f5

uodated the casecade variant to have deselect function

06448be

updated the element placement

f1d301f

update view

97c507e

updated the main view ui and comit modal

ee683a1

coderabbitai Bot reviewed May 5, 2026

View reviewed changes

ashrafchowdury added 2 commits May 5, 2026 16:53

added the mechanisam for adding the testset

7a083c4

fix annotation table evaluator render and testset outputs unification

077e9fe

vercel Bot deployed to Preview May 5, 2026 11:39 View deployment

fix

85329ce

vercel Bot deployed to Preview May 5, 2026 12:10 View deployment

Merge branch 'main' into feat/frontend-annotation-queue-testset

7bea0bc

ashrafchowdury marked this pull request as ready for review May 5, 2026 12:35

dosubot Bot added size:XXL This PR changes 1000+ lines, ignoring generated files. feature Frontend labels May 5, 2026

vercel Bot deployed to Preview May 5, 2026 12:40 View deployment

coderabbitai Bot reviewed May 5, 2026

View reviewed changes

cleanup

dceaef4

vercel Bot deployed to Preview May 5, 2026 14:05 View deployment

ashrafchowdury and others added 2 commits May 6, 2026 11:56

cleanup

91b823e

Merge branch 'main' into feat/frontend-annotation-queue-testset

077deb8

vercel Bot deployed to Preview May 6, 2026 05:58 View deployment

coderabbitai Bot reviewed May 6, 2026

View reviewed changes

Comment thread web/packages/agenta-entities/src/testset/api/mutations.ts Outdated

revert

5e471f0

vercel Bot deployed to Preview May 6, 2026 06:41 View deployment

coderabbitai Bot reviewed May 6, 2026

View reviewed changes

Comment thread web/packages/agenta-entities/src/testset/api/mutations.ts

remove doted column created path for testset

f79a470

vercel Bot deployed to Preview May 6, 2026 07:10 View deployment

coderabbitai Bot reviewed May 6, 2026

View reviewed changes

Comment thread web/packages/agenta-annotation/src/state/controllers/annotationSessionController.ts

Comment thread web/packages/agenta-annotation/src/state/testsetSync.ts

Comment thread web/packages/agenta-annotation/src/state/testsetSync.ts

cleanup and fixes

842007f

vercel Bot deployed to Preview May 6, 2026 07:27 View deployment

ashrafchowdury linked an issue May 6, 2026 that may be closed by this pull request

Implement annotation queue test set flow #4248

Open

ashrafchowdury requested a review from ardaerzin May 6, 2026 10:05

ardaerzin requested changes May 6, 2026

View reviewed changes

ashrafchowdury added 3 commits May 7, 2026 00:01

added keyboard shortcut keys on the ui for better understanding

1ef6d36

cleanup and fix retrivel for the annotated scenario on the table view

f49aa3a

feat: updated the toast message component to support link and link-te…

a08ab76

…xt acceptance to provide entity link easily on the message for faster and better navigation

vercel Bot deployed to Preview May 6, 2026 19:28 View deployment

- fix message strcture issue

692de32

- refactor code - rendering json data in beautified mode by default

vercel Bot deployed to Preview May 7, 2026 07:08 View deployment

disabled testset button when we added annotation value but didn't not…

0f31316

… annotated yet

vercel Bot deployed to Preview May 7, 2026 08:43 View deployment

ashrafchowdury added 2 commits May 7, 2026 16:28

fixed table messages and input rendering issues

f6a84dd

improved the logics

e2fddee

vercel Bot deployed to Preview May 7, 2026 10:41 View deployment

Merge branch 'main' into feat/frontend-annotation-queue-testset

72a1e43

vercel Bot deployed to Preview May 7, 2026 11:03 View deployment

ashrafchowdury marked this pull request as draft May 7, 2026 11:11

Conversation

ashrafchowdury commented May 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What's new

Fixes

QA scope (Annotation-queue)

Uh oh!

vercel Bot commented May 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

coderabbitai Bot commented May 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Reviews paused

Walkthrough

Changes

Estimated Code Review Effort

Possibly Related PRs

❌ Failed checks (1 warning, 1 inconclusive)

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

github-actions Bot commented May 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Railway Preview Environment

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ardaerzin May 6, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

ashrafchowdury commented May 5, 2026 •

edited

Loading

vercel Bot commented May 5, 2026 •

edited

Loading

coderabbitai Bot commented May 5, 2026 •

edited

Loading

github-actions Bot commented May 5, 2026 •

edited

Loading