Parallelize LLM as Judge

**Is your feature request related to a problem? Please describe.**
Right now, LLM as judge is [run in series](https://github.com/google/adk-python/blob/f35d129b4c59d381e95418725d6eaa072ca7720a/src/google/adk/evaluation/llm_as_judge.py#L136-L155). This is N * M LLM calls in series, where N is # samples and M is # eval cases. This part can take a long time.

**Describe the solution you'd like**
I would like it to be possible for these to run in parallel, either by default or via a flag that can be passed into the agent evaluator.

**Additional context**
I've done a monkey patch of this in my own project. Before it took close to 5 minutes for me to eval one test case with 5 samples & 2 different rubric numbers with gemini 3 pro. After the patch it was down to 1 minute.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Parallelize LLM as Judge #3958

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Parallelize LLM as Judge #3958

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions