starting-ragchatbot-codebase/backend-tool-refactor.md at 411d98ca483097e8e41dc5a2482933ea562fa098 · https-deeplearning-ai/starting-ragchatbot-codebase · GitHub

28 lines (22 loc) · 1.48 KB

Refactor @backend/ai_generator.py to support sequential tool calling where Claude can make up to 2 tool calls in separate API rounds.

Current behavior:

Claude makes 1 tool call → tools are removed from API params → final response
If Claude wants another tool call after seeing results, it can't (gets empty response)

Desired behavior:

Each tool call should be a separate API request where Claude can reason about previous results
Support for complex queries requiring multiple searches for comparisons, multi-part questions, or when information from different courses/lessons is needed

Example flow:

User: "Search for a course that discusses the same topic as lesson 4 of course X"
Claude: get course outline for course X → gets title of lesson 4
Claude: uses the title to search for a course that discusses the same topic → returns course information
Claude: provides complete answer

Requirements:

Maximum 2 sequential rounds per user query
Terminate when: (a) 2 rounds completed, (b) Claude's response has no tool_use blocks, or (c) tool call fails
Preserve conversation context between rounds
Handle tool execution errors gracefully

Notes:

Update the system prompt in @backend/ai_generator.py
Update the test @backend/tests/test_ai_generator.py
Write tests that verify the external behavior (API calls made, tools executed, results returned) rather than internal state details.

Use two parallel subagents to brainstorm possible plans. Do not implement any code.