feat: add retry logic for chat completion streaming by PortRoyale · Pull Request #1381 · elie222/inbox-zero

PortRoyale · 2026-01-23T04:45:49Z

Summary

Add retry with exponential backoff for transient errors in chatCompletionStream. This helps with smaller models (8B) that may occasionally produce schema mistakes or encounter transient network issues.

Changes

Add retry loop with max 2 retries and exponential backoff (1s, 2s)
Detect JSON parse errors (SyntaxError, "Unexpected token")
Detect tool call/schema validation errors
Detect transient network errors (via existing isTransientNetworkError)
Improve error logging with error type classification

Motivation

The existing createGenerateObject and createGenerateText functions already have retry logic via withLLMRetry and withNetworkRetry. This brings similar resilience to the streaming chat completion path.

When using smaller local models (like 8B parameter models via Ollama), occasional JSON malformation or schema validation failures can occur. Adding retry logic allows the system to recover gracefully from these transient failures.

Test plan

Verified TypeScript compiles without errors
Test with local Ollama model to verify retry behavior on transient failures
Verify no regression in normal streaming behavior

🤖 Generated with Claude Code

Add retry with exponential backoff for transient errors in chatCompletionStream. This helps with smaller models (8B) that may occasionally produce schema mistakes. Changes: - Add retry loop with max 2 retries and exponential backoff (1s, 2s) - Detect JSON parse errors (SyntaxError, "Unexpected token") - Detect tool call/schema validation errors - Detect transient network errors - Improve error logging with error type classification The existing createGenerateObject and createGenerateText functions already have retry logic via withLLMRetry and withNetworkRetry. This brings similar resilience to the streaming chat completion path. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

vercel · 2026-01-23T04:45:54Z

@PortRoyale is attempting to deploy a commit to the Inbox Zero OSS Program Team on Vercel.

A member of the Team first needs to authorize it.

CLAassistant · 2026-01-23T04:45:57Z

All committers have signed the CLA.

cubic-dev-ai

No issues found across 1 file

cubic-dev-ai bot reviewed Jan 23, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add retry logic for chat completion streaming#1381

feat: add retry logic for chat completion streaming#1381
PortRoyale wants to merge 1 commit intoelie222:mainfrom
PortRoyale:feat/streaming-retry-logic

PortRoyale commented Jan 23, 2026

Uh oh!

vercel bot commented Jan 23, 2026

Uh oh!

CLAassistant commented Jan 23, 2026 •

edited

Loading

Uh oh!

cubic-dev-ai bot left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

PortRoyale commented Jan 23, 2026

Summary

Changes

Motivation

Test plan

Uh oh!

vercel bot commented Jan 23, 2026

Uh oh!

CLAassistant commented Jan 23, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

cubic-dev-ai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

CLAassistant commented Jan 23, 2026 •

edited

Loading