feat(js): implemented generate middleware by pavelgj · Pull Request #4515 · firebase/genkit

pavelgj · 2026-02-09T01:28:21Z

Checklist (if applicable):

PR title is following https://www.conventionalcommits.org/en/v1.0.0/
Tested (manually, unit tested, etc.)
Docs updated (updated docs or a docs bug required)

gemini-code-assist · 2026-02-09T01:28:50Z

Summary of Changes

Hello @pavelgj, I'm Gemini Code Assist¹! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed!

This pull request introduces a powerful new middleware system for Genkit's generate functionality. This system enables developers to inject custom logic at various stages of the generation process, including before and after the main generate action, model calls, and individual tool executions. It provides a flexible mechanism for cross-cutting concerns like logging, caching, and dynamic tool injection, enhancing the extensibility and observability of AI applications built with Genkit.

Highlights

Introduction of Generate Middleware: A new system is implemented allowing interception and modification of the generate action, model calls, and tool calls within Genkit AI.
Middleware Definition and Usage: Developers can now define middleware using the generateMiddleware helper with optional configuration schemas and apply them via the use option in generate calls.
Middleware Hooks: Middleware can provide specific hooks for different stages: generate (wrapping the entire process), model (intercepting model execution), and tool (intercepting individual tool calls).
Dynamic Tool Injection: Middleware can dynamically inject tools into the generation request, expanding the capabilities of AI models on the fly.
Plugin Integration and Backward Compatibility: The new middleware system is fully integrated with the Genkit plugin architecture for global registration and maintains backward compatibility with older functional middleware patterns.
Reflection API Support: The Genkit reflection server now exposes registered middleware, allowing for introspection and discovery of available middleware.

🧠 New Feature in Public Preview: You can now enable Memory to help Gemini Code Assist learn from your team's feedback. This makes future code reviews more consistent and personalized to your project's style. Click here to enable Memory in your admin console.

Changelog

js/ai/src/generate.ts
- Added imports for new middleware-related types and functions.
- Updated GenerateOptions.use to accept new middleware types.
- Introduced normalizeMiddleware to standardize middleware representations.
- Modified the generate function to create a child registry, normalize and resolve middleware, and pass resolved middleware to generateHelper.
- Added maybeRegisterDynamicMiddlewareTools to register tools provided by middleware.
- Refactored maybeRegisterDynamicTools and maybeRegisterDynamicResources to operate on the child registry.
- Updated toGenerateActionOptions to include use in the generated parameters.
js/ai/src/generate/action.ts
- Updated imports for Registry, maybeRegisterDynamicMiddlewareTools, normalizeMiddleware, GenerateMiddlewareDef, and resolveMiddleware.
- Removed old ModelMiddlewareArgument and ModelMiddlewareWithOptions types.
- Modified defineGenerateAction to create a child registry, normalize and resolve middleware, register dynamic middleware tools, and pass resolved middleware to generateActionImpl.
- Renamed generate to generateActionImpl and introduced a new generateActionTurn function.
- Implemented middleware dispatch logic within generateActionImpl for the generate hook.
- Updated generateHelper and generateActionImpl to use GenerateMiddlewareDef[] for middleware.
- Added logic to append tools supplied by middleware to the request.
- Modified resolveResumeOption to accept middleware.
- Updated dispatchModel logic to use currentMiddleware.model for model interception.
- Modified resolveToolRequests to accept tools and middleware arguments.
js/ai/src/generate/middleware.ts
- New file: Defines MiddlewareDescSchema, MiddlewareRefSchema, GenerateMiddleware interface, and GenerateMiddlewareDef interface.
- Introduces generateMiddleware function for creating configurable middleware.
- Introduces resolveMiddleware function to instantiate middleware definitions from references.
js/ai/src/generate/resolve-tool-requests.ts
- Added import for ActionRunOptions and GenerateMiddlewareDef.
- Modified resolveToolRequest to accept middleware and implemented dispatch logic for the tool hook.
- Modified resolveToolRequests to accept tools and middleware arguments, and removed registry parameter.
- Modified resolveResumedToolRequest, resolveResumeOption, and resolveRestartedTools to accept middleware.
js/ai/src/index.ts
- Exported new middleware-related types and functions from generate/middleware.ts.
- Exported GenkitPluginV2.
js/ai/src/model-types.ts
- Imported MiddlewareRefSchema.
- Updated GenerateActionOptionsSchema to include an optional use array of MiddlewareRefSchema.
js/ai/src/plugin.ts
- New file: Defines GenkitPluginV2 interface with generateMiddleware method.
js/ai/tests/generate/action_test.ts
- Added a new test suite generateAction middleware injection.
- Added a test case to verify that tools injected via middleware are passed to the model.
js/ai/tests/generate/middleware_test.ts
- New file: Comprehensive tests for generateMiddleware functionality.
- Tests execution order of generate, model, and tool hooks.
- Tests configuration and old-style functional middleware, and pre-registered middleware via plugins.
- Tests error handling for incorrect middleware usage, stream interception and modification, and multiple middleware execution order.
- Tests tool injection and execution via middleware.
js/core/src/plugin.ts
- Updated BaseGenkitPluginV2 interface to include generateMiddleware method.
- Added ResolvableAction type.
js/core/src/reflection.ts
- Modified /api/values endpoint to support middleware type.
- Added logic to call toJson() method on middleware objects if available for serialization.
js/core/tests/reflection_test.ts
- New file: Tests for the ReflectionServer API.
- Tests handling of missing and unsupported type parameters.
- Tests retrieval of defaultModel values and middleware values, including toJson() serialization.
js/docs/generate-middleware.md
- New file: Documentation explaining how to define and use generate middleware, including plugin integration.
js/genkit/src/common.ts
- Exported new middleware-related types and functions: MiddlewareDescSchema, MiddlewareRefSchema, generateMiddleware, GenerateMiddleware, GenerateMiddlewareDef, MiddlewareDesc, and MiddlewareRef.
js/genkit/src/genkit.ts
- Modified Genkit class to process generateMiddleware from plugins and register them in the registry.
js/genkit/src/plugin.ts
- Updated GenkitPluginV2 interface to include generateMiddleware method.
- Updated GenkitPluginV2Instance to implement generateMiddleware.
js/genkit/tests/generate_test.ts
- Added a test case to verify middleware plugin integration with genkit.generate.

Activity

The author, pavelgj, has implemented a new feature: generate middleware.
The pull request includes comprehensive unit tests for the new middleware system.
New documentation for generate middleware has been added.
The PR title follows conventional commits specification.
The changes have been manually and unit tested.

Using Gemini Code Assist

The full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips.

Invoking Gemini

You can request assistance from Gemini at any point by creating a comment using either /gemini <command> or @gemini-code-assist <command>. Below is a summary of the supported commands on the current page.

Feature	Command	Description
Code Review	`/gemini review`	Performs a code review for the current pull request in its current state.
Pull Request Summary	`/gemini summary`	Provides a summary of the current pull request in its current state.
Comment	@gemini-code-assist	Responds in comments when explicitly tagged, both in pull request comments and review comments.
Help	`/gemini help`	Displays a list of available commands.

Customization

To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a .gemini/ folder in the base of the repository. Detailed instructions can be found here.

Limitations & Feedback

Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for Github and other Google products, sign up here.

You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with the Gemini Code Assist IDE Extension.

Review the Privacy Notices, Generative AI Prohibited Use Policy, Terms of Service, and learn how to configure Gemini Code Assist in GitHub here. Gemini can make mistakes, so double check it and use code with caution. ↩

gemini-code-assist

Code Review

This pull request introduces a comprehensive middleware system for the generate function, which is a great feature for adding cross-cutting concerns like logging, caching, and telemetry. The implementation is well-structured, with clear separation of concerns and good test coverage. I've left a few comments on minor improvements and a potential design smell in the plugin interface. Overall, this is a solid contribution.

js/ai/src/generate.ts

js/ai/src/generate/action.ts

js/ai/src/generate/middleware.ts

js/core/src/reflection.ts

…te the test to verify multi-turn execution.

…rity audit - Add 7 new gaps (G38-G44) for Middleware V2, Bidi Action/Flow/Model, Agent primitive, Plugin V2, and Reflection API V2 - Mark middleware gaps G1-G3, G12-G16 as PAUSED (blocked on upstream JS #4515 and Go #4422 Middleware V2 RFCs) - Mark G19 (Model API V2) as SUPERSEDED by G38 + G41 - Add deep-dive sections §8l-8p covering all 5 active RFC designs - Update dependency graph: critical path now G38→G2→G1→G3 (4 levels) - Restructure phased roadmap: Phase 1 (unblocked), Phase 2-3 (paused), Phase 4 (bidi/agent, blocked), Phase 5 (integration), Phase 6 (deferred) - Update §5g cross-SDK gaps table with new primitives - Update §9b status tracker and §9c dependency matrix - Update summary metrics: 36 total gaps, 8 paused, 6 upstream-blocked Co-authored-by: Cursor <cursoragent@cursor.com>

…interactions including generate, stream, and embed.

pavelgj · 2026-02-10T00:21:57Z

/gemini review

gemini-code-assist

Code Review

This pull request introduces a powerful middleware system for the generate function, allowing interception of different stages of the generation process (generate, model, tool). This is a significant feature that enables cross-cutting concerns like logging, caching, and telemetry. The implementation is well-structured, with a new generateMiddleware helper, clear definitions for middleware hooks, and support for both new object-based middleware and backward compatibility with old function-based middleware.

The PR also includes a significant and beneficial refactoring, extracting core AI functionalities into a new GenerateAPI base class, which the main Genkit class now extends. This improves code organization and separation of concerns.

The changes are accompanied by comprehensive tests covering various middleware scenarios and documentation to guide users on the new feature.

I have one minor suggestion to improve an error message in the reflection API to make it more accurate. Overall, this is an excellent contribution.

js/core/src/reflection.ts

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>

apascal07 · 2026-02-10T17:24:53Z

js/ai/src/generate-api.ts

+/**
+ * `GenerateAPI` encapsulates model generate APIs.
+ */
+export class GenerateAPI {


☹️

What use cases were you thinking of?

Some use-cases:

Automatic Context compression middleware (requires access to an llm)

LLM classifier based model routing (simple tasks use small model, complex models use large model)

agentic RAG use-case require embedders

apascal07 · 2026-02-10T18:23:58Z

js/ai/src/generate/resolve-tool-requests.ts

+  ): Promise<{
+    response?: ToolResponsePart;
+    interrupt?: ToolRequestPart;
+    preamble?: GenerateActionOptions;


This is just for backwards compat, right?

Updates the generation loop and middleware logic to safely handle cases where tool execution yields no result. - Updates `generateActionTurn` to verify `toolMessage` existence before streaming chunks or adding to the message history, preventing runtime errors on undefined values. - Updates middleware types and `resolveToolRequest` to allow `void` return values. - Ensures `resolveToolRequests` returns an empty object if no response parts or transfer preamble are generated, rather than constructing a malformed tool message.

- Simplify `GenerateMiddlewareDef` to return `ToolResponsePart | void` directly, removing the complex wrapper object that included `interrupt` and `preamble`. - Move `isPromptAction` check in `resolveToolRequest` to execute before the middleware chain, ensuring prompt actions are handled immediately without passing through tool middleware. - Update `executeTool` implementation and return types to align with the simplified middleware signature.

… and merge order Comprehensive update reflecting current state of all merged and open PRs: Merged PRs reflected: - #4511 (G5+G6: span_id + X-Genkit-Span-Id) → marked Done - #4507+#4508 (G11: CHANGELOG.md) → marked Done - #4509 (plugin test coverage uplift) → marked Done - #4505 (PARITY_AUDIT.md baseline) → marked Done - #4488 (sample naming, py.typed, check_consistency) → marked Done Open PRs tracked: - #4516 (G1+G2: model middleware storage) - #4514 (Transfer-Encoding fix) - #4513 (G18: multipart tools) - #4512 (G20-G22: constructor parity) - #4510 (G3+G12-G16: middleware functions) - #4504 (Checks plugin) - #4495, #4494, #4401 (bug fixes + reflection v2) New content: - Middleware Taxonomy section with 4-layer diagram - Decision guide for which middleware layer to use - G38 gap (auto-wiring via get_model_middleware) - PR merge order graph with file conflict matrix - Updated summary metrics and phase tables fix: address review comments — fix G1/G2 status consistency, middleware level count, #4514 merged docs(py): update PARITY_AUDIT.md — mark #4494 and #4514 merged, add #4518 Cohere Update PR status tracking: - #4494 (RedactedSpan fix): marked merged - #4514 (Transfer-Encoding fix): marked merged - #4518 (Cohere provider plugin): added as open PR - Updated dependency chain, summary metrics, and open PR counts docs(py): update PARITY_AUDIT.md — add #4519 (Core fix) docs(py): update PARITY_AUDIT.md with latest merged PR status Rationale: Several PRs have merged since the last update. This syncs the document with the current state of all PRs. Changes: - Mark #4495, #4518, #4520 as merged in all tables - Move #4495, #4518 from INDEPENDENT to MERGED in dependency chain - Add #4520 (converter extraction) to merged list - Update summary metrics: 6 open PRs (down from 8) - Simplify Layer 3 deps since #4495 is now merged docs(py): add issue tracker analysis, dependency graph, model conformance roadmap, and sample flow test plan Rationale: Comprehensive update to PARITY_AUDIT.md with 5 new sections (§12–§16) covering: - Cross-SDK issue tracker analysis verified against Python source code - Dependency-aware reverse topological sort roadmap for prioritized fixes - Model conformance testing roadmap with provider parity matrix - Sample flow test plan with optimal execution order for error detection Changes: - §12: Fixability assessment of 9 'likely' issues with code-level verdicts - §13: Dependency graph (W1–W14), file conflict matrix, PR manifest with regression test specifications, sprint-based execution plan - §14: Model conformance roadmap (Phases 0–4), plugin parity matrix, conformance PR mapping, JS-only plugin gaps - §15: Combined roadmap unifying parity gaps, issue fixes, and conformance - §16: Sample flow test plan with 5-phase error detection priority pyramid, 36 samples ordered by feature coverage, quick-start commands, and env var reference table docs(py): add active RFC redesigns (Middleware V2, Bidi, Agent) to parity audit - Add 7 new gaps (G38-G44) for Middleware V2, Bidi Action/Flow/Model, Agent primitive, Plugin V2, and Reflection API V2 - Mark middleware gaps G1-G3, G12-G16 as PAUSED (blocked on upstream JS #4515 and Go #4422 Middleware V2 RFCs) - Mark G19 (Model API V2) as SUPERSEDED by G38 + G41 - Add deep-dive sections §8l-8p covering all 5 active RFC designs - Update dependency graph: critical path now G38→G2→G1→G3 (4 levels) - Restructure phased roadmap: Phase 1 (unblocked), Phase 2-3 (paused), Phase 4 (bidi/agent, blocked), Phase 5 (integration), Phase 6 (deferred) - Update §5g cross-SDK gaps table with new primitives - Update §9b status tracker and §9c dependency matrix - Update summary metrics: 36 total gaps, 8 paused, 6 upstream-blocked docs(py): update PARITY_AUDIT.md PR status to 2026-02-11 Comprehensive status update reflecting 23 PRs merged since last update (2026-02-09), 3 PRs closed/superseded, and 11 currently open PRs including releasekit tooling, dotprompt fixes, and CI workflow migration. Key changes: - §14f: Checks plugin marked as merged (#4504) - §15b: Split into Recently Merged / Closed / Currently Open - §15c: Updated metrics (31 merged, 11 open, 3 closed) - Added releasekit PRs (14 merged, 3 open) as new workstream

feat: implemented generate middleware

cd9ccd7

github-project-automation bot added this to Genkit Backlog Feb 9, 2026

github-actions bot added docs Improvements or additions to documentation js labels Feb 9, 2026

gemini-code-assist bot reviewed Feb 9, 2026

View reviewed changes

js/ai/src/generate.ts Show resolved Hide resolved

js/ai/src/generate/action.ts Outdated Show resolved Hide resolved

js/ai/src/generate/middleware.ts Show resolved Hide resolved

js/core/src/reflection.ts Show resolved Hide resolved

feat: Apply middleware to all turns of multi-turn generation and upda…

2c31fa9

…te the test to verify multi-turn execution.

yesudeep mentioned this pull request Feb 9, 2026

[SUPERSEDED] feat(py/genkit): add model-level middleware support via define_model(use=[...]) #4516

Draft

feat: Introduce a unified GenerateAPI class to centralize AI model …

8ce3844

…interactions including generate, stream, and embed.

github-actions bot added the root label Feb 10, 2026

gemini-code-assist bot reviewed Feb 10, 2026

View reviewed changes

js/core/src/reflection.ts Outdated Show resolved Hide resolved

pavelgj changed the title ~~feat: implemented generate middleware~~ feat(js): implemented generate middleware Feb 10, 2026

pavelgj and others added 2 commits February 9, 2026 19:27

Apply suggestion from @gemini-code-assist[bot]

c327325

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>

Apply suggestion from @gemini-code-assist[bot]

781f574

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>

pavelgj requested a review from apascal07 February 10, 2026 00:36

feat: pass the ai object to generate middleware functions.

87c56c1

pavelgj requested a review from MichaelDoyle February 10, 2026 15:57

apascal07 reviewed Feb 10, 2026

View reviewed changes

pavelgj added 2 commits February 10, 2026 19:12

pavelgj added 3 commits February 11, 2026 15:42

fixed resume middleware

577916f

Merge branch 'main' into pj/generate-middleware

b8730bc

allow dotprompt to use middleware

2f57f05

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(js): implemented generate middleware#4515

feat(js): implemented generate middleware#4515
pavelgj wants to merge 11 commits intomainfrom
pj/generate-middleware

pavelgj commented Feb 9, 2026

Uh oh!

gemini-code-assist bot commented Feb 9, 2026

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

pavelgj commented Feb 10, 2026

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

apascal07 Feb 10, 2026

Uh oh!

pavelgj Feb 10, 2026

Uh oh!

apascal07 Feb 10, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

pavelgj commented Feb 9, 2026

Uh oh!

gemini-code-assist bot commented Feb 9, 2026

Summary of Changes

Highlights

Footnotes

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

pavelgj commented Feb 10, 2026

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

apascal07 Feb 10, 2026

Choose a reason for hiding this comment

Uh oh!

pavelgj Feb 10, 2026

Choose a reason for hiding this comment

Uh oh!

apascal07 Feb 10, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants