Pipelex
diff --git a/‎.claude/skills/temporal-diagnose/SKILL.md‎
Lines changed: 287 additions & 0 deletions b/‎.claude/skills/temporal-diagnose/SKILL.md‎
Lines changed: 287 additions & 0 deletions
diff --git a/‎.claude/skills/temporal-diagnose/references/temporal-worker-problem.md‎
Lines changed: 103 additions & 0 deletions b/‎.claude/skills/temporal-diagnose/references/temporal-worker-problem.md‎
Lines changed: 103 additions & 0 deletions
@@ -0,0 +1,287 @@
+---
+name: temporal-diagnose
+description: >
+  Two modes for working on the Temporal worker library loading bug.
+  DIAGNOSE mode: run the 3-terminal Temporal dev setup (server + worker + job),
+  observe the failure, interpret errors. Use when the user says "test temporal",
+  "run temporal", "diagnose temporal", "temporal dev", "reproduce the temporal bug",
+  "check if temporal works", or pastes Temporal worker/submitter output to interpret.
+  FIX mode: discuss architecture, design the solution, plan implementation, make
+  code changes. Use when the user says "fix temporal", "let's discuss a fix",
+  "design the temporal fix", "implement the temporal fix", "plan the temporal
+  solution", or wants to iterate on the worker library loading solution.
+  Always use this skill when the conversation touches the Temporal worker library
+  problem, get_required_pipe failures on the worker, or mthds_contents not reaching
+  the worker.
+---
+
+# Temporal Worker Library — Diagnose & Fix
+
+This skill has two modes. Determine which one from the user's prompt:
+
+- **"diagnose"**, **"test"**, **"run"**, **"reproduce"**, **"check"** → DIAGNOSE mode
+- **"fix"**, **"discuss"**, **"design"**, **"implement"**, **"plan"**, **"solution"** → FIX mode
+
+If ambiguous, ask the user: "Do you want to diagnose (run the setup and observe) or discuss the fix?"
+
+Read `references/temporal-worker-problem.md` before proceeding — it explains the
+root cause, code paths, and expected error patterns.
+
+## How Claude Code runs everything
+
+Claude Code handles all three processes. Do NOT ask the user to open terminals
+or run commands — do it yourself.
+
+Use **tmux** to manage the long-running processes (server and worker) in named
+sessions. This lets you start them, run the job submitter, and then capture
+output from all three to diagnose.
+
+| Process | tmux session | Raw command | Lifecycle |
+|---------|-------------|-------------|-----------|
+| Temporal server | `temporal-server` | `temporal server start-dev` | Long-running, stays up across iterations |
+| Temporal worker | `temporal-worker` | `PIPELEXPATH=<bundle_dir> .venv/bin/python -m pipelex.temporal.worker_cli --is-not-sandboxed` | Long-running, restart after code changes |
+| Job submitter | (inline Bash) | `make trund` / `make trun` | Runs and exits |
+
+**Important**: The server and worker are **long-running processes that never exit**.
+They block the shell they run in. That is why they run inside tmux sessions, not
+inline. The submitter (`make trund` / `make trun`) is the only process that runs to
+completion and exits — run it directly via Bash, not in tmux.
+
+**Why raw commands in tmux**: tmux sessions run in a bare shell without the
+Makefile's variable resolution (`$(VENV_PYTHON)`, `$(call PRINT_TITLE,...)`).
+Using `make ts` or `make tw` inside tmux will fail. Always use the raw commands
+shown above for tmux sessions. The `make` targets are only for the job submitter
+which runs in Claude Code's own shell.
+
+### tmux cheatsheet
+
+**Start a session:**
+```bash
+tmux new-session -d -s temporal-server 'temporal server start-dev'
+```
+
+**Check if running:**
+```bash
+tmux has-session -t temporal-server 2>/dev/null && echo "running" || echo "not running"
+```
+
+**Read output** (last N lines):
+```bash
+tmux capture-pane -t temporal-worker -p -S -100
+```
+
+**Kill and restart** (e.g., to pick up code changes):
+```bash
+tmux kill-session -t temporal-worker
+tmux new-session -d -c "$PWD" -s temporal-worker 'PIPELEXPATH=tests/integration/pipelex/pipes/controller/pipe_sequence .venv/bin/python -m pipelex.temporal.worker_cli --is-not-sandboxed'
+```
+
+If tmux is not installed, fall back to asking the user to run the server and
+worker in separate terminals.
+
+---
+
+## DIAGNOSE Mode
+
+Run the 3-process Temporal development setup and interpret results.
+
+### Prerequisites
+
+Verify these yourself (via Bash):
+1. `tmux` installed: `which tmux`
+2. `temporal` CLI installed: `which temporal`
+
+### Step 1: Start the Temporal server
+
+First check if a server is already running (possibly outside tmux from a previous
+session or another terminal):
+```bash
+curl -s http://localhost:8233 > /dev/null && echo "running" || echo "not running"
+```
+
+If **running**: skip to step 2. The server is already up — no need to start it again.
+
+If **not running**: start it in a tmux session:
+```bash
+tmux new-session -d -s temporal-server 'temporal server start-dev'
+```
+Sleep **3 seconds**, then verify:
+```bash
+sleep 3 && curl -s http://localhost:8233 > /dev/null && echo "running" || echo "not running"
+```
+
+Do NOT try to start the server if port 7233 is already in use — it will fail with
+a bind error, the tmux session will exit immediately, and subsequent `capture-pane`
+calls will fail.
+
+### Step 2: Start the worker
+
+```bash
+tmux has-session -t temporal-worker 2>/dev/null || \
+  tmux new-session -d -s temporal-worker \
+  'cd $PWD && PIPELEXPATH=tests/integration/pipelex/pipes/controller/pipe_sequence .venv/bin/python -m pipelex.temporal.worker_cli --is-not-sandboxed'
+```
+
+The worker is also long-running and never exits. Sleep **4 seconds** (no more),
+then capture the pane:
+```bash
+sleep 4 && tmux capture-pane -t temporal-worker -p -S -30
+```
+Look for `Temporal Worker started for 'temporal_task_queue'`.
+
+### Step 3: Submit a job
+
+Run the job submitter. It connects to Temporal, submits the workflow, and **waits
+for the result**. If the worker fails to process the job (e.g., deserialization
+error), the submitter may hang for a long time waiting for a response that never
+comes. Run it in the background so you can check worker output while it's waiting.
+
+Dry run (no real LLM calls):
+```bash
+TEMPORAL_BUNDLE="tests/integration/pipelex/pipes/controller/pipe_sequence/pipe_sequence_1.mthds"
+tmux has-session -t temporal-submitter 2>/dev/null || \
+  tmux new-session -d -s temporal-submitter \
+  "cd $PWD && .venv/bin/pipelex run bundle $TEMPORAL_BUNDLE --temporal --dry-run --mock-inputs --no-logo"
+```
+
+Or for real LLM execution:
+```bash
+TEMPORAL_BUNDLE="tests/integration/pipelex/pipes/controller/pipe_sequence/pipe_sequence_1.mthds"
+tmux has-session -t temporal-submitter 2>/dev/null || \
+  tmux new-session -d -s temporal-submitter \
+  "cd $PWD && .venv/bin/pipelex run bundle $TEMPORAL_BUNDLE --temporal --mock-inputs --no-logo"
+```
+
+Both default to `pipe_sequence_1.mthds`. To target a specific pipe, add `--pipe <pipe_code>`.
+Override the bundle by changing `TEMPORAL_BUNDLE`.
+
+### Step 4: Diagnose the output
+
+Read the submitter output (from step 3) AND the worker output:
+```bash
+tmux capture-pane -t temporal-worker -p -S -200
+```
+
+**Expected failure (bug not yet fixed):**
+
+There are two failure layers, both caused by the missing library on the worker.
+See `references/temporal-worker-problem.md` for details.
+
+**Layer 1 — Deserialization failure** (hits first):
+1. The PipeJob's WorkingMemory contains Stuff objects with dynamically-generated
+   concept content classes (e.g., `RawText` inheriting from `TextContent`)
+2. These classes are generated during library loading by `ConceptFactory` /
+   `StructureGenerator` and registered with Kajson's class registry
+3. On the worker, the library was never loaded → these classes don't exist →
+   Kajson fails with `KajsonDecoderError: Class 'RawText' not found in module 'builtins'`
+4. Temporal wraps this as `RuntimeError: Failed decoding arguments`
+5. The submitter may hang waiting for a result that never comes
+
+**Layer 2 — Library resolution failure** (would hit after Layer 1 is fixed):
+1. `WfPipeRouter.run()` receives the PipeJob with the top-level PipeSequence
+2. `PipeSequence.run_pipe()` calls `get_required_pipe("clean_text")`
+3. `library_manager` singleton is empty on the worker → error
+4. Propagates as `TemporalError` / `ActivityError` to the submitter
+
+The submitter output will show a Temporal workflow failure (or hang indefinitely
+for Layer 1 failures).
+
+**After fix is applied (success looks like):**
+- Submitter: successful pipeline result printed to stdout
+- Worker (`tmux capture-pane`): logs showing pipe execution steps
+- Temporal UI (http://localhost:8233): completed workflow with result
+
+### Step 5: Iterate
+
+1. Kill and restart the worker (to pick up code changes):
+   ```bash
+   tmux kill-session -t temporal-worker
+   tmux new-session -d -c "$PWD" -s temporal-worker 'PIPELEXPATH=tests/integration/pipelex/pipes/controller/pipe_sequence .venv/bin/python -m pipelex.temporal.worker_cli --is-not-sandboxed'
+   sleep 5
+   ```
+2. Make code changes
+3. Run `make trund` again and read the result
+4. Capture worker output: `tmux capture-pane -t temporal-worker -p -S -200`
+5. Repeat
+
+The server session (`temporal-server`) stays running across iterations.
+
+### Cleanup
+
+When done with the entire session:
+```bash
+tmux kill-session -t temporal-worker 2>/dev/null
+tmux kill-session -t temporal-server 2>/dev/null
+```
+
+### Test bundles for different pipe controllers
+
+| Controller | Bundle path |
+|------------|-------------|
+| PipeSequence | `tests/integration/pipelex/pipes/controller/pipe_sequence/pipe_sequence_1.mthds` |
+| PipeCondition | `tests/integration/pipelex/pipes/controller/pipe_condition/pipe_condition_1.mthds` |
+| PipeBatch | `tests/integration/pipelex/pipes/controller/pipe_batch/uppercase_transformer.mthds` |
+| PipeParallel | `tests/integration/pipelex/pipes/controller/pipe_parallel/pipe_parallel_1.mthds` |
+
+---
+
+## FIX Mode
+
+Discuss architecture, design choices, and implementation for solving the worker
+library loading problem. Stay in discussion/planning territory — do NOT jump to
+code changes unless the user explicitly says to implement.
+
+### What you must understand first
+
+Read `references/temporal-worker-problem.md` thoroughly. The core tension:
+- `pipeline_run_setup()` loads the library into `library_manager` — but only in the API process
+- PipeJob carries the serialized top-level pipe, but child pipes are resolved by code at runtime
+- On the worker, `get_required_pipe()` finds an empty library
+
+### Design dimensions to discuss with the user
+
+1. **Where does the library load on the worker?**
+   - At worker startup (base library from PIPELEXPATH)?
+   - Per-workflow in an Activity (custom bundles from mthds_contents)?
+   - Both (two-tier cache)?
+
+2. **What travels with the workflow input?**
+   - Today: a pre-resolved `PipeJob` with the top-level pipe object
+   - Option A: send `mthds_contents` + `pipe_code` instead, resolve on worker
+   - Option B: send `PipeJob` but also include `mthds_contents` for the worker to load
+
+3. **Replay safety**
+   - Library loading is I/O — it belongs in Activities, not workflow code
+   - Side-effect state (loading into a singleton) is lost on replay
+   - Activities re-execute cleanly on replay
+
+4. **Caching strategy**
+   - Tier 1: base library at worker startup (same for all executions)
+   - Tier 2: per-request overlay cached by content hash of mthds_contents
+
+### Key files to read and discuss
+
+| What | Where |
+|------|-------|
+| Library loading (API-side) | `pipelex/pipeline/pipeline_run_setup.py` |
+| Hub singleton + get_required_pipe | `pipelex/hub.py` |
+| Workflow definition | `pipelex/temporal/tprl_pipe/wf_pipe_router.py` |
+| Router (Temporal) | `pipelex/temporal/tprl_pipe/pipe_router_top.py` |
+| Router (local, works fine) | `pipelex/pipe_run/pipe_router.py` |
+| Worker startup | `pipelex/temporal/worker_cli.py` |
+| All controllers that break | `pipelex/pipe_controllers/` (sequence, condition, batch, parallel, sub_pipe) |
+| Library manager | `pipelex/libraries/library_manager.py` |
+
+### When the user says "implement"
+
+Only then shift to making code changes. Use the diagnose loop to verify each change:
+1. Make code changes yourself
+2. Restart the worker:
+   ```bash
+   tmux kill-session -t temporal-worker
+   tmux new-session -d -c "$PWD" -s temporal-worker 'PIPELEXPATH=tests/integration/pipelex/pipes/controller/pipe_sequence .venv/bin/python -m pipelex.temporal.worker_cli --is-not-sandboxed'
+   sleep 5
+   ```
+3. Run `make trund` via Bash and read the output
+4. Capture worker output: `tmux capture-pane -t temporal-worker -p -S -200`
+5. Repeat
@@ -0,0 +1,103 @@
+# Temporal Worker Library Problem
+
+## The Bug
+
+When pipelex runs pipe controllers (PipeSequence, PipeCondition, PipeBatch, PipeParallel, SubPipe) via Temporal, they fail because the library is not loaded on the worker process. The missing library causes two cascading failures.
+
+## Root Cause
+
+```
+API Process                                    Temporal Worker
+─────────────────────────────────────────     ──────────────────────────────
+PipelexRunner.execute_pipeline()
+  ├─ pipeline_run_setup()
+  │   ├─ Loads library (library_manager)      (empty here)
+  │   ├─ Generates dynamic concept classes    (classes don't exist here)
+  │   ├─ Registers them with Kajson           (Kajson registry incomplete here)
+  │   ├─ Resolves pipe by code                (can't resolve here)
+  │   └─ Creates PipeJob (top-level pipe)
+  └─ PipeRouterTop sends PipeJob ──────────►  WfPipeRouter.run(pipe_job)
+     to Temporal                                ├─ Kajson deserializes PipeJob
+                                                │   └─ FAILS (Layer 1): unknown class
+                                                └─ pipe.run_pipe()
+                                                     └─ get_required_pipe() FAILS (Layer 2)
+```
+
+1. `pipeline_run_setup()` loads the library into an in-memory `library_manager` singleton — **only in the API process**.
+2. During library loading, `ConceptFactory` dynamically generates Python classes for concepts defined in `.mthds` bundles (e.g., `RawText = "Raw input text..."` generates a `RawText` class inheriting from `TextContent`) and registers them with Kajson's class registry.
+3. The `PipeJob` is serialized via Kajson, which embeds `__class__` / `__module__` metadata for all Pydantic objects — including these dynamically-generated concept classes.
+4. On the worker, the library was never loaded → these dynamic classes don't exist → **Kajson deserialization fails** before the workflow even starts (Layer 1).
+5. Even if deserialization succeeded, child pipes are referenced **by code** via `get_required_pipe()`, which queries the empty `library_manager` singleton (Layer 2).
+6. Temporal can replay workflows on different workers, so any side-effect library state is lost.
+
+## Key Code Paths
+
+| What | Where |
+|------|-------|
+| Library loading | `pipelex/pipeline/pipeline_run_setup.py` → `library_manager` |
+| Dynamic concept class generation | `pipelex/core/concepts/concept_factory.py` → `_handle_basic_blueprint()` |
+| Structure generator (creates the classes) | `pipelex/core/concepts/structure_generation/generator.py` |
+| Kajson class registration | `pipelex/pipelex.py:353` (CoreRegistryModels) + `concept_factory.py:359` (dynamic) |
+| Kajson data converter (Temporal serde) | `pipelex/temporal/temporal_data_converter.py` |
+| `get_required_pipe()` | `pipelex/hub.py:511` |
+| Callers that break (Layer 2) | `pipelex/pipe_controllers/sequence/pipe_sequence.py`, `condition/pipe_condition.py`, `batch/pipe_batch.py`, `parallel/pipe_parallel.py`, `sub_pipe.py` |
+| Workflow definition | `pipelex/temporal/tprl_pipe/wf_pipe_router.py` |
+| Router (Temporal) | `pipelex/temporal/tprl_pipe/pipe_router_top.py` |
+| Router (local) | `pipelex/pipe_run/pipe_router.py` |
+| Worker CLI | `pipelex/temporal/worker_cli.py` |
+
+## What the Library Contains
+
+- **Base libraries** — shared pipe/concept definitions from `PIPELEXPATH` directories. Same for all executions.
+- **Custom bundles** — per-request `mthds_contents` (MTHDS bundle strings). Each API call can bring its own definitions.
+
+## What Library Loading Does (beyond populating pipes)
+
+Loading a library also **generates dynamic Python classes** for concepts. When a `.mthds` file
+declares a simple concept like `RawText = "Raw input text..."`, `ConceptFactory._handle_basic_blueprint()`
+calls `StructureGenerator.generate_from_structure_blueprint()` to create a new Python class named
+`RawText` inheriting from `TextContent`, then registers it with Kajson's class registry. These
+dynamically-generated classes are used as the `content` type of `Stuff` objects in the `WorkingMemory`.
+
+When the PipeJob is serialized via Kajson for Temporal transport, these objects carry
+`__class__: "RawText"` and `__module__: "builtins"` metadata. The worker must have these classes
+registered before it can deserialize the PipeJob.
+
+## Why Tests Don't Catch It
+
+- Integration tests use local `PipeRouter` (in-process), not Temporal. Library is shared.
+- Temporal tests only test leaf workflows (text gen, jinja2) that don't call `get_required_pipe()`.
+- No test sends a pipe controller through `WfPipeRouter`.
+
+## Expected Error Patterns
+
+The bug manifests in two layers. Layer 1 hits first and prevents Layer 2 from being reached.
+
+### Layer 1: Kajson deserialization failure (hits first)
+
+On the **worker** stderr:
+- `KajsonDecoderError: Class '<ConceptCode>' not found in module 'builtins'`
+  (e.g., `Class 'RawText' not found in module 'builtins'`)
+- Wrapped as `RuntimeError: Failed decoding arguments` by Temporal's workflow instance
+- The concept code (e.g., `RawText`) is a dynamically-generated Python class created by
+  `ConceptFactory._handle_basic_blueprint()` during library loading — which never ran on the worker
+- The submitter may **hang indefinitely** waiting for a workflow result that will never arrive
+
+### Layer 2: Library resolution failure (would hit after Layer 1 is fixed)
+
+On the **worker** stderr:
+- Errors from `get_required_pipe()` — pipe code not found in library
+- The error originates inside `run_pipe()` of a controller (PipeSequence, PipeCondition, etc.)
+- The API/submitter side will see a `TemporalError` or `ActivityError` wrapping it
+
+## Proposed Fix Direction
+
+Loading the library on the worker fixes both layers: it generates and registers the dynamic
+concept classes (fixing Layer 1 deserialization) and populates `library_manager` with pipe
+definitions (fixing Layer 2 resolution).
+
+1. Workers load base library at startup from PIPELEXPATH (Tier 1 cache) — this generates
+   dynamic concept classes and registers them with Kajson, enabling deserialization.
+2. `mthds_contents` travels with workflow input (not consumed on API side).
+3. Library loading happens in Activities (replay-safe), not workflow code.
+4. Per-request overlay cached by content hash (Tier 2 cache).