UiPath
diff --git a/‎samples/tool-calling-suspend-resume/MANUAL_TEST_GUIDE.md‎
Lines changed: 214 additions & 0 deletions b/‎samples/tool-calling-suspend-resume/MANUAL_TEST_GUIDE.md‎
Lines changed: 214 additions & 0 deletions
diff --git a/‎samples/tool-calling-suspend-resume/README.md‎
Lines changed: 41 additions & 11 deletions b/‎samples/tool-calling-suspend-resume/README.md‎
Lines changed: 41 additions & 11 deletions
@@ -0,0 +1,214 @@
+# Manual Testing Guide for Suspend/Resume
+
+This guide shows how to manually test the suspend/resume functionality using CLI commands.
+
+## Step 1: Initial Execution (Suspend Phase)
+
+Run the agent - it will suspend at the `interrupt()` call:
+
+```bash
+uv run uipath run agent-simple --input '{"query": "test manual suspend"}'
+```
+
+Expected output:
+```
+Status: SUSPENDED
+Output: {
+  'abc123...': {
+    'message': 'Waiting for external completion',
+    'query': 'test manual suspend'
+  }
+}
+```
+
+The key here is the **interrupt_id** (the long hash like `abc123...`). This is needed for resume.
+
+## Step 2: Inspect What Was Saved
+
+### Check the checkpoint:
+```bash
+uv run python -c "
+from langgraph.checkpoint.sqlite.aio import AsyncSqliteSaver
+from graph_simple import builder
+import asyncio
+
+async def check():
+    async with AsyncSqliteSaver.from_conn_string('__uipath/state.db') as saver:
+        graph = builder.compile(checkpointer=saver)
+        state = await graph.aget_state({'configurable': {'thread_id': 'default'}})
+        print('State values:', state.values)
+        print('Next tasks:', state.next)
+
+asyncio.run(check())
+"
+```
+
+### Check triggers in database:
+```bash
+sqlite3 __uipath/state.db "SELECT runtime_id, interrupt_id FROM __uipath_resume_triggers"
+```
+
+## Step 3: Resume Execution
+
+### Option A: Using CLI Resume (If Available)
+
+```bash
+# If the uipath CLI supports resume with data:
+uv run uipath resume agent-simple \
+  --thread-id default \
+  --resume-data '{"<interrupt_id>": "MY RESUME DATA"}'
+```
+
+Replace `<interrupt_id>` with the actual interrupt ID from Step 1.
+
+### Option B: Using Python Script (Recommended)
+
+Create a resume script:
+
+```bash
+cat > test_manual_resume.py << 'EOF'
+import asyncio
+from uipath.runtime import UiPathRuntimeContext, UiPathExecuteOptions
+from uipath_langchain.runtime.factory import UiPathLangGraphRuntimeFactory
+
+async def main():
+    # Prompt user for interrupt_id
+    print("Enter the interrupt_id from the suspend output:")
+    interrupt_id = input("> ").strip()
+
+    print("\nEnter the data you want to provide for resume:")
+    resume_data_value = input("> ").strip()
+
+    # Create runtime
+    ctx = UiPathRuntimeContext()
+    factory = UiPathLangGraphRuntimeFactory(ctx)
+    runtime = await factory.new_runtime(entrypoint="agent-simple", runtime_id="default")
+
+    # Resume with provided data
+    resume_input = {interrupt_id: resume_data_value}
+    options = UiPathExecuteOptions(resume=True)
+
+    print(f"\nResuming with data: {resume_input}")
+    result = await runtime.execute(input=resume_input, options=options)
+
+    print(f"\n✅ Status: {result.status}")
+    print(f"Output: {result.output}")
+
+    await factory.dispose()
+
+if __name__ == "__main__":
+    asyncio.run(main())
+EOF
+
+uv run python test_manual_resume.py
+```
+
+Example interaction:
+```
+Enter the interrupt_id from the suspend output:
+> abc123def456...
+
+Enter the data you want to provide for resume:
+> Completed by manual testing
+
+✅ Status: SUCCESSFUL
+Output: {'query': 'test manual suspend', 'result': 'Completed with resume data: Completed by manual testing'}
+```
+
+## Step 4: Verify Final State
+
+Check that the execution completed:
+
+```bash
+uv run python -c "
+from langgraph.checkpoint.sqlite.aio import AsyncSqliteSaver
+from graph_simple import builder
+import asyncio
+
+async def check():
+    async with AsyncSqliteSaver.from_conn_string('__uipath/state.db') as saver:
+        graph = builder.compile(checkpointer=saver)
+        state = await graph.aget_state({'configurable': {'thread_id': 'default'}})
+        print('Final state:', state.values)
+        print('Next tasks:', state.next)  # Should be empty
+
+asyncio.run(check())
+"
+```
+
+Expected output:
+```
+Final state: {'query': 'test manual suspend', 'result': 'Completed with resume data: Completed by manual testing'}
+Next tasks: ()
+```
+
+## Full End-to-End Test Script
+
+For convenience, here's a complete script that does both phases:
+
+```bash
+cat > test_full_cycle.py << 'EOF'
+import asyncio
+from uipath.runtime import UiPathRuntimeContext, UiPathExecuteOptions
+from uipath_langchain.runtime.factory import UiPathLangGraphRuntimeFactory
+
+async def main():
+    ctx = UiPathRuntimeContext()
+    factory = UiPathLangGraphRuntimeFactory(ctx)
+    runtime = await factory.new_runtime(entrypoint="agent-simple", runtime_id="manual_test")
+
+    print("=" * 80)
+    print("PHASE 1: Execute and Suspend")
+    print("=" * 80)
+
+    result1 = await runtime.execute(input={"query": "test full cycle"})
+    print(f"Status: {result1.status}")
+    print(f"Interrupts: {result1.output}")
+
+    if result1.status.name != "SUSPENDED":
+        print("ERROR: Expected SUSPENDED status")
+        return
+
+    interrupt_id = list(result1.output.keys())[0]
+    print(f"\n✓ Got interrupt_id: {interrupt_id[:16]}...")
+
+    print("\n" + "=" * 80)
+    print("PHASE 2: Resume")
+    print("=" * 80)
+
+    user_data = input("Enter data to provide for resume (or press Enter for default): ").strip()
+    if not user_data:
+        user_data = "Manual test completed"
+
+    resume_input = {interrupt_id: user_data}
+    options = UiPathExecuteOptions(resume=True)
+    result2 = await runtime.execute(input=resume_input, options=options)
+
+    print(f"\n✅ Status: {result2.status}")
+    print(f"Final output: {result2.output}")
+
+    await factory.dispose()
+
+if __name__ == "__main__":
+    asyncio.run(main())
+EOF
+
+uv run python test_full_cycle.py
+```
+
+## Common Issues
+
+### Issue: "No checkpoint found"
+- Make sure you're using the same `thread_id` / `runtime_id` for both suspend and resume
+- Default is `"default"` for `uipath run`
+
+### Issue: "Field required" validation error
+- This was the bug we just fixed - make sure `graph_simple.py` returns a dict, not a State object
+
+### Issue: "No triggers found in database"
+- Triggers might have been deleted by a previous failed resume attempt
+- Re-run the suspend phase (Step 1)
+
+### Issue: Empty resume data
+- Make sure you're providing the correct interrupt_id from the suspend output
+- The interrupt_id is the key in the output dict from the suspend phase
@@ -101,10 +101,14 @@ async def invoke_process_node(state: State) -> State:
 - **`test_suspend_step2.py`** - Resume step (can be run separately)
 - **`inspect_state.py`** - Utility to decode and inspect checkpoint database
 
-### Configuration
+### Configuration & Evaluations
 - **`pyproject.toml`** - Python dependencies
 - **`uipath.json`** - Agent configuration
-- **`evaluations/`** - Evaluation sets for testing suspend/resume behavior
+- **`langgraph.json`** - Graph definitions (graph and agent-simple)
+- **`evaluations/`** - Evaluation framework for validating suspend/resume
+  - `eval-sets/test_simple_no_auth.json` - Test cases for suspend/resume validation
+  - `evaluators/resume-completed-evaluator.json` - Contains evaluator checking completion text
+  - `evaluators/suspend-resume-trajectory-evaluator.json` - LLM judge for trajectory validation
 
 ## Running the Demo
 
@@ -133,24 +137,50 @@ uv run python demo_suspend_resume.py resume
 
 ## Using with UiPath Evaluation Runtime
 
-Test with the evaluation runtime to see how triggers are extracted:
+Test with the evaluation runtime to see how suspend/resume is validated:
 
 ```bash
-# Simple variant (no authentication required)
-uv run uipath eval agent-simple evaluations/eval-sets/test_simple_no_auth.json
+# IMPORTANT: Clean previous checkpoint state first!
+rm -rf __uipath/state.db
 
-# Full RPA variant (requires authentication)
-uv run uipath eval graph evaluations/eval-sets/test_suspend_resume.json
+# Run evaluation - agent will suspend
+uv run uipath eval agent-simple evaluations/eval-sets/test_simple_no_auth.json
 ```
 
+**What this tests**:
+- Agent executes and calls `interrupt()` → suspends
+- Evaluation runtime detects SUSPENDED status
+- Triggers are extracted (API resume triggers with inbox IDs)
+- Evaluators are skipped during suspend (they run after resume)
+- Status propagates to orchestrator
+- Checkpoint saved to `__uipath/state.db` (40KB)
+
 Expected output:
 ```
-🔴 DETECTED SUSPENSION → Runtime detects SUSPENDED status
-📋 Extracted 1 trigger(s) → Shows InvokeProcess trigger details
-⏭️ Skipping evaluators → Evaluators run after resume
-✅ Result: SUSPENDED with triggers
+EVAL RUNTIME: Resume mode: False
+🔴 EVAL RUNTIME: DETECTED SUSPENSION
+EVAL RUNTIME: Agent returned SUSPENDED status
+EVAL RUNTIME: Extracted 2 trigger(s) from suspended execution
+EVAL RUNTIME: Propagating SUSPENDED status from inner runtime
+✓ Basic suspend/resume with query - No evaluators
 ```
 
+**Note**: The `--resume` flag exists in the eval command, but the full resume flow (providing resume data to `interrupt()`) is handled by the orchestrator in production. For local testing of the complete suspend/resume cycle, use `demo_suspend_resume.py`.
+
+### Evaluators
+
+The sample includes two evaluators that validate suspend/resume behavior:
+
+**`ResumeCompletedEvaluator`** (ContainsEvaluator)
+- Checks that the result contains "Completed with resume data"
+- Validates the agent completed successfully after resume
+
+**`SuspendResumeTrajectoryEvaluator`** (LLM Judge)
+- Uses GPT-4 to evaluate the entire suspend/resume trajectory
+- Assesses whether the agent properly suspended and resumed
+
+These evaluators run AFTER resume completes, not during the suspend phase.
+
 ## Inspecting the State Database
 
 Want to see what's stored in the checkpoint?