Knowledge Work SDK

A Python SDK for building AI agents that perform knowledge work—research, analysis, writing, and decision-making tasks that require iteration, verification, and structured thinking.

Why Knowledge Work is Different from Code

Code has a tight feedback loop: write code → run tests → fix errors → repeat. The solution space is constrained—there's usually one correct answer, and automated tests tell you if you found it.

Knowledge work is fundamentally different. The solution space is vast and underspecified. A "market analysis" could be a two-paragraph summary or a 50-page deep dive. A "strategy recommendation" could emphasize cost, speed, risk, innovation, or any combination. There's no test suite that returns pass/fail.

Our approach: Since knowledge work lacks natural verification, we synthesize one using rubrics. A rubric defines what "good" looks like before execution begins, enabling:

Self-verification: The agent checks its own work against explicit criteria
Iterative refinement: Failed verification triggers targeted improvement
Transparent evaluation: Humans can audit the rubric and verification process

This SDK implements a self-verifying agentic loop that brings structure to the inherently open-ended nature of knowledge work. The agent can search the web, read and write files, execute code, generate artifacts, and ask the user for clarification—all coordinated through an orchestrator that verifies its own output.

Why I'm Sharing This

This started as a harness for running RL training on knowledge tasks. I'm open-sourcing it because:

Knowledge workflows are underexplored. Most AI tooling focuses on code. But knowledge work—research, analysis, strategy, writing—is where most professionals spend their time. The primitives for building these systems aren't well established yet.
This could be a useful building block. If you're building products that involve AI doing research, making recommendations, or producing documents, this verification loop might save you weeks of iteration.

┌─────────────────────────────────────────────────────────────┐ │ 1. BRIEF CREATION │ │ → Formalize task into structured requirements │ └─────────────────────────────────────────────────────────────┘ ↓ ┌─────────────────────────────────────────────────────────────┐ │ 2. RUBRIC CREATION │ │ → Generate evaluation criteria (hidden from executor) │ └─────────────────────────────────────────────────────────────┘ ↓ ┌─────────────────────────────────────────────────────────────┐ │ 3. TASK EXECUTION │ │ → Orchestrator delegates to subagents, runs searches │ └─────────────────────────────────────────────────────────────┘ ↓ ┌─────────────────────────────────────────────────────────────┐ │ 4. VERIFICATION │ │ → Check answer against rubric → PASS or FAIL │ └─────────────────────────────────────────────────────────────┘ ↓ (if FAIL) ← ITERATE ← ↓ (if PASS) ┌─────────────────────────────────────────────────────────────┐ │ 5. SUBMISSION │ │ → Submit verified answer │ └─────────────────────────────────────────────────────────────┘

clioai

MoltPulse

Knowledge Work SDK

Why Knowledge Work is Different from Code

Why I'm Sharing This

The Verification Loop

Installation

As a dependency (recommended)

For development

Quick Start

Execution Modes

Supported Providers

Standard Mode (Default)

Plan Mode

Explore Mode

Iterate Mode

Checkpointing & Resume

Composing Workflows

Explore → Select → Execute

Enabling Capabilities

Web Search

File System Access

Code Execution

Working with Files

User Clarification (ask_user)

Streaming Events

Configuration Reference

Result Objects

RunResult

Execution Trace

Tools Available to Orchestrator

TODO

Done

In Progress

Planned

Examples

Example Outputs

Additional Guides

Training Guidelines

Limitations

License

Ecosystem Role

Embed Badge