GitHub - Hemachandra9899/Scout: A recursive AI research operating system. Not a chatbot. A research agent that thinks before it speaks.

███████╗ ██████╗ ██████╗ ██╗   ██╗████████╗
██╔════╝██╔════╝██╔═══██╗██║   ██║╚══██╔══╝
███████╗██║     ██║   ██║██║   ██║   ██║   
╚════██║██║     ██║   ██║██║   ██║   ██║   
███████║╚██████╗╚██████╔╝╚██████╔╝   ██║   
╚══════╝ ╚═════╝ ╚═════╝  ╚═════╝    ╚═╝

A recursive AI research operating system.

Not a chatbot. A research engine that plans, searches, crawls, verifies, remembers, and answers with evidence.

The Problem with Chatbots

Most AI assistants follow a simple loop:

You ask → LLM answers

Fast. But fragile. Sounds confident. Hides uncertainty. Rarely shows where the answer came from.

Scout is different. It runs a full research loop before it answers.

How Scout Thinks

You ask a question
        │
        ▼
  Intent + Query Planning
        │
        ▼
  Multi-query Source Discovery
        │
        ▼
  Memory-aware Source Ranking
        │
        ▼
  Bounded Web Crawl · Scrapling
        │
        ▼
  Clean Markdown Extraction
        │
        ▼
  Vector Ingestion + Chunking
        │
        ▼
  Claim-level Evidence Extraction
        │
        ▼
  Citation Verification
        │
        ▼
  Durable Memory Writes
        │
        ▼
  Evidence-based Answer Synthesis

Scout does not just generate. It researches, indexes, verifies, remembers — and then answers.

Research Engine v2

Scout's Research Engine v2 adds a deterministic research backbone.

Answers should be backed by evidence, not vibes.

Stage	Module	Purpose
Search Planning	`SearchPlannerAgent`	Understand query, infer intent, generate subqueries
Source Planning	`planResources()`	Combine registry sources and web search candidates
Source Ranking	`rankResourceCandidates()`	Rank official, trusted, community, and reference sources
Memory-aware Ranking	`memory-ranking.ts`	Boost useful sources, penalize failed sources
Deep Crawl	`crawl-manager.ts` + Scrapling	Crawl bounded same-domain pages, convert to Markdown
Evidence Extraction	`evidence-extractor.ts`	Convert Markdown into claim-level evidence
Citation Verification	`citation-verifier.ts`	Mark claims as `supported`, `weak`, or `unsupported`
Evidence Pack	`evidence-pack.ts`	Package evidence, citations, coverage, and gaps
Memory	`MemoryManager` + `MemoryAgent`	Store useful sources, failed crawls, and durable facts
Answer Synthesis	`answer-synthesizer.ts`	Build grounded Markdown answers from verified evidence
Answer Modes	`answer-mode.ts`, `answer-renderers.ts`	Format as comparison, how-to, summary, or general

Answer Modes

Scout formats answers based on the user's intent.

Mode	Trigger Examples	Output
`comparison`	"compare A and B", "A vs B", "differences"	Comparison table + key takeaways + evidence notes
`how_to`	"how to", "fix", "debug", "implement", "setup"	Steps + implementation notes + verification checklist
`research_summary`	"overview", "summarize", "what is", "deep dive"	Main points + evidence notes + sources
`general`	Fallback	Grounded answer with numbered citations

Example response shape:

{
  "answer": {
    "mode": "comparison",
    "status": "answered",
    "markdown": "...",
    "citations": [],
    "usedEvidenceCount": 10,
    "supportedEvidenceCount": 8,
    "weakEvidenceCount": 2,
    "omittedUnsupportedCount": 4,
    "confidence": 0.91
  }
}

Unsupported claims are never used in final answer synthesis.

Memory System

Scout uses add-only memory. It does not overwrite past memories. It writes new entries with scope, kind, source URLs, entities, metadata, and confidence.

Memory kinds:

preference      fact      durable_fact
source_quality  source_failure
decision        task_trace

Before research — Scout retrieves relevant memories for source ranking:

source_quality  → boost useful sources
source_failure  → penalize repeatedly failing URLs/domains
durable_fact    → lightly boost related sources/entities

After research — Scout writes new memories from the run:

supported evidence → durable_fact
useful sources     → source_quality
failed crawls      → source_failure

This lets Scout improve across runs without hiding the evidence trail.

Architecture

Layer	Technology	Responsibility
Frontend UI	Next.js, Tailwind CSS	Research chat, job state, answer display, source drawers
Central API	Fastify, TypeScript, Prisma	Projects, tool routes, jobs, documents, orchestration
Worker	Node.js, BullMQ	Background research execution
Runtime	Pyodide / sandboxed execution	Safe dynamic reasoning and tool calls
Model Service	FastAPI, Scrapling, Playwright	Crawling, scraping, Markdown extraction, model utilities
Vector Store	Qdrant	Semantic retrieval over project documents
Database	Postgres / Supabase	Projects, jobs, documents, chunks, memories, reports
Queue / Cache	Redis	Async jobs and status tracking
Knowledge Package	TypeScript	Research pipeline, evidence, memory, answer synthesis

Project Structure

scout/
├── apps/
│   ├── api/                    # Fastify API
│   ├── model-service/          # FastAPI + Scrapling service
│   ├── rlm-runtime/            # Runtime / tool execution layer
│   ├── web/                    # Frontend UI
│   └── worker/                 # Background worker
│
├── packages/
│   ├── knowledge/              # Research engine, agents, memory, evidence
│   ├── retrieval/              # Vector retrieval abstractions
│   ├── database/               # Prisma client / DB utilities
│   ├── clients/                # Shared service clients
│   └── queue/                  # Queue helpers
│
├── prisma/
│   └── schema.prisma
│
├── scripts/
│   └── dev-patches/
│
├── docker-compose.yml
├── run.sh
└── README.md

Key Packages

`packages/knowledge/src/research/`

Core research pipeline.

answer-mode.ts          answer-renderers.ts     answer-synthesizer.ts
citation-verifier.ts    crawl-manager.ts        evidence-extractor.ts
evidence-pack.ts        memory-ranking.ts       query-builder.ts
resource-planner.ts     search-provider.ts      source-ranker.ts
source-types.ts         research-orchestrator.ts

`packages/knowledge/src/agents/`

Small deterministic agents.

search-planner.agent.ts     memory-agent.ts     types.ts

`packages/knowledge/src/memory/`

Add-only memory layer.

memory-manager.ts     memory-types.ts

Getting Started

Prerequisites

Docker + Docker Compose v2+
Node.js
npm

Start the stack

docker compose build
docker compose up

Or use the helper script:

chmod +x ./run.sh
./run.sh

Generate Prisma client

npm run prisma:generate

Web Research Smoke Test

Create or use a valid projectId, then run:

curl -X POST http://localhost:8000/tools/web-research \
  -H "Content-Type: application/json" \
  -d '{
    "projectId":"<PROJECT_ID>",
    "query":"Compare Meta Marketing API and Google Ads API permissions and rate limits",
    "maxResults":5,
    "maxPagesPerSource":3,
    "maxTotalPages":12,
    "maxDepth":1,
    "useOrchestrator":true
  }'

Expected response fields:

subqueries                  resourcesPlanned
documents                   failedCrawls
evidencePack                evidencePack.evidence
evidencePack.citationVerification
memories.retrieved          memories.usedForRanking
memories.written            answer.mode
answer.markdown             answer.citations
answer.confidence

Example Output Shape

{
  "status": "ok",
  "query": "Compare Meta Marketing API and Google Ads API permissions and rate limits",
  "subqueries": [
    {
      "query": "Meta Marketing API permissions",
      "reason": "Find source-specific permission details",
      "priority": 1
    }
  ],
  "resourcesPlanned": [
    {
      "title": "Meta Marketing API Documentation",
      "url": "https://developers.facebook.com/docs/marketing-apis/",
      "tier": "official_docs",
      "score": 128,
      "matchedBy": ["registry", "memory:source_quality:+16"]
    }
  ],
  "evidencePack": {
    "coverage": {
      "hasEvidence": true,
      "claimCount": 14,
      "supportedClaimCount": 11,
      "weakClaimCount": 3,
      "unsupportedClaimCount": 2
    }
  },
  "answer": {
    "mode": "comparison",
    "status": "answered",
    "confidence": 0.91,
    "markdown": "## Answer\n\n...",
    "citations": [
      {
        "id": 1,
        "title": "Meta Marketing API Documentation",
        "url": "https://developers.facebook.com/docs/marketing-apis/",
        "tier": "official_docs",
        "usedClaims": 4
      }
    ]
  }
}

Development Principles

Keep route handlers thin.
Keep modules small and single-purpose.
Prefer deterministic stages before LLM polish.
Do not let final answers introduce unsupported facts.
Apply DRY: shared rendering logic belongs in renderer utilities.
Avoid deeply nested functions.
Avoid large swarms until the research pipeline is stable.
Memory should change future behavior, not just store logs.
Every answer should be traceable back to evidence.

Currently not adopted:

large agent swarms           consensus algorithms
GraphAgent                   unconstrained LLM answer polish
private workspace connectors

Multi-provider search

Scout can search through multiple providers when API keys are configured:

FIRECRAWL_API_KEY
TAVILY_API_KEY
GITHUB_TOKEN

Provider behavior:

Provider	Used for
Firecrawl	Existing general web search fallback
Tavily	Main web search provider
GitHub	Repository discovery for SDKs, clients, examples, and implementation references

Brave Search is intentionally not used for now.

Search providers are optional. Scout uses whatever is configured and deduplicates URLs across providers before ranking.

Roadmap

Now

Add tests for evidence extraction
Add tests for citation verification
Add tests for memory-aware ranking
Add tests for answer mode detection
Add tests for answer renderers
Add UI support for answer.markdown
Add source drawer for answer.citations

Later

Add SKILL.md-style lightweight prompt agents
Add hierarchical CoordinatorAgent
Add private connectors for GitHub, Notion, Slack, Google Drive
Add entity-claim graph visualization
Add streaming traces for each research stage

Project Status

Scout is in active development.

The current branch focuses on Research Engine v2:

planning            source ranking          memory-aware retrieval
Scrapling crawl     evidence extraction     citation verification
answer synthesis    answer modes

The deterministic research pipeline is the foundation. Graph agents, swarms, and LLM polish come later.

Built to think deeper. Research further. Answer with evidence.

Provider smoke tests

Provider smoke tests call real external APIs and are skipped by default.

Run Tavily only:

RUN_PROVIDER_SMOKE=1 TAVILY_API_KEY=... npm run test:providers

Run GitHub only:

RUN_PROVIDER_SMOKE=1 GITHUB_TOKEN=... npm run test:providers

Run Firecrawl + Tavily:

RUN_PROVIDER_SMOKE=1 FIRECRAWL_API_KEY=... TAVILY_API_KEY=... npm run test:providers

Brave is intentionally not used.

Frontend research debug panels

The web app can render the research-response-v1 contract from completed jobs.

Debug tabs:

Summary
Sources
Crawl
Evidence
Grounding
Raw

The UI prefers:

ui.answerMarkdown
ui.citations
ui.evidenceCoverage
ui.crawlTrace
ui.groundingStatus

when a contract is available, while preserving legacy report rendering as fallback.

Benchmark query suite

Scout includes a fixed research benchmark suite.

Run after Docker is up:

API_BASE_URL=http://localhost:8000 \
BENCHMARK_PROJECT_ID=test-project \
npm run benchmark:research

Quick smoke:

BENCHMARK_MAX_QUERIES=3 npm run benchmark:research

Outputs are written to:

benchmark-runs/<timestamp>/

The runner validates:

contractVersion
grounding status
citation count
accepted crawl pages
filtered evidence count

Name		Name	Last commit message	Last commit date
Latest commit History 58 Commits
.github/workflows		.github/workflows
apps		apps
benchmarks		benchmarks
docs		docs
packages		packages
prisma		prisma
scripts/dev-patches		scripts/dev-patches
tasks		tasks
.dockerignore		.dockerignore
.env.example		.env.example
.gitignore		.gitignore
.qdrant-initialized		.qdrant-initialized
LESSONS.md		LESSONS.md
README.md		README.md
SCOUT_RESEARCH_ENGINE_V2_IMPLEMENTATION_PLAN.md		SCOUT_RESEARCH_ENGINE_V2_IMPLEMENTATION_PLAN.md
TODO.md		TODO.md
apply_scout_research_engine_v2_step22_frontend_debug_panels.py		apply_scout_research_engine_v2_step22_frontend_debug_panels.py
apply_scout_research_engine_v2_step23_benchmark_query_suite.py		apply_scout_research_engine_v2_step23_benchmark_query_suite.py
docker-compose.yml		docker-compose.yml
package-lock.json		package-lock.json
package.json		package.json
run.sh		run.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

The Problem with Chatbots

How Scout Thinks

Research Engine v2

Answer Modes

Memory System

Architecture

Project Structure

Key Packages

`packages/knowledge/src/research/`

`packages/knowledge/src/agents/`

`packages/knowledge/src/memory/`

Getting Started

Prerequisites

Start the stack

Generate Prisma client

Web Research Smoke Test

Example Output Shape

Development Principles

Multi-provider search

Roadmap

Now

Next

Later

Project Status

Provider smoke tests

Frontend research debug panels

Benchmark query suite

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

The Problem with Chatbots

How Scout Thinks

Research Engine v2

Answer Modes

Memory System

Architecture

Project Structure

Key Packages

packages/knowledge/src/research/

packages/knowledge/src/agents/

packages/knowledge/src/memory/

Getting Started

Prerequisites

Start the stack

Generate Prisma client

Web Research Smoke Test

Example Output Shape

Development Principles

Multi-provider search

Roadmap

Now

Next

Later

Project Status

Provider smoke tests

Frontend research debug panels

Benchmark query suite

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

`packages/knowledge/src/research/`

`packages/knowledge/src/agents/`

`packages/knowledge/src/memory/`

Packages