Architecture overview

Motivation

A naive “RAG over docs” service is a thin shell around a vector store: embed the query, top-k similarity, stuff the chunks into a prompt. That design has three structural failures at enterprise scale:

No memory of decisions. It cannot represent “we chose X over Y” — every query re-derives the world from raw text.

No trust gradient. A blog post and a ratified standard are weighted identically.

No safe automation. Any machine-generated content immediately becomes indistinguishable from human-authored truth.

AskMyDocs is architected to fix all three: a typed canonical layer carries decisions and rejections, a reranker firewall enforces a trust gradient, and a self-compiling tier is quarantined behind that firewall.

The chat request lifecycle

A grounded answer is the product of a fixed pipeline. The contract: never answer from parametric knowledge alone — every claim is grounded in retrieved, cited context, or the system refuses.

Three properties are load-bearing here:

3× over-retrieval then fusion. The reranker sees a wider candidate set than the final k, so keyword-strong-but-vector-weak matches survive.

Graph expansion and rejected injection degrade to no-ops. A tenant with zero canonical docs gets identical behaviour to plain hybrid RAG.

Logging never breaks the user path. ChatLogManager::log() is wrapped in try/catch by design.

Load-bearing decisions (and why)

No AI SDKs for OpenAI / Anthropic / Gemini / OpenRouter — raw Http::

Provider transport is the raw HTTP client, not vendor SDKs. This is intentional: full control over auth, retries, timeouts, and response parsing, plus trivial testability via Http::fake(). Regolo is the documented exception — it is wired through the in-house padosoft/laravel-ai-regolo SDK adapter on laravel/ai, which ships its own test surface and observability hooks.

Two ingestion entry points, one execution path

CLI and HTTP both fan into IngestDocumentJob → DocumentIngestor. No third path may bypass this. It guarantees idempotency, canonical handling, and graph indexing happen identically regardless of how a document arrives.

Canonical markdown is the source of truth; the DB is a projection

The canonical kb/ folders in consumer repos are authoritative; the knowledge_documents + kb_nodes + kb_edges rows are rebuildable from Git at any moment via kb:rebuild-graph + re-ingest. No feature may require DB-only state that cannot be reconstructed from the markdown. The one exception is kb_canonical_audit — an immutable forensic trail that survives hard deletes.

Promotion is always human-gated (ADR 0003)

Claude skills and the suggest / candidates endpoints produce drafts; only humans (git push → GitHub Action) and operators (kb:promote) commit canonical storage. There is no “automatic promotion” shortcut.

For the full editorial record, see the architecture decisions narrative.

Where to go deeper

Retrieval pipeline

Hybrid search, reranker fusion weights, and the boost/penalty knobs.

Canonical graph

kb_nodes / kb_edges, project-scoped composite FKs, and graph rebuild.

Auto-Wiki engine

The self-compiling phases and the firewall that contains them.

Database schema

Every table, column, index, and uniqueness constraint.

Get Started

Guides

Best Practices

Integrations

Configuration & Operations

Architecture

Reference

Architecture overview

Motivation

The component map

The chat request lifecycle

The ingestion fan-in

Load-bearing decisions (and why)

Where to go deeper

Retrieval pipeline

Canonical graph

Auto-Wiki engine

Database schema

​Motivation

​The component map

​The chat request lifecycle

​The ingestion fan-in

​Load-bearing decisions (and why)

​Where to go deeper

Retrieval pipeline

Canonical graph

Auto-Wiki engine

Database schema

Motivation

The component map

The chat request lifecycle

The ingestion fan-in

Load-bearing decisions (and why)

Where to go deeper