Codename · Chronicle

Retrieval · Generation · Foundation

Ask your documents
a question.

Chronicle is an enterprise document intelligence system. It ingests any format, enriches every file with structured AI analysis, and answers questions across your entire corpus in plain English — with sources, with precision, and with the reliability that regulated environments require.

See how it works ↓Not a vector database bolted onto an LLM. A retrieval-and-generation system built to the precision bar of regulated industries.

Ingestion

Any format

Deployment

Air-gap capable

Query

LLM understanding

Retrieval

Hybrid · RRF

Generation

RAG, grounded

Runtime

Kubernetes-native

01 · The problem

Enterprise knowledge doesn't disappear. It just becomes unreachable.

Most of what your organization knows is written down somewhere no one can reach.

Contracts, policies, research, correspondence, reports, recordings — organizations generate knowledge continuously and lose access to most of it almost immediately. It lives in repositories no one can search effectively, in formats that defeat keyword search, across languages that fragment the corpus, in documents that were never meant to be found years later by someone who wasn't there when they were written.

The consequence is not just inefficiency. It is —

01decisions made without the context that already exists.

02due diligence that misses a clause.

03a compliance question answered from memory instead of from the record.

Keyword search was not built for this. Neither was a vector database bolted onto an LLM. Chronicle is.

02 · Ingestion

From raw file to searchable meaning.

Four phases. Every document passes through all of them before it becomes searchable or answerable. By the time a query arrives, the work is already done.

02 · Ingestion

From raw file to searchable meaning.

Four phases. Every document passes through all of them before it becomes searchable or answerable. By the time a query arrives, the work is already done.

input

Any file

pdf · scan · audio · video · sheet

→

Extract

classify · unpack · route

→

Parse

text · OCR · transcribe · translate

→

Enrich

entities · topics · summaries · captions

→

Embed

dense vectors · text + image, one space

→

output

Searchable corpus

indexed · enriched · answerable

Fig. 1 — The write path. Every file is classified, parsed to clean text, enriched into structured dimensions, and embedded — before any query arrives. By query time, the work is already done.

Extract

classify · unpack · route

Every incoming file is classified by its actual content, not its extension. Archives are unpacked. Each document is routed correctly before any processing begins.

Parse

text · OCR · transcribe · translate

Text is extracted from everything — native documents directly, scanned files through OCR, audio and video through transcription. Non-English content is detected and translated. Every document emerges as clean, chunked, processable text, regardless of how it arrived.

Enrich

entities · topics · summaries · captions

Each document passes through specialist models in sequence. Named entities are extracted, topics classified against a configurable taxonomy, summaries generated, images captioned. Every enrichment output becomes a searchable dimension — a filter, a facet, an indexed field.

Embed

dense vectors · text + image, one space

Every text chunk becomes a dense vector encoding its meaning; every image becomes a vector in the same space as text queries. Two documents discussing the same concept in different words end up geometrically close — which is what makes semantic retrieval, and generation, possible.

03 · Retrieval & generation

Not a list of documents. An answer.

When a query arrives, Chronicle does not run a search. It runs a pipeline — understanding the question, retrieving on two axes at once, and synthesizing an answer grounded in what it found.

03 · Retrieval & generation

Not a list of documents. An answer.

When a query arrives, Chronicle does not run a search. It runs a pipeline — understanding the question, retrieving on two axes at once, and synthesizing an answer grounded in what it found.

When a user submits a query, Chronicle does not run a search. It runs a pipeline. A local language model interprets the query, extracts structured filters expressed in plain English, and rewrites it for semantic retrieval — before the index is touched, and without any data leaving the environment.

Two retrieval legs then run simultaneously. The first applies the extracted filters as hard constraints and matches on structured metadata and keywords. The second embeds the query as a vector and retrieves by meaning, surfacing results that share no literal terms with the query but are semantically aligned. Reciprocal Rank Fusion combines both legs into a single ranking that reflects structural precision and semantic relevance together.

The retrieved passages are handed to a generation model. Chronicle synthesizes a narrative answer grounded in those passages, with every claim tied to a specific document and location.

Query understanding

interpreted locally, before retrieval

“indemnity clauses after 2021…”→Local LLM→

Structured filtersSemantic rewriteKeywords

on-prem

nothing leaves

Retrieval

two legs, run simultaneously

Retrieve · in parallel

Structured + keyword

filters as hard constraints · metadata

Vector semantic

embed query · retrieve by meaning

→

Fuse

Reciprocal Rank Fusion

one ranking · precision + relevance

→

ranked

Top passages

Generation

grounded in retrieved passages

Top passages

the evidence set

→

Then

Generation model

synthesize a narrative answer

→

output

Grounded answer

[1][2][3]every claim → a source

Fig. 2 — The read path. A query is interpreted locally into filters and a semantic rewrite; two retrieval legs run at once and are fused by Reciprocal Rank Fusion; the top passages are handed to a generation model that writes a grounded answer — every claim tied to a specific document and location.

Nothing is invented. Every statement is verifiable.

This is the precision bar regulated environments require — and the reason Chronicle is built on RAG, not on a model that answers from memory.

04 · Compliance & precision

Built for environments where getting it wrong is not an option.

Every model runs where your data lives. Every output is traceable to a version and a configuration. Every parameter is tunable without a redeployment.

04 · Compliance & precision

Built for environments where getting it wrong is not an option.

Every model runs where your data lives. Every output is traceable to a version and a configuration. Every parameter is tunable without a redeployment.

Air-gap deployable

No external API calls. No data leaves your environment. Every model — enrichment, embedding, query understanding, generation — runs on your own infrastructure. Built for environments where data sovereignty is a legal requirement, not a preference.

Auditable by design

Every enrichment output is a traceable result of a documented model and configuration. Topic classifications, entity extractions, and generated answers are structured outputs tied to specific model versions and runtime parameters — reviewable, reproducible, and defensible.

Configurable without redeployment

Prompts, confidence thresholds, topic taxonomies, and model parameters are tunable at runtime through a configuration interface. When your domain or regulatory environment changes, Chronicle adapts without an engineering engagement.

05 · What it handles

Any format in. A searchable dimension out.

Three numbers that signal completeness and sovereignty — independent of any single client or deployment.

05 · What it handles

Any format in. A searchable dimension out.

Three numbers that signal completeness and sovereignty — independent of any single client or deployment.

Any

Format ingested

pdf · scan · word · audio · video · image · sheet

10+

Enrichment dimensions

entities · topics · language · summaries · captions · embeddings

External API calls

the whole system runs on your infrastructure

Chronicle reasons across documents. Scout is its sibling — the same retrieval thinking, applied to profiles.See Scout, the profile-search sibling →

“Your organization has been generating the answer for years. Chronicle is the system that can finally reason through it.”

Every engagement that draws on Chronicle starts with the pipeline already running, the enrichment layer already configured, and the generation architecture already production-tested on Kubernetes. What changes is the document taxonomy, the entity types that matter in your domain, and the precision thresholds your environment requires. That is the work worth doing together.

Start somewhere

Have a repository your organization can't fully reason through?

Tell us about it — its scale, its formats, its languages, and where your current search breaks down. We'll tell you what Chronicle can do for it.

Brief us on what you're working with →or reach us at hello@coraltree.ai

Ask your documentsa question.

Enterprise knowledge doesn't disappear. It just becomes unreachable.

From raw file to searchable meaning.

From raw file to searchable meaning.

Extract

Parse

Enrich

Embed

Not a list of documents. An answer.

Not a list of documents. An answer.

Built for environments where getting it wrong is not an option.

Built for environments where getting it wrong is not an option.

Air-gap deployable

Auditable by design

Configurable without redeployment

Any format in. A searchable dimension out.

Any format in. A searchable dimension out.

Have a repository your organization can't fully reason through?

Ask your documents
a question.