ayrio logo Expand capability. Not cost.
Heroda — Document Provenance Engine

The Father of
Document History.

Inspired by Herodotus, Heroda brings Git-grade version control, AI-powered ingestion, and chain-of-custody attribution to every legal and business document you touch.

Request Access See Features
Git Source of Truth
Offline First Architecture
1 Binary No Dependencies
PDF · DOCX Format Agnostic
"contract_final_v3_REAL_final.pdf" is not a version control system. It is evidence that the problem has been unsolved for forty years.
The Heroda Premise
Capabilities

Every Format. Every Source. One History.

Heroda absorbs the chaos of external documents, converts everything to clean Markdown, and stores it with full authorship attribution in Git — the only audit trail that holds up.

Git-Native Version Control

Every document is a Markdown file in a Git repository. When a counterparty returns edits, their commit is attributed to their name and email — creating a legally sound chain of custody that answers exactly who changed what and when.

Authorship Attribution

Shadow Redlining

Counterparties often return documents with direct edits and no Track Changes enabled. Heroda diffs the inbound file against your last sent version and isolates every substantive change, ignoring font noise, smart quotes, and formatting drift.

AI-Powered Diff

Scanned PDF Ingestion

If a counterparty returns a scanned PDF or a file with custom fonts, Heroda falls back to Tesseract OCR, then runs LLM cleanup to fix artifacts while strictly preserving original meaning. The clean result is committed under their authorship.

OCR + LLM Cleanup

Track Changes Export

Generate a .docx where every change since the last sent version appears as native Word Track Changes (<w:ins> / <w:del>) — no manual redlining required. Your counterparty sees a clean redline in the tool they already use.

OOXML Native

Premium PDF Rendering

Export Markdown to professionally typeset PDFs using curated CSS paired with a high-fidelity HTML-to-PDF engine. Define per-organization themes to ensure every outbound document matches your corporate branding exactly.

Branded Output

Offline-First Auto-Sync

The full Heroda engine runs locally with no network required. A background daemon silently syncs commits to a central bare Git repository whenever connectivity is restored — no "push" discipline required from users.

Background Daemon
The CLI

Simple Commands. Serious Provenance.

Heroda ships as a single statically-linked binary for Windows, macOS, and Linux. No runtime, no containers, no setup wizard.

heroda — command reference
# Initialize a new document workspace heroda init client-acme/ # Export a contract to a branded PDF heroda export contract.md --to pdf # Export with Track Changes redline against last commit heroda export contract.md --to docx --track-changes # Ingest a counterparty's returned DOCX or scanned PDF heroda ingest from_client.docx --target contract.md # Interactively review changes — accept or reject hunks heroda review from_client.docx --target contract.md ✓ Converted DOCX via Pandoc ✓ Isolated 3 substantive changes (12 formatting artifacts ignored) › [1/3] "Pro Bono" → "At Cost" [a]ccept / [r]eject / [s]kip # Show who wrote every line of a document heroda blame contract.md # Show full version history heroda log contract.md # Run entirely offline using the embedded model heroda ingest scan.pdf --local
Usage Patterns

Three Workflows That Cover Most Situations.

01

The Clean Hand-off

Standard outbound negotiation. You draft, export a clean PDF, and reconcile whatever comes back — without leaving your terminal or touching a portal.

  • Draft contract in Markdown
  • heroda export --to pdf → send to client
  • Client returns redlined .docx
  • heroda review from_client.docx
  • Accept, commit, auto-sync
02

Shadow Redlining

The counterparty edited directly without enabling Track Changes — a common and frustrating scenario. Heroda finds every change instantly.

  • Receive unmarked edited .docx
  • heroda review --diff-only
  • View exact word-level changes
  • Generate redline for your legal team
  • heroda export --to docx --track-changes
03

Legacy Resurrection

You have a filing cabinet of scanned PDFs. Heroda OCRs, cleans, and commits them into a searchable, diffable Git history with original authorship.

  • heroda init archives/
  • heroda ingest old_scan.pdf --author-name "Original Signer"
  • LLM cleans OCR artifacts
  • Commit attributed to original signer
  • Repeat — entire archive becomes searchable
Distribution

Lite or Pro. Always a Single Binary.

Regardless of tier, Heroda is a single statically-linked Go executable. No runtime, no Docker, no dependencies to manage on the target machine.

Heroda Lite

Cloud-Powered

~15 MB download
  • Full CLI feature set
  • Gemini API for LLM cleanup
  • Fast, lightweight processing
  • Windows · macOS · Linux
  • Requires internet for AI features
  • Requires GEMINI_API_KEY

Every Document, Accounted For.

Get early access to Heroda or talk to us about a deployment for your legal or operations team.

Request Early Access