ECAI SBRM โ IPFS-Hosted Feature Execution Workflow
1. Executive Summary
ECAI SBRM search is a deterministic indexing and retrieval system for structured financial statements. Unlike text search, SQL-bound reporting, or probabilistic vector models, ECAI encodes financial concepts deterministically, writes them into reproducible disk indexes, and returns exact postings for every query. The same data and logic always produce the same indexed state and the same search result, making the system audit-ready, replayable, and infrastructure-grade. IPFS provides immutable transport, but the core value is deterministic financial concept retrieval with zero probabilistic AI.
2. What this system actually delivers
This document describes a deterministic financial indexing and search system built on ECAI.
The value proposition is simple:
- Deterministic indexing of structured financial statements (SBRM)
- Deterministic retrieval of financial concepts
- Reproducible execution
- Zero probabilistic AI
IPFS is only the transport layer.
The core value is the indexing model.
3. The Problem
Financial statement search today is typically:
- Text search (imprecise)
- SQL queries (schema-bound and brittle)
- Vector search (probabilistic and non-reproducible)
- Or manually curated reports
These approaches either:
- Approximate results,
- Depend on fragile schemas,
- Or rely on probabilistic ranking.
None of them provide cryptographically reproducible retrieval.
In regulated financial environments, that is a serious limitation.
4. The ECAI Difference
ECAI indexing is deterministic by design.
Each financial concept (e.g. mini:CashAndCashEquivalents):
- Is encoded deterministically
- Is written into disk segments deterministically
- Produces exact postings lists
- Returns exact document matches
There is:
- No ranking heuristic
- No embeddings
- No scoring model
- No probabilistic similarity
The same input always produces the same index. The same query always produces the same result.
5. Compact Value Proposition
ECAI SBRM search provides:
- Exact concept retrieval
- Replayable indexing
- Audit-ready determinism
- Infrastructure-level reproducibility
Query โ Posting List โ Document IDs โ Exact Retrieval
Not:
Query โ Model โ Guess โ Ranked approximation
6. Why This Matters
For financial systems, compliance, and audit:
- Results must be explainable.
- Results must be reproducible.
- Results must be verifiable.
If the same feature CID and data CID are replayed, the same index and the same search result are produced.
That is operational determinism.
7. The Role of IPFS
IPFS makes both:
- The indexing logic
- The financial data
immutable and content-addressed.
But the strategic value is not IPFS.
The strategic value is:
Deterministic ECAI indexing of structured financial data.
8. ECAI SBRM โ Feature-From-IPFS Execution Model
This workflow makes both:
- The SBRM JSONL data
- The DamageBDD feature file
content-addressed and immutable via IPFS.
Execution becomes:
Feature CID โ Loaded โ Executed Data CID โ Loaded โ Indexed Query โ Deterministic Retrieval
No mutable scripts. No local feature dependency. Everything is hash-addressed.
—
9. Architecture
SBRM JSONL โโ> ipfs add โโ> CID (data)
Feature file โโ> ipfs add โโ> CID (logic)
curl execute_feature(feature_cid, vars)
โ
DamageBDD:
ipfs cat(feature_cid)
parse feature
ipfs cat(data_cid)
ingest
query disk index
โ
Deterministic result
The logic itself is immutable.
This is executable intelligence stored as content.
—
10. Create the Feature File
Example: ecaisbrmquery.feature
Feature: Query SBRM index from IPFS-hosted feature
Scenario: Ingest JSONL and query disk index
Given I store "{{ipfs_jsonl_cid}}" in "ipfs_jsonl_cid"
When I ingest the SBRM financial JSONL file from IPFS hash in "ipfs_jsonl_cid"
Given I store "{{base_dir}}" in "ecai_base_dir"
Given I open the ECAI disk search at base dir "ecai_base_dir" and store it in "idx"
When I query the ECAI SBRM index in "idx" for "{{term}}" and store the results in "docs"
Then I set response status to 200
And I set response json to:
"""
{
"ok": true,
"term": "{{term}}",
"docs": {{docs}}
}
"""
—
11. Add the Feature to IPFS
ipfs add -Q ecai_sbrm_query.feature
Output:
bafybeifeaturecid...
This is your immutable executable intelligence.
—
12. Add the Data to IPFS
ipfs add -Q ./sbrm.jsonl
Output:
bafybeidatacid...
—
13. Execute via One-Line curl
Now both logic and data are content-addressed.
FEATURE="$(ipfs add -Q ecai_sbrm_query.feature)" && \ DATA="$(ipfs add -Q ./sbrm.jsonl)" && \ curl -sS -X POST http://127.0.0.1:8080/execute_feature_from_ipfs \ -H 'content-type: application/json' \ -d "{\"feature_cid\":\"$FEATURE\",\"vars\":{\"ipfs_jsonl_cid\":\"$DATA\",\"term\":\"mini:CashAndCashEquivalents\"}}"
Optional:
... | jq
—
14. What execute_feature_from_ipfs Does
- ipfs cat(featurecid)
- Parse feature text
- Execute BDD steps
- ipfs cat(datacid)
- Ingest via ecaisbrmfinancialstatementingestor
- Query via ecaidisksearch
- Return JSON
The logic is immutable. The data is immutable. The result is deterministic.
—
15. Why This Is Important
This model gives you:
- Portable executable intelligence
- Immutable logic
- Immutable data
- Deterministic indexing
- Deterministic retrieval
- Zero probabilistic AI
You are not calling a model.
You are executing a content-addressed intelligence program.
—
16. Operational Hardening
- Pin feature CIDs
- Pin data CIDs
- Restrict
execute_feature_from_ipfsendpoint - Enforce basedir whitelist
- Snapshot disk index regularly
—
17. The Bigger Picture
Traditional AI:
Mutable model Mutable weights Opaque inference
ECAI:
Immutable logic (IPFS feature) Immutable data (IPFS JSONL) Deterministic indexing Deterministic retrieval
This is structured intelligence.
Not stochastic generation.
—
18. Summary
Workflow:
Feature CID + Data CID โ Execute โ Retrieve โ Deterministic Result
ECAI does not guess.
It retrieves.
