Skip to content

h3rb3rn/moe-codex

Repository files navigation

MoE Codex

EU-sovereign data and audit platform for MoE Sovereign — the open-source alternative to Palantir Foundry / Gotham / AIP.

Latin codex = manuscript collection, lawbook — thematic sibling of moe-libris.

License Docs Comparison

Repository topology. moe-codex is an optional add-on to moe-sovereign. Operators who only need a sovereign multi-model LLM gateway run moe-sovereign alone. Operators who additionally need a data catalog, bundle versioning, lineage tracking, approval workflow, investigation Cypher explorer, and data-health drift detection (authorities, KritIS, pharma compliance, banking audit) deploy moe-codex alongside.


What MoE Codex provides

Module Path Palantir equivalent
Data Catalog /catalog Foundry Catalog
Knowledge Approvals /approval Foundry Branch Approval + Action Types
Object Explorer /explorer Gotham Object Drilldown
Data Health & Drift /health Foundry Health Checks
JupyterLite Notebook /notebook Foundry Code Workbook
Lineage (Marquez) /lineage Foundry Lineage
Versioning (lakeFS) /versioning Foundry Branching
ETL Fan-Out (NiFi) /etl Foundry Pipeline Builder
Timeline /timeline Gotham Temporal Analysis
Link Analysis (Cytoscape) /link-analysis Gotham Object Network
Pipeline Builder (Kestra) /kestra Foundry Pipeline Builder (visual)
Dynamic Forms /forms Foundry Forms
Federated Search (OpenSearch) /search Foundry Search
Charts & Pivot /charts Foundry Quiver (partial)
Analytics / BI (Superset) /superset Foundry Quiver / Palantir Workspace
Compliance Posture (Falco + SCAP) /compliance Apollo Compliance Posture
OPA Access Control /opa Foundry RBAC / Markings
ML Experiment Tracking (MLflow) /eval Foundry Model Workbench
Prompt Safety (NeMo Guardrails) /guardrails Palantir AIP Safety
SQL Federation (Trino) /trino Foundry SQL
Document Intelligence (DocLing) /documents Foundry Document Intelligence

Full feature-by-feature mapping: docs/system/palantir_comparison.md (50 modules across Foundry/Gotham/AIP/Apollo, 12 industry use-cases).


Quick Start

git clone https://github.com/h3rb3rn/moe-codex.git
cd moe-codex
cp .env.example .env
# Edit: CODEX_NODE_ID, CODEX_ADMIN_KEY, SOVEREIGN_URL (for GraphRAG attachment)

./install.sh          # installs Falco, configures host, starts the stack
Service URL
Codex API http://localhost:8090
Codex Docs http://localhost:8090/docs
Marquez UI http://localhost:3030
lakeFS UI http://localhost:8010
NiFi UI http://localhost:8181/nifi/

Host requirements for Tier 1 compliance (Falco):
Linux ≥ 5.8 with BTF (/sys/kernel/btf/vmlinux). On Proxmox/QEMU: set CPU type to host in VM settings before first start. The installer handles everything else (Falco apt package, perf_event_paranoid=2, systemd service).
See docs/admin/compliance.md for details.

Operators with an existing moe-sovereign installation that uses Phase 16-24 (formerly in moe-sovereign) should see docs/migration/from-sovereign.md for a clean-cut migration including data transfer.


Architecture

moe-sovereign  <- HTTP/REST ->  moe-codex  <-- Marquez (Lineage)
   (core)                       (codex)    <-- lakeFS (Versioning)
                                           <-- NiFi / Kestra (ETL / Pipelines)
                                           <-- Superset (BI / Quiver)
                                           <-- OpenSearch (Federated Search)
                                           <-- Falco + OpenSCAP (Compliance)
                                           <-- Trino (SQL Federation)
                                           <-- OPA (Access Control)
                                           <-- MLflow (Experiment Tracking)
                                           <-- Postgres (shared terra_checkpoints)
                                           <-- Valkey (shared terra_cache)

moe-sovereign runs autonomously without moe-codex. Deploying codex adds all data-platform modules. Communication uses clearly defined HTTP APIs (OpenAPI schemas in docs/api/).


EU sovereignty & compliance

moe-codex is built for authorities, KritIS, pharma compliance, banking audit and similar compliance-driven operators:

Hosting recommendation: Hetzner / IONOS / STACKIT / OVH / Scaleway — all under EU jurisdiction, no US CLOUD-Act exposure. Air-gap deployment documented.


Status (2026-05-17)

Phase Component Status
16 OpenLineage/Marquez
17 NiFi ETL submission
18 lakeFS bundle versioning
19 Enterprise-stack dashboard
20 Data Catalog
21 Branch-based Approval
22 Read-only Cypher Explorer
23 Data Health drift detection
24 JupyterLite Notebook
D1 OPA, MLflow, NeMo Guardrails, Trino, DocLing
D2 Timeline, Link Analysis, Kestra, Forms, Search, Charts
D3 Superset (BI), OpenSearch (Search), Falco + OpenSCAP (Compliance)
D4 Budibase (Workshop), TimescaleDB (Time Series), HedgeDoc (Notes), PostGIS+KeplerGL (Geo)
D5 Leaflet.js interactive maps, Dossier/Case File, Doc corrections, pinned Docker versions

Stack is feature-complete for all achievable Palantir equivalents. 92 % of Palantir modules covered. 246 tests passing.


License & license hygiene

Apache 2.0 (see LICENSE).

The stack uses exclusively OSI-approved open-source components and follows the Linux-Foundation-Fork principle — see docs/system/license_compliance.md and the audit script scripts/audit-licenses.sh. Concretely: OpenSearch instead of Elasticsearch, OpenTofu instead of Terraform, Apache Kafka instead of Confluent, Valkey instead of Redis ≥ 7.4, Meltano instead of Airbyte.


Related repositories

  • moe-sovereign — the MoE core (API gateway, 15 expert specialists, MCP precision tools, caching, GraphRAG basis)
  • moe-libris — federation hub for knowledge bundle exchange between sovereign instances
  • moe-codexthis repository — EU-Palantir alternative

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors