Future Work

Open questions and potential improvements for Causantic.

Recently Implemented

These items were previously listed as future work and have since been implemented:

Chain Walking (v0.3.0): Sequential linked-list edges walked by chain-walker.ts with cosine-similarity scoring. Replaces sum-product graph traversal. Provides episodic narrative ordering from search seeds.
Search Tool (v0.3.0): search MCP tool for pure semantic discovery via hybrid BM25 + vector search with optional chain context. Replaces explain.
Edge Simplification (v0.3.0): Sequential 1-to-1 edges replace m×n all-pairs topology. Removes decay.ts, traverser.ts, pruner.ts.
Web Dashboard (Lower Priority → Done): React + Vite frontend with timeline visualization, cluster browser, search interface, and project views. Launch with npx causantic dashboard.
Per-Project Isolation (Medium Priority → Done): Federated approach — all projects in one store, filtered at query time via projectFilter. Cross-project graph edges traversed freely.
Collection Benchmark Suite (New): Self-service benchmarks for health, retrieval quality, chain quality, and latency. Scoring with tuning recommendations and historical tracking.
Session Reconstruction (New): Pure chronological context rebuilding via reconstruct MCP tool — "what did I work on yesterday?"
Native HDBSCAN (New): Pure TypeScript rewrite — 130× speedup over hdbscan-ts.
Hybrid BM25 + Vector Search (New): FTS5 keyword search fused with vector search via RRF.
MMR Reranking (v0.5.4): Maximal Marginal Relevance reranking in the search pipeline. Single retrieval.mmrLambda knob (default 0.7) controls relevance vs diversity. Cluster boostFactor removed — redundant with MMR.
Multiple Embedding Models (v0.7.0): Configurable via embedding.model (jina-small, nomic-v1.5, jina-code, bge-small). Changing model requires npx causantic reindex to re-embed all chunks. Model-specific clustering thresholds handled via calibration.
Multi-Path Chain Walking (v0.8.0): DFS with backtracking explores all paths from seed chunks, emitting each as a candidate chain. selectBestChain() picks the winner by highest median per-node cosine similarity. Bounded by maxExpansionsPerSeed (200) and maxCandidatesPerSeed (10). For linear chains, behavior identical to previous greedy single-path.
Budget-Aware Retrieval (v0.9.2): Token budget enforcement at multiple pipeline stages — oversized chunk filtering (pre-MMR and chain passthrough), budget-aware MMR selection (greedy loop tracks remaining budget), and budget-aware chain formatting (drop chunks exceeding remaining budget, no partial chunks).

High Priority

Relevance Feedback

Goal: Learn from user interactions to improve retrieval quality.

Approach:

Track which retrieved chunks the user actually references in conversation
Use implicit feedback (chunks that appear in subsequent tool calls) as positive signal
Adjust chain walking scoring or seed selection based on feedback patterns

Token Usage Analytics

Goal: Demonstrate cost savings from smarter context retrieval.

Approach:

Track context window usage before/after memory augmentation
Measure compression ratio (raw session tokens vs retrieved context)
Dashboard showing memory efficiency metrics

Implementation Ideas:

Hook into Claude Code's token reporting
Log baseline vs augmented queries
Calculate effective compression ratio

Incremental Clustering

Goal: Update clusters without full re-run.

Current State: HDBSCAN runs on all embeddings daily.

Challenge: HDBSCAN is not inherently incremental.

Potential Approaches:

Approximate nearest cluster assignment for new points
Periodic full re-clustering with incremental updates between
Explore online clustering algorithms (DBSTREAM, DenStream)

Medium Priority

VSCode Extension

Goal: Direct integration without MCP.

Features:

Inline memory suggestions
Memory explorer panel
Query interface in editor

Lower Priority

Team Sharing

Goal: Share memory across team members.

Considerations:

Privacy (redact sensitive content)
Encryption (secure transport)
Conflict resolution (merge strategies)
Access control (who can see what)

Multi-Modal Memory

Goal: Handle images, diagrams, and other media.

Challenges:

Embedding non-text content
Storage format
Retrieval across modalities

Research Questions

Optimal Chunk Size

Question: What's the ideal chunk size for retrieval?

Current: Code-aware chunking based on structure.

To Explore:

Fixed token counts
Semantic boundaries
Adaptive sizing based on content

Cross-Session Linking Quality

Question: How well do cross-session links work?

Current: Based on structural cross-session edges. The collection benchmark suite can now measure cross-session bridging quality — run npx causantic benchmark-collection --full to evaluate.

To Measure:

Precision of cross-session edges
User validation of suggested links
A/B testing of linking strategies

Community Contributions Welcome

If you're interested in working on any of these areas:

Open an issue to discuss the approach
Reference this document in your PR
Include benchmarks/experiments to validate changes

See CONTRIBUTING.md for contribution guidelines.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Future Work

Recently Implemented

High Priority

Relevance Feedback

Token Usage Analytics

Incremental Clustering

Medium Priority

VSCode Extension

Lower Priority

Team Sharing

Multi-Modal Memory

Research Questions

Optimal Chunk Size

Cross-Session Linking Quality

Community Contributions Welcome

Uh oh!

FilesExpand file tree

future-work.md

Latest commit

History

future-work.md

File metadata and controls

Future Work

Recently Implemented

High Priority

Relevance Feedback

Token Usage Analytics

Incremental Clustering

Medium Priority

VSCode Extension

Lower Priority

Team Sharing

Multi-Modal Memory

Research Questions

Optimal Chunk Size

Cross-Session Linking Quality

Community Contributions Welcome