feat: simplify extraction pipeline and add batch entity summarization by prasmussen15 · Pull Request #1224 · getzep/graphiti

prasmussen15 · 2026-02-12T21:28:10Z

Summary

Remove chunking code for entity-dense episodes and edge extraction - improved LLMs can now handle larger outputs in single calls
Add batch entity summarization - summarize all entities in a single LLM call instead of one-by-one
Simplify edge attribute extraction - extract attributes from fact text only (with reference_time), not full episode content
Improve edge deduplication - continuous indexing, allow duplicate candidates to also be contradicted

Changes

Node Operations

Remove _extract_nodes_chunked, _extract_from_chunk, _merge_extracted_entities
Add _extract_entity_summaries_batch for batch summarization
Nodes with short summaries get edge facts appended directly (no LLM call)

Edge Operations

Remove MAX_NODES constant and generate_covering_chunks usage
Process all nodes in single LLM call for edge extraction
Deduplicate invalidation candidates against duplicate candidates
Use continuous indexing across both candidate lists

Prompts

New extract_summaries_batch prompt with SummarizedEntity/SummarizedEntities models
Simplified extract_attributes for edges (fact + reference_time + existing_attributes only)
Updated resolve_edge with continuous indexing and consolidated contradicted_facts field

Tests

Remove obsolete chunking tests from test_entity_extraction.py

Test plan

Run existing unit tests
Test entity extraction with large episodes
Test edge extraction with many nodes
Test batch summarization with multiple entities
Verify edge deduplication and contradiction detection

🤖 Generated with Claude Code

Add TokenUsageTracker class to track input/output tokens by prompt type during LLM calls. This helps analyze token costs across different operations like extract_nodes, extract_edges, resolve_nodes, etc. Changes: - Add graphiti_core/llm_client/token_tracker.py with TokenUsageTracker - Update LLMClient base class to include token_tracker instance - Update OpenAI base client to capture and record token usage - Add token_tracker property on Graphiti class for easy access - Update podcast_runner.py to print token usage summary after ingestion Usage: client = Graphiti(...) # ... run ingestion ... client.token_tracker.print_summary(sort_by='prompt_name') Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

Disable the optimization that skips LLM calls when node summary + edge facts is under 2000 characters. This forces all summaries to be generated via LLM for token usage analysis. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

This reverts the summary optimization changes. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

- Remove chunking code for entity-dense episodes (node_operations.py) - Delete _extract_nodes_chunked, _extract_from_chunk, _merge_extracted_entities - Always use single LLM call for entity extraction - Remove chunking code for edge extraction (edge_operations.py) - Remove MAX_NODES constant and generate_covering_chunks usage - Process all nodes in single LLM call instead of covering subsets - Add batch entity summarization (node_operations.py, extract_nodes.py) - New SummarizedEntity and SummarizedEntities Pydantic models - New extract_summaries_batch prompt for batch processing - New _extract_entity_summaries_batch function - Nodes with short summaries get edge facts appended directly (no LLM) - Only nodes needing LLM summarization are batched together - Simplify edge attribute extraction (extract_edges.py, edge_operations.py) - Remove episode_content from context (attributes from fact only) - Keep reference_time for temporal resolution - Add existing_attributes to preserve/update existing values - Improve edge deduplication prompt (dedupe_edges.py, edge_operations.py) - Use continuous indexing across duplicate and invalidation candidates - Deduplicate invalidation candidates against duplicate candidates - Allow EXISTING FACTS to be both duplicates AND contradicted - Consolidate to single contradicted_facts field - Remove obsolete chunking tests (test_entity_extraction.py) Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

graphiti_core/llm_client/openai_base_client.py

claude · 2026-02-12T21:29:41Z

graphiti_core/llm_client/openai_base_client.py

+                    total_output_tokens += output_tokens
+
+                    # Record token usage
+                    self.token_tracker.record(prompt_name, total_input_tokens, total_output_tokens)


Token usage is recorded even when there's an exception during retry attempts. The total_input_tokens and total_output_tokens are accumulated across retries, but if a retry fails after a successful initial call, the tracker will record tokens from both the successful and failed attempts, potentially double-counting.

Consider moving the token_tracker.record() call outside the retry loop, or only record on the first successful response.

graphiti_core/utils/maintenance/node_operations.py

graphiti_core/graphiti.py

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

- Implement token tracking in AnthropicClient._generate_response() and generate_response() using result.usage.input_tokens/output_tokens - Implement token tracking in GeminiClient._generate_response() and generate_response() using response.usage_metadata - Add comprehensive unit tests for TokenUsageTracker class - Add tests for _extract_entity_summaries_batch function covering: - No nodes needing summarization - Short summaries with edge facts - Long summaries requiring LLM - Node filter (should_summarize_node) - Batch multiple nodes - Unknown entity handling - Missing episode and summary Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

- Remove import of extract_attributes_from_node (function was removed) - Add import of _extract_entity_summaries_batch - Update tests to use new batch summarization API Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

- Add MAX_NODES = 30 constant - Partition nodes needing summarization into flights of MAX_NODES - Extract _process_summary_flight helper for processing each flight - Each flight makes a separate LLM call to avoid context overflow Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

Update both DEFAULT_MODEL and DEFAULT_SMALL_MODEL to use gpt-5-mini. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

graphiti_core/llm_client/openai_base_client.py

graphiti_core/utils/maintenance/node_operations.py

graphiti_core/prompts/dedupe_edges.py

graphiti_core/utils/maintenance/edge_operations.py

claude · 2026-02-12T21:53:15Z

graphiti_core/llm_client/token_tracker.py

+
+
+class TokenUsageTracker:
+    """Thread-safe tracker for LLM token usage by prompt type."""


Minor: The docstring says "Thread-safe" but this is an async codebase. While threading.Lock works for protecting shared state in async code (since asyncio is single-threaded), the comments and design suggest this was written with threading in mind.

For clarity and potential future multi-threaded scenarios (e.g., if using ThreadPoolExecutor for blocking operations), this is fine. However, if you want to be more explicit about async-safe design, you could use asyncio.Lock instead, which is specifically designed for async contexts.

That said, threading.Lock is actually safer here if the tracker might be accessed from both sync and async contexts (like from the print_summary method which is sync).

Remove explicit model configuration to use the default gpt-5-mini models from OpenAIClient. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

graphiti_core/graphiti.py

graphiti_core/llm_client/openai_base_client.py

graphiti_core/utils/maintenance/node_operations.py

graphiti_core/utils/maintenance/edge_operations.py

graphiti_core/prompts/extract_nodes.py

Restore the original default models instead of gpt-5-mini. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

- Fix unreachable code in _handle_structured_response (check response.refusal) - Process node summary flights in parallel using semaphore_gather - Use case-insensitive name matching for LLM summary responses - Handle duplicate node names by applying summary to all matching nodes - Fix edge case when both edge lists are empty in contradiction processing - Fix potential AttributeError when episode is None in edge attributes - Add tests for flight partitioning and case-insensitive name matching Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

prasmussen15 and others added 4 commits February 11, 2026 19:30

Revert "chore: temporarily disable summary early return optimization"

54fb84f

This reverts the summary optimization changes. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

prasmussen15 temporarily deployed to development February 12, 2026 21:28 — with GitHub Actions Inactive

claude bot reviewed Feb 12, 2026

View reviewed changes

graphiti_core/llm_client/openai_base_client.py Outdated Show resolved Hide resolved

claude bot reviewed Feb 12, 2026

View reviewed changes

graphiti_core/utils/maintenance/node_operations.py Show resolved Hide resolved

claude bot reviewed Feb 12, 2026

View reviewed changes

graphiti_core/utils/maintenance/node_operations.py Outdated Show resolved Hide resolved

claude bot reviewed Feb 12, 2026

View reviewed changes

graphiti_core/graphiti.py Show resolved Hide resolved

chore: bump version to 0.27.2pre1

e68ca6d

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

prasmussen15 temporarily deployed to development February 12, 2026 21:32 — with GitHub Actions Inactive

prasmussen15 temporarily deployed to development February 12, 2026 21:40 — with GitHub Actions Inactive

Update test_node_operations.py for batch summarization API

df2a5f6

- Remove import of extract_attributes_from_node (function was removed) - Add import of _extract_entity_summaries_batch - Update tests to use new batch summarization API Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

prasmussen15 temporarily deployed to development February 12, 2026 21:44 — with GitHub Actions Inactive

getzep deleted a comment from claude bot Feb 12, 2026

prasmussen15 temporarily deployed to development February 12, 2026 21:50 — with GitHub Actions Inactive

Change default OpenAI models to gpt-5-mini

ad9d82e

Update both DEFAULT_MODEL and DEFAULT_SMALL_MODEL to use gpt-5-mini. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

prasmussen15 temporarily deployed to development February 12, 2026 21:51 — with GitHub Actions Inactive

claude bot reviewed Feb 12, 2026

View reviewed changes

graphiti_core/llm_client/openai_base_client.py Outdated Show resolved Hide resolved

claude bot reviewed Feb 12, 2026

View reviewed changes

graphiti_core/utils/maintenance/node_operations.py Outdated Show resolved Hide resolved

claude bot reviewed Feb 12, 2026

View reviewed changes

graphiti_core/utils/maintenance/node_operations.py Outdated Show resolved Hide resolved

claude bot reviewed Feb 12, 2026

View reviewed changes

graphiti_core/prompts/dedupe_edges.py Show resolved Hide resolved

claude bot reviewed Feb 12, 2026

View reviewed changes

graphiti_core/utils/maintenance/edge_operations.py Outdated Show resolved Hide resolved

claude bot reviewed Feb 12, 2026

View reviewed changes

graphiti_core/utils/maintenance/edge_operations.py Show resolved Hide resolved

claude bot reviewed Feb 12, 2026

View reviewed changes

Update podcast_runner.py to use default OpenAI models

7132b51

Remove explicit model configuration to use the default gpt-5-mini models from OpenAIClient. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

claude bot reviewed Feb 12, 2026

View reviewed changes

graphiti_core/graphiti.py Show resolved Hide resolved

prasmussen15 temporarily deployed to development February 12, 2026 21:53 — with GitHub Actions Inactive

claude bot reviewed Feb 12, 2026

View reviewed changes

graphiti_core/llm_client/openai_base_client.py Outdated Show resolved Hide resolved

claude bot reviewed Feb 12, 2026

View reviewed changes

graphiti_core/utils/maintenance/node_operations.py Outdated Show resolved Hide resolved

claude bot reviewed Feb 12, 2026

View reviewed changes

graphiti_core/utils/maintenance/node_operations.py Outdated Show resolved Hide resolved

claude bot reviewed Feb 12, 2026

View reviewed changes

graphiti_core/utils/maintenance/edge_operations.py Outdated Show resolved Hide resolved

claude bot reviewed Feb 12, 2026

View reviewed changes

graphiti_core/prompts/extract_nodes.py Show resolved Hide resolved

prasmussen15 and others added 2 commits February 12, 2026 17:04

Revert default model changes to gpt-4.1-mini/nano

7310c78

Restore the original default models instead of gpt-5-mini. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

prasmussen15 temporarily deployed to development February 12, 2026 22:06 — with GitHub Actions Inactive

getzep deleted a comment from claude bot Feb 12, 2026

prasmussen15 merged commit a200ff2 into main Feb 12, 2026
10 checks passed

prasmussen15 deleted the feat/simplify-extraction-and-batch-summaries branch February 12, 2026 22:10

getzep locked and limited conversation to collaborators Feb 12, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: simplify extraction pipeline and add batch entity summarization#1224

feat: simplify extraction pipeline and add batch entity summarization#1224
prasmussen15 merged 12 commits intomainfrom
feat/simplify-extraction-and-batch-summaries

prasmussen15 commented Feb 12, 2026

Uh oh!

Uh oh!

claude bot Feb 12, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

claude bot Feb 12, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant



		class TokenUsageTracker:
		"""Thread-safe tracker for LLM token usage by prompt type."""

Conversation

prasmussen15 commented Feb 12, 2026

Summary

Changes

Node Operations

Edge Operations

Prompts

Tests

Test plan

Uh oh!

Uh oh!

claude bot Feb 12, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

claude bot Feb 12, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant