Built on ByteDance's MineContext, extending the original open-source project into a glasses-first personal context platform.
🚀 Quick Start (2 minutes) • 中文文档 • 📖 Detailed Guide
# 1. Start server (no capture mode for glass backend)
uv run opencontext start --port 8000 --config config/config.yaml --no-capture
# 2. Process videos for November 12th
uv run glass start 12-11
# 3. View report
open persist/reports/12-11.md# 1. Start server (no capture mode for glass backend)
uv run opencontext start --port 8000 --config config/config.yaml --no-capture
# 2. Start WebUI
cd webui && npm run dev
# 3. Open browser to http://localhost:5174📦 [Setup Script → scripts/setup/quick_setup.sh] • ✅ [Validation → scripts/setup/validate_setup.py]
方式1:命令行 (适合高级用户)
# 1. 启动服务 (无录屏模式,适合glass后端)
uv run opencontext start --port 8000 --config config/config.yaml --no-capture
# 2. 处理11月12日的视频
uv run glass start 12-11
# 3. 查看报告
open persist/reports/12-11.md方式2:网页界面 (适合交互使用)
# 1. 启动服务 (无录屏模式,适合glass后端)
uv run opencontext start --port 8000 --config config/config.yaml --no-capture
# 2. 启动网页界面
cd webui && npm run dev
# 3. 打开浏览器访问 http://localhost:5174# 1. 获取 AUC Turbo 凭证
export AUC_APP_KEY=你的应用密钥
export AUC_ACCESS_KEY=你的访问密钥
# 2. 复制配置模板
cp config/config.yaml.example config/config.yaml
# 3. 验证安装
uv run glass --help🔗 完整中文文档 →
# 1. Run our automated setup script
./quick_setup.sh
# 2. Validate installation
uv run python validate_setup.py# Install uv package manager
curl -LsSf https://astral.sh/uv/install.sh | sh
# Install dependencies
uv sync
# Install FFmpeg
brew install ffmpeg # macOS
# sudo apt install ffmpeg # Ubuntu/Debian- 📹 Video Processing: Transform daily recordings into searchable context
- 🗣️ Speech Recognition: AUC Turbo transcription for audio content
- 🔍 Smart Search: Find moments across your visual memory
- 📊 Daily Reports: AI-generated summaries of your day
- 🌐 Web Interface: Drag-and-drop video processing
- ⚡ CLI Tools: Batch processing and automation
Smart Glasses → Video Capture → Frame Extraction → Speech Recognition → Context Storage → Search & Reports
Key Components:
- Video Pipeline: FFmpeg-based processing with adaptive sampling
- Speech Layer: Doubao AUC Turbo for transcription
- Context Engine: Unified storage with MineContext foundation
- WebUI: React frontend with real-time processing
- CLI Tools: Command-line interface for batch operations
- ✅ Video Ingestion: MP4/MOV/MKV support, up to 2GB per file
- ✅ Speech Recognition: AUC Turbo integration for audio transcription
- ✅ Frame Extraction: Adaptive sampling for optimal context density
- ✅ Timeline Generation: Chronological organization with highlights
- ✅ Report Generation: Markdown summaries with visual cards
- ✅ Web Interface: Interactive upload and browsing
- ✅ CLI Processing: Batch operations and automation
Following Linus Torvalds' code audit recommendations:
- ✅ Architecture Simplified: 4-layer → 2-layer design
- ✅ Race Conditions Fixed: Atomic state management
- ✅ Single Points Eliminated: Speech recognition fallback
- ✅ Configuration Unified: Single GlobalConfig system
- ✅ Reliability Enhanced: Graceful error handling
- Try it: Follow the Quick Start Guide
- Configure: Set up AUC Turbo credentials
- Process: Upload your first video
- Explore: Search through your visual context
- 🔧 Setup: Quick Start Guide
- 📖 Details: Full Documentation below
- 🇨🇳 中文: Chinese Documentation
- 🐛 Issues: GitHub Issues
- 💬 Community: GitHub Discussions
- Vision
- Prerequisites
- Installation
- Configuration
- Start the Pipeline
- Speech Recognition with AUC Turbo
- Generate Glass Reports
- Frame-Only Ingest (Legacy)
- Architecture
- Contributing
MineContext Glass reimagines personal context management around daily life. Using smart glasses, we capture day-long video streams and transform them into an organized, searchable knowledge base that bridges the physical and digital worlds. Every clip becomes part of a living memory system that powers summaries, reminders, and intelligent recommendations.
By standing on MineContext's mature context engineering foundations, we combine the existing cyberspace context (screen captures, documents, chats) with real-life visuals to create a full-spectrum, proactive assistant. The next milestone is speech recognition extracted from captured video audio, so conversations and spoken cues join the same context graph.
- macOS or Linux with Python 3.9+.
uvpackage manager (recommended) or a Python virtual environment.ffmpegandffprobeavailable onPATH(for examplebrew install ffmpegon macOS).- Optional: connected smart glasses with USB or Wi‑Fi file sync.
uv syncOr use a traditional virtual environment:
python -m venv .venv
source .venv/bin/activate
pip install -e .- Duplicate
config/config.yaml.example(or your existing MineContext config) toconfig/config.yaml. - Set API keys, embedding models, and storage paths as needed.
- Configure the Glass block so AUC Turbo credentials are available:
glass:
speech_to_text:
provider: auc_turbo
auc_turbo:
base_url: https://openspeech.bytedance.com/api/v3
resource_id: volc.bigasr.auc_turbo
app_key: "${AUC_APP_KEY:}"
access_key: "${AUC_ACCESS_KEY:}"
request_timeout: 120
max_file_size_mb: 100
max_duration_sec: 7200Credentials can live in the config file, environment variables, or be passed as CLI flags. See docs/api/GLASS_AUDIT_REPORT.md for quotas, error codes, and troubleshooting tips.
Run the context server with:
uv run opencontext start --port 8000 --config config/config.yaml --no-captureGlasses footage dropped into the configured import path will be processed automatically. Use the CLI or API endpoints to inspect timelines, digests, and retrieved clips.
-
Export the required credentials (or supply them via CLI flags):
export AUC_APP_KEY=your-app-key export AUC_ACCESS_KEY=your-access-key
-
Launch the production pipeline against a specific date folder. The new
glassentry point maps directly to the OpenContext CLI and processes everything undervideos/<dd-mm>/:uv run glass start dd-mm --config config/config.yaml
Results are written to the unified storage configured by
CONTEXT_PATH, and timeline reports are collected underpersist/reports/dd-mm/. Pass--report-outputto override the destination directory or--lookback-minutesto adjust the window used for report generation.
Once a timeline has been ingested, you can produce scoped summaries directly from the CLI:
uv run python -m opencontext.cli glass report \
--timeline-id <timeline> \
--lookback-minutes 120 \
--output persist/reports/<timeline>.mdThe report command reads the contexts persisted by the Glass timeline processor and writes Markdown files suitable for sharing.
If you only need frame extraction (for example, benchmarking embedding throughput), place raw .mp4 files under videos/<DATE>/ (such as videos/2025-02-27/12-13.mp4) and run:
uv run opencontext.tools.daily_vlog_ingestThis legacy tool extracts frames, updates the context store, and writes summaries to persist/reports/<date>.md. Speech recognition is no longer performed here; the Glass ingestion flow handles it through AUC Turbo when you process the resulting timelines.
MineContext Glass keeps the original context-flow of context_capture → context_processing → storage → server routes, expanding the capture stage with a dedicated video manager.
- Video Capture Manager (upcoming) pulls footage from smart glasses, handles deduplication, and writes raw assets to managed storage.
- Video Processing Pipeline extracts frames, runs embeddings, and forwards structured snippets into the context store.
- Speech Recognition Layer (AUC Turbo) transcribes audio tracks via Doubao AUC Turbo and attaches aligned text spans to the same timeline entries as their visual counterparts.
- Unified Retrieval API exposes both cyberspace and real-life context through a single search and recommendation surface.
Refer to opencontext/ for CLI entry points, managers, storage adapters, and utilities; configuration files live under config/, while runtime data persists in persist/ and logs/.
- English: 当前文档
- 中文: README_zh.md - 完整中文文档
- 视频处理: 支持MP4/MOV/MKV格式,最大2GB
- 语音识别: AUC Turbo集成,音频转录
- 智能搜索: 跨模态上下文检索
- 报告生成: Markdown格式日常总结
- 网页界面: 拖拽上传,实时处理
- CONTRIBUTING.md - 贡献指南
- AGENTS.md - 代理说明
- 技术债务清理 - 架构改进详情
We welcome issues and pull requests focused on expanding context capture, improving retrieval, or polishing the smart glasses workflow. Please review CONTRIBUTING.md and follow the repository's testing and Conventional Commit guidelines.
This project inherits the original MineContext license. See LICENSE for details.
