Obsidian-first research corpus that synthesizes Translation Studies theory with Large Language Models (LLMs). The repo keeps the vault clean, automates ConnectedPapers + Semantic Scholar graphs, and emits both white_paper.md (LaTeX-ready) and white_paper.html (embed-ready for GitHub Pages).
- Conda environment with Python 3.10+ and R (visNetwork + htmlwidgets). The shared stack uses the
BA_Gensimenv:/home/ben/miniconda3/envs/BA_Gensim. connectedpapers-pyAPI token exported asCONNECTEDPAPERS_TOKEN(or pass--allow-test-tokenfor demo data).- System dependencies for Mermaid-ready HTML already handled via CDN.
# 1. Refresh ConnectedPapers graphs (skips if token missing)
conda run -n BA_Gensim python script/white_paper_builder.py --skip-html
# 2. Full publish (widgets + markdown + HTML)
conda run -n BA_Gensim python script/white_paper_builder.pyThe builder automatically:
- Invokes
script/connected_papers.pywhen a token is available (or when--allow-test-tokenis supplied). - Invokes
script/build_semantic_widgets.pyto regenerate all Semantic Scholar widgets and the combined manifest. - Converts the ordered Obsidian notes (driven by
Obsidian_Project/outline.md) intowhite_paper.mdandwhite_paper.html, injects the contact block, transforms Mermaid code fences into live diagrams, and lists every generated widget for download.
| Flag | Purpose |
|---|---|
--skip-connected-refresh |
Use cached ConnectedPapers payloads. |
--allow-test-token |
Falls back to ConnectedPapers demo graphs when no API key is present. |
--skip-widget-refresh |
Keep the existing Semantic Scholar widgets. |
--force-widget-refresh |
Rebuild widgets even if cached JSON/HTML exist. |
--skip-html |
Only emit white_paper.md (for LaTeX workflows). |
- Primary embed:
data/network_manifests/combined_network_widget.html(auto-lazy-loaded inwhite_paper.html). - Widget index: automatically pulled from
data/network_manifests/graph_manifest_combined.jsonand shown in the HTML under Widget Library. - Source scripts live in
/script/(seebuild_semantic_widgets.py,connected_papers.py,network_widgets.R).
Obsidian_Project/– cleaned notes ready for Obsidian + Mermaid.script/– automation helpers (always document new utilities here).data/– reference metadata, raw Semantic Scholar graphs, widget outputs, and ConnectedPapers payloads.system_instruct/– high-level mandates & publishing rules.docs_AI/– reserved for coordination transcripts (git-ignored).
- Mermaid not visible:
white_paper_builder.pynow rewrites```mermaidblocks into<div class="mermaid">and pulls the Mermaid CDN automatically. Re-run the builder if diagrams still show as code blocks. - Slow HTML load: The heaviest asset is the combined knowledge-graph iframe. It lazily loads, but you can temporarily hide the embed by replacing
loading="lazy"withloading="lazy" data-disabledinObsidian_Project/Visual_Assets.mdduring drafting. - ConnectedPapers throttling: If the API token is absent, the builder skips the refresh and logs a warning. Export the token (or pass
--allow-test-token) before publishing.
- Lead Agent: Digital Humanities Systems Architect ([email protected])
- Toolchain: GPT-5.1-Codex (Preview), Gemini 3.0 Pro, GitHub Copilot (VS Code)