Skip to content

Release 4.1.1

Choose a tag to compare

@thiswillbeyourgithub thiswillbeyourgithub released this 27 Oct 18:11
· 63 commits to main since this release

What's new

This release focuses on integrating Chonkie for semantic chunking, improving test reliability, and code quality enhancements through comprehensive linting.

Features

  • Chonkie Semantic Chunking Integration
    • Implemented ChonkieSemanticSplitter using semantic chunking with memoization ([081e81a])
    • Added transform_documents method to ChonkieSemanticSplitter ([534cc90])
    • Replaced RecursiveCharacterTextSplitter with ChonkieSemanticSplitter in summarize.py ([77f1652])
    • Added chonkie to requirements ([7234f86])
    • Merged chonkie branch into dev ([f89390a])

Fixes

  • Logging & Display

    • Fixed colors not appearing in loguru ([99502e7])
    • Fixed wrong logic for stdout color ([83e7fb9])
  • Parsing & Type Hints

    • Allow LLM to mention "thinking" inside its thinking ([d2bca84])
    • Fixed error message when parsing thinking ([a50ec42])
    • Fixed typehint error for topk autoincrease ([615828a])

Refactor

  • Split batch file loader into two files ([a0420fd])
  • Comprehensive ruff linter run across codebase ([d9f7eac])
  • Switched from black to ruff ([2d8a51b])
  • Made ruff configuration less strict ([e04fc8d])

Tests

  • DDG Test Improvements
    • Finally fixed DDG error not capturing output ([e1b2a87])
    • Capture DDG output properly ([d9c5ae9])
    • Set max DDG results to 10 to reduce failures ([ccdffd1])
    • Print output before error message ([66bf47c])
    • Better way to print output ([96c5186])
    • Don't use alias of grep ([a02ffbc])

Chore

Commits details since the last release

bumpver.toml
docs/source/conf.py
setup.py
wdoc/wdoc.py

tests/test_cli.sh

tests/test_cli.sh

tests/test_cli.sh

README.md

tests/test_cli.sh

tests/test_cli.sh

tests/test_cli.sh
tests/test_wdoc.py

wdoc/utils/batch_file_loader.py
wdoc/utils/load_recursive.py

setup.py

wdoc/utils/misc.py

wdoc/utils/tasks/summarize.py

wdoc/utils/misc.py

wdoc/utils/tasks/query.py

wdoc/wdoc.py

wdoc/utils/logger.py

wdoc/utils/logger.py

wdoc/utils/logger.py

wdoc/utils/misc.py

wdoc/utils/misc.py

README.md

README.md

README.md

README.md

README.md

scripts/AnkiFiltered/AnkiFilteredDeckCreator.py
scripts/NtfySummarizer/NtfySummarizer.py
scripts/TheFiche/TheFiche.py
tests/test_parsing.py
tests/test_vectorstores.py
tests/test_wdoc.py
wdoc/main.py
wdoc/utils/batch_file_loader.py
wdoc/utils/customs/binary_faiss_vectorstore.py
wdoc/utils/customs/litellm_embeddings.py
wdoc/utils/embeddings.py
wdoc/utils/env.py
wdoc/utils/filters.py
wdoc/utils/interact.py
wdoc/utils/llm.py
wdoc/utils/loaders/init.py
wdoc/utils/loaders/anki.py
wdoc/utils/loaders/local_audio.py
wdoc/utils/loaders/local_html.py
wdoc/utils/loaders/local_video.py
wdoc/utils/loaders/logseq_markdown.py
wdoc/utils/loaders/online_media.py
wdoc/utils/loaders/pdf.py
wdoc/utils/loaders/shared_audio.py
wdoc/utils/loaders/youtube.py
wdoc/utils/misc.py
wdoc/utils/prompts.py
wdoc/utils/retrievers.py
wdoc/utils/tasks/parse.py
wdoc/utils/tasks/query.py
wdoc/utils/tasks/shared_query_search.py
wdoc/utils/tasks/types.py
wdoc/wdoc.py

.pre-commit-config.yaml

.pre-commit-config.yaml
setup.py