Skip to content

Implement code changes to enhance functionality and improve performance #161

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 1 commit into
base: main
Choose a base branch
from

Conversation

gafda
Copy link

@gafda gafda commented Jul 9, 2025

This pull request is a fix to the issue on Issue langchain-ai/langchain-community#187, and includes updates to the PDF parsing logic and dependency management, as well as minor configuration changes. The most important changes involve improving error handling and refining the extract_images_from_page method, adding new PDF-related dependencies, and simplifying configuration settings in pyproject.toml.

PDF Parsing Improvements:

Dependency Management:

  • libs/community/pyproject.toml: Added new dependencies (pdfminer-six, pdfplumber, pymupdf, pypdf, and unstructured) under a new [pdf] section to support PDF-related functionality.

Configuration Simplification:

@gafda
Copy link
Author

gafda commented Aug 1, 2025

Any news about my PR?

@gafda gafda force-pushed the fix-filter-error branch from ced3274 to 694e770 Compare August 1, 2025 11:24
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant