-
Notifications
You must be signed in to change notification settings - Fork 77
Modernize dependencies and CI infrastructure #216
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Open
rom1504
wants to merge
26
commits into
criteo:master
Choose a base branch
from
rom1504:modernize-dependencies
base: master
Could not load branches
Branch not found: {{ refName }}
Loading
Could not load tags
Nothing to show
Loading
Are you sure you want to change the base?
Some commits from the old base branch may be removed from the timeline,
and old review comments may become outdated.
Open
Changes from all commits
Commits
Show all changes
26 commits
Select commit
Hold shift + click to select a range
38fd660
Modernize dependencies and CI infrastructure
rom1504 59e53f8
Fix pyarrow constraint compatibility with embedding_reader
rom1504 fa57083
Fix PEX build numpy constraint compatibility
rom1504 48e0df9
Disable fail-fast in CI to test all Python versions
rom1504 4fb76f2
Update publish action to use Python 3.10
rom1504 c82f63b
Add Java 17 setup to GitHub Actions workflows
rom1504 f8348eb
Update pyarrow and embedding_reader to latest versions
rom1504 a9f5f43
Expand pyarrow version range for better compatibility
rom1504 5998818
Fix NumPy 2.x JSON serialization issue
rom1504 38624b6
Fix NumPy 2.x JSON serialization issue
rom1504 244a9a1
Modernize Python support and upgrade PySpark to 4.x
rom1504 7f40878
Fix CI lint job and PEX build issues
rom1504 4273f3e
Add defensive fixes for FAISS merge robustness in CI environments
rom1504 2598410
Fix black formatting in distributed.py
rom1504 5c2750e
Update documentation generation to use Python 3.10
rom1504 4aeb38c
Revert "Add defensive fixes for FAISS merge robustness in CI environm…
rom1504 763b83b
Fix FAISS distributed test by pinning numpy<2
rom1504 fbdbd5b
Add comprehensive debugging to distributed test
rom1504 4539fa2
Add detailed debugging to PySpark merge operations
rom1504 cd9911a
Replace print statements with proper logging in distributed.py
rom1504 423250e
Fix lint issues in distributed.py logging
rom1504 c4c57f3
Test NumPy 2.x compatibility with existing debugging infrastructure
rom1504 0d00b95
Revert to NumPy 1.x and remove debugging infrastructure
rom1504 4822bbf
Update setup.py
rom1504 640108e
Update Makefile
rom1504 31ce429
Update setup.py
rom1504 File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| Original file line number | Diff line number | Diff line change |
|---|---|---|
| @@ -0,0 +1,16 @@ | ||
| """JSON encoder for numpy types""" | ||
| import json | ||
| import numpy as np | ||
|
|
||
|
|
||
| class NumpyEncoder(json.JSONEncoder): | ||
|
Contributor
Author
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. unnecessary since not using numpy >= 2 |
||
| """Custom JSON encoder for numpy types""" | ||
|
|
||
| def default(self, o): | ||
| if isinstance(o, np.integer): | ||
| return int(o) | ||
| elif isinstance(o, np.floating): | ||
| return float(o) | ||
| elif isinstance(o, np.ndarray): | ||
| return o.tolist() | ||
| return super().default(o) | ||
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| Original file line number | Diff line number | Diff line change |
|---|---|---|
|
|
@@ -4,5 +4,4 @@ pylint==3.0.3 | |
| pytest-cov==4.1.0 | ||
| pytest-xdist==3.5.0 | ||
| pytest==8.0.1 | ||
| pyspark==3.2.2; python_version < "3.11" | ||
| pyspark<3.6.0; python_version >= "3.11" | ||
| pyspark>=4.0.0,<5.0.0 | ||
|
Contributor
Author
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. maybe updating pyspark is the cause |
||
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| Original file line number | Diff line number | Diff line change |
|---|---|---|
| @@ -1,10 +1,8 @@ | ||
| dataclasses>=0.6,<1.0.0; python_version < "3.7" | ||
| fire>=0.4.0,<0.6.0 | ||
| fire>=0.4.0,<0.7.0 | ||
| numpy>=1.19.5,<2 | ||
| pandas>=1.1.5,<3 | ||
| pyarrow>=6.0.1,<16 | ||
| pyarrow>=6.0.1,<30 | ||
| tqdm>=4.62.3,<5 | ||
| faiss-cpu<1.7.3; python_version < "3.7" | ||
| faiss-cpu>=1,<2; python_version >= "3.7" | ||
| faiss-cpu>=1,<2 | ||
| fsspec>=2022.1.0 | ||
| embedding_reader>=1.5.1,<2 | ||
| embedding_reader>=1.8.0,<2 |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
unnecessary since not using numpy >= 2