-
Notifications
You must be signed in to change notification settings - Fork 0
DVP image readers #1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Conversation
Add reader for Stereo-seq files.
update codecov
Made Xenium table optional
Support for `binned_ouputs` folder for Visium HD
merscope reader update
updates: - [github.com/asottile/pyupgrade: v3.15.2 → v3.16.0](asottile/pyupgrade@v3.15.2...v3.16.0)
[pre-commit.ci] pre-commit autoupdate
Reader for seqFISH data
fix bin_size parsing visium hd
for more information, see https://pre-commit.ci
[pre-commit.ci] pre-commit autoupdate
for more information, see https://pre-commit.ci
…/spatialdata-io into improved-release-process
improved release process
|
Hi @lucas-diedrich, I have noticed this PR from you. By any chance did you intend to open it against |
merscope reader remove invalid polygons
|
Hi @LucaMarconato, thanks for your message! I intended to open this PR against my fork to receive initial internal feedback, apologies for the confusion! I am still waiting for some feedback on the readers so it might still take some time until I open a real pull request. We (the Mann Lab) are very interested in using
Parsers for various MS-based proteomics data formats will probably be available in the next few weeks. Comparing our requirements with the implemented readers for spatial omics technologies, I noted that our workflow might deviate slightly from the other technologies: Usually, we first analyze the imaging data to identify cells of interest and only select a small subset for subsequent Mass Spectrometry-based profiling. In other words, we would usually have a sequential workflow in which we
Then perform the initial computational analysis to identify cells or areas of interest, and subsequently This is in contrast to the existing readers that appear to be intended to load all data at once into the object. Overall, it is certainly true that the workflow and the generated output files, at this point, are far less streamlined than other spatial technologies and some outputs might change in the future. Therefore, my question would be what you would consider the best way to proceed:
Many thanks! |
…overflow. Construct dask arrays directly after reading the respective numpy array
… dvp_image_readers
|
Hi, thanks for the detailed explanation.
We have in the roadmap to have a robust way to parse various image formats. This was partially (only) started here scverse#234 and here https://github.com/scverse/spatialdata-io/blob/main/src/spatialdata_io/readers/generic.py, so the One would need to double check the support for
Smaller PRs are easier to build a review, so I would split the work whenever possible.
We support building, writing and reading |
Update xenium.py, fix problem with hidden files in morphology_foucs direcory
This PR implements readers for DVP imaging data. Imaging data represents 1 of 3 modalities (imaging, proteomics, cell segmentation/shape information) relevant to DVP experiments.
Currently implemented readers
spatialdata_io.experimental.czi.read_czi- Reader for Carl-Zeiss formatted imaging files. It's a wrapper for the pylibczirw library, the official CarlZeiss python IO package for imaging data. The reader supports a) reading in a single RGB channel (as typically used for pathology/HE stains), or b) an arbitrary number of grayscale-channelsspatialdata_io.experimental.openslide.read_openslide. Reader for various whole slide imaging data, with a focus on digital pathology dataformats. This is a wrapper for the openslide library, a widely used and supported library to parse pathology data. This reader is mainly implemented for its support of themiraxformat. The reader supports parsing of RGB/RGBA images and returns an RGBA image.General strategy
Both readers follow the same strategy - to mitigate issues with reading the very large imaging data into memory, they tile the image, read in these subsets in a parallelized manner, leveraging
dask.delayedand return an assembled xarray, parsed by thespatialdata.models.Image2DModel.Implementation
This strategy is implemented with general functions, implemented in the
_utils.pymodule and format-specific funtions, implemented in the respective modules. The implementation follows the functional programming scheme of the otherspatialdata_ioreaders.