Add reader for Stereo-seq files. #70

LLehner · 2023-07-18T17:21:37Z

Add reader for Stereo-seq files.

TODO:

for more information, see https://pre-commit.ci

…ldata-io into Stereo-seq_reader

LLehner · 2023-07-31T14:50:33Z

file format description @LucaMarconato

https://github.com/STOmics/SAW/tree/main/Documents/FileFormat

codecov-commenter · 2023-08-03T13:11:55Z

Codecov Report

Attention: Patch coverage is 45.51282% with 85 lines in your changes are missing coverage. Please review.

Project coverage is 36.98%. Comparing base (755d475) to head (10c267a).
Report is 199 commits behind head on main.

Additional details and impacted files

@@            Coverage Diff             @@
##             main      #70      +/-   ##
==========================================
- Coverage   41.92%   36.98%   -4.94%     
==========================================
  Files          16       17       +1     
  Lines         854     1352     +498     
==========================================
+ Hits          358      500     +142     
- Misses        496      852     +356

Files	Coverage Δ
src/spatialdata_io/__init__.py	`100.00% <100.00%> (ø)`
src/spatialdata_io/_constants/_constants.py	`100.00% <100.00%> (ø)`
src/spatialdata_io/readers/stereoseq.py	`16.66% <16.66%> (ø)`

... and 9 files with indirect coverage changes

for more information, see https://pre-commit.ci

…ldata-io into Stereo-seq_reader

for more information, see https://pre-commit.ci

…ldata-io into Stereo-seq_reader

for more information, see https://pre-commit.ci

…ldata-io into Stereo-seq_reader

giovp · 2024-04-24T08:35:07Z

@LLehner @LucaMarconato how is this looking? should we merge?

LLehner · 2024-04-24T20:20:55Z

@giovp tasks 3. and 4. from issue#97 still need to be fixed.

giovp · 2024-04-25T08:10:16Z

fantastic, what is the blocker?

LLehner · 2024-04-28T21:34:57Z

fantastic, what is the blocker?

aggregation of e.g. image channels over segmentation masks doesn't work yet, perhaps the segmentation mask isn't properly linked to the table. Also plotting with rendering shapes doesn't work yet.

timtreis · 2024-05-06T09:56:14Z

Is the plotting an issue in spatialdata-plot ? Can you open an issue and tag me, referencing this convo?

LucaMarconato · 2024-05-10T14:34:42Z

I looked into the points 3 and 4 from #97.

Point 3: the labels object has integer values in {0, 1}, which the background being 0 and each cell having label index 1. This makes the aggregation not computable. Possible solutions are to introduce an arbitrary labeling by identifying the connected components, or to look if the labeling already exists in the raw data. @florianingelfinger do you know if the raw data contains such information or should we proceed with a arbitrary labeling? Also, Florian mentioned that some data is available in obsm, I will look into it to see if it can be helpful for this. Edit: I am now parsing the polygonal data from obsm. I am not converting the labels (explained in a comment below).
Point 4: there was a bug with the instance_key column getting set to None, I fixed it in ac34e78 (#70). This still doesn't fix point 4 but now I think it should be easy, probably just a string mismatch. I will look into it.

In addition to solving the above, I would also like to address the following points:

now that we support representing multiple annotations tables, I would create a table for each bin size (currently we just parse the table for the cell-level data).
currently in napari we tacitly subsample a points layer if it contains more than 100000 points. I will add a warning ~~icon next to the label~~ so that the user is warned that this happens (otherwise plotting the bin sizes of 1 leads to a unintuitive visualization). The warnings will tell how to remove this limit.

Finally, we will be working on a rasterization-based approach for rendering large collections of bins. This will come after this PR is merged, but when available, will improve the user experience around Stereo-seq data.

florianingelfinger · 2024-05-13T06:44:20Z

Many thanks for your work! To my knowledge there is no cell identifier associated with each cell in the raw data or at least we have not used one so far. I would proceed as suggested with arbitrary labeling!

for more information, see https://pre-commit.ci

…ldata-io into Stereo-seq_reader

LucaMarconato · 2024-05-22T12:26:31Z

I fixed all the points above, with the exception to the parsing of the labels, which I now parse as an image with two colors instead of as a labels, to avoid confusion. I have tried using scikit-image to relabel the labels image but the number of labels that I obtain and the number of cells is slightly different. This is easily fixable but I will rather skip doing this proprocessing within the reader and let the user choose to perform this if needed.

I will polish the code and make a short example notebook, after this we are good to merge.

LucaMarconato · 2024-05-24T14:29:26Z

I prepared and uploaded the notebook here; I removed the outputs because the data is not currently public.

The notebook is affected by two bugs of spatialdata-plot, that I tracked here and here. The visualization work with napari-spatialdata. @timtreis since you have the data locally, could you please have a look at them?

Anyway, since the bugs are not in spatialdata-io, now the PR is ready to merge! Thanks all for the work! 🚀

LLehner and others added 6 commits July 18, 2023 19:20

Add reader for Stereo-seq data

504bd9e

Fix init

4595f5d

Fix duplicate Key values

21733f8

[pre-commit.ci] auto fixes from pre-commit.com hooks

8c4e9b8

for more information, see https://pre-commit.ci

Add Key value

093f8c9

Merge branch 'Stereo-seq_reader' of https://github.com/LLehner/spatia…

442de55

…ldata-io into Stereo-seq_reader

Add shapes

a007eb2

LLehner added 3 commits August 3, 2023 15:27

Remove print

8a61442

Fix radius

790ac13

Fix constants

5c24bb2

LLehner marked this pull request as draft August 20, 2023 11:41

LLehner and others added 17 commits August 20, 2023 18:37

Update constants and reader

8f48b60

[pre-commit.ci] auto fixes from pre-commit.com hooks

0f3f9d6

for more information, see https://pre-commit.ci

Fix constants

8219aa7

Add dataset identifier

ca22756

Update reader

b7b1866

[pre-commit.ci] auto fixes from pre-commit.com hooks

a6df587

for more information, see https://pre-commit.ci

Fix parameters

3a78585

Merge branch 'Stereo-seq_reader' of https://github.com/LLehner/spatia…

e256b0b

…ldata-io into Stereo-seq_reader

Fix parameters

edca0d9

[pre-commit.ci] auto fixes from pre-commit.com hooks

fa1fe68

for more information, see https://pre-commit.ci

Fix input parameters

14b3b2c

Merge branch 'Stereo-seq_reader' of https://github.com/LLehner/spatia…

d46d4dd

…ldata-io into Stereo-seq_reader

Fix input parameters

ef50c98

Fix input parameters

70a1ab4

[pre-commit.ci] auto fixes from pre-commit.com hooks

1d54073

for more information, see https://pre-commit.ci

Fix mypy

62ced28

Merge branch 'Stereo-seq_reader' of https://github.com/LLehner/spatia…

57fcada

…ldata-io into Stereo-seq_reader

LucaMarconato and others added 3 commits May 7, 2024 16:34

Merge branch 'main' into Stereo-seq_reader

df00ec3

fix spelling

d68ae3b

fixed cellID being nan due to incorrect assigned name

ac34e78

LucaMarconato and others added 7 commits May 15, 2024 16:17

fix metadata

1872f11

wip bin data parsing

2b8f70f

[pre-commit.ci] auto fixes from pre-commit.com hooks

85fde20

for more information, see https://pre-commit.ci

renamed StereoseqKeys to SK

23af1bd

Merge branch 'Stereo-seq_reader' of https://github.com/LLehner/spatia…

8ee712d

…ldata-io into Stereo-seq_reader

comparing points vs shapes for bins

295e1b4

using points for bins

abf1471

LLehner mentioned this pull request May 22, 2024

interactive_visualization for stereo-seq scverse/squidpy#674

Closed

LucaMarconato added 4 commits May 22, 2024 14:28

experiment with relabeling; I will remove the code

65cddb7

cleanup

e9bfc75

fix spatialdata dependency

3c40d08

stereoseq fixes

579d972

LucaMarconato and others added 3 commits May 24, 2024 16:31

changelog

94112e1

add stereoseq to readme

e1ce1ac

cleanup

9cd012c

LucaMarconato approved these changes May 24, 2024

View reviewed changes

LucaMarconato merged commit 5c912d1 into scverse:main May 24, 2024
5 checks passed

LucaMarconato mentioned this pull request Jul 10, 2024

Stereoseq expected directory structure #161

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add reader for Stereo-seq files. #70

Add reader for Stereo-seq files. #70

LLehner commented Jul 18, 2023 •

edited

Loading

LLehner commented Jul 31, 2023

codecov-commenter commented Aug 3, 2023 •

edited

Loading

giovp commented Apr 24, 2024

LLehner commented Apr 24, 2024

giovp commented Apr 25, 2024

LLehner commented Apr 28, 2024 •

edited

Loading

timtreis commented May 6, 2024

LucaMarconato commented May 10, 2024 •

edited

Loading

florianingelfinger commented May 13, 2024

LucaMarconato commented May 22, 2024

LucaMarconato commented May 24, 2024

Add reader for Stereo-seq files. #70

Add reader for Stereo-seq files. #70

Conversation

LLehner commented Jul 18, 2023 • edited Loading

LLehner commented Jul 31, 2023

codecov-commenter commented Aug 3, 2023 • edited Loading

Codecov Report

giovp commented Apr 24, 2024

LLehner commented Apr 24, 2024

giovp commented Apr 25, 2024

LLehner commented Apr 28, 2024 • edited Loading

timtreis commented May 6, 2024

LucaMarconato commented May 10, 2024 • edited Loading

florianingelfinger commented May 13, 2024

LucaMarconato commented May 22, 2024

LucaMarconato commented May 24, 2024

LLehner commented Jul 18, 2023 •

edited

Loading

codecov-commenter commented Aug 3, 2023 •

edited

Loading

LLehner commented Apr 28, 2024 •

edited

Loading

LucaMarconato commented May 10, 2024 •

edited

Loading