Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Adding multiple "visits" (clinical data) to a single .obs #1617

Open
Rafael-Silva-Oliveira opened this issue Aug 28, 2024 · 0 comments
Open

Comments

@Rafael-Silva-Oliveira
Copy link

Hello

I'm trying to add some clinical data to each of 75 individual samples (.obs) from bulk RNA sequencing. However, each individual sample can have 3-5 visits (treatments, etc) and I can't quite "expand" from 75 samples to 190+ samples as it throws an error, since the original adata is of size (75, 42000)

For example:

Say we have an ID called R1_70 and this ID (patient) had 5 visits and each visit has new information (Treatment type, visit date, etc). How can I add this and associate it with R1_70 on my .obs without expanding over the original size of 75 samples? One hot encoding wouldn't work here. Maybe if I "explode" each column to have "col_A_visit_1","col_B_visit_2", etc, but then this would make it quite awkward to run DEG, functional, analysis, etc with tools like scanpy or decoupler.

Any help on how to do this is welcome!

Thanks

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant