Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Outputs of CellRangerCount seem to contain duplicated files in an archive #11

Open
prioux opened this issue May 19, 2023 · 2 comments
Open
Labels
bug Something isn't working

Comments

@prioux
Copy link
Member

prioux commented May 19, 2023

CellRangerCount seems to create a folder with a bunch of files.

Inside that folder there is a .tgz archive that contain a duplication of the files already in that folder. That TAR file contains most, but not all of the files in the output folder.

This should be cleaned up. Either:

  1. the tar file is completely redundant and should be removed before saving the output folder or
  2. the tar file is supposed to be the final product (with a few less files) and we shouldn't even attempt to save the output folder.

How to check as a CBRAIN developer:

a) cd to the root of a finished CellRangerCount task
b) create the list of files in the output dir:

find cellranger_count_res -type f | sort >/tmp/listfiles  # adjust name of folder

c) create the list of files in the .tgz archive

tar -tzf cellranger_count_res/cellranger_count_res.mri.tgz | sort >/tmp/archfiles

d) Compare them with diff, or csdiff:

diff /tmp/listfiles /tmp/archfiles
@prioux prioux added the bug Something isn't working label May 19, 2023
@prioux
Copy link
Member Author

prioux commented Jun 12, 2023

Can we contact the CellRanger developers about that?

@natacha-beck
Copy link
Contributor

We do not have direct contact with the dev team for this tool, it was integrated for users.

When look at the cellranger documentation, section Explore the output of cellranger count I feel like the only ouput that need to be saved is the outs folder.

I will confirm with the user.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

2 participants