Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Find solution for logging and results when multiple users are running workflows #25

Open
drewoldag opened this issue Jul 31, 2024 · 1 comment

Comments

@drewoldag
Copy link
Collaborator

drewoldag commented Jul 31, 2024

As it stands, all the logs go to .../<date>/<run_id> and all the output files end up in .../staging/ but when multiple people are running at the same time this won't work. People would end up clobbering each other's run logs, and the checkpointing system wouldn't know where to look for a given user's latest checkpoints - instead it would consider everyones checkpoints.

Additionally, now that there are multiple types of workflows, we need a way to distinguish which workflow produced a particular set of results along with some way to distinguish who is producing those results. This is also a problem for the checkpoint files. It would be bad for workflow A to use a checkpoint from workflow B just because the input files were named the same thing.

Probably need something like .../<person>/<run_id> or .../<workflow_id>/<person>/<run_id>

@drewoldag drewoldag changed the title Require a user to have a unique logging directory Find solution for logging and results when multiple users are running workflows Aug 15, 2024
@drewoldag
Copy link
Collaborator Author

@wilsonbb had some thoughts about a more substantial system for manifest handling when multiple people are processing from the same staging data.

#30 (comment)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant