Skip to content

Why RepeatDataset can reduce time of loading data between epochs #1193

Closed Answered by HAOCHENYE
MciaR asked this question in Q&A
Discussion options

You must be logged in to vote

Sorry for my late reply. If the persistent_workers is set to False in Dataloader, there could be additional overhead to relaunch the workers. However, if you enable it in Dataloader, it means all workers will run independently in multiple processes, and any modification to dataset or pipeline will only work in the main process, and the dataset and pipeline in subprocesses launched by Dataloader will not be influenced.

Replies: 1 comment 1 reply

Comment options

You must be logged in to vote
1 reply
@MciaR
Comment options

Answer selected by MciaR
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants