You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
First, it would be interesting to adapt the DataGenerator to work as a Python Generator. I'll read more about this.
Regarding your comment on data loading, it is clear that this process should be more efficient. This is an important issue to work on. We have to think how to do this, because the idea is to keep the library independent of the backend used to implement the machine learning model. Does it make sense to have different DataGenerators for each backend as we have with the ModelContainers? For instance, now we have KerasDataGenerator that converts a DataGenerator to be used for training keras models. Maybe we can implement this keras generator following the blog.
Thanks for publishing this interesting framework!
I have a question: Are you planning to support "real" python generators such as tf.Dataset as DataGenerator? Then the input pipeline could be much more efficient, especially when dealing with datasets that don't fit in memory.
Here is a nice blogpost about this: https://towardsdatascience.com/how-to-build-efficient-audio-data-pipelines-with-tensorflow-2-0-b3133474c3c1
Thanks and all the best!
The text was updated successfully, but these errors were encountered: