-
Notifications
You must be signed in to change notification settings - Fork 10
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Questions about the model performance #3
Comments
Hi @YiyuLuo, I'm not the author of the repository but I'm currently implementing a PyTorch version of the network described in the paper. Regarding the size of the model I think it is reasonable to decrease the size of some layers. An interesting part of the paper is the ablation study: I think that reducing the three FC layers to one FC layer of 100 units (as done by @mayurnewase) can not be sufficient enough to retain the whole complexity of the output masks, though. |
Thanks for your reply! |
What was the size of your three FC layers ? |
the same as the paper, 600 units each |
How much of the AVSpeech dataset did you use ? I don't have as good results as they do in the paper but they are quite satisfying and I use a lighter model with only 15% of their dataset. |
Due to some policy reasons, AVSpeech dataset is not available. I used GRID dataset instead, about 20,000 speech clips in total. |
I'm still able to download the AVSpeech dataset from this page. Is this website unavailable for you ? |
This website is available but China mainland can't access YouTube. |
Hey I am also using Pytorch but not able to get results, can you help me. Can I get your gmail so that I can discuss it with you. |
Hi! Thank you very much for your great work!
I'm also working on this paper these days, but it takes a long time to train the model exactly the same as in the paper. I haven't got good results so far.
I find you change some layers of the network. I'm also wondering whether it is possible to build a small model while keeping the performance. Could you please tell me how's the perfomance of your modified model?
The text was updated successfully, but these errors were encountered: