-
Notifications
You must be signed in to change notification settings - Fork 417
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Call for content #4
Comments
I have some notebook from the recent Kaggle Diabetic Retinopathy competition loading some trained model and using it to predict on some images, get some activations, etc. But it's quite quickly written and a little messy, so it probably needs to be cleaned up a bit. Nevertheless, something quite applied like that might be interesting for some people, maybe? |
Forgot to mention: it's built on top of an older version of Lasagne as well. At commit cf1a23c21666fc0225a05d284134b255e3613335. Same as my own fork. |
I can write a spatial transformer example, when the layer is added. I would need to clean up the transformer code and create a repeat layer similar to http://keras.io/layers/core/#repeatvector (which I think is quite usefull for encoder/decoder stuff) I have code for combining a GRU with a spatial transformer network. I could also just rewrite the spatial transformer network results. What do people prefer? |
btw I also have a Penntree example. The setup is similar to http://arxiv.org/abs/1409.2329 except that i would have to use a GRU because of a minor technical issue: craffel/nntools@f4d15ba#diff-48f9f9d93ed00587a6c49197b4d6e64eL992 Penntree example |
@JeffreyDF your notebook looks really interesting, but it seems like it's dependent on a lot other stuff in your repo, might be a quite a bit of work to move it. Perhaps we should add an "External Resources" section to @skaae I think the pentree example would be great, and spatial transformer as well once Lasagne/Lasagne#355 is resolved. |
We also still have this wiki page for links to things that don't fit into Recipes (or were not turned into Recipes yet): https://github.com/Lasagne/Lasagne/wiki/3rd-party-extensions-and-code |
@ebenolson Yes, you're right. But in essence, the same functionality can be achieved with just a few extra lines of code in the notebook, I think. It's mostly the DataLoader which only reads images and resizes them (at least for testing) and the custom metrics, iirc. I can try to clean that up a bit (a little busy at the moment but should be able to find some time in the week). But I'm totally fine if you would rather have it as a more "complete" project to maybe point to (or not!). It makes some sense to do that. :-) |
Yeah, I feel like not many people know about/visit the wiki though, it would be nice if we could increase visibility (although Recipes is in a similar state at the moment). |
Ah, yes, extra remark: for my notebook I'm using these 80-90MB model dumps to load first. So yes ... that probably makes it a little more unsuitable to have here! Forgot about that. |
If you've got time to package it, I think that would be great, I'm happy with either though. I would like to keep large binary files out of the repo (data, snapshots), so if you decide to rework it it would be good if you could host those on Dropbox/S3 or something similar. |
I do have an S3 bucket I made for Recipes stuff that I'm happy to put them in if you want. |
I'll try to clean up the Penntree example and make it use GRU units instead of LSTM. I don't have access to a GPU before i return to Denmark in ~10 days so i can not test the results before then. |
Great, thank you! |
@ebenolson Thank you! Will take that into account. Might do something with it in a little while. :-) |
I will submit PRs for the highway networks and hidden factors of variation notebooks when I find some time. It's Hopefully soon! I'll also have a think about what else I could share. Maybe some of the cyclic pooling/rolling stuff (with custom CUDA kernels) would be useful to have on here as well.
@skaae please do both :D That cluttered MNIST example looks really cool by the way! Is this published anywhere? |
We could also add links on http://gitxiv.com/ if the example reproduces a specific paper. |
I have the code from the LSTM benchmark, which needs a little updating, which is a noisy speech recognition experiment. It would be a little silly to host it in both places though. I also have code (also online) for my ISMIR paper, but it's a bit obscure for a simple example. One thing I've been asked for is examples for using a CNN on spectrograms. I could try to make up a simple example for this, or maybe @f0k should reproduce his ISMIR paper from last year with Lasagne ;) For any of the above, data is an issue as none of them are common datasets (like CIFAR or MNIST). How can we get stuff in the S3 bucket @ebenolson ? @skaae A number of people have asked me for a "char-rnn" example in Lasagne, which I think your penn treebank example is close to. Any chance you want to do that too/instead? |
Yes. I think its the same as the char-rnn except that i need a sample function which im not sure how to implement. As i understand it the model outputs a probability distribution over words which you sample and use as input in the next time step (?). We could implement sampling by compling a single step model and then run it in for-loop.
Here f is the recurrent model compiled to run a single step. |
I've actually already reproduced it, but it depends on other code of mine that I'd need to strip off. Also the training data is not available online. It'd be easier to reproduce the onset detection paper, for which at least part of the training data is public (but still not enough). These data issues are a bit annoying... Maybe this year's ISMIR paper would be suited best. I'll see what I can do. |
I've emailed access details to you and @skaae. If anyone else needs access let me know. |
I updated the language model to use GRU. It seems to run but i cannot test the performance until i have access to GPU. https://github.com/skaae/nntools/tree/penntree_lasagne/examples Todo:
Comments are velcome :) |
Re: data storage and S3, maybe a better solution would be to use Git LFS (or one of the equivalents)? That way it won't ever go anywhere and we won't need any downloader scripts. |
I'm not sure - it looks like Github LFS is still under construction (I applied for the early access though, we'll see what happens...). Also I don't know if asking people to install git-lfs is easier than a download script (with an ipython notebook you can do it inline with |
I have access. There are a bunch of other options, but I'm predicting git LFS to come out on top.
Yeah, I mostly mean for data provenance (what happens when you go broke and your S3 disappears?) |
Hi @skaae, I like the Penntree example very much. Glad that it will work with what's checked in now. I had it working before, but needed to change some parameters for LTSM to fit Colin's recurrent branch. What is the issue with data for that example? It's quite small. Or that's about a different paper. Sorry having a hard time following. |
Cool! Do you know if it's possible to get a download URL, like the zipball links for repos? Is there any more info on quota/pricing available? The 1GB storage limit I saw seems pretty small.
Yeah, free github hosting would be ideal - although I wonder how sustainable binary hosting will be for them. Anyway for now I think everyone should just use whatever they're comfortable with and we can always mirror/migrate in the future. S3 has cost me $0.52 so far, so I'm not too concerned about breaking the bank :) |
That's a good question. I'm pretty sure it doesn't, which limits its utility...
https://help.github.com/articles/billing-plans-for-git-large-file-storage/
Hah ok, I didn't realize it was so cheap! I think you're right that that's the way to go for now. |
@moscow25 can you move the question to the lasagne google group? Maybe be a bit more specific about the problem and i'll try to help. |
I was gonna sort out the notebooks I did before and submit PRs for them, but it looks like they depended on some functions provided by the old mnist example that are no longer there. So it'll take a bit longer than I anticipated as I don't have time to fix them right now. If anyone else wants to do it, be my guest. It just a question of getting rid of Otherwise, hopefully next weekend. |
@benanne @dnouri @craffel @f0k @skaae (and anyone else of course)
With first release imminent it would be nice to have a bit more here... I know a couple of you have stuff written up already, but I bet everyone has some suitable code lying around.
If you have anything you're willing to contribute, please open a PR... don't worry if it's not perfect, I can take care of making sure everything functions with the latest Lasagne.
The text was updated successfully, but these errors were encountered: