You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I'd like to do that in one pass at the end, since I want to ensure that the URLs submitted do not contain usernames/passwords in them, similarly for comments as well. This is why it's hard to provide access to this dataset "on the go".
Also, we will be labeling at least a portion of the dataset. I have yet to figure out the legal possibilities around distributing the labeled dataset as there may be copyright restrictions there which would prevent us from doing so. But if we do have the option I'd like to release the labeled dataset when it's finished as well.
Hello !
Will the dataset built with this be shared publicly? (Or is it already maybe?)
thanks!
The text was updated successfully, but these errors were encountered: