-
Notifications
You must be signed in to change notification settings - Fork 34
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Very high dimensional data and pyhsmm-SLDS? #3
Comments
Hi @hedgy123, thanks for your feedback! I've been doing a bunch of work that addresses various aspects of your problem. The latest versions of PySLDS support input driven dynamics and input driven observations, at least with Gibbs sampling. Hopefully the examples make it clear how to add inputs to the model.
Scott |
That's so awesome! Thanks so much! I tried it right away but couldn't compile the package + the dependencies: pylds has syntax errors under python 2.7; and pyslds, under 3.5. Specifically, resample_states((data, kwargs)) in pyslds.parallel - tuple unpacking was removed in python 3. |
Sorry about that @hedgy123 -- I've been doing all my testing in python3.5. Not sure why yours crashed on joblib though. Mine would only give a runtime error if |
Python 2.7 is best Python. We should set up a simple Travis CI test to catch compatibility issues. |
Hi again Matt and Scott,
I absolutely love your SLDS code and your bee example is awesome! I was wondering though, would this work for a very high-dimensional system?
To be specific, consider a dataset consisting of some large number of time series (of order 100) – for example, I have 100 sensors measuring different quantities for the same process. The process likely contains a number of distinct dynamical states. Can I feed this type of data into your pyslds.models.WeakLimitStickyHDPHMMSLDS model and let it pick out the distinct states? If so, could you suggest sensible parameters to start for such a high-dimensional system? I've played around with it for a little bit but it seems to choke, probably because my initial guesses weren't very good.
Assuming the above works, could you suggest the best way to extract in which state the system is at any particular time, or (even better) time boundaries of the distinct states?
Finally, I have a number of inputs that I know can potentially drive the sensor readings. Is there a way to input them into the model (so that it becomes a switching LDS with inputs)?
BTW, I have also been playing with your WeakLimitHDPHSMM, and while it does a fantastic job finding clusters in 2D distributions, I’ve been getting rather strange results with my 100-sensor data. Is it supposed to work for such a large number of dimensions or am I pushing it?
Thank you very much!!
The text was updated successfully, but these errors were encountered: