Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Loading data from .csv file #135

Open
GearFear opened this issue Nov 17, 2016 · 0 comments
Open

Loading data from .csv file #135

GearFear opened this issue Nov 17, 2016 · 0 comments

Comments

@GearFear
Copy link

I try to load data from mlsd datasets for k-means clustering on GPU.

I call:
val (mm, opts) = KMeans.learner("data/MLSD/mlsd1_4898430_38_20.csv")
opts.dim = 20
opts.batchSize = 20000
opts.nend=10

mm.train

And I get following error:

scala.MatchError: 959330605 (of class java.lang.Integer)
at BIDMat.HMat$.loadMat(HMat.scala:304)
at BIDMat.HMat$.loadMat(HMat.scala:323)
at BIDMach.datasources.FileSource$Prefetcher$$anonfun$run$1.apply$mcVI$sp(FileSource.scala:248)
at scala.collection.immutable.Range.foreach$mVc$sp(Range.scala:160)
at BIDMach.datasources.FileSource$Prefetcher.run(FileSource.scala:238)
at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511)
at java.util.concurrent.FutureTask.run(FutureTask.java:266)
at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142)
at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617)
at java.lang.Thread.run(Thread.java:745)

scala.MatchError: 959330605 (of class java.lang.Integer)
at BIDMat.HMat$.loadMat(HMat.scala:304)
at BIDMat.HMat$.loadMat(HMat.scala:323)
at BIDMach.datasources.FileSource$Prefetcher$$anonfun$run$1.apply$mcVI$sp(FileSource.scala:248)
at scala.collection.immutable.Range.foreach$mVc$sp(Range.scala:160)
at BIDMach.datasources.FileSource$Prefetcher.run(FileSource.scala:238)
at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511)
at java.util.concurrent.FutureTask.run(FutureTask.java:266)
at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142)
at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617)
at java.lang.Thread.run(Thread.java:745)

scala.MatchError: 959330605 (of class java.lang.Integer)
at BIDMat.HMat$.loadMat(HMat.scala:304)
at BIDMat.HMat$.loadMat(HMat.scala:321)
at BIDMach.datasources.FileSource$$anonfun$init$1.apply$mcVI$sp(FileSource.scala:128)
at scala.collection.immutable.Range.foreach$mVc$sp(Range.scala:160)
at BIDMach.datasources.FileSource.init(FileSource.scala:127)
at BIDMach.Learner.init(Learner.scala:56)
at BIDMach.Learner.firstPass(Learner.scala:93)
at BIDMach.Learner.retrain(Learner.scala:82)
at BIDMach.Learner.train(Learner.scala:71)
... 33 elided

Even if I change the extension of file to .txt, I still have this error.
I use scala interpreter using ./bidmach

Mat.checkCUDA:
2 CUDA devices found, CUDA version 8.0

nvcc --version
nvcc: NVIDIA (R) Cuda compiler driver
Copyright (c) 2005-2016 NVIDIA Corporation
Built on Sun_Sep__4_22:14:01_CDT_2016
Cuda compilation tools, release 8.0, V8.0.44

I have 2 Tesla K80.

If I load data from MNIST8M folder, I can use clustering with this data successfully.

But I'd like to get the power of BidMach to cluster my own datasets. How can I use it in BidMach?

Thanks in advance!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant