-
Notifications
You must be signed in to change notification settings - Fork 37
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Running Tensorflow model via TVM on OpenCL #143
Comments
Can you give me more details about what were you trying to do?
|
Hi @doe300, My apologies but I am new to the OpenCL and the whole TVM stack. I will try to explain in more details what I did.
I get
Thank you for help. Best Regards Marcin Sielski PS. I managed to execute entire example on Intel/beignet successfully. |
Hi @doe300, Do you have any ideas how to troubleshoot the issue? I wonder what the message 'Invalid local type for memory area: (p) i32* %tmp.326' mean. Thank you for help in advance. Best Regards Marcin Sielski |
The error means that the OpenCL kernel accesses memory in a way that is not yet supported by the VC4C compiler. Without knowing the actual kernel code, there is not much that can be done here. If you compile the VC4CL host-side stack in debug mode (with the CMake variable |
@doe300 vc4cl-source-1804289383.cl.tar.gz Thank you for help in advance. Best Regards Marcin Sielski |
Thanks for the source code. I do not get the error reported above, but I do get some register-association errors. I will have a look into them. |
This fixup-step will try to group all scalar (including pointer) locals into group vectors to reduce the register pressure. Also removes the obsolete register-resolver round maximum. Effects: * fixes register association error for #143 * drastically increases number of instructions where applied
This fixup-step will try to group all scalar (including pointer) locals into group vectors to reduce the register pressure. Also removes the obsolete register-resolver round maximum. Effects: * fixes register association error for #143 * drastically increases number of instructions where applied
I am experiencing following error on Rasberry Pi while trying to run tensorflow model over TVM on OpenCL.
terminate called after throwing an instance of 'vc4c::CompilationError'
what(): Normalizer: Invalid local type for memory area: (p) i32* %tmp.3262
The text was updated successfully, but these errors were encountered: