Some basic example with Kernel #21

kolkov · 2017-10-17T12:24:43Z

Hi!
Please provide some basic example how to work with this library properly.
How to create basic Kernel and use it into computation with GPU.

Thanks!

chewxy · 2017-10-18T02:48:22Z

Yeah I'll get round to doing that.

kolkov · 2017-10-18T08:04:15Z

It is better to give two examples. One is quite simple, to understand the basics, and one more complex, to understand where to move on.
Like in Nvidia docs.
Thanks!

cobnst · 2018-12-05T13:12:58Z

@chewxy @kolkov well how to found the example?0.0

Icaro-Lima · 2019-08-21T23:52:43Z

I'm here 2 years later ... Any examples?

malkhamis · 2019-08-22T18:44:19Z

maybe this test file can serve as an example 🤔 ?

Icaro-Lima · 2019-08-22T19:50:28Z

Thank you! This is exactly what I need!

neurlang · 2024-05-22T19:34:15Z

I got a working kernel here

tl;dr: The order of operations which seems to be working is:
initCUDA()

device, err := cu.GetDevice(0)
ctx, err := device.MakeContext(cu.SchedAuto)
err = ctx.Lock()
// then do malloc etc for permanent memory
mod, err := cu.LoadData(kernel.PTXreduceCUDA)
fn, err := mod.Function("reduce")
stream, err := cu.MakeStream(cu.DefaultStream)

reduceCUDA()

err := cu.SetCurrentContext(ctx)
// single use memory malloc/ permanent memory clear (memset 0)
err = cu.MemcpyHtoD(d_input_nums, unsafe.Pointer(&input_numbers[0]), inputNumsSize) // copy some data to kernel
err = d_fn.LaunchAndSync(x[1][0], x[1][1], x[1][2], x[0][0], x[0][1], x[0][2], 0, d_stream, args)
err = cu.MemcpyDtoH(unsafe.Pointer(&result[0]), d_result, resultSize) // copy solution data back from device to host

destroyCUDA()

cu.MemFree(memory) // free all memories
ctx.Unlock()
ctx.Destroy()

the kernel itself is in the kernel/ subdir, compiled to .ptx as this:

#!/bin/bash
nvcc -ptx reduceCUDA.cu -o reduceCUDA.ptx

the kernel does have a sort of main function:

extern "C" __global__ void reduce(uint8_t *d_set, uint32_t *d_nums, uint32_t *alphabet, uint32_t* out) {
// the code
}

Hope this serves as a solid example to the next person.

chewxy · 2024-05-23T03:18:08Z

hey @neurlang do you want to put this in a PR as an example?

neurlang · 2024-05-28T02:34:43Z

pushing hashtron kernel to the main cu repo doesn't feel right. it would shift the maintenance burden to you, i still need to tweak the kernel sometimes, etc. I think I will keep it in my repo instead. Anyone can look there anyway.

Speaking of this repo (gorgonia/cu), I would prefer to put here more testcases which stress the code paths related to my kernel, to make sure that hashtron behavior is not broken here accidentally by you or by someone, now that it works pretty stable.

On the other hand, now that we (or simply me) know how to make a CU kernel, it would be a good idea to write it to the gorgonia.github.io site as a tutorial to make a CU kernel.

I'll open a separate issue on there instead.

neurlang mentioned this issue May 28, 2024

Add a tutorial for coding a gorgonia/cu kernel gorgonia/gorgonia.github.io#129

Open

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Some basic example with Kernel #21

Some basic example with Kernel #21

kolkov commented Oct 17, 2017

chewxy commented Oct 18, 2017

kolkov commented Oct 18, 2017 •

edited

Loading

cobnst commented Dec 5, 2018 •

edited

Loading

Icaro-Lima commented Aug 21, 2019

malkhamis commented Aug 22, 2019

Icaro-Lima commented Aug 22, 2019 •

edited

Loading

neurlang commented May 22, 2024

chewxy commented May 23, 2024

neurlang commented May 28, 2024

Some basic example with Kernel #21

Some basic example with Kernel #21

Comments

kolkov commented Oct 17, 2017

chewxy commented Oct 18, 2017

kolkov commented Oct 18, 2017 • edited Loading

cobnst commented Dec 5, 2018 • edited Loading

Icaro-Lima commented Aug 21, 2019

malkhamis commented Aug 22, 2019

Icaro-Lima commented Aug 22, 2019 • edited Loading

neurlang commented May 22, 2024

chewxy commented May 23, 2024

neurlang commented May 28, 2024

kolkov commented Oct 18, 2017 •

edited

Loading

cobnst commented Dec 5, 2018 •

edited

Loading

Icaro-Lima commented Aug 22, 2019 •

edited

Loading