cublas
Here are 88 public repositories matching this topic...
Framework, toolkit and ready-to-use applications for numerical linear algebra dependent machine learning algorithms.
-
Updated
Feb 7, 2021 - C++
Explore performance implications of various matrix multiplication approaches using GPU/CUDA compared to CPU side processing
-
Updated
Dec 9, 2020 - C++
Extreme Learning Machine for image classification implemented using Cuda C++ and cuBLAS
-
Updated
Dec 19, 2022 - Jupyter Notebook
custom Basic cuBLAS example from modifying NVIDIA cuBLAS Samples
-
Updated
Dec 30, 2019 - Makefile
Level 3 matrix multiplication using both cublas and mkl.
-
Updated
Jul 20, 2018 - Cuda
Лабораторные работы по курсу "Параллельное программирование"
-
Updated
Dec 6, 2023 - C++
Modular Image processing Library toolKit (milk)
-
Updated
Jan 19, 2021 - C
This project utilizes CUDA and cuBLAS to optimize matrix multiplication, achieving up to a 5x speedup on large matrices by leveraging GPU acceleration. It also improves memory efficiency and reduces data transfer times between CPU and GPU.
-
Updated
Nov 19, 2024 - C++
A CUDA approach for computing the multiplication of a transposed matrix with the initial one, using the cuBLAS library.
-
Updated
Sep 28, 2021 - Cuda
Improve this page
Add a description, image, and links to the cublas topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the cublas topic, visit your repo's landing page and select "manage topics."