CUDA
Links
- CUB - Flexible library of cooperative threadblock primitives and other utilities for CUDA kernel programming.
- Jitify - Single-header C++ library for simplifying the use of CUDA Runtime Compilation.
- Gunrock - CUDA library for graph-processing designed specifically for the GPU.
- Lantern - Implementation of a machine learning framework prototype in Scala.
- Jitify - Single-header C++ library for simplifying the use of CUDA Runtime Compilation (NVRTC).
- CUDA.jl - Programming in Julia. (CUDA.jl 2.0)
- GPGPU Programming with CUDA
- k-NN algorithm on CUDA
- KBLAS CUDA - Subset of BLAS routines optimized for NVIDIA GPUs.
- CudaSift - SIFT features with CUDA.
- ZLUDA - CUDA on Intel GPUs.
- CUDA Library Samples
- PyCUDA - Access Nvidia's CUDA parallel computation API from Python.
- CUTLASS - CUDA Templates for Linear Algebra Subroutines.
- Reaction Diffusion modelling with Julia and CUDA (2020)
- Unifying the CUDA Python Ecosystem (2021) (HN)
- Using OpenGL instead of CUDA for machine learning
- Awesome GPGPU - Algorithms implemented in CUDA + resources about GPGPU.
- Accelerating Conway's Game of Life Using CUDA (2021)
- cu - Go package that provides an idiomatic interface to the CUDA Driver API.
- RustaCUDA - Rusty wrapper for the CUDA Driver API.
- ec-gpu - CUDA/OpenCL code generator for finite-field arithmetic over prime fields and elliptic curve arithmetic constructed with Rust.
- Python venv Setup for Deep Learning Resources (Tweet)
- Rust CUDA - Extremely fast GPU code fully in Rust. (HN) (Reddit)
- cuda-toolkit - GitHub Action to install CUDA.
- Koila - Prevent
CUDA error: out of memory
in just 1 line of code. - ICPCUDA - Super fast implementation of ICP in CUDA for compute capable devices 3.5 or higher.
- CUDPP - CUDA Data Parallel Primitives Library.
- RAFT - RAPIDS Analytics Framework Toolkit.
- TSNE-CUDA - GPU Accelerated t-SNE for CUDA with Python bindings.
- cuda-kat - CUDA kernel author's tools.
- CUDA Design Patterns - CUDA design patterns and a bit of template magic for CUDA.
- GPUMD - Graphics Processing Units Molecular Dynamics.
- Some CUDA programming fun with Julia (2022) (HN)
- CUDA Samples - Samples for CUDA Developers which demonstrates features in CUDA Toolkit.
- How CUDA Programming Works (2022) (HN)
- GPU Puzzles - Solve puzzles. Learn CUDA.
- cudarc - Safe rust wrapper around CUDA toolkit.
- CUDAMat - Python module for performing basic dense linear algebra computations on the GPU using CUDA.
- CV-CUDA - Open-source, graphics processing unit (GPU)-accelerated library for cloud-scale image processing and computer vision.
- Rust cuBLAS - Safe CUDA cuBLAS wrapper for the Rust language.
- CUDA rules for Bazel
- How to Optimize a CUDA Matmul Kernel for cuBLAS-like Performance: a Worklog (2022) (HN)
- cutex - PyCUDA based PyTorch Extension Made Easy.
- Neural Network CUDA Example - Several simple examples for neural network toolkits (PyTorch, TensorFlow, etc.) calling custom CUDA operators.
- Nvidia's CUDA Monopoly (2023) (HN)
- async-cuda - Asynchronous CUDA, NPP and TensorRT.