GPU
Banana has nice cloud GPU pricing.
Makepad is is incredibly exciting to write cross platform fast UI using GPU.
Gentle introduction to GPUs inner workings is great read.
Links
- cuDF - GPU DataFrame Library.
- ROCm - Platform for GPU-Enabled HPC and Ultrascale Computing.
- W3C GPU for the Web Community Group
- RAPIDS - Open GPU Data Science.
- GPU hardware effects - Demonstration of various hardware effects on CUDA GPUs.
- Basis Universal GPU Texture Codec (Basis Universal Transcoders)
- NVIDIA GPU Monitoring Tools
- A Taste of GPU Compute (2020) (Slides) (HN)
- KeOps - Kernel Operations on the GPU, with autodiff, without memory overflows.
- 3D Fluid - GPU Fluid Simulation with Volumetric Rendering.
- Understanding Latency Hiding on GPUs (2016) (HN)
- Harlan - Domain specific language for programming GPUs.
- Region-based memory management for expressive GPU programming (2015)
- Fast 2D rendering on GPU (2020) (HN)
- Nyuzi Processor - Experimental GPGPU processor hardware design focused on compute intensive tasks. It is optimized for use cases like deep learning and image processing.
- Reyes Rendering on the GPU (2015) (HN)
- Compressed GPU texture formats – a review and compute shader decoders (2020)
- GPU Compute Basic Algorithms - MLIR
- Which GPUs to get for deep learning (2020) (HN)
- KodeLife - Real-time GPU shader editor.
- Getting Started with GPU Computing in Anaconda (2017)
- gpu-distance-field - TypeScript library for generating distance fields from 2d images on the GPU.
- Gradient Free Cloud GPUs
- Object Detection from 9 FPS to 650 FPS in 6 Steps (2020) (HN)
- Object Detection at 1840 FPS with TorchScript, TensorRT and DeepStream (2020) (HN)
- GPU-Accelerated ODE Solving in R with Julia, the Language of Libraries (2020)
- Detect GPU - Classify GPU's based on their benchmark score in order to provide an adaptive experience.
- Moana Motunui Renderer on GPU (2020) (HN) (Code)
- GPUs on a Raspberry Pi Compute Module 4 (2020)
- Installing the world’s worst video card (2020) (Lobsters)
- Let’s build a video card - Video series on how VGA signals work and building a circuit that provides the correct timing of sync signals so that a monitor recognizes the signal and displays an image stored on an EEPROM.
- NVIDIA Research (GitHub)
- Umpire - Application-focused API for memory management on NUMA & GPU architectures.
- GPUVis - GPU Trace Visualizer.
- Cloud GPU Vendors - Compares on-demand GPU vendors to help Machine Learning practitioners pick their preferred platform.
- GPGPU-Sim - Provides a detailed simulation model of contemporary NVIDIA GPUs running CUDA and/or OpenCL workloads.
- ArrayFire - General purpose GPU library. (Web)
- AMD ROCm Platform Docs - Open-source software development platform for HPC/Hyperscale-class GPU computing. (Code)
- Dissecting the Apple M1 GPU (2021) (HN)
- Dissecting the Apple M1 GPU: Part 2 (2021) (HN)
- Asahi GPU - Research for an open source graphics stack fo Apple M1.
- Understanding GPU caches (2021)
- Security of the Intel Graphics Stack (2021)
- LambdaCube 3D - Haskell-like purely functional domain specific language for programming the GPU. (Web)
- Apple GPU architecture docs and tools
- GPA - GPU Performance Advisor.
- Vgpu_unlock - Unlock vGPU functionality for consumer grade GPUs. (HN)
- Patterns and behaviors for GPU computing
- Faster Rendering Using Hardware Acceleration (2021)
- NVTOP - NVIDIA GPUs htop like monitoring tool.
- gpustat - Simple command-line utility for querying and monitoring GPU status.
- The stack monoid revisited (2021)
- Hash Functions for GPU Rendering (2021) (HN)
- NVIDIA Merlin - Open source library providing end-to-end GPU-accelerated recommender systems.
- Inferrd - GPU-Accelerated Inference Hosting. (HN)
- GPU architecture types explained (2021)
- Simulating worlds on the GPU
- Triton - Open-Source GPU Programming for Neural Networks. (Article) (HN)
- How do GPUs avoid false sharing? (2021)
- GDRCopy - Low-latency GPU memory copy library based on NVIDIA GPUDirect RDMA technology.
- RealTime 3D Strange Attractors scout on GPU
- NCCL - Optimized primitives for collective multi-GPU communication.
- How to train large models on many GPUs? (2021) (HN)
- Gentle introduction to GPUs inner workings (2021) (HN)
- Heterogeneous Systems Course: Meeting 2: SIMD processors and GPU architecture (2021)
- Implementing a GPU-driven debug line renderer (2021)
- Rendering on the Apple M1 Max Chip (2021) (HN) (Reddit) (Tweet)
- Ask HN: Will there ever be a vendor agnostic GPU interface? (2021)
- NVIDIA Omniverse Platform - Scalable, multi-GPU real-time reference development platform for 3D simulation and design collaboration. (GitHub)
- AmgX - Distributed multigrid linear solver library on GPU.
- GPU Accelerated Non-rigid ICP for surface registration
- Ricks-Lab GPU Utilities - Set of utilities for monitoring GPU performance and modifying control settings.
- Watchmen - Simple and easy-to-use toolkit for GPU scheduling.
- GPU-Accelerated Lossless Compression Survey
- RMM - RAPIDS Memory Manager.
- Easy Scalable Text Rendering on the GPU (2016)
- Capuchin: Tensor-based GPU Memory Management for Deep Learning (Review)
- Nvidia GPUs Enable Simulation of a Living Cell (2022) (HN)
- Writing an open source GPU driver – without the hardware (2022) (HN)
- Human GPU Exercises
- NVIDIA GPU Operator - NVIDIA GPU Operator creates/configures/manages GPUs atop Kubernetes.
- DietGPU - GPU-based lossless compression for numerical data.
- BGPQ: A Heap-Based Priority Queue Design for GPUs (2021) (Code)
- NVIDIA Container Toolkit - Build and run containers leveraging NVIDIA GPUs.
- BabelStream - Measure memory transfer rates to/from global device memory on GPUs.
- Tool for messing around with Apple GPU assembly
- Introduction to GPUs with OpenGL (2022) (HN)
- Rasterization in One Weekend (2018)
- Encase - Provides a mechanism to lay out data into GPU buffers according to WGSL's memory layout rules.
- RadeonTop - View your GPU utilization, both for the total activity percent and individual blocks.
- Vast.ai - Rent GPU Servers for Deep Learning and AI. (FAQ) (HN)
- nvitop - Interactive NVIDIA-GPU process viewer, the one-stop solution for GPU process management.
- Nvidia Hopper GPU Architecture and H100 Accelerator (2022) (HN)
- Nvidia Grace CPU (HN)
- NVIDIA Warp - Python framework for writing high-performance simulation and graphics code.
- nvblox - Signed Distance Functions (SDFs) on NVIDIA GPUs.
- Sparse Voxel Octree (Vulkan Version) - GPU SVO Builder using rasterization pipeline, a efficient SVO ray marcher and a simple SVO path tracer.
- Ask HN: How does a CPU communicate with a GPU? (2022)
- GPU Top - GPU profiling tool.
- Jetson Nano Developer Kit | Getting Started
- GPU Performance API for AMD GPUs
- A Driver on the GPU (2022)
- wgpu-py - Next generation GPU API for Python.
- NVIDIA Linux Open GPU Kernel Module Code (HN) (Article)
- The Apple GPU and the impossible bug (2022) (HN)
- Voronoi Diagrams on the GPU (2016)
- How GPU Computing Works (HN)
- DeepOps - Tools for building GPU clusters.
- TensorDock Core GPU Cloud - GPU servers from $0.29/hr. (HN)
- Vector graphics on GPU
- Inlyne - GPU powered, browserless, markdown + HTML viewer.
- The GPU Banana Stand (2022)
- Clip control on the Apple GPU (2022) (HN)
- Running Stable Diffusion on Your GPU with Less Than 10Gb of VRAM (2022) (HN)
- What are Tensor Cores?
- GeForce RTX 40 Series (HN)
- Nvidia GPU exporter for Prometheus
- CoreWeave - GPU Cloud.
- Banana - Machine Learning Model Deployment on Serverless GPUs. (Twitter) (GitHub)
- Reverse Engineering a GPU from 2009 (2022)
- Crusoe Cloud GPU Pricing (Tweet)
- LibVF.IO - Vendor neutral GPU multiplexing tool driven by YAML & VFIO.
- NVIDIA PhysX SDK (HN)
- Beyond GPU unified memory: fully unified address spaces (2022)
- Tales of the M1 GPU (2022) (HN) (Reddit)
- GPU-IO - GPU-accelerated computing library for physics simulations and other mathematical calculations. (Examples)
- Apple GPU drivers now in Asahi Linux (2022) (HN)
- Dream Factory - Multi-threaded GUI manager for mass creation of AI-generated art with support for multiple GPUs.
- ArchProbe - Profiler to disclose and quantify hardware features on GPUs.
- MIG Profiler - Multi-Instance-GPU profiling tool.
- EnvyControl - Easy GPU switching for Nvidia Optimus laptops under Linux.
- Awesome GPU
- Use.GPU Goes Trad (2023) (HN)
- GPU Caching Compared Among AMD, Intel UHD, Apple M1 (2022) (HN)
- GPUs for Deep Learning in 2023 – An In-depth Analysis (HN)
- GPU-Backed User Interfaces (2023) (HN)
- Apple GPU micro architecture
- GPU Tester - Detects broken and slow GPUs in a cluster.
- NVIDIA Cloud Native Stack - Run cloud native workloads on NVIDIA GPUs.
- GPU-backed User Interfaces (2023)
- GPU-Initiated On-Demand High-Throughput Storage Access in the BaM System Architecture
- Branching on a GPU (2021)
- Fuser - Fusion Code Generator for NVIDIA GPUs.
- The State of Serverless GPUs (HN)
- Oblivus - Affordable and scalable GPU servers from $0.29/hr. (HN)
- Cloud GPU Resources and Pricing (HN)
- AMDGPU_TOP - Tool to display AMDGPU usage.
- Nvidia DGX GH200: 100 Terabyte GPU Memory System (2023) (HN)
- taichi.js - Modern GPU Compute and Rendering in JavaScript.
- Nvidia H100 and A100 GPUs – comparing available capacity at GPU cloud providers (2023) (HN)
- GPU Programming: When, Why and How? (HN)
- Which GPU(s) to Get for Deep Learning (2023) (HN)
- Nvidia H100 GPUs: Supply and Demand (2023) (HN)
- Ask HN: Cheapest hardware to run Llama 2 70B (2023)
- Optimization Techniques for GPU Programming (HN)
- Udacity CS344: Intro to Parallel Programming | NVIDIA Developer
- Making AMD GPUs competitive for LLM inference (2023) (HN)
- GPU-Accelerated LLM on a $100 Orange Pi (2023)
- The first conformant M1 GPU driver (2023) (Lobsters) (HN)
- Get Lambda Instances - Use GitHub Actions to get the lambda cloud instances you want.
- Multi-GPU Training with PyTorch and TensorFlow
- GPU Benchmarks on LLM Inference