Parallel Algorithms

  • Parallel Prefix Sum (Scan) with CUDA (2007)

    A classic reference on implementing a work-efficient parallel prefix sum algorithms

    • CUDA
    • paper
    • classic