CUDPP 2.0
CUDA Data-Parallel Primitives Library
Variables
/mnt/hgfs/VMWare/cudpp/src/cudpp/cudpp_globals.h File Reference

Global declarations defining machine characteristics of GPU target. These are currently set for best performance on G8X GPUs. The optimal parameters may change on future GPUs. In the future, we hope to make CUDPP a self-tuning library. More...

Variables

const int SORT_CTA_SIZE = 256
const int SCAN_CTA_SIZE = 128
const int REDUCE_CTA_SIZE = 256
const int LOG_SCAN_CTA_SIZE = 7
const int WARP_SIZE = 32
const int LOG_WARP_SIZE = 5
const int LOG_SIZEOF_FLOAT = 2
const int SCAN_ELTS_PER_THREAD = 8
const int SEGSCAN_ELTS_PER_THREAD = 8

Detailed Description

Global declarations defining machine characteristics of GPU target. These are currently set for best performance on G8X GPUs. The optimal parameters may change on future GPUs. In the future, we hope to make CUDPP a self-tuning library.

cudpp_globals.h


Variable Documentation

const int SORT_CTA_SIZE = 256

Number of threads per CTA for radix sort. Must equal 16 * number of radices

const int SCAN_CTA_SIZE = 128

Number of threads in a CTA

const int REDUCE_CTA_SIZE = 256

Number of threads in a CTA

const int LOG_SCAN_CTA_SIZE = 7

log_2(CTA_SIZE)

const int WARP_SIZE = 32

Number of threads in a warp

const int LOG_WARP_SIZE = 5

log_2(WARP_SIZE)

const int LOG_SIZEOF_FLOAT = 2

log_2(sizeof(float))

const int SCAN_ELTS_PER_THREAD = 8

Number of elements per scan thread

const int SEGSCAN_ELTS_PER_THREAD = 8

Number of elements per segmented scan thread

 All Classes Namespaces Files Functions Variables Typedefs Enumerations Enumerator Defines