Issues
- 0
- 1
Let's remove `-Wfatal-errors` from the flags
#45 opened by cwpearson - 1
Remove `-Werror`
#46 opened by cwpearson - 1
Add a CUDA 10 docker image
#12 opened by cwpearson - 1
- 0
- 0
ThetaGPU: error in src/cudaMemcpyPeerAsync_Duplex_GPUGPUPeer: a PTX JIT compilation failed
#42 opened by cwpearson - 2
NUMA node boundary benchmarks?
#41 opened by rlerdorf - 0
use cudaEvent to measure empty kernel time
#40 opened by cwpearson - 1
- 0
add NVSHMEM benchmarks
#39 opened by cwpearson - 0
Break benchmarks out into latency and bandwidth
#38 opened by cwpearson - 5
- 0
Undefined when `USE_NUMA != 1`
#36 opened by cwpearson - 0
Undefined when `USE_NUMA != 1`
#35 opened by cwpearson - 1
- 1
stack-smashing error during do_after_inits
#14 opened by cwpearson - 0
Make sure data is random to foil compression
#34 opened by cwpearson - 4
Unknown CMake command "sugar_include".
#32 opened by Yiltan - 2
- 1
Any lessons to be learned from EasyPerf?
#29 opened by cwpearson - 0
Investigate using cudaLaunchHostFunc for getting wall time when a stream operation ends
#28 opened by cwpearson - 1
mfence should clobber memory
#26 opened by cwpearson - 1
sync should clobber memory
#27 opened by cwpearson - 1
dcbf should clobber memory
#11 opened by cwpearson - 1
rename UM_Coherence to UM_Demand
#20 opened by cwpearson - 1
Error if numa is not found
#18 opened by cwpearson - 1
- 1
create zero-copy H2D
#24 opened by cwpearson - 3
- 0
prefetch-duplex GPU/GPU may be able to associate both streams with a single device
#22 opened by cwpearson - 0
- 0
- 1
add multi-threaded explicit transfer benchmarks
#16 opened by cwpearson - 0
Flush caches in unified memory host-to-gpu
#15 opened by cwpearson - 0
- 0
Use clflush with "+m" output operand?
#10 opened by cwpearson - 0
- 0
This numa/wr.cpp argument no longer exists
#9 opened by cwpearson - 0
- 0
- 1
Clean up memcpy/host_to_gpu
#3 opened by cwpearson - 1
Clean up memcpy/pinned_to_gpu
#4 opened by cwpearson - 1
Clean up memcpy/gpu_to_gpu_nopeer
#5 opened by cwpearson - 1
- 1
Graceful handling on non-NUMA systems
#1 opened by cwpearson