To-Do List
12/4/2012
...
Questions for Nvidia
- constants memory
- texture memory
- optimization tricks: pre-fetch etc.
- what does a queued warp do? (does it pre-fetch the memory)
- reducing number of registers in kernel (does compiler typically do this optimally?)
- how to learn with nvvp if we're memory/flops limited
- understanding the nvvp columns
- ask about zher speedup numbers
12/4/2012
- understand nvidia zher speedup plot (jun/cpo)
- libxc on gpu (lin)
- use CUDA5
- use common functional file for CPU/GPU
- use common work file for CPU/GPU
- read samuli old talk
- run 3x4x3 pt system
- RPA timing measurements (lin)
- multi-alpha zher at a lower priority(jun)
- reduce registers? prefetch?
- explore the parameter space: tile-size
- try multiple surfaces with jacapo/gpaw-pw (aj)
- paper (jun)
- try calling dacapo density mixing from GPAW (cpo)
- install GPAW on Keeneland (cpo)
- make sure all libxc self-tests run
- move suncatgpu01 to CUDA5 (cpo)
...
- come up with list of items to ask about at nvidia mtgs
- constants memory
- texture memory
- optimization tricks: pre-fetch etc.
- reducing number of registers in kernel (does compiler typically do this optimally?) how to learn with nvvp if we're memory/flops limited
- libxc on gpu (lin)
- read samuli old talk
- run 3x4x3 pt system
- run PBE0
- fix linking undefined symbol
- make sure all self-tests run
- put paramsize fix in for mgga and lda
- test libxc 2.0.0
- RPA timing measurements (lin)
- multi-alpha zher at a lower priority(jun)
- reduce registers? prefetch?
- explore the parameter space: tile-size
- try multiple surfaces with jacapo/gpaw-pw (aj)
- paper (jun)
- try calling dacapo density mixing from GPAW (cpo)
- install GPAW on Keeneland (cpo)
- "patch" file for libxc (only the memsets?) (cpo)
- move suncatgpu01 to CUDA5 (cpo)
- figure out how to softlink lda_c_pw.cuh (cpo)
...