Page History

Versions Compared

Key

This line was added.
This line was removed.
Formatting was changed.

To-Do List

5/9/2012

rpbe kernel (lin)
- try cudamallochost with memcpyasync
- fix stream behavior and try with 1,2,4,8,16 streams
- understand stream behaviour with nvvp
zher streams(jun)
- in benchmark, have separately variable nstream/nw
- can we see whether we have 4 or 16 streams?
- understand stream behaviour with nvvp
work on libxc (cposeparate stream and n-omega parameters (jun)

5/2/2012

looking at EXX bottleneck (rewriting) (jun)
use cuda streams for small RPA systems (jun)
libxc integration (cpo)
understand MKL benchmark (jun/cpo)
pycuda (cpo)
understand RPBE kernel: (lin)
- understand "double" problem
- vary np, block_size, nstreams
- loop testfunc many times
- longer term: look at jussi/samuli kernel for ideas

...