To-Do List
5/9/2012
- rpbe kernel
- try cudamallochost with memcpyasync
- fix stream behavior and try with 1,2,4,8,16 streams
- separate stream and n-omega parameters (jun)
5/2/2012
- looking at EXX bottleneck (rewriting) (jun)
- use cuda streams for small RPA systems (jun)
- libxc integration (cpo)
- understand MKL benchmark (jun/cpo)
- pycuda (cpo)
- understand RPBE kernel: (lin)
- understand "double" problem
- vary np, block_size, nstreams
- loop testfunc many times
- longer term: look at jussi/samuli kernel for ideas
...