...
- libxc on gpu (lin)
- remove print statements
- test spin-polarized
- understand why H numbers are different than gpugpaw_v2
- merge libxc-gpu and libxc
- copy less of the scratch data to GPU
- run the self-tests
- do the memsets for lda/mgga
- see if performance is better/worse
- multi-alpha zher at a lower priority(jun)
- reduce registers? prefetch?
- explore the parameter space: tile-size
- paper (jun)
- try calling dacapo density mixing from GPAW (cpo)
- install GPAW on Keeneland (cpo)
...