Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

To-Do List

12/4/2012

...

Questions for Nvidia
  • constants memory
  • texture memory
  • optimization tricks: pre-fetch etc.
  • what does a queued warp do? (does it pre-fetch the memory)
  • reducing number of registers in kernel (does compiler typically do this optimally?)
  • how to learn with nvvp if we're memory/flops limited
  • understanding the nvvp columns
  • ask about zher speedup numbers
12/4/2012
  • understand nvidia zher speedup plot (jun/cpo)
  • libxc on gpu (lin)
    • use CUDA5
    • use common functional file for CPU/GPU
    • use common work file for CPU/GPU
    • read samuli old talk
    • run 3x4x3 pt system
  • RPA timing measurements (lin)
  • multi-alpha zher at a lower priority(jun)
    • reduce registers? prefetch?
    • explore the parameter space: tile-size
  • try multiple surfaces with jacapo/gpaw-pw (aj)
  • paper (jun)
  • try calling dacapo density mixing from GPAW (cpo)
  • install GPAW on Keeneland (cpo)
  • make sure all libxc self-tests run
  • move suncatgpu01 to CUDA5 (cpo)

...

  • come up with list of items to ask about at nvidia mtgs
  • constants memory
  • texture memory
  • optimization tricks: pre-fetch etc.
  • reducing number of registers in kernel (does compiler typically do this optimally?)
  • how to learn with nvvp if we're memory/flops limited
  • libxc on gpu (lin)
    • read samuli old talk
    • run 3x4x3 pt system
    • run PBE0
    • fix linking undefined symbol
    • make sure all self-tests run
    • put paramsize fix in for mgga and lda
    • test libxc 2.0.0
  • RPA timing measurements (lin)
  • multi-alpha zher at a lower priority(jun)
    • reduce registers? prefetch?
    • explore the parameter space: tile-size
  • try multiple surfaces with jacapo/gpaw-pw (aj)
  • paper (jun)
  • try calling dacapo density mixing from GPAW (cpo)
  • install GPAW on Keeneland (cpo)
  • "patch" file for libxc (only the memsets?) (cpo)
  • move suncatgpu01 to CUDA5 (cpo)
  • figure out how to softlink lda_c_pw.cuh (cpo)

...