You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 2 Next »

Discussions with Taran, Mona on Feb. 9, 2023

  • Can limit ourselves to 8000 hsd samples (~1us)
  • At 1MHz 8000 samples is 16GB/s so too big for one drp node with 4-6GB/s limit (especially with 2 hsd's per node)
    • can reduce with fex by at least a factor of 8 to get within 4GB/s per drp node
    • at 100kHz 8000 samples would work from a data volume perspective

We want these outer-products:

  • hsd-hsd outer product (symmetric, same hsd, save a factor of 2)
  • hsd-fzp outer product (fzp is piranha: 2048)

mona should test:

  • the "sparse" outer product from fex
  • full 8000x8000 (plus 8000x2048 for fzp):
    • we can do 8000 samples from 2 hsd channels per drp node at 100kHz (3.2GB/s) from a data-volume perspective but need to check from a CPU perspective.

the full-matrix memory usage with mpi: looks like 8GB for 60 cores for 8000 samples

in future with 1MHz hopefully have a better detector for fzp that also runs at 1MHz


  • No labels