Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

We tested the 6 outer products outlined above on drp-srcf-eb002. The best rate is 1.25 kHz for all 6 operations 4kHz for the most important ones. 

...

languagebash

...

and accumulate the results back to the full size matrices (3 of 8000 x 8000, 2 of 8000 x 2048, and 1 2048 x 2048) on s3df. The performance per core is around 400 Hz. We scale this up to 1MHz with 2048 cores (18 milano nodes).

Image Added

...

This script for this performance test is

test_fast_outer_filling.py

...

Python script for the results above:

...

languagepy
titletest_fast_outer.py

...

and was submitted with submit_s3df.sh

Performance with reduced full data

...