Page History
...
We tested the 6 outer products outlined above and accumulate the results back to the full size matrices (3 of 8000 x 8000, 2 of 8000 x 2048, and 1 2048 x 2048) on s3df. The performance per core is around 400 Hz. We scale this up to 1MHz with 2500 2048 cores (20 18 milano nodes).
...
language | bash |
---|
...
This script for this performance test is test_fast_outer_filling.py and was submitted with submit_slacs3df.sh
Performance with reduced full data
...
Overview
Content Tools