Discussion on March 18, 2022 with Monarin Uervirojnangkoorn, Elliott Slaughter, Iris Chang, Johannes Blaschke, Chuck Yoon
Goals:
- Legion and MPI support for spinifel/mtip
- Do we need to continue callback support in psana for legion?
- Is the collective "Reduce" necessary and can it be guaranteed that all cores will be able to call it the same number of times? (e.g. at the ragged end-of-run)
Discussed 4 approaches to do this:
- A native-psana approach where the Reduce/Broadcast of image results would be done in bigdata (BD) or server (SRV) nodes
- Changes to the current get_data() approach, turning that into a callback inside the psana event loop
- Implementing a low level interface for legion to use ("psiter") providing access to smd and the ability to convert it into bigdata. Legion would then implement its own parallelization on top of that (Elliott did this previously for an earlier version of psana).
- Decoupling psana and mtip with some buffering
Conclusions:
Overview
Content Tools