Page History

...

By replacing Send with Isend. We allow Smd0 to move on after initiating send command to an eventbuilder core. With this overlap, we see that the total wall time improves from 7.4 to 4.4 seconds with 16 eventbuilder cores.

	eb=1		eb=2		eb=4		eb=8		eb=16
TASK	total(ms)	#occurs	total(ms)	#occurs	total(ms)	#occurs	total(ms)	#occurs	total(ms)	#occurs
SMD0GOTCHUNK	19641995.3707	1086	20351993.8369	1086	20151975.744	1086	19921964.946	1086	20041983.7913	1086
SMD0GOTEB	56955841.4978	10872800	2779.48	1087	17481964.1136	1087	16761916.0166	1087	16191832.1413	1087
SMD0GOTREPACK	244297.8586	1087	212258.733	1087	235295.582	1087	198306.5737	1087	186345.9558	1087
SMD0DONEWITHEB	4857.976	1087	5061.0478	1087	5262.7437	1087	5361.2395	1087	5160.6476	1087
SMD0GOTSTEPHIST	7678.2734	1087	7985.6866	1087	8387.6596	1087	8380.2787	1087	8281.9809	1087
SMD0GOTSTEP	8786.3738	1087	8689.95	1087	9091.3487	1087	9289.2696	1087	88.6243	1087
total:	81178357.2618	81178357.2618	52655268.1544	52655268.1544	42264477.0777	42264477.0777	40964420.2842	40964420.2842	40344391.1213	40344391.1213
rate (MHz)	1.2320		1.90		2.3723		2.4426		2.4828

Conclusions/ Known Issues

We gain some performance by overlapping Send with other computation tasks. However, this code with (Isend/ Irecv) crashes with the current real experiment data (tmoc00118, run=463). We need to investigate this issue before continuing this work.

In additional to overlapping send, we can also perform computational tasks while Smd0 wait for an eventbuilder core to come back (Irecv). This implementation should be explored after the issue mentioned above is solved.

Page tree

Versions Compared

Old Version 2

New Version Current

Key

Conclusions/ Known Issues