Overview

Our first large area detector (2Mpx, 5kHz). Total data volume 2Mpx*5kHz*2bytes/px=20GB/s. 5 tiles in each quad. Each tile is 4 asics. Each ASICs is 144*192*4 (4 is number of asics).

Issues

possible fiber configurations:

4 nodes: 1 quad per node (5GB/s)
8 nodes: 2 or 3 tiles per node
10 nodes: 2 tiles per node
20 nodes: 1 tile per node

Is a segment a tile? or a set of tiles (e.g. 2 or 3)?

Two ideas: segments, and "unit cell". Should those ideas be identical?

Would make my brain hurt the least if we made a tile a segment (that's been our previous approach)

Perhaps we should mock-up the epixhr segment idea in advance using the kcuSim detector.

The configuration is only done one node. But we can make sure that the configuration object is present on all the relevant nodes (and in the xtc).

Schedule

(on Dec. 19, 2022) Sometime in February/March a prototype with 3 tiles will be released (could generate fake data for missing panels for 1 quad). Full detector: summer?

Calibration Time

Running this script on a psffb node (twice so we get all the data in cache so it behaves more like the daq):

(ana-4.0.48-py3) drp-srcf-eb004:~$ more junk.py
import time
from psana import *
ds = DataSource('exp=mfxx1005021:run=340')
det = Detector('epix10k2M')
for nevt,evt in enumerate(ds.events()):
    img = det.image(evt)
    #print(img.shape)
    if nevt>=20: break
    if nevt==0: tstart=time.time() # start the timer after first event (which is slow)
print('time per evt:',(time.time()-tstart)/nevt,'nevt:',nevt)

(ana-4.0.48-py3) drp-srcf-eb004:~$ python junk.py
time per evt: 0.17038806676864623 nevt: 20
(ana-4.0.48-py3) drp-srcf-eb004:~$ python junk.py
time per evt: 0.17048853635787964 nevt: 20
(ana-4.0.48-py3) drp-srcf-eb004:~$

So one core can do about 5Hz. That implies for 5kHz we need 16 nodes * 60 cores/node. The algorithms Mikhail runs are listed here: Method det.calib algorithms#Detectordependentalgorithms

This code implies that we may be able to reduce this time significantly with a "single pass" calibration (I believe Mikhail currently does two passes through the data with numpy: one for pedestals and one for gains). Hopefully don't need common mode in the DRP?

(ps-4.5.24) drp-neh-cmp015:lcls2$ more junk.cc
#include <stdint.h>
#include <stdio.h>
#include <string.h>

#define RAW_SIZE 2000000
#define NIMG 100
uint16_t raw[NIMG][RAW_SIZE];
float result[RAW_SIZE];
uint16_t peds[RAW_SIZE];
uint8_t mask[RAW_SIZE];
float gains[7][RAW_SIZE];

int main() {
    memset(mask,0,RAW_SIZE);

    for (unsigned count=0; count<NIMG; count++) {
        for (unsigned i=0; i<RAW_SIZE; i++) {
            unsigned val = raw[count][i];
            unsigned range = val&0x7000;
            result[i] = mask[i] ? 0 : (val-peds[i])*gains[range][i];
        }
    }
}
(ps-4.5.24) drp-neh-cmp015:lcls2$

With these results (about 10ms per image):

(ps-4.5.24) drp-neh-cmp015:lcls2$ g++ -o junk junk.cc
(ps-4.5.24) drp-neh-cmp015:lcls2$ time ./junk

real	0m0.955s
user	0m0.949s
sys	0m0.005s
(ps-4.5.24) drp-neh-cmp015:lcls2$

Fiber Connections

A slack conversation with Dionisio and Matt:

Hi Dionisio, I believe TXI is getting an epixhr 2Mpx and we have to purchase daq equipment for it now.. Can you tell me how many fibers that detector will have? Thanks!

10:36 AM

That detector will use the 12 lane transceiver
10:36
and there will be 4 fiber bundles per 2M camera
10:36
one fiber per quad similar to the analog 10k camera

10:50 AM

I assume you mean “one fiber bundle per quad” above? And 12 fibers per bundle, so a total of 48?

10:51 AM

exactly

10:51 AM

10:55
Sorry, one more question: each fiber will be 6Gbps or something higher? (edited)

10:56 AM

48 fiber pairs, I assume.

10:57 AM

all lanes are bi-dir
10:57
so yes, 48 pairs

11:01 AM

6gbps?

11:02 AM

these are the fibers I am planning to use here for testing
image.png

11:03
most likely 12gbps

11:03 AM

We need single-mode.
11:03
Those look like multi-mode.

11:10 AM

these are the fibers that can connect to the transceiver. MM to SM conversion will be needed for this camera and we need to plan of which option we will use for that (same as it was done for ePix HR? or something else)

11:35 AM

Perhaps we discussed before and I forgot. Doing the MM to SM conversion for 96 fibers feels unsustainable.

11:43 AM

We discussed before. The transceiver on the detector end only exists for mm now.

12:15 PM

Do I understand correctly we would need 48/8=6 kcu cards to to convert all of them? That will require a larger node.
12:18
I’m also worried because I had significant problems with lane-locking with the converter-card for the epix100. I had to mix-and-match lanes to find one that locked. Optical powers looked good. If we really need to do this conversion we should probably use that converter card to understand the lane locking problems. Maybe I needed to clean all the fibers somehow, even though the optical powers looked good?

Fiber Lane Assignments

(Tentative)

Lane[0].VC[0] = Data[0]
Lane[1].VC[0] = Data[1]
Lane[2].VC[0] = Data[2]
Lane[3].VC[0] = Data[3]
Lane[4].VC[0] = Data[4]
Lane[5].VC[0] = SRPv3
Lane[5].VC[1] = XVC
Lane[6].VC[3:0] = slow monitoring[3:0]
Lane[7].VC[3:0] = o-scope[3:0]
Lane[10:8] = Reserved for edgeML
Lane[11] = LCLS-II Timing

Can split the fibers onto different nodes.

Data is not currently in a natural order in the fiber, but plan on doing this in the tile electronics.

Page tree

EpixHR