Page History

\This page provides an overview of the status and inner workings of the TES Readout development and eventual implementation.

...

TESMap.pdf

PDF

name	TESMap.pdf

PseudoCode

This is a proposed algorithm to handle the data produced by the TES detectors for LCLS-II.

The data is assumed to consist of several time streams per FPGA (250) sampled at 1 MHz. The beam repetition rate is 10 kHz, for a total of 100 samples minimum between valid (photon generated) events. The samples are assumed to be 16 bits.

Parfor : Parallel For

NumberOfChannels : number of channels to be handles by the FPGA

Samples per Window : Number of samples in a single pulse window

TotalNumberOfWindows : Maximum number of windows to be kept in memory for the configured filters. Currently only 3 for a total of four cases. The active window, the preceding one and the following one. Possibly more windows to be added before/after for more precision in the energy measurement.

Active Window : Window on which we are performing the filter calculation

Current Window : Window that is currently loading in buffers.

EventMatrix : Matrix containgin information about the presence of events in each window of each channel.

...

language	py
title	TES Algorithm
linenumbers	true
collapse	true

...

Configuration Steps

The following are an overview of the main configuration steps for the TES-SMURF detector to go from turn the system on and start collecting data.

To be completed as more information becomes available.

Configure the main registers of the SMURF board (to be detailed)
Set the bias of the cryogenic amplifiers. The bias are provided by the manufacturer.
Find the resonator frequencies -
1. Each multiplexed line must be scanned in frequency. The resonators are situated where the amplitude of the signal drops and there is a phase discontinuity. The phase discontinuity is used to determine the most precise frequency of the resonators. If 2 resonators are too close, one needs to be turned off to avoid crosstalk.
SetupNotches - Find the transformation η(I,Q) = (I', Q') where η is a complex transformation. We want to move the operation point so that we are at the (I', Q') = (1,0) point and that all variations maximized their projection on Q'. Image Added
Setup the flux ramp. The Squids are connected to the flux generators in large groups. Each group can have a different flux ramp.
1. Flux amplitude
2. Ramp amplitude
At this point it is possible to track the phase of the carrier frequency and collect data but the TES still need to be configured.
The voltage for the TES is selected by tracing the IV curve, from which we get the RTES curve. We wish to be as close as possible to the bottom part of the transition without hitting instability effects.

IV curve and RI curve of TS when a current ramp is applied Image Added

Benchmarks

This table lists the latest benchmarks.

Currently, it evaluates the average time rate (over 1e6 iterations) to complete the number of products indicated as well as the sum of the resulting vector. The length of the vector varies according to the number of samples.The code currently uses a simple decision tree which directly indicates how many filters to use. Once we know how many filters are necessary this will become a fixed value. Meanwhile a new decision tree will determine the case to use depending on which windows have a hit, currently in progressone linear transformation and two inner products. Each pixel has its own ring buffer of 10 windows, each window has 100 data samples of uint_16. The buffers are refilled as the reader nears the empty flag from Random data. The buffers are implemented using Eigen Matrix library.

These benchmarks were obtained os psanagpu116, using O3 and vectored optimization. As a reference, 10 kHz operation gives a time window of 100 µs.

Code Block

language	bash
title	Compile flags

g++ -std=c++11 -O3 -DNDEBUG -march=native -IEigen main.cpp

All values are in μs.

Number of products Number of samples	1	2	3	4
100	0.061	0.074	0.091	0.108
200	0.113	0.145	0.176	0.210
300	0.150	0.190	0.241	0.296
400	0.200	0.266	0.329	0.397
500	0.239	0.321	0.402	0.486
600	0.279	0.378	0.476	0.576
700	0.318	0.435	0.549	0.667

The following graph shows the calculation rate per pixel as a function of the density of hits. While the data generation is not in the scope of the timer, it may still affect the start and stop of said timer. Clemens algorithm pre generates all data and thus times one continuous loop. Thus if there are no events, the loop is very very efficient. However, the Eigen library used in the most recent version of the code does accelerate the calculation of the inner products, as shown by the better performance at higher hit rates.

Image Added

Latest code :

Latest integration iteration (Working copy)

Original codes before combinations.

Code with Eigen (Audrey)

Code with switch case (Clemens)

References

Ullom and Bennet. Review of superconduction superconducting transition sensors for X-ray and gamma-ray spectroscopy

Irwin and Hilton. Transition-edge sensors.

Tasks

Task

Status

Comment

Create fake data vectors

Status

title	Not started

Using random number vectors

Implement event detection algorithm

Status

title	Not started

Program arguments for benchmarking

Status


colour	YellowGreen
title	In progressDone

Arguments for the size of the vectors and the number of array products used

Preprocessor arguments

Decision tree for event cases

Status


colour	YellowGreen
title	In progressDone

Switch case tree for given number of filters - reusable for different cases

Input data from simulation

Status

title	Not started

Create circular buffer

Status


colour	Green
title	Done

Optimize loop behavior

Status


colour	Yellow
title	in progress

Page tree

Versions Compared

Old Version 16

New Version Current

Key

PseudoCode

Configuration Steps

Benchmarks

References

Tasks