To-Do List
- ensure that Matt's latest xpm firmware fixes the xpm link-glitch storms
- reproduce/fix TxLinkReset workaround
- reproduce/fix xpmmini-to-lcls2timing workaround
- reproduce/fix link-lock failure on timing system KCUs
- reproduce/fix timing nodes assigning wrong timestamp to configure transition by 1 or 2 buckets
- manufacture new boards
- check/fix loopback fiber problem in production xpms in room 208
- work on high-speed-digitizer timing robustness
- check wave8 timing robustness
Summary Of Testing
These are the results of the tests that have been conducted in the FEE alcove to determine if the XPM glitch can be reproduced.
Every test is run from a starting behavior where the DAQ can allocate, configure, run, and disable.
Whenever the DAQ does not follow the starting behavior remedies are applied to recover it.
XPM firmware 3.6.0
Opal config does not have xpm mini -timing2 hack
cnf file uses -D fakecam for additional timing nodes
in 2023/11/14
action | issue found | stat | remedy |
---|---|---|---|
stop pyxpm 10 and 11 fru-deactivate and activate xpm 11 restart pyxpm 10 and 11 start DAQ | bucket issue | 1/10 | rebooting timing node cmp001 |
rebooting timing node cmp001 | no issue | 0/5 |
in 2023/10/30
action | issue found | stat | remedy |
---|---|---|---|
stop pyxpm 10 and 11 | 9/20 | --- | |
stop pyxpm 10 and 11 | Opal fails in configuration | 5/20 | reboot timing nodes |
stop pyxpm 10 and 11 | groupca and xpmpva are shutdown at startup | 4/20 | ctrl-x in the terminal successfully restart them |
example of the timing shift in the timing nodes (before -D fakecam).
XPM firmware 3.5.4
Opal_config.py still has xpm mini – timing2 hack
In 2023/10/26:
action | issue found | error stat | remedy |
---|---|---|---|
stop pyxpm 10 and 11 | no issue has been detected | 0/10 | --- |
stop pyxpm 10 and 11 | at first xpmpva DAQ:NEH:XPM:11 does not come up Then Opal shutsdown | 3/20 | stop pyxpm 10 and 11 |
Observation Of Front-Panel XPM Link Glitch With Version 3.5.4
Perhaps fixed by Matt in later firmware version?
XPM11 glitches between 4pm and after 6pm and also around 10:10 am the next day
2023/10/27: updating firmware
xpm11 to xpm_noRTM-0x030601000-20231011111938-weaver-645bee8.mcs
xpm10 to xpm-0x030601000-20231011111954-weaver-645bee8.mcs
Testing Details
In 2023/10/24 :
action | result | remedy | result |
---|---|---|---|
Remove XPM10 fiber timing in the back while DAQ running | *** XpmDetector: timing link ID is ffffffff = 4294967295^M | TxlinkReset of cmp015 in XPM11 | DAQ recovers |
Repeat XPM10 fiber timing removal removal | DAQ cannot disable | --- | DAQ recovers by itself at restart |
Repeat XPM10 fiber timing removal removal | --- | --- | no issue |
Repeat XPM10 fiber timing removal removal | DAQ cannot disable | --- | DAQ recovers by itself at restart |
Remove XPM10 fiber timing in the back while DAQ stopped | --- | --- | DAQ starts with no issue |
Repeat XPM10 fiber timing removal removal while DAQ stopped | --- | --- | DAQ starts with no issue |
Remove transceiver from XPM10 in the back (DAQ stopped) | --- | --- | DAQ starts with no issue |
Remove transceiver from XPM10 in the back (DAQ started) | --- | --- | DAQ starts with no issue |
timing 1 shutsdown by itself | TXlinkReset on XPM10 for XPM11 | DAQ recovers | |
Remove fiber on XPM10 to XPM11 | --- | --- | DAQ starts with no issue |
Remove transceiver on XPM10 to XPM11 | --- | --- | DAQ starts with no issue |
Remove fiber on XPM11 AMC0 port 0 | --- | --- | DAQ starts with no issue |
Remove transceiver on XPM11 AMC 0 port0 | --- | --- | DAQ starts with no issue |
opal disappears from the list f detectors | restart DAQ | DAQ starts with no issue | |
power cycle xpm10 via switch only AMC0 | XPM 11 looses timing node | Restart pyxpm 10 and 11 restart pyxpm 11 | DAQ restarts but opal shutsdown |
opal still shutdown | devGui xpmmini timing v2 | no avail | |
Stop pyxpm 10 and 11 | DAQ starts with no issue |
Conclusion:
It appears that yanking the timing fiber can cause disturbances in the system, but they are not repeatable 100% of the time.
XPMs Power spikes can set the DAQ in a behavior similar to the XPM glitch, but only if pyxpms are running. To be repeated.