Page History

...

In these log files saw that only one half of the hsd didn't get phase2: /reg/neh/home/cpo/2020/06/15_18:39:03_drp-tst-dev021:tmohsd_7.log /reg/neh/home/cpo/2020/06/15_18:39:03_drp-tst-dev021:tmohsd_6.log.

UNSOLVED: Saw this error in /reg/neh/home/cpo/2020/06/16_15:49:13_drp-tst-dev020:tmohsd_4.log:

Code Block
ERROR:root:Web server error: HTTPSConnectionPool(host='pswww.slac.stanford.edu', port=443): Read timed out. (read timeout=3.05)

UNSOLVED: Saw this error in /reg/neh/home/cpo/2020/06/15_14:39:26_daq-tst-dev07:hsdioc_89.log when starting up processes after a power-cycle (for new hsd firmware for the interrupt-holdoff):

...

PROPOSED SOLUTION: running 1 hsd (datadev_0) and after a while the deadtime goes to 100% from that hsd, but timepausecnt is zero for both hsd’s. I think it's caused by ben's headercntof latching to 1 on the “A” hsd, even though msgdelay is set to 99. Do we need to increase msgdelay? Matt says msgdelay is max 100 (corresponding to 1us). This could be caused by slow updates ignoring dead time if we get long dead time due to the soft-lockup issue above (now solved by Matt, we believe). So could be the same issue. Matt will have SlowUpdates pay attention to dead time to avoid this.

Not Critical

(spurious error) UNSOLVED: early procmgr telnet timeout?

...

Page tree

Versions Compared

Old Version 44

New Version 45

Key

Not Critical