...
No Format |
---|
http://glastlnx24:5441: org.apache.xmlrpc.XmlRpcException: Failed to create input stream: Connection reset or Connection refused |
This is a problem with the XML-RPC python server. This problem should be brought to the attention of the FO shifter.
19:20 (Richard, for Tony)
Batch jobs were taking a long time, apparently being slow, but were in fact failed with no log files produced. Was tracked down to DNS failures on the balis. It has been reset (reported by Neal Adama at 18:15).
7:10 pm I restarted tomcat12 since the monitoring programs were complaining and ServerMonitoring showed it missing – - Tony
Anchor | ||||
---|---|---|---|---|
|
6:00 pm Old DQM ingestion script put back into production. The new script worked fine for some 24 hours and then we started having "idle" sessions locking out all the following ones. There were some 60 of them waiting. Killing the first one did not solve the problem as the next one went in "idle" state. We decided to kill all the waiting sessions and put the old script back in production. The failed ingest scripts are being rolled back.
Panel | |||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
|
|
|
|
| |||||||||||||||||||||||||||||||||||||
01:00 Restarted tomcat07 due to Data Quality Monitoring Unresponsive. 01:00 PM New DQM ingestion script put into production to avoid ORACLE slowdowns. If any problems, please contact Max.
02:55 Restarted tomcat07 due to Data Quality Monitoring Unresponsive.
19:50 - Data Processing page went unresponsive for 2.5 hours. See GDP-26@JIRA and SSC-84@JIRA
3:38 pm Restarted glast-tomcat07. Data Quality Monitoring Unresponsive RunQuality ExceptionCannot set the run quality flag due to GRQ-4@JIRA
11:55am Restarted glast-tomcat07. Data Quality Monitoring Unresponsive 12:25am: OpsLog and Monitoring/Trending web-apps interfering with each other
Outstanding Issues: David Decotigny requests we get calibration trending working again. 10:18pm Web severs are working again:
The root problem was a software crash on rtr-slb1. |
...