...
Area | Problem | Comments | Resolution |
---|---|---|---|
Oracle | GLASTTREND space full |
| The space was expanded |
LSF | Only 100-200 jobs running, when 500+ in queue. | LSF reported 467 jobs in "RSV" status. Neal reports that this is a problem that they have seen before and are investigating with Platform. He requests we contact him if we see it again, but it has not reoccurred since 14:20 on Saturday | ? |
Xrootd | Xrootd slow | Wilko has postulated that the problem may have been that the scratch disk on the batch machine was too busy. He will ask Yemi to add monitoring of the batch scratch disks to ganglia | |
Pipeline | Some DEV jobs failing in strange way | A race condition was discovered where the mail message from the batch job could be received before the stream had been transitioned to "QUEUED" state. | Work around installed in DEV PII-319@JIRA |
...
Area | Problem | Comments | Resolution |
---|---|---|---|
Oracle | GLASTTREND space full again | ? | Ian added 32 GB of space and changed the critical threshold to 90% |
Pipeline | 2 Stream on DEV are waiting, even though all their PIs are finished | Dan is investigating, probably a result of the patch he put into DEV on Saturyday Saturday | ? |
Note at 17:31 new stored procedures were installed into the PROD pipeline
...