Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

Image Added Computing Newsletter

Disk Space

We continue to add about 5TB of new data a week bringing the total size of LAT data at SLAC to xxxTB (xxx L1 output, xxx MC and xxx reprocessed data). 4 new file yyyTB servers arrived at SLAC on Monday and hopefully by the time you are reading this the first one will have been installed averting the need to store older data only on tape. Although the LAT data is all kept on raid arrays with multiple redundant drives we have also acquired an addition 250TB of tapes so that we can continue to keep all data backed up in case of unanticipated disk problems.

Gallery
excludeabacus.jpg,title=titleNew file servers arrive unpacked at SLAC , (Monday 7 Dec 2009)

New user disk space (too successful?)

The new user disk space, /afs/slac.stanford.edu/g/glast/users, is gaining users and usage. Currently, 3 TB of disk space is allocated (of which about 1/2 is actually used) amongst 167 users. In addition, there are nine science groups with allocations totaling 345 GB. From this perspective, the new user space is a success.

Wiki Markup
However, a usage pattern for user space has emerged that is stressing the server.  Submitting 100s of simultaneous batch jobs can cause the server to become non-responsive which, in turn, causes batch jobs to stall and eventually fail.  In addition, interactive users attempting to access this space will be unsuccessful.  The SLAC Computing Division has been alerted of this issue with the hope that a solution can be worked out.  In the meantime, please be aware of the possibility that one can overload that server and affect other users.  Batch jobs should be limited to prevent such overloading conditions.  This can be done by dribbling in batch jobs a few at a time while \[monitoring the server\|[http://ganglia01.slac.stanford.edu:8080/ganglia/glast/?m=load_one&r=hour&s=descending&c=nfs-glast&h=sulky55.slac.stanford.edu&sh=1&hc=4
Image Removed
]\].   When the CPU utilization exceed \~50%, you are entering the danger zone.

Pass 7.2 reprocessing

The first 14 months of survey data was reprocessed in October and November with the new Pass 7.2 event classification. The data sample extends from run 239557414 (2008-08-04 15:43:34 UT) through run 277596392 (2009-10-18 22:06:32 UT), spanning 6581 runs and including over 14 billion events. The C&A group is currently evaluating this reprocessed data and, depending on their findings, there may be another reprocessing cycle early next year.

...