Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

  • The initial audiences were:
  • The hourly data was chosen rather than the raw data since it is cleaner having been through several filters.
    • The hourly data is in: /nfs/slac/g/net/pinger/pingerreports/hep/<metric>/ directory with the file name <metric><size><by><yyyy><mm>-<dd>.txt.gz
  • The raw (daily gathered data from all the monitoring hosts) is close to 400GBytes. This includes:

    • /nfs/slac/g/net/pinger/pinger_mon_data/ping-<YYYY>-<MM>.txt  (15GB) SLAC measurements
    • /nfs/slac/g/net/pinger/pinger2/data/ping-<YYYY>-<MM>.txt (8GB) SLAC measurements
    • /nfs/slac/g/net/pinger/pingerdata/ (375GB) data gathered at SLAC
    • We ignore these data in the rest of this web page.

...

Uncompressed Volume of files per Year for all metricsUncompressed Volume for 3 metricsVolume of uncompressed data by metric
Image Added

Compression

If  I multiply the file size times the frequency to get the bytes in each bin, and then sum I get 11,566,219,714 Bytes from compressed and  58,434,764,384 from uncompressed. This is ~ a compression ratio of 5:1.  The graph below also shows that Uncompressed files are larger than compressed.

Volume of compressed data by metricFrequency of files by size from compressed & uncompressed dataCumulative & yearly compressed data volumes
Image Added

Missing data

There are two types of missing data:

...

Hourly file data analysis from Renan and Christiane, see also 

https://docs.google.com/spreadsheets/d/1357xGkpYFeW0DcnDB-i7ZER2RBhjdQHEbmMDOvdc7bA/edit?usp=sharing

Pinger Data Volume from Les

...