Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

  • The initial audiences were:
  • This The hourly data was chosen rather than the raw data since it is cleaner having been through several filters.
    • The hourly data is in: /nfs/slac/g/net/pinger/pingerreports/hep/<metric>/ directory with the file name <metric><size><by><yyyy><mm>-<dd>.txt.gz
  • The raw (daily gathered data from all the monitoring hosts) is close to 400GBytes. This includes:

    • /nfs/slac/g/net/pinger/pinger_mon_data/ping-<YYYY>-<MM>.txt  (15GB) SLAC measurements
    • /nfs/slac/g/net/pinger/pinger2/data/ping-<YYYY>-<MM>.txt (8GB) SLAC measurements
    • /nfs/slac/g/net/pinger/pingerdata/ (375GB) data gathered at SLAC
    • We ignore these data in the rest of this web page.

Volume of data

There are roughly 100,000 files. The volumes of the files are shown below.

Uncompressed Volume of files per Year for all metricsUncompressed Volume for 3 metricsVolume of uncompressed data by metric
Image Added

Compression

If  I multiply the file size times the frequency to get the bytes in each bin, and then sum I get 11,566,219,714 Bytes from compressed and  58,434,764,384 from uncompressed. This is ~ a compression ratio of 5:1.  The graph below also shows that Uncompressed files are larger than compressed.

Volume of compressed data by metricFrequency of files by size from compressed & uncompressed dataCumulative & yearly compressed data volumes
Image Added

Missing data

There are two types of missing data:

...

Missing files/year by metricTotal number of dots per yearDots per metric
Image Added

Spreadsheets

Hourly file data analysis from Renan and Christiane, see also 

https://docs.google.com/spreadsheets/d/1357xGkpYFeW0DcnDB-i7ZER2RBhjdQHEbmMDOvdc7bA/edit?usp=sharing

Pinger Data Volume from Les