Time & date 

12:00 noon 7/9/2018 at SLAC

Attendees:

Bebo White, Umar Kalim, Les Cottrell

Discussion

See also Feasability of a Blockchain model for the collection and distribution of PingER data, and  https://blockgeeks.com/guides/what-is-blockchain-technology/ for an introduction.

The idea is to reduce/eliminate the dependence on SLAC. 

We identified two possibilities for PingER:

  1. The PingER Oracle meta database of host coordinates (NODEDETAILS)
  2. The actual raw measurement data. Typically found on the PingER Measurement Agent (MA) cached under /usr/local/share/pinger/data.  This data is gathered on a daily basis (by ping_data.pl) from each active MA by SLAC and archived as /nfs/slac/g/net/pinger/pinger2/data/ping-<YYYY>-<MM>.txt. The data flow is described in PingER data flow at SLAC.  It is already publicly available via anonymous FTP.

1).  NODEDETAILS

Enabling MAs to update NODEDETAILS independently would enable a richer sharing of both Beacons (kept in <Beacons> and target hosts (kept in <HostList>.  Currently, only Beacons are shared.  This sharing could be a big advantage to MAs such as SLAC, GZHU and UBRU which have large local <HostList>s. The amount of data in the database relatively small. There are about 3500 hosts in the database including active: ~ 127  Beacons, 40 MAs;  2200 Disabled (no longer active hosts). Each host has about 20 columns of information,  each of which is up to 100Bytes long. As envisioned each snapshot would be a complete representation of the database.  The database is only updated occasionally, e.g. say once a week on average so the number of snapshots is not large.

Maybe while one is learning about about Blockchains this might be a place to start.  Once this is better understood then move on to item 2.

2). Raw measurement data

For the current data storage, just from 2016-01 thru 2018-06 there are about 32 GBytes or  ~0.4GBytes/month or  ~0.012GBytes/day. The data is updated on a daily basis.

If we take the model that each blockchain ledger entry is the result of a 30-minute measurement by an MA, and each MA make its own entry, then we have:

Questions