...
Ridzuan has put together a rough proposal to use Hadoop to store and make available PingER data. He is evaluating several vendors version of Hadoop 2.0 installations to decide which is the right one to adopt. Last time I successfully install and trying Hadoop 1.0 but then it has many drawbacks such as no real-time streaming processing. He awaits receiving details from Johari about the MYREN cloud. Johari sent information on the Myren Cloud. The web site is as follows: https://cloud.myren.net.my/. It gives information about the service and how to apply, plus user guide is available from the website. As for Ridzuan, he should be eligible to register for the service, but he needs to check with UM (University Malaya) procedures for applying for the service.
Ibrahim Abaker is planning to work on a topic initially entitled " leveraging pingER big data with a modified pingtable for event-correlation and clustering". Ibrahim has a proposal, see https://confluence.slac.stanford.edu/download/attachments/17162/leveraging%20pingER%20big%20data%20with%20a%20modified%20pingtable%20for%20event-correlation%20and%20clustering.docx. He is currently spending time reading about RDF storage and its retrieval, also he is trying to setup Hadoop clusters for the experiment. He is in email discussion with Renan on which part to work on first. He plans to collaborate with Renan to make the pinger store and process more efficient. Les has sent him documentation of PinGER which is very helpful. Ridzuans' work is more to do with hosting the data and stream data analysis. Ibrahim is looking more at applying MapReduce ( programming model for processing large data sets with a parallel, distributed algorithm on a cluster), reducing the storage needs and providing querying of the data.
...