Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

1. Introduction

Wiki Markup
PingER has a huge amount of data and, until the conclusion of this project, the easiest way to retrieve it is through Pingtable \[ref\]. Pingtable provides a friendly web interface to retrieve PingER raw data (millions of text files) and load it into a human readable HTML page. However, this is not a web standard and crossing PingER data to generate very specific information may not be possible or extremely difficult using the existing way to retrieve PingER data. This project attempts to provide a standard semantic web format to data retrievable in Pingtable.

...

Wiki Markup
Finally, the existing APIs to handle RDF provide well-known publishing formats such as JSON \[ref\], CSV \[ref\], and XML \[ref\]. We can conveniently get the results of a query and put it into visualization libraries to come up with very interesting visualizations of the data.

2. Ontology

a. Goal

To define the vocabulary used by PingER as well as its terms, concepts, taxonomy, and relations with each other.

...

b. Progress

...

Wiki Markup
We are basing our ontology on the one proposed by the project MOMENT\[1\] described by the article _Measurement Ontology for IP traffic (MOI); IP traffic measurement ontologies architecture_, by the European Telecommunications Standards Institute (ETSI) \[2\].

...

  • Wiki Markup
    Freebase \[ref\], an open large graph database.
  • Wiki Markup
    DBPedia \[6\], the ontology and the resources, will be used to provide more information about any geographic location or any other thing Wikipedia has that can be connected to the PingER. So it should be possible to make very specific queries.

...

To do

...

Wiki Markup
After doing very complex _mashups_ \[ref\] using the ontology, we need to verify if the ontology needs any adjustments.

3. RDF Repository

...

Goal

...

To establish a good environment for the RDF repository.b.

...

Progress

...

Wiki Markup
We analyzed the existing technologies to make it possible. There are well-known Triple Storages such as Jena, Sesame, and Virtuoso \[7\].  According to \[7\], Virtuoso struggles to load large datasets (>1M triples). Hence, we first decided to try Jena SDB (with MySQL) and then Sesame Native.

...

Therefore, we decided to migrate the project to use Open RDF Sesame Native as RDF Repository.

...

c. To do

...

Run very complex queries to test the performance.

...

Wiki Markup
\[3\] PingER Node Details. Retrieved from [https://confluence.slac.stanford.edu/display/IEPM/PingER+NODEDETAILS|../../../../../../../../../display/IEPM/PingER+NODEDETAILS||\||] on June 5, 2013.

...