Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

  1. The first step requires us to create multiple CSV files. Each CSV file will correspond to an independent geolocation technique. Open a spreadsheet already created and copy target IPs and error distance columns into a new spreadsheet and save it as a CSV (.csv) file. Name it against the geolocation technique such as apollonius.csv for Apollonius. Table 2 below shows geolocation technique against its file name.
    1. It is a possible that for some geolocation techniques, we might not have IP addresses, instead we might have hostnames. To handle such a case we have created a shell script GetIPFromHostName.sh to convert a list of hostnames into IP addresses. To do this, copy the hostnames to HOSTS variable inside GetIPFromHostName.sh. These must be separated by white-space or new line character. Run the script to get the print out of IP address list at the terminal.
  2. Put these under a csv directory. Put the csv directory and Node_info.txt file alongside CreateCSVForComparison.pl script. Table 3 below provides links to these files.
  3. Execute CreateCSVForComparison.pl script. This will generate all-analysis.csv file containing data in the following format. This will contain all data including null value for those targets for which a geolocation technique didn't find any estimate results. The name of each technique represents column of error distance values.
  4. Open this all-analysis.csv file and convert this to a spreadsheet for analysis.

...

Table 3 below provides links to the files above in csv directory, Node_info.txt file, all-analysis.csv file and CreateCSVForComparison.pl script.

File

Description

GetIPFromHostName.sh

Script that takes hostnames and converts those to IP addresses. The hostnames list must be copied to HOSTS variable inside the script, each value separated by white-space or new line character. This script outputs a list of IP addresses in the same order as that of hostnames.

csv directory

Contains all the csv files.

Node_info.txt

Contains information such as hostname, IP addresses, Regions, Lat/Longs, etc. for 182 targets.

all-analysis.csv

An amalgamation of all the geolocation techniques and their error distances against the IP addresses and other information. The format is shown above in the box titled "data format of all-analysis.csv".

CreateCSVForComparison.pl

Script that takes csv files from csv directory and Node_info.txt file as inputs and processes out all-analysis.csv file as output.

...

Once all-analysis.csv is generated, don't directly copy it to Windows since there are some formatting issues in such a case. It won't open correctly in Microsoft Excel. So in order to make this right. Open all-analysis.csv via vim (on Linux) and copy paste the text into Windows notepad (later save it as all-analysis.csv). Once done press CTRL+H to find and replace ^M characters that are read by Linux vim but not my by Microsoft Excel and probably therefore causes all sorts of formatting issues.

...