FEDEARATION- SMOKEPING, PINGER INTEGRATION

 

INTRODUCTION

 

SLAC (Stanford Linear Accelerator Center), Internet2, and other scientific institutes submitted a proposal under the name of “Measurement and Analysis for the Global Grid and Internet End-to-end Performance (MAGGIE)” to the United States Department of Energy (DOE). The main goals of this proposal are to build a federation of measurement infrastructures so that latency measurements (historical and recent) can be shared, best practices among several measurement infrastructures can be adopted, the need for sites to deploy multiple infrastructures to gain all the available benefits be reduced and different analysis and visualization techniques be applied to yield more ubiquitous coverage and new insights. This project “Federation-SmokePing, PingER Integration” serves the main ‘MAGGIE Project’ and deals with the integration of two network latency measurement tools namely SmokePing and PingER.

The objectives of the project include:

1.      Study and evaluate SmokePing and PingER to understand how they work, and their pros and cons.

2.      Extend the capability of SmokePing to derive parameters/metrics not currently supported such as ping unreachability, jitter, derived TCP throughput, etc.

3.      Provide the ability to select and visualize these new metrics. Feed the modifications to SmokePing back into the development base.

4.      Modify PingER to enable it to utilize the SmokePing time series visualization tool.

5.      Evaluate and provide the ability to feed SmokePing data to the Pinger
analysis/visualization toolkit, so we can use the PingER aggregation
techniques to characterize performance by regions, or affinity groups.

The work that has been done with respect to each objective and the results yielded are as follows:

1. Results Obtained from Objective- Working of SmokePing and PingER and the Pros and Cons of Each

 

Benefits of PingER over SmokePing

 

PingER

SmokePing

Parameters Measured

Measures thirteen parameters

Measure eight parameters

Data Visualization

Has an elaborate textual and graphical output

Smoke Ping only provides with graphical results

Data Analysis

Provides analysis for multiple locations using    aggregation techniques to characterize performance by regions, or affinity groups

Provides analysis for installation location and each remote site separately

Data Storage

PingER provides detailed analysis on historical data dating back to greater than 11 years

SmokePing data is not kept in a centralized manner

Development

And Improvement

PingER is being constantly improved through a focused team

There is no official team to upgrade it. Never the less it has a active archive team of developers and users constantly removing problems and improving the software

Error Reporting

PingER has error reporting module

Does not have error reporting module

Data Maintenance

Data keeps on getting appended at the end of the flat files

Once the room in the database finishes, every new data entry causes and overwrite on an existing database entry in a round robin fashion

 

 

Benefits of Smokeping over PingER

 

SmokePing

PingER

Uptime Visualization

Uptime is depicted using colors displayed at the background of SmokePing graphs

No visualization support for dynamic hosts

Alarms

Generates alarms if performance goes below a certain point

Does not need nor generate any alarms

Data Resolution

Stores data in consolidated form after specified time. By default after twelve data sets are filled in the Round Robin Archive, they are combined to form one single data set in another Round Robin Archive. After hundred and forty-four data sets are filled, they are accumulated to form a data set in the third Round Robin Archive of the Round Robin Database.

Keeps data dating back to 11 plus years on high resolution, therefore it has high data storage requirements ~0.6 MB/pair/month

 

 

 

Smart Alarms

Offers plug able alarm modules called 'matchers' which are small Perl modules which examine the past latency measurements and decide if an alarm should be triggered after applying whatever operations necessary to come to a conclusive answer

Does not need nor provide alarms

 

Probes

Offers plug able modules to write application level probes

Does not work with application level probes

 

 

                   The objective “Working of SmokePing and PingER and the Pros and Cons of Each Tool” reveals the follow main facts:

§         SmokePing shows network performance by the help of data collected and displayed at near real time. PingER on the other hand visualizes data of a day prior to date, at the analysis sites.

§         SmokePing uses less memory for the same step interval as PingER by consolidating the data as time passes. This happens by default to the data beyond last three hours.

§         PingER has the ability to show output based on hourly, daily, monthly, last 60 days and last 120 days and all monthly aggregation.

§         Using PingER one can characterize performance based on affinity groups and regions. SmokePing only shows data for one link from source to destination in which the source is always the site where SmokePing is deployed.

§         PingER shows thirteen parameters where as SmokePing shows eight. Never the less SmokePing shows at a glance output of the network performance, with important network latency parameters such as packet loss and RTT together in one graph and Smoke depicting the RTT delay variation.

§         SmokePing generates ‘smart’ alarms based on patterns that the administrator of the software specifies. It also mails the admin if the performance goes significantly low. PingER does not generate alarms nor mails and does not need to do so either as its data is not near real time but of a day prior to date.

§         SmokePing shows uptime for dynamic hosts by displaying colors at the background of the graphs. This tells the amount of time a dynamic host was able to keep its IP address. PingER does not display this feature however it does make sure that it is monitoring the correct host using a script that runs automatically on a daily bases.

§         Both the tools are being constantly improved.

Both SmokePing and PingER have their pros and cons. Depending upon what factors one wants to optimize one can choose between the two. For an in-depth and detailed analysis, and to characterize performance based on affinity groups and regions, a large (13) number of network latency parameters, PingER is the correct choice. If one wants at a glance near real time visualization of network performance, smart alarms, application level probes, with few important latency parameters, with monitoring possibility of only links between the installation system and the targets sites, SmokePing is a better choice.

2. Results Obtained from Objective- Enhancement of SmokePing Network Latency Parameters

 

The results of the objective “Enhancement of SmokePing Network Latency Parameters” are that SmokePing functionality has been enhanced with the following 11 parameters:

1.      RTT Overall Average

§         The total average RTT for all the ping data in the time interval displayed by the graph.

2.      RTT Maximum Average

§         The maximum average RTT that any probe ever reached during the time period displayed by the graph.

3.      RTT Current Average

§         The average RTT for the last probe received.

4.      Derived TCP Throughput

§         This is the bounded TCP throughput in Kilobits per second for the link between the monitoring host and the remote host.

5.      Average Packet Success

§         This is the percentage average success of the ping packets. These are the ones that out of the total packets sent came back.

6.      Maximum Packet Success

§         This is the percentage maximum success of the ping packets. These are the ones that out of the total packets sent for any probe during the time range of graph, came back.

7.      Current Packet Success

§         This is the percentage current success of the ping packets. These are the ones that out of the total packets sent for the last probe came back.

8.      RTT Delay Variation Graph

§         This graph displays the RTT variation for a given range of time. It is the difference between the maximum and minimum ping RTT for a probe.

9.      Average RTT Delay Variation

§         The overall average RTT variation for the RTT Delay Variation Graph.

10.  Packet Success Graph

§         This graph displays the packet success percentage against the axis time.

11.  The Minimum Packet Loss:

§         This is the minimum packet loss that occurred for any probe in the range of the graph.

More parameters of PingER still need to be integrated into SmokePing. I am looking into the possibility of the integrating all latency measurement parameters as “plug-ins”. For this I had posted my question at the Smokeping Archives a couple of times. I have yet to get a reply. 

SmokePing Enhanced Mode Visualization

3. Results Obtained from Objective- Data Conversion from PingER Format to SmokePing Format and its Visualization

 

It is possible to convert PingER data into SmokePing format. The data is converted as a cron job on a daily basis. The data is maintained for the past 90 days. Converted data is visualized using RRDTool and developing graph codes. The converted data is displayed in the form of SmokePing Graphs to a user when he visits the “Selection Web page” and clicks on the desired monitoring host, remote host, packet size, pings per probe combination. The user can also plot the converted data by specifying any start and end date and/or time within the past 90 days.

Eventually the user will be able to plot the graphs by clicking on the corresponding monitoring site link present at the website of PingER whose URL is as follows:

 http://www.slac.stanford.edu/cgi-wrap/pingtable.pl

This will eliminate the need for the “Selection Web page”.

Right now work is being done to fill the data files using ‘update’ function of RRDTool. This will make the conversion code simpler and more efficient.

 

Selection Web Page of ‘PingERtoSmokePing Application’ to Generate Corresponding Graphs

 

 

SmokePing Network Performance Graph of ‘PingERtoSmokePing Application’ Displaying Complete Last 90 Days PingER Data from Source Site “monitor.niit.edu.pk” to Destination Site “monitor.niit.edu.pk”

(Please note that current date is considered as 2004-02-30 00:00)

 

 

 

SmokePing Network Performance Graph of ‘PingERtoSmokePing Application’ Displaying PingER Data Selected from 2004-02-27 to 2004-02-29 from Source Site “monitor.niit.edu.pk” to Destination Site “monitor.niit.edu.pk”