• Management Pack:  HPC Server
  • MP Version:  3.1.3266.0 for HPC Server 2008 R2
  • Released:  2/14/2011
  • Publisher:  Microsoft

MPI Ping-Pong Lightweight Throughput Test Result Monitor

  • ID:  Microsoft.HPC.2008R2.Monitor.HeadNode.Performance.MpiPingPongTest
  • Description:  MPI Ping-Pong Lightweight Throughput test result based performance monitor for HPC 2008 R2 cluster
  • Target:  HPC 2008 R2 Active Head Node
  • Enabled:  No

Operational States

Name State Description
Success Success  
Failure Warning  

Overridable Parameters

Parameter Name Default Value Description Override
Timeout Seconds 600  
Interval Seconds 86400  
Sync Time    

Alert Details

Monitor State Message Priority Severity Auto Resolution
Failure (Warning) MPI Ping-Pong Lightweight Throughput test failed Medium Match Monitor Health Yes

Run As Profiles

Name
HPC Server Admin Action Acount

Monitor Knowledgebase

Summary

This monitor tests the bandwidth and the latency of node-to-node communication between different nodes by invoking the MPI Ping-Pong: Lightweight Throughput diagnostic test (mpipingpong.exe) that is included in Windows HPC Server 2008 R2. The monitor is set to run daily by default.

This monitor will enter the Warning state if the MPI Ping-Pong diagnostic test fails on the cluster.

Causes

The Warning state is caused by either network congestion or network connectivity issues in a cluster. A Warning state indicates that at least one node is performing poorly relative to the other nodes in the cluster. A poorly performing node meets both of the following criteria:

  • The average latency/throughput over all network links for the node is at least one standard deviation away from the mean value for the cluster. AND

  • The latency is at least 20% higher or the throughput is at least 20% lower than the cluster mean. This avoids unwarranted warnings on highly uniform cluster networks.

Resolutions

To troubleshoot and fix this problem:

  • Check the network connectivity for all the nodes in the cluster.

  • Check the diagnostic test results in HPC Cluster Manager (in Diagnostics), where detailed information can be obtained about the failure.

External References
This monitor does not contain any external references.

See Also for HPC Server Management Pack


Downloads for HPC Server Management Pack

AZURE OPTIMIZATION ASSESSMENT GET STARTED
MIGRATION TO AZURE GET STARTED
SYSTEM CENTER MIGRATION TO AZURE GET STARTED
MIGRATION TO AZURE FOR SQL AND WINDOWS 2008 GET STARTED