This monitor tracks the status of the HPC MPI Service, which is automatically installed on each cluster node to allow Message Passing Interface (MPI) executable files to be run as tasks on the HPC cluster. When this service is stopped on the head node, the head node cannot communicate with the HPC MPI services that are running on other cluster nodes. When this service is stopped on another cluster node, the node cannot run MPI executable files.
This error can be caused by any of the following:
The HPC MPI Service encountered an error and had to stop running.
The HPC MPI Service is disabled.
Group Policy does not allow this service to start.
To troubleshoot and fix this problem:
Restart the HPC MPI Service on the target node
Start Event Viewer on the target node and check for any system events from the Service Control Manager or application events from the HPC MPI Service. Resolve any errors that are reported by these events.
If the service still cannot be restarted, contact the domain administrator to make sure that this service is not disabled by the domain Group Policy.
If the preceding steps do not resolve the problem, uninstall and reinstall Microsoft HPC Pack on the node.