On one server (same configuration as the rest) I am regularly getting the following RMA error returned intermittently (as the result for the test) across a number of tests for the server:
"RMA: Cannot send data. An existing connection was forcibly closed by the remote host."
I have checked the timeout settings for both the RMA manager configuration on the HostMonitor server as well as the RMA service on the remote server (both set to 120 seconds).
If I refresh any of the tests that return the above error they return a normal response.
Across the 30 server fleet the hardware/software builds are consistent (built from a standard OS "image") and the range of tests are also identical (with some additional "server specific" tests added per system).
The servers are high spec and the affected server does not appear to be working excessively hard (it's a centralised tape backup server that is typically operating overnight and fairly dormant during the day - when errors are still occuring).
Any idea what may be the cause of this error (or what this error means in more detail so that I can troubleshoot it further)?
Cheers,
Frilby
