Some e-mail alerts are not being sent + other questions

All questions related to installations, configurations and maintenance of Advanced Host Monitor (including additional tools such as RMA for Windows, RMA Manager, Web Servie, RCC).
Post Reply
myxiplx
Posts: 21
Joined: Tue Apr 13, 2004 1:53 am

Some e-mail alerts are not being sent + other questions

Post by myxiplx »

Since implementing SMS alerts for a few events, we've spotted that e-mails are not always sent when an alert is logged in Hostmonitor.

This morning we've received half a dozen SMS notifications but no e-mail alerts. The action for all these events has included an e-mail and a SMS alert.

We're still getting some alerts through, but checking the log we seem to be missing more than we're getting.

This morning, I received the following alerts via e-mail (all set to send e-mail alert only):
11:29 Ping alert for ROB-010
11:31 Ping alert all clear
11:32 SQL Server alert
11:37 SQL Server all clear
11:37 Web site alert
11:39 Web site all clear

Checking the log file reveals the extra events, both of which are set to send e-mail and SMS alerts:
11:29 Power failure in server room
11:29 Power restored to server room

I know the SMS and e-mail action is set up correctly because we use the same profile when monitoring the server room temperature and always receive both SMS and e-mail alerts.

The only unusual thing about these alerts is that they were fired off because the server had a problem with it's network adaptor and stopped responding on the network. I've changed the tests now so the ping test is more frequent and made the event log tests dependent on the results of the ping, but I'd still have expected to receive e-mails for these alerts.

Any ideas?

------------------------------------------------------------------------------------

[17/08/2005 11:28:52] ping rob-010 No answer ping (timeout - 2000 ms)
[17/08/2005 11:29:01] Power Failure in Server Room Unknown Win32 Error. Code: 1722.<cr> The RPC server is unavailable check NT Event Log
[17/08/2005 11:29:01] Power Restored to Server Room Unknown Win32 Error. Code: 1722.<cr> The RPC server is unavailable check NT Event Log
[17/08/2005 11:29:03] CPU rob-010 critical Unknown CPU Usage
[17/08/2005 11:30:02] Power Failure in Server Room Ok 0 ms check NT Event Log
[17/08/2005 11:30:02] Power Restored to Server Room Ok 0 ms check NT Event Log
[17/08/2005 11:30:53] ping rob-010 Host is alive 0 ms ping (timeout - 2000 ms)
[17/08/2005 11:31:05] CPU rob-010 critical Ok 2 % CPU Usage
[17/08/2005 11:31:49] Sys Monitor started
[17/08/2005 11:31:52] Sys Error: Cannot open script file "C:\Program Files\HostMonitor4\reset puremessage count.hms"
[17/08/2005 11:31:54] MSSQL rob-021 Dema Unknown Access violation at address 00000000. Read of address 00000000 check MS SQL server
[17/08/2005 11:32:58] Sys Error: Cannot open script file "C:\Program Files\HostMonitor4\reset puremessage count.hms"
[17/08/2005 11:35:04] www.robinsons.com No answer URL request
[17/08/2005 11:36:55] MSSQL rob-021 Dema Host is alive 156 ms check MS SQL server
[17/08/2005 11:37:00] www.robinsons.com No answer URL request
[17/08/2005 11:41:53] Sys Error: Cannot open script file "C:\Program Files\HostMonitor4\reset puremessage count.hms"
[17/08/2005 11:42:58] Sys Error: Cannot open script file "C:\Program Files\HostMonitor4\reset puremessage count.hms"
KS-Soft
Posts: 13012
Joined: Wed Apr 03, 2002 6:00 pm
Location: USA
Contact:

Post by KS-Soft »

The only unusual thing about these alerts is that they were fired off because the server had a problem with it's network adaptor and stopped responding on the network
Mail server installed on this server? In this case mail was not send because HostMonitor could not access server...
Could you check system log (log that is specified on Advanced Logs page in the Options dialog)? When HostMonitor cannot execute action, it records information about problem into this log file

Regards
Alex
Post Reply