Grouping of tests for SLA reporting?

All questions related to installations, configurations and maintenance of Advanced Host Monitor (including additional tools such as RMA for Windows, RMA Manager, Web Servie, RCC).
Post Reply
Snucke
Posts: 9
Joined: Tue Sep 27, 2005 12:28 am

Grouping of tests for SLA reporting?

Post by Snucke »

Hello there,

First of all, thanks for a very useful and versatile product!

I´m wondering if it is possible to group several tests together for measuring availability of a service to customers?

For example, if I have 10 load balanced citrix metaframe servers and one of them fails, I still have a 100% working service for my customers to use.

However, I need to show my customers that I fulfill the SLA and if I monitor the 10 servers individually, it will be 10 different stats with some numbers red and others black. This would not make any sense for the customer who just wants to know if the service has been available or not.

The dream scenario would be if you could make the folder itself a reportable object which assumes a certain status based on how many of the tests (in %) inside the folder that have the specific status.

Regards

/Rob
KS-Soft
Posts: 13012
Joined: Wed Apr 03, 2002 6:00 pm
Location: USA
Contact:

Post by KS-Soft »

HostMonitor provides some folder-level statistical counter
http://www.ks-soft.net/hostmon.eng/mfra ... ldersMacro
So you may create report like this http://www.ks-soft.net/hostmon.eng/examples/custom1.htm

But it will not help in your case. You need to decrease "bad" counter only when ALL tests fail, right?
I think you should create additional dependant test item using "This test depends on expression" option. Expression may look like "('%::Test1::SimpleStatus%'=='DOWN') and ('%::Test2::SimpleStatus%'=='DOWN') and ('%::Test3::SimpleStatus%'=='DOWN') and ... and('%::Test10::SimpleStatus%'=='DOWN')", OtherwiseStatus: Host is alive
And use this test for reports

On the other hand.. Why don't you create test to check service that is importand for your customer? E.g. if you are monitoring web service, create URL test to monitor web service...

Regards
Alex
Snucke
Posts: 9
Joined: Tue Sep 27, 2005 12:28 am

Post by Snucke »

Hi Alex,

A specific test for a particular service is of course the best solution but the problem arises when there are services that are not "supported" by HM with a single test. And that is Citrix servers for example where we instead have to check that all the different servers are accepting connections.

The expression solution is not suitable I´m afraid because we need to monitor and report when a critical number of tests/servers are failing (like 25%). That is because if we have 1000 users connecting to our servers and all but one server goes down, we would still be at 100% uptime using expressions but 99% of the users will not be able to login...

/Rob
KS-Soft
Posts: 13012
Joined: Wed Apr 03, 2002 6:00 pm
Location: USA
Contact:

Post by KS-Soft »

The expression solution is not suitable I´m afraid because we need to monitor and report when a critical number of tests/servers are failing (like 25%).
Sorry, this is impossible in current version.
Sounds like we need completely new report manager...

Regards
Alex
Post Reply