Hostmon Out of System Resources

All questions related to installations, configurations and maintenance of Advanced Host Monitor (including additional tools such as RMA for Windows, RMA Manager, Web Servie, RCC).
sadek76
Posts: 71
Joined: Tue Apr 27, 2004 3:28 pm
Location: New Jersey

Hostmon Out of System Resources

Post by sadek76 »

I'm running version 5.12 on Windows 2003 server, with 4-3.20GHz cpu's and 4Gig's of memory, with 1100 tests, mostly PING and UNC tests, running as a service. Each time an alert is generated I use an external Trapgen program to send a SNMP trap to a Unix trap server. After a few days I try to edit a test or profile or perform any task which opens a sub-window and receive an error message saying that the process can not be completed, out of system resources. Once I stop and restart the service it works fine for the next few days. This sounds like you still have the memory leak bug that was fixed in ver. 5.10.
KS-Soft
Posts: 13012
Joined: Wed Apr 03, 2002 6:00 pm
Location: USA
Contact:

Post by KS-Soft »

Cannot reproduce this problem :(
What other tests/actions do you use? Do you use ODBC logging? If yes, what ODBC driver do you use?
BTW: HostMonitor may send SNMP Trap by itself, use SNMP Trap action.

Regards
Alex
KS-Soft
Posts: 13012
Joined: Wed Apr 03, 2002 6:00 pm
Location: USA
Contact:

Post by KS-Soft »

Could you check what kind of resources is used by HostMonitor? GDI objects? User objects? Memory? Handles?
You may use standard Task Manager for this purpose

Also, do you have installed some antivirus monitor like Norton Antivirus or McAfee?

Regards
Alex
sadek76
Posts: 71
Joined: Tue Apr 27, 2004 3:28 pm
Location: New Jersey

Post by sadek76 »

I have a few URL tests and one service test. I don't use any ODBC logging.
System Resources show: GDI Objects=391, User Objects=211, Memory=19K, Handles=200, Threads=11.
No anti-virus software installed.
I can take a snapshot of the window the next time it happens, but how do I send you a .jpg or .doc file?
KS-Soft
Posts: 13012
Joined: Wed Apr 03, 2002 6:00 pm
Location: USA
Contact:

Post by KS-Soft »

System Resources show: GDI Objects=391, User Objects=211, Memory=19K, Handles=200, Threads=11
Looks Ok.
But as I understand everything works fine now? Could you take resources snapshot when problem occurs?
I prefer .jpg, send it to support@ks-soft.net

Regards
Alex
sadek76
Posts: 71
Joined: Tue Apr 27, 2004 3:28 pm
Location: New Jersey

Post by sadek76 »

Everything is working fine now. When it happens again, I will take snapshots and send them to you.

BTW - Your snmp trap alerts only let me send one varbind per trap. With the trapgen program that I use, I'm able to send multiple varbinds in one trap. Example: "c:\program files\trapgen\trapgen" -d 14.17.70.214 -c S3CCCT -s 108025 -o 1.3.6.1.4.1.3471 -v 1.3.6.1.4.1.11.2.17.2.4 STRING "%TestName%" -v 1.3.6.1.4.1.11.2.17.2.4 STRING "%Status%" -v 1.3.6.1.4.1.11.2.17.2.4 STRING "%TaskComment%" -v 1.3.6.1.4.1.11.2.17.2.4 STRING "%TestMethod%"
User avatar
Marcus
Posts: 367
Joined: Mon Nov 18, 2002 6:00 pm

Out of memory

Post by Marcus »

We also see more and more 'out of memory' errors. This happens only when a lot of reports needs to be generated (we don't use external programs, so I can't confirm that this is the same / similar problem but it does look like it). It does hower not seem to be a lack of memory on the system part, since a few hundreds MB real memory are still available when we see this problem occur and the pagefile is far from being full.

Since the message we get are HostMonitor messages (no system messages indicating a lack of memory), it seems to be a HostMonitor problem. Also HostMonitor does not crash or stop, it just runs on generating more and more out of memory errors when a report needs to be created. It looks like the report engine is sometimes a bit confused :wink:
sadek76
Posts: 71
Joined: Tue Apr 27, 2004 3:28 pm
Location: New Jersey

Post by sadek76 »

I do run alot of reports. Whenever a tests fails, I write it to a report. Also I have reports setup that run every minute. I have trace route tests running and I create reports for them every 5 minutes. I have URL tests which check response time and site availability, which I also write the results to reports.
timn
Posts: 184
Joined: Thu Nov 20, 2003 9:57 am
Location: United States

Post by timn »

Alex: Just FYI, like Marcus, I've also had the "Out Of Memory" errors occur --- on 3 occassions -- typically after the HM machine has been running constantly for two weeks or more. This could indicate a very small/slow mem leak but because it's not been a huge issue, I haven't spent much time pursuing it. I've bigger fish to fry at the moment.

We don't have a lot of reports, but the ones we do have could be where this error is coming from. It appears Marcus has a better feel for this than I do.
KS-Soft
Posts: 13012
Joined: Wed Apr 03, 2002 6:00 pm
Location: USA
Contact:

Post by KS-Soft »

Today I tested HostMonitor on Windows 2000 SP4, generated a lot of reports but do not see any resource leakage.
Probably there is error in some system DLL... Will try on Windows Server 2003.

Timn, Marcus: what Windows do you use? Windows 2003 as well?

IMHO Windows 2000 is the best Windows, Windows 2003 has some bugs, Windows XP has more bugs (I do not count NT 4.0 - its too old and Windows 9x that is not real multitasking OS).

Regards
Alex
User avatar
Marcus
Posts: 367
Joined: Mon Nov 18, 2002 6:00 pm

Post by Marcus »

We use Windows 2000 SP4 with a few security fixes.
timn
Posts: 184
Joined: Thu Nov 20, 2003 9:57 am
Location: United States

Post by timn »

We also use Win2K SP4 - same as Marcos. In our environment, HM is running on it's own dedicated box.

These events are very rare for us. When they do happen, the "Out of Memory" messages do seem to be coming from HM rather than from the OS

Next time it happens, I'll try to gather some additional info.
KS-Soft
Posts: 13012
Joined: Wed Apr 03, 2002 6:00 pm
Location: USA
Contact:

Post by KS-Soft »

I tested on Windows 2000 SP4 + security fixes... I tried all kind of reports - works fine.
May be it appears under some very specific circumstances. Could you send hostmon.ini, Palettes.lst and RepPList.lst files to support@ks-soft.net

Regards
Alex
User avatar
Marcus
Posts: 367
Joined: Mon Nov 18, 2002 6:00 pm

Post by Marcus »

I don't think it is the type of report, but the ammount of reports that needs to be generated.

If we are unlucky, it can be possible that hundreds of reports needs to be generated (actually only a few, but hundreds in a row)
timn
Posts: 184
Joined: Thu Nov 20, 2003 9:57 am
Location: United States

Post by timn »

For us, it only seems to occur after HM has been running continuously for at least two weeks - but even then the problem is rare. For example, HM has currently been running for nearly 4 weeks with no problems.

We have roughly 3,000 tests with an estimated load of 19 tests/sec. We have only 4 different reports.

The problem is so infrequent for us that it is really 'below our radar'. We have other applications we wished ran as well as HM.
Post Reply