ctmfw jobs are DISAPPEARing

Post Reply
snrru0

ctmfw jobs are DISAPPEARing

Post by snrru0 » 18 Mar 2010 11:46

on our AIX server sapdp53 I have noticed the following :

% pwd
/local/config/incontrol/cmaprd/ctm/locks
% cd ../pid
% ls -al
total 16
drwxr-xr-x 2 cmaprd controlm 4096 Mar 17 16:43 .
drwxr-xr-x 19 cmaprd controlm 4096 Mar 18 06:45 ..
-rw-r--r-- 1 root system 0 Mar 10 01:24 ag_6463530
-rw-r--r-- 1 cmaprd controlm 0 Mar 17 16:43 at_5259316
% shagent
root 6463530 - 0:09 ./cmaprd/ctm/exe/p_ctmag
root 5259316 - 5:29 ./cmaprd/ctm/exe/p_ctmat
% cd ../proclog
% ls -ltr AT*
-rw-r--r-- 1 cmaprd controlm 54830 Mar 17 01:27 AT_720944.log.0317
-rw-r--r-- 1 cmaprd controlm 53714 Mar 18 01:21 AT_720944.log.0318
-rw-r--r-- 1 cmaprd controlm 1777852 Mar 18 09:33 AT_5259316.log
-rw-r--r-- 1 cmaprd controlm 17148 Mar 18 09:44 AT_720944.log

As far as I know there should be only one AT log active !? That should be the one that corresponds to p_ctmat process 5259316.
So why are there 2 active : AT_5259316.log and AT_720944.log ?

am I right here ?
and could that be the cause why FW jobs are DISAPPEARing ?

a job log :

17/03/10 17:12:02 SL5208 QUANTITATIVE RESOURCE SERPECC QUANTITY 1 ALLOCATED

17/03/10 17:12:02 SL5105 SUBMITTED TO sapdp53

17/03/10 17:12:18 TR5101 STARTED AT 20100317171202 ON sapdp53

17/03/10 17:12:18 TR5120 JOB STATE CHANGED TO Executing

17/03/10 17:12:38 TR5102 DISAPPEARED AT 20100317171237. RUNCNT 60

17/03/10 17:12:38 TR5134 ENDED NOTOK

17/03/10 17:12:38 TR5120 JOB STATE CHANGED TO Analyzed

17/03/10 17:12:38 SL5214 QUANTITATIVE RESOURCES RELEASED

17/03/10 17:12:38 SL5120 JOB STATE CHANGED TO Wait Rerun

17/03/10 17:12:39 SL5120 JOB STATE CHANGED TO Wait Time Window

sapdp53 config:

Agent User Name : cmaprd
Agent Directory : /local/bin/incontrol/cmaprd/ctm
Agent Platform Architecture : AIX
Agent Version : 6.3.01.400
Agent Host Name : am1hc046
Logical Agent Name : sapdp53


rgds
Ruud van Ruler

User avatar
rahulsehgal
Nouveau
Nouveau
Posts: 148
Joined: 19 Mar 2009 12:00
Location: Delhi
Contact:

Post by rahulsehgal » 22 Mar 2010 2:48

Hi Ruud van Ruler,

Check your agent communication with your Control-M Server as this issue only occurs when agent is in DISABLED state on Control-M Server.

CONTROL-M/Server will not attempt to communicate with a
disabled Agent computer until its status is manually changed to
Available. When the status of an Agent computer is changed to Disabled, any jobs that were running on the computer at that moment are assigned Disappeared status.

so try this command and check if job is able to run properly again or not:-

ctm_agstat -UPDATE <agentname> AVAILABLE

If its still going in to the disappear status, then try restarting the agent services and check if it is communicating properly.

Regards,
Rahul Sehgal

snrru0

Post by snrru0 » 25 Mar 2010 12:51

thanks Rahul

but this is not the case here for other jobs are running fine.
this is an intermittent problem

rgds
Ruud

Post Reply