Monday, February 22, 2010

Process time out while waiting for a managed process to start

I installed ESB_DT using ant script then i restarted my SOA suite it is not starting properly.It is starting all the services but oc4j_soa container.

getting exceptions

channel closed, exception is ChannelClosedException



The reason behind is that service failover is not configured.

service-failover="1" is done to indicate I want only one of these OC4J's in my

cluster.In an active passive mode we should have only one active ESB_DT.

As per documentation we need to make the changes in both the nodes if you have configured a cluster.

also you need to remove the parameter numprocs="1" for the ESB_DT group

This is done in following ways

Go to you SOA_HOME\opmn\conf-opmn.xml

There you will find an entry

<process-set id="ESBDT_GROUP" numprocs="1"/>

Remove the numprocs="1" so that it should now look like

<process-set id="ESBDT_GROUP" />

Again for

<process-type id="OC4J_ESBDT" module-id="OC4J" status="enabled" >

add an entry service-failover="1" to ensure failover.

so now this entry should look like

<process-type id="OC4J_ESBDT" module-id="OC4J" status="enabled" service-failover="1">

Save the changes.

do an opmnctl reload to reload the new configuration.Stop the Server and start it you should now be able to start the oc4j_soa



I removed the numprocs="1" process-set id as service-failover does not support numprocs (i.e. multi-JVM).


In general this happens if OC4J takes longer than timeout parameter in opmn.xml
There can be a lot of reason behind that like memory issues,too many process but in order to avoid this issue you can do the following changes

GO to SOA_HOME\opmn\conf -opmn.xml file

and there change the start timeout parameter for the container in which you are facing the issue.By default it is 600.

Similarly if you are getting an issue that Process time out while waiting for a managed process to stop

in that case just change the stop timeout parameter and change it to some higher value from the default 120.

Once these changes are done.Save the changes and do a opmnctl reload to reload all the changes.Then restart the server.

No comments: