rexecd and NIS Clash?

From: Gale, Gary \(Factiva\) <"Gale,>
Date: Mon, 04 Feb 2002 07:14:26 -0500

Morning Tru64-ites,

We're coming across a problem with one of our applications which makes use
of the rexec() system call.

One of our processes acts as a "watchdog", starting, restarting and stopping
a series of other processes. These processes are fired up using rexec(), and
specifying that rexec should set up an auxillary channel to the created
process (ie: the err_file_desc argument to rexec() is non-null).

On our test boxes this code works perfectly but when we transfer the code
onto our integration test boxes the code hangs in rexec(). A bit of
debugging shows that rexec is waiting in accept(), presumably waiting for
the remote server to contact the client in order to set up the control
channel.

Both boxes are GS40's running 4.0f; the only difference seems to be that the
integration boxes are running NIS and the development boxes aren't.

What's stranger is that trying to rexec() an app on the test box from the
test box works and trying to rexec() an app on the test box from the
integration box works; but no attempt to rexec() an app on the integration
box, either locally, ie: both the app and watchdog on the integration box,
or remotely ie: the app on the integration box and the watchdog on the test
box, works at all.

Seeing as this "hang" is occurring, the remote client doesn't get started
up; I'm wondering whether there's some reverse lookup occuring in the remote
client which is failing due to a difference in non-NIS and NIS host
resolution.

Any ideas?

All the best

Gary

--
Gary Gale                               Mail: gary.gale_at_factiva.com
UK Server Group                         Phone: +44 (0) 207 542 8814
Factiva, A Dow Jones & Reuters Company  Web: www.factiva.com
Received on Mon Feb 04 2002 - 12:11:34 NZDT

This archive was generated by hypermail 2.4.0 : Wed Nov 08 2023 - 11:53:43 NZDT