ipc resource configuration on a unix system
the following sections describe the interprocess communication (ipc) parameters on a unix system and provide guidelines for configuring them:
§ parameter sets controlling ipc resources
shared memory semaphores message queues and messages other kernel tunables
parameter sets controlling ipc resources
on a unix system, the bea tuxedo system uses the ipc resources provided by the unix operating system, which are controlled by the following three sets of tunable parameters.
tunable parameters starting with this prefix . . .
control the . . .
shm
amount of shared memory
sem
number of semaphores
msg
size of message queues and messages
the settings for these parameters are application-dependent. most unix systems are shipped with default values that are too low for a bea tuxedo application.
because the ipc parameters vary across different versions of the unix system, the descriptions provided in the following sections are generic. refer to bea tuxedo 8.0 platform data sheets, for the exact parameter names and defaults for each platform and for information on how to change parameter values. if you change a parameter value, you will need to rebuild the kernel and reboot the operating system, using standard administrative tools. consult your operating system administrator or the system administrators guide for your platform for details.
if your bea tuxedo application is distributed, the minimum ipc resources must be available on every unix platform participating in the application.
shared memory
in the bea tuxedo environment, shared memory is used for the bulletin board and the control table of the workstation listener (wsl) and the iiop listener (isl) processes. an application may also use shared memory for its own purposes.
the following shared memory parameters may need to be adjusted:
shmmax
maximum size, in bytes, of a shared memory segment. this number represents the largest shared memory segment that can be allocated. a process can, however, attach to more than one segment of size shmmax.
shmseg
maximum number of shared memory segments per process. for a given configuration, the maximum amount of shared memory to which a process can attach is the product (in bytes) of shmmax * shmseg. a value between 6 and 15 should be adequate.
shmmni
maximum number of shared memory identifiers in the system. the bea tuxedo system requires one identifier per bulletin board and an additional identifier for each workstation listener (wsl) and iiop listener (isl) that is running.
shmmin
minimum size, in bytes, of shared memory segment. this parameter should always be set to 1.
semaphores
every process that participates in a bea tuxedo application requires a semaphore. a semaphore is a hardware or software flag used to prevent processes from accessing the same shared memory space at the same time. when a process has control of a shared memory resource, all other processes are locked out of the shared memory resource until the process releases the resource.
when the bea tuxedo application is booted, the underlying bea tuxedo system checks the number of semaphores configured in the operating system. if the configured number is not high enough, the boot fails.
the following semaphore parameters may need to be adjusted:
semmns
maximum number of semaphores in the system. the minimum requirement for semmns is
maxaccessers – maxwsclients + 13
where maxaccessers is the maximum number of bea tuxedo system processes on a particular machine (including servers and native clients) and maxwsclients is the maximum number of bea tuxedo remote clients. both of these parameters are specified in the ubbconfig file for the application. for more information about ubbconfig, see “creating the configuration file” in setting up a bea tuxedo application or ubbconfig(5) in the file formats, data descriptions, mibs, and system processes reference.
semmni
maximum number of active semaphore sets.
semmsl
maximum number of semaphores per semaphore set. semmni and semmsl are commonly chosen so that their product equals semmns. the bea tuxedo system does not perform semaphore operations on semaphore sets; however, it attempts to allocate as many semaphores per semaphore set as possible.
semmap
size of the control map used to manage semaphore sets. semmap should be equal to semmni.
semmnu
number of undo structures in the system. because an undo structure is needed for each process that can access the bulletin board, semmnu must be at least as large as semmns. (the unix operating system uses undo structures to unlock semaphores held by processes that die unexpectedly.)
semume
maximum number of undo entries per undo structure. the value 1 suffices.
message queues and messages
the bea tuxedo system uses unix system messages and message queues for client/server communication. examples of such messages are service requests, service replies, conversational messages, unsolicited notification messages, administrative messages, and transaction control messages.
every multiple servers, single queue (mssq) set of servers and every individual server has a message queue for receiving requests. every client has its own queue for receiving replies. servers that specify the replyq parameter also get individual reply queues.
the adjustment of kernel message parameters is important to the proper tuning of an application. inappropriate values can lead to an inability to boot, or to severe performance degradation.
several message queue parameters are available to define various characteristics of the queue space, as indicated in the following table.
this parameter. . .
specifies the . . .
msgtql
total number of outstanding messages that can be stored by the kernel
msgmnb
total number of bytes that can be stored on one queue
msgmax
maximum size of an individual message
msgseg
total number of message segments that can be outstanding at one time
msgssz
size of each segment
if the limit specified by any of these parameters is exceeded, then a blocking condition occurs. there is one exception to this rule: msgmax. messages that exceed 75 percent of msgmnb, or that are larger than msgmax, are placed in a unix file. a very small message containing the filename is then sent to the recipient. because this mode of operation results in a severe reduction in performance, we strongly recommend that you avoid it.
what is application deadlock?
an application deadlock can result if every process is blocked while trying to send a message. for example, when clients fill up the message space with requests, servers that are trying to send replies are blocked. therefore, no server can read a message and a deadlock results. occasionally, timeouts can break a deadlock, but no useful work will have been done.
a client that sends its requests with the tpnoreply flag is especially troublesome. this practice can fill either individual queues or the system message space, depending on the size of the messages. such applications may have to implement their own flow control to limit the number of outstanding messages.
to summarize, if clients or servers are blocking on their send operations (requesting services or sending replies), there is potential for trouble. it is usually no problem, though, for a single server request queue to remain full, as long as there is space in the system for more messages on other queues.
performance implications of blocking conditions
there are performance implications to queue blocking conditions, both on the sending side and the receiving side. when waking up blocked processes, the unix operating system wakes up all the processes blocked on a particular event, even if only one can proceed. the other processes go back to sleep. this process scheduling overhead can be expensive.
for example, on an empty server request queue on which more than one server (mssq) resides, an arriving message wakes up all the idle (blocked) servers on that queue. in the case of a full server request queue, as each request is read by a server, the system wakes up all the blocked clients. depending on the size of the messages, zero or more clients can place messages on the queue. the rest go back to sleep. because there may be hundreds of clients in the system, the mass wakeup of all of these clients every time a service request is processed can severely degrade performance.
tunable message parameters
a properly tuned system rarely fills its queues. enough slack should be left in the queues to handle the natural variability of the message flow. no exact settings can be recommended. tuning is very application dependent. the unix ipcs(1) command provides a snapshot of the queues so you can determine whether they are full. you can try setting the tpnoblock flag when sending requests. if you do, clients can tell when queues are full, and they can slow down a bit. it might help to increase the scheduling priority of servers with full request queues.
the following message parameters may need to be adjusted:
msgmni
number of unique message queue identifiers. each process participating in a bea tuxedo application on a particular machine typically needs at least one message queue. this number is reduced if mssq sets are used, which means that multiple server processes share a single queue. for transaction processing, count an additional queue per server group for transaction manager server (tms) processes. thus, the minimum requirement for msgmni can be determined by the following formula:
msgmni = maxaccessers + 7
+ (number of servers with replyq)
+ (number of mssq sets)
– (number of servers in mssq sets)
msgmax
maximum message size in bytes. msgmax must be big enough to handle any bea tuxedo application running on this machine.
msgmnb
maximum message queue length in bytes. this number must accommodate the total size of all messages that are on a queue and have not been taken off by the associated processes. the minimum value for msgmnb is the value of msgmax. messages longer than 75% of msgmnb are sent to a file instead of a message queue—a situation that should be avoided because it severely degrades performance.
msgmap
number of entries in the control map used to manage message segments. the value of msgmap should be the number of message segments (specified in msgseg).
msgssz
size, in bytes, of a message segment. a message can consist of several such segments. the value of msgssz should be such that a multiple of msgssz is equal to the size (including the bea tuxedo system header) of the most commonly sent message. by dividing messages into segments in this way, you can avoid wasting space.
msgseg
number of message segments in the system.
msgtql
total number of outstanding messages that can be stored by the kernel. this is the maximum number of unread messages at any given time.
other kernel tunables
experience with the bea tuxedo system has shown that some other unix system tunables may need to be set to higher values. these parameters are very application dependent and do not apply to all applications. the section bea tuxedo 8.0 platform data sheets includes information on the defaults for each platform and instructions for changing them.
ulimit
maximum file size. ulimit needs to be large enough so that you can install the bea tuxedo system and build servers. we recommend 4 mb.
nofiles
maximum number of open files per process. a bea tuxedo server requires a minimum of four file descriptors.
maxup
maximum number of processes per non-superuser. the bea tuxedo system processes—servers and administrative processes—run with the uid specified in the applications ubbconfig file. maxup needs to be large enough to allow all of these processes to run.
nproc
maximum number of processes (system wide).
nregion
number of region table entries to allocate. most processes have three regions: text, data, and stack. additional regions are needed for each shared memory segment and each shared library (including text and data) that is attached. however, the region table entry for the text of a “shared text” program is shared by all processes executing that program. each shared memory segment attached to one or more processes uses another region table entry.
numtim
maximum number of streams modules that can be pushed by the transport layer interface (tli). a typical default value is 16; we recommend setting this parameter to at least 256.
numtrw
the number of tli read/write structures to allocate in kernel data space. a typical default value is 16; we recommend setting this parameter to at least 256.