C H A P T E R  10

Lights Out Management

This chapter introduces the Lights Out Management (LOMlite 2) facilities available for the Netra 20 server and explains how to use the /usr/sbin/lom utility, which provides a user interface to the device.

The chapter contains the following sections:


Introducing Lights Out Management

Management facilities that enable you to detect and respond quickly to problems are implemented by the LOMlite2 board, which is located as shown in FIGURE 10-1. The facilities are independent of Solaris--that is, they are available even when the system is in standby mode--and are particularly useful for managing servers that are deployed in a "lights out" environment. You can also use them to perform quick onsite management tasks at a locally-connected terminal.

You use the LOMlite facilities, remotely or from a local connection, to:

 FIGURE 10-1 The Location of the Lights Out Management Device

Location of LOM device.

LOM Console Port

LOMlite2 events are reported at the LOM console port, which it shares by default with the console. The LOM console port can be configured to display only LOMlite2 events, with normal console activity directed to ttya or ttyb. For further information, see Separating LOMlite2 From the Console on the LOM Console Port.


Powering On or Resetting the Server From the LOMlite2 Shell

To use the Lights Out Management facilities either remotely or locally, you need a terminal connection to the LOM console port on the Netra 20 server.

There are two ways of interrogating the LOMlite2 device or of sending it commands to perform:

The remainder of the current chapter explains this method.

This facility is described in Chapter 11.

When a Netra 20 server is connected to a live electrical supply, it is always either fully powered or in standby power mode. The lom> prompt poweron and poweroff commands power on the system and return it to standby mode, respectively. To remove power entirely from the server you must disconnect its power cable or open all external circuit breakers.



Note - All commands that involve powering on or off, resetting, or specifying the boot mode of the server require you or the named user to have r-level LOMlite2 user permission. If you have not set up any named users of the LOMlite2 device, you have r-level permission by default. If you have set up one or more named users, you must explicitly give them r-level permission to execute these commands. For more information about user privileges, see Setting Up LOMlite2 Privileges for Named Users.



Powering the Server On or Down to Standby Mode

The lom> prompt is accessible when the system is in standby mode. Remember that the LOMlite2 uses standby power and so remains active even when the Netra 20 server is not powered on.

single-step bulletTo power on the server, type:

lom> poweron

single-step bulletTo power the server down to standby mode, type:

lom> poweroff

Displaying the lom> Prompt

When you connect a terminal to the LOM console port, the lom> prompt is displayed only if the server is powered off. If the server is powered on and has not already been put into LOM mode, it will display the console prompt.

single-step bulletTo display the lom> prompt, type the following LOMlite2 escape sequence:

# #.

When you type the LOMlite2 escape sequence, the LOMlite2 device takes control of the LOM console port and the lom> prompt appears on your terminal screen. You can type the LOMlite2 escape sequence at any time.



Note - If you are at the console and you type the first character of the LOM escape sequence (by default this is #), there is a second's delay before the character appears on the screen. This is because the server waits to see if you type the dot (.) character next. If you do, the lom> prompt is displayed. If you do not, the # character appears on the screen.



See also FIGURE 7-1 for a flow chart of the system prompts.

Exiting From the lom> Prompt

single-step bulletTo return to the Solaris console prompt from the lom> prompt, type:

lom> console



Note - If you have the LOM console port dedicated to the LOMlite2 device (instead of shared between the LOMlite2 and the console), this command will have no effect. For information about dedicating the LOM console port to the LOMlite2 device, see Separating LOMlite2 From the Console on the LOM Console Port.





Note - If you have set up named users for the LOMlite2 device, they must have
c-level permission to execute the console command. Without it, the command will not work. For more information about setting up user permissions, see Setting Up LOMlite2 Privileges for Named Users.



Resetting the Server

single-step bulletTo reset the server, type:

lom> reset

Displaying the ok or kadb Prompt

To display the ok or kadb prompt, type the following at the lom> prompt:

lom> break



Note - If you have the LOM console port dedicated to the LOMlite2 device, this command has no effect. For information about dedicating the LOM console port to the LOMlite2 device, see Separating LOMlite2 From the Console on the LOM Console Port. To use the break command, you must have c-level LOMlite2 user privileges. For more information, see Setting Up LOMlite2 Privileges for Named Users.



Controlling the Server's Boot Behavior

The LOMlite2 shell includes a bootmode command:

bootmode [-u][normal|forth|reset_nvram|diag|skip_diag]

This command enables you to dictate the behavior of the server after a reset. Its functionality is identical to the functionality available on Sun keyboards via the L1 key combinations. (However, the bootmode command is provided because the L1 key combinations are not available for the Netra 20 server: you cannot use them from a keyboard that is connected to the server via a serial link.)

You must have r-level LOMlite2 permission to use the bootmode command. For information about user privileges, see Setting Up LOMlite2 Privileges for Named Users.

Boot Modes

If you use the bootmode command without arguments, the LOMlite2 device reports just the current boot mode. The boot modes available are listed in TABLE 10-1.

TABLE 10-1 Boot Modes

Mode

Description

normal

In this mode, the server boots using your OpenBootTM PROM settings. To cause this parameter to take effect, you must reset the server after executing the bootmode command at the lom> prompt.

forth

In this mode, the server does not boot to Solaris but stops the boot cycle at the ok prompt. The command is equivalent to the L1-F key combination for Sun keyboards. To cause the parameter to take effect, you must execute the bootmode forth command at the lom> prompt and then reset the server. (It is only when the server resets that it reads the new forth parameter from the LOMlite2 device.)

reset_nvram

In this mode, the server returns all NVRAM data to its factory-default setting. The command is equivalent to the L1-N key combination for Sun keyboards. To cause the parameter to take effect, you must reset the server after executing the bootmode command at the lom> prompt.

diag

In this mode, the server performs full self-diagnostics as part of the boot process. The command is equivalent to the L1-D key combination for Sun keyboards. To cause the parameter to take effect, you must power off and then power on the server within 10 minutes after executing the bootmode command at the lom> prompt.

-u

This option does not represent a boot mode. However, if you have previously dedicated the LOM console port to the LOMlite2 device and now want to share the port between the console and the LOMlite2 device, you can use the -u option followed by a reset to achieve this. It is a quick alternative to the procedure described for sharing the LOM console port in the section called Separating LOMlite2 From the Console on the LOM Console Port. To cause the parameter to take effect, you must power off and then power on the server within 10 minutes after executing the bootmode command at the lom> prompt.



Monitoring the Server From the LOMlite2 Shell

This section describes the commands that enable you to check the status of the components monitored by the LOMlite2 device. It also describes how to view the events stored in the LOMlite2 device's event log.

Checking the Current Status of All Components

To see the current status of all the components monitored by the LOMlite2 device, type:

lom> environment

CODE EXAMPLE 10-1 shows a sample output from the environment command. In this example, all the components are running normally and none is faulty.

If the fans, PSU output, or supply rails fail on your Netra 20 server, you may need to replace components. For information about ordering and installing field-replaceable components, see Appendix A.

CODE EXAMPLE 10-1 Sample Output from the environment Command
lom> environment
LEDs:
1 Power            ON
2 Fault            OFF
3 Supply A         OFF
4 Supply B         OFF
5 PSU ok           ON
6 PSU fail         OFF
 
Alarms:
1 -                OFF
2 -                OFF
3 -                OFF
 
PSUs:
1 AC               OK
 
Fans:
1 PSU1             OK speed 88%
2 PSU2             OK speed 84%
3 CPU              OK speed 83%
 
Temperature sensors:
1 Ambient          OK 22degC 
2 CPU0 enclosure   OK 22degC 
3 CPU0 die         OK 56degC 
4 CPU1 enclosure   OK 22degC 
5 CPU1 die         OK 55degC 
 
Overheat sensors:
1 PSU              OK
 
Circuit breakers:
1 SCC              OK
2 PSU              OK
 
lom>

Fan Identification

Event reports about the fans inside the server include the number of the fan to which they relate:

LOMlite2 Event Log

Viewing the Last Ten Events in the Event Log

Type the following:

lom> show eventlog

The example shows a sample event log display. The eventlog command reports the last 10 events. Note that the first event is the oldest and that each event has a time-stamp indicating the hours and minutes (and, if applicable, days) since:



Note - If the fans, PSU output, or supply rails fail on your Netra 20 server, you may need to replace components. For information about ordering and installing field-replaceable components, see Appendix A.



Viewing the Entire Event Log

single-step bulletType:

lom> loghistory [pause x] [level y]

where x is the number of lines you want the screen to display before pausing, and y is the severity level of the events you want to see. By default, the loghistory command displays all the events in the log and does not pause the display.

If you specify a severity level, the reports for the level you specify and above are displayed. For example, if you specify level 2, you will see reports of level 2 and level 1 events. If you specify level 3, you will see reports of level 3, level 2, and level 1 events (see TABLE 10-2.

TABLE 10-2 LOMlite2 Severity Levels

Level

Meaning

1

Fatal event

2

Warning event

3

Information event

4

User event


The default security level is 3.

For more information about severity levels, see Viewing the syslogd File.

Each entry in the log includes the time of the event, the server's host name, a unique identifier for the event and a user-friendly text message describing the event.



Note - The LOMlite2 device's Event Log is never cleared and can contain several hundred entries. Events are stored from the first time the server boots. Eventually the buffer can become full, but when this happens the LOMlite2 device will start again at the beginning of the buffer, over-writing the earliest events.



Viewing All Events

single-step bulletTo view all events from the first to the nth logged event, type:

lom> loghistory index +n [pause x] [level y]

where

By default, the loghistory command does not pause the display.

Viewing the Last and Previous n Logged Events

single-step bulletTo view the last event and the previous n events, type:

lom> loghistory index -n [pause x] [level y]

where

By default, the loghistory command does not pause the display.

single-step bulletTo see the last five events, type:

lom> loghistory index -5
+0h39m34s Alarm 1 ON
+0h39m40s Alarm 3 ON
+0h39m54s Alarm 3 OFF
+0h40m0s Alarm 1 OFF
+0h40m58s Fault LED ON

Checking That a Component Has Been Fixed

If a monitored component has failed, the LOMlite2 device does not continue to report the failure.

single-step bulletTo check the status of the component, for example, after attempting to fix it, type:

lom> check

This causes the LOMlite2 device to update the status of all the components it monitors.


Setting Up LOMlite2 Privileges for Named Users

You can specify up to four named users of the LOMlite2 device on a Netra 20 server. By default, no users are set up and therefore no user login prompt appears when you use the LOM escape sequence.

However, if you set up one or more users, each time you use the LOM escape sequence to display the lom> prompt you are prompted for a user name and password. Therefore, one of the user accounts you set up, must be for yourself.

Four areas of authorization are available for named users. By default, all four are open to them. However, if you specify any (using the lom> userperm command), only those that you specify will be available.

Permissions Available for LOMlite2 Users

The four areas of authorization available by default are:

For information about how to specify one or more of these areas of authorization for a named user, see Specifying Permissions for a Named User.

Creating a LOMlite2 User Account

single-step bulletTo create a user account, type:

lom> useradd user name

where the user name is up to eight characters long, begins with an alphabetic character and contains at least one lower-case alphabetic character. You can use any of the following characters in the user name:



Note - You must have User Administration (u-level) authorization to add a user (see Permissions Available for LOMlite2 Users). If you have not added any users, you have a-level and all other levels of authorization by default.



Specifying the Password for a LOMlite2 User Account

single-step bulletTo specify a user account password, type:

lom> userpassword user name

where the user name is the name of a LOMlite2 user account that already exists.



Note - You must have User Administration (u-level) authorization to set a password for a user (see Permissions Available for LOMlite2 Users). If you have not added any users, you have a-level and all other levels of authorization by default.



Viewing Details of a LOMlite2 User Account

single-step bulletTo view the details of a user account, type:

lom> usershow user name

where the user name is the name of an existing LOMlite2 user account.



Note - You must have User Administration (u-level) authorization to view the details of a LOMlite2 user account (see Permissions Available for LOMlite2 Users).



Changing Your Own User Password

1. To change the password for the account you are currently logged into, type:

lom> password

2. When prompted, type the current password.

3. When prompted, type the new password you want to use.

4. Type the new password again to confirm it.

Deleting a LOMlite2 User Account

single-step bulletTo delete a user account, type:

lom> userdel user name

where the user name is the name of an existing LOMlite2 user account.



Note - You must have User Administration (u-level) authorization to delete a user account (see Permissions Available for LOMlite2 Users).



If you delete all the users you have set up, you will no longer see the login prompt when you go to the lom> prompt.

Specifying Permissions for a Named User

By default, all four areas of authorization are available to each named user you set up.

single-step bulletTo limit users to a particular area or areas of authorization, type:

lom> userperm username [c][u][a][r]

You can specify:



Note - If no parameters are specified, a command usage error is displayed.



The parameters are:

This stands for "console permission". It enables the named user to exit from the lom> prompt to the Solaris prompt (as long as the LOM console port is shared between the LOMlite2 and the console).

This stands for "user administration permission". It enables the named user to add and delete users and alter their areas of authorization by using the userperm command.

This stands for "administration permission". It enables the named user to change the LOMlite2 device's configuration variables (see Configuring the LOMlite2 Device).

This stands for "reset permission". It enables the named user to reset the server and to power it on and off using the LOMlite2 device.

Quitting a LOMlite2 Named User Session

single-step bulletTo quit a named user session, type:

lom> logout

This returns you to the LOMlite2 login prompt.


Configuring the LOMlite2 Device

The LOMlite2 set command can take the following variables as arguments:

The first four duplicate the functions of the commands having the same name. The last enables you to stop the LOMlite2 device from sending event reports to the LOM console port.



Note - To execute the commands described in this section, a named user must have a-level permission. For more information, see Setting Up LOMlite2 Privileges for Named Users.



Turning the Fault LED on

single-step bulletTo turn the Fault LED on by setting the faulton variable, type:

lom> faulton

Turning the Fault LED off

single-step bulletTo turn the Fault LED off by setting the faultoff variable, type:

lom> faultoff

Setting the alarmon Software Flag

single-step bulletSet the alarmon n variable by typing:

lom> alarmon n 

where n is the number of the alarm you want to turn on: 1, 2, or 3.

Setting the alarmoff Software Flag

single-step bulletTo set the alarmoff configuration variable, type:

lom> alarmoff n 

where n is the number of the alarm you want to turn off: 1, 2, or 3.

Stopping LOMlite2 Sending Event Reports to the LOM Console Port

single-step bulletTo set the event-reporting variable on, type:

lom> set event-reporting on



Note - This is the equivalent of using the lom -E on command from the Solaris shell.



single-step bulletTo set the event-reporting variable off, type:

lom> set event-reporting off



Note - This is the equivalent of using the lom -E off command from the Solaris shell.




Separating LOMlite2 From the Console on the LOM Console Port

By default, the LOMlite2 device shares the LOM console port with the console, and when it has an event report to send it takes control of the LOM console port itself, interrupting any console activity you are performing. To prevent the LOMlite2 device from interrupting the console, either turn serial event reporting off, or dedicate the LOM console port to the LOMlite2 device and use one of the serial ports for console activity (see below).

The advantages of dedicating the LOM console port to the LOMlite2 device and using a serial port as your console port include:

Note, however, that if you dedicate the LOM console port to the LOMlite2 device, you cannot use the console command to quit the LOMlite2 shell. Instead, for access to Solaris, you must connect to the Netra 20 server using the serial port.

If you dedicate the serial port to the console, you cannot bring up the LOMlite2 shell, which means that you cannot interrogate or reconfigure the LOMlite2 device.

To perform anything other than ASCII transfers, you must use the serial port.

Dedicating the LOM console to LOMlite2

1. Set up console connections to both the LOM console port and the serial A port.

2. At the Solaris prompt, type:

# eeprom input-device=ttya
# eeprom output-device=ttya
# reboot 

The serial A port (ttya) is now your console port. The LOM console port remains under the control of the LOMlite2 device.



Note - You can also configure serial port B as the console by substituting ttyb for ttya in the above.



Sharing the LOM Console Port Between LOMlite2 and the Console

By default, the LOM console port is shared by the LOMlite2 device and the console. Therefore, you need only to follow the instructions in this section if you have configured the Netra 20 server using the instructions in the previous section (Dedicating the LOM console to LOMlite2) and you now want to share the LOM console port between the LOMlite2 and the console.

To share the port between the LOMlite2 device and the console:

1. Set up console connections to both the LOM console port and the serial port.

2. At the Solaris prompt, type:

# eeprom input-device=keyboard
# eeprom output-device=screen
# reboot 

In the absence of a screen and keyboard, input-device and output-device default to lom-console, which is now shared between the LOMlite2 device and the console.


Viewing the syslogd File

The LOMlite2 device monitors the status of the fans, supply rails, temperature, and power supply even when the server is powered down (the LOMlite2 device operates on standby power). If it detects a fault, it turns on the Fault LED on the server's front and back panels and stores a report in an event log, which resides in memory on the LOMlite2 device. When Solaris is running, the LOMlite2 device also sends event reports to syslogd, which handles these in the way it has been configured to handle event reports. This means that, by default, it sends them to the console and stores them in the following file:

/var/adm/messages

In this file the reports are displayed with a label identifying them as lom reports and indicating their severity. The different levels of severity are, in descending order:

1. Fatal

These events might concern, for example, exceeded temperature thresholds or supply rail failures. Fatal events may cause the server to shut itself down.

2. Warning

These events might concern fans turning too slowly, the Fault LED having been turned on, or the System Configuration Card having been removed. They are not events that cause the server to shut down, but they do require immediate attention.

3. Info

These events are most likely to inform you that some problem is now OK. For example, if you re-insert the System Configuration Card, the LOMlite2 device will generate an event to say that the System Configuration Card has been restored.

4. User

User level events concern the activity of named users whom you have authorized to access the LOMlite2 device. For example, an event is generated when a user logs in or out.


The LOMlite2 Shell Command List

The commands you can execute from the lom> prompt are listed in TABLE 10-3.

TABLE 10-3 LOM Commands

Command

Effect

alarmoffn

Sets alarm n off (where n is 1, 2, or 3.)
These three alarms are software flags. They are associated with no specific conditions but are available to be set by your own processes.

alarmonn

Sets alarm n on
See the description for the alarmoff command.

break

Takes the server down to the ok prompt

bootmode

Determines the behavior of the server during the boot process

check

Resets monitoring to report all failures
If a monitored component has failed, the LOMlite2 device will not continue to report the same failure. To check the status of the component, for example, after attempting to fix it, issue the check command. This updates the status of all monitored components.

console

This command takes you out of the LOMlite2 shell and back to the # prompt or ok prompt, depending on how you invoked the LOMlite2 shell. It returns control of the serial connection to the console.

environment

Displays the temperature of the server and the status of the fans, power supply, over-temperature monitors, alarms, and fault LED

faulton

Sets the Fault LED to On

faultoff

Sets the Fault LED to Off

help

Displays the list of LOM commands

loghistory

Displays all the events in the LOMlite2 device's Event Log

logout

This command is for use by named users you have set up with password access to the LOMlite2 device. It returns them to the LOM user login prompt.

poweron

Powers the server on

poweroff

Powers the server down to standby power mode

reset

Resets the Netra 20 server

show model

Displays the server model

show hostname

Displays the server name (this command is equivalent to the Solaris uname -n command)

show eventlog

Displays the LOMlite2 device's event log
The event log is the list of the last 10 events to have been stored in the LOMlite2 device. The most recent event is the one at the bottom of the list.

show escape

Displays the current LOMlite2 escape sequence

show

Displays all the information available with the show command

useradd

Adds a user to the LOMlite2 device's list of permitted users

userdel

Deletes a user from the LOMlite2 device's list of permitted users

usershow

Displays the details of a named user's LOMlite2 account

userpassword

Sets or changes a user's password

userperm

Sets the permission levels for a named user

version

Displays the version number of the LOMlite2 device