07 Oracle Clusterware Postinstallation Procedures

References

Oracle® Clusterware Installation Guide 11g Release 1 (11.1) for Linux

Procedure

Followed the procedure in Oracle Clusterware Postinstallation Procedures .

Required Postinstallation Tasks

Reviewed Required Postinstallation Tasks .

Back Up the Voting Disk After Installation

Following Back Up the Voting Disk After Installation ,

[root@penrith1 ~]# cd /u01/app/crs
[root@penrith1 crs]# ls
product
[root@penrith1 crs]# mkdir backups
[root@penrith1 crs]# cd backups
[root@penrith1 backups]# dd if=/u04/voting of=voting_20120103.bak
41024+0 records in
41024+0 records out
21004288 bytes (21 MB) copied, 39.7536 seconds, 528 kB/s
[root@penrith1 backups]# ls -l
total 20540
-rw-r--r-- 1 root root 21004288 Jan  3 10:58 voting_20120103.bak
Configure Input/Output Fencing

Following Configure Input/Output Fencing , I find that the hangcheck configuration does not exist in /etc/modprobe.conf . The configuration is entered as follows:

[root@penrith1 ~]# echo "options hangcheck-timer hangcheck_tick=1 \
hangcheck_margin=10 hangcheck_reboot=1" >>/etc/modprobe.conf
[root@penrith1 ~]# grep hang /etc/modprobe.conf
options hangcheck-timer hangcheck_tick=1 hangcheck_margin=10 hangcheck_reboot=1
[root@penrith1 ~]# /sbin/modprobe -v hangcheck-timer
insmod /lib/modules/2.6.18-164.el5/kernel/drivers/char/hangcheck-timer.ko \
hangcheck_tick=1 hangcheck_margin=10 hangcheck_reboot=1
[root@penrith1 ~]# /sbin/lsmod | grep hang
hangcheck_timer         8025  0 
[root@penrith1 ~]# echo "/sbin/modprobe -v hangcheck-timer" >>/etc/rc.d/rc.local
[root@penrith1 ~]# cat /etc/rc.d/rc.local
#!/bin/sh
#
# This script will be executed *after* all the other init scripts.
# You can put your own initialization stuff in here if you don't
# want to do the full Sys V style init stuff.

touch /var/lock/subsys/local
/sbin/modprobe -v hangcheck-timer

Setting the diagwait parameter to 13 for oprocd . First, on PENRITH1 ,

[root@penrith1 ~]# cd /u01/app/crs/product/11.1.0/crs/bin/
[root@penrith1 bin]# ./crsctl stop crs
Stopping resources. 
This could take several minutes.
Successfully stopped Oracle Clusterware resources 
Stopping Cluster Synchronization Services. 
Shutting down the Cluster Synchronization Services daemon. 
Shutdown request successfully issued.

Repeated this on PENRITH2 . Then, on PENRITH1 ,

[root@penrith1 bin]# ./oprocd stop
Jan 03 11:23:31.209 | ERR | failed to connect to daemon, errno(111)

Repeated this on PENRITH2 and got the same result. Then, on PENRITH1 ,

[root@penrith1 bin]# ./crsctl set css diagwait 13 -force
Configuration parameter diagwait is now set to 13.

On PENRITH1 ,

[root@penrith1 bin]# ./crsctl start crs
Attempting to start Oracle Clusterware stack 
The CRS stack will be started shortly

Repeated this on PENRITH2 .

And both hosts started rebooting. The problem appears to be that I did not wait for one node to start up crs completely before starting crs on the other node.

Download and Install Patch Updates

Skipped Download and Install Patch Updates for the time being as I want to complete the 11.1.0.6 installation first before doing the 11.1.0.7 patch.

Recommended Postinstallation Tasks

Followed Recommended Postinstallation Tasks and ran CVU Postinstallation Check as follows:

su - crs
cd /u00/product/11.1.0.6/clusterware
./runcluvfy.sh stage -post crsinst -n penrith1,penrith2 -verbose \
>/tmp/cluvfy_post_crsinst.lst

The log file, cluvfy_post_crsinst.lst , has been uploaded as an attachment to this page.

And the root.sh script was backed up as follows on both PENRITH1 and PENRITH2 :

[crs@penrith1 ~]$ mkdir -p backup/11.1.0.6
[crs@penrith1 ~]$ cp /u01/app/crs/product/11.1.0/crs/root.sh backup/11.1.0.6/