虚拟机Linux环境下,安装11G r2的RAC,使用的是grid用户,共两个节点,第二个节点 root时遇到Timed out waiting for the CRS stack to start.
全部完全卸载、磁盘dd操作后重新安装多次,问题依旧。如下:

[root@dbserver2 ~]# /apps/oracle/11.2.0/grid/root.sh
Running Oracle 11g root.sh script...

The following environment variables are set as:
    ORACLE_OWNER= grid
    ORACLE_HOME=  /apps/oracle/11.2.0/grid

Enter the full pathname of the local bin directory: [/usr/local/bin]:
   Copying dbhome to /usr/local/bin ...
   Copying oraenv to /usr/local/bin ...
   Copying coraenv to /usr/local/bin ...


Creating /etc/oratab file...
Entries will be added to the /etc/oratab file as needed by
Database Configuration Assistant when a database is created
Finished running generic part of root.sh script.
Now product-specific root actions will be performed.
2012-06-04 03:04:12: Parsing the host name
2012-06-04 03:04:12: Checking for super user privileges
2012-06-04 03:04:12: User has super user privileges
Using configuration parameter file: /apps/oracle/11.2.0/grid/crs/install/crsconfig_params
Creating trace directory
LOCAL ADD MODE
Creating OCR keys for user 'root', privgrp 'root'..
Operation successful.
Adding daemon to inittab
CRS-4123: Oracle High Availability Services has been started.
ohasd is starting
ADVM/ACFS is not supported on centos-release-4-8

 

CRS-4402: The CSS daemon was started in exclusive mode but found an active CSS daemon on node dbserver1, number 1, and is terminating
CRS-2673: Attempting to stop 'ora.cssdmonitor' on 'dbserver2'
CRS-2677: Stop of 'ora.cssdmonitor' on 'dbserver2' succeeded
An active cluster was found during exclusive startup, restarting to join the cluster
CRS-2672: Attempting to start 'ora.mdnsd' on 'dbserver2'
CRS-2676: Start of 'ora.mdnsd' on 'dbserver2' succeeded
CRS-2672: Attempting to start 'ora.gipcd' on 'dbserver2'
CRS-2676: Start of 'ora.gipcd' on 'dbserver2' succeeded
CRS-2672: Attempting to start 'ora.gpnpd' on 'dbserver2'
CRS-2676: Start of 'ora.gpnpd' on 'dbserver2' succeeded
CRS-2672: Attempting to start 'ora.cssdmonitor' on 'dbserver2'
CRS-2676: Start of 'ora.cssdmonitor' on 'dbserver2' succeeded
CRS-2672: Attempting to start 'ora.cssd' on 'dbserver2'
CRS-2672: Attempting to start 'ora.diskmon' on 'dbserver2'
CRS-2676: Start of 'ora.diskmon' on 'dbserver2' succeeded
CRS-2676: Start of 'ora.cssd' on 'dbserver2' succeeded
CRS-2672: Attempting to start 'ora.ctssd' on 'dbserver2'
CRS-2676: Start of 'ora.ctssd' on 'dbserver2' succeeded
CRS-2672: Attempting to start 'ora.asm' on 'dbserver2'
CRS-2676: Start of 'ora.asm' on 'dbserver2' succeeded
CRS-2672: Attempting to start 'ora.crsd' on 'dbserver2'
CRS-2676: Start of 'ora.crsd' on 'dbserver2' succeeded
CRS-2672: Attempting to start 'ora.evmd' on 'dbserver2'
CRS-2676: Start of 'ora.evmd' on 'dbserver2' succeeded
Timed out waiting for the CRS stack to start.

重新执行root还是不行,如下:

[root@dbserver2 ~]# cd /apps/oracle/11.2.0/grid/crs/install/
[root@dbserver2 install]# /apps/oracle/11.2.0/grid/crs/install/rootcrs.pl -verbose -deconfig -force
……
error: package cvuqdisk is not installed
Successfully deconfigured Oracle clusterware stack on this node

[root@dbserver2 ~]# /apps/oracle/11.2.0/grid/root.sh
……
CRS-2676: Start of 'ora.evmd' on 'dbserver2' succeeded
Timed out waiting for the CRS stack to start.

查看日志:


[root@dbserver2 dbserver2]# pwd
/apps/oracle/11.2.0/grid/log/dbserver2

[root@dbserver2 dbserver2]# tail -f /apps/oracle/11.2.0/grid/log/dbserver2/alertdbserver2.log

2012-06-01 20:03:44.976
[crsd(11217)]CRS-1013:The OCR location in an ASM disk group is inaccessible. Details in /apps/oracle/11.2.0/grid/log/dbserver2/crsd/crsd.log.
2012-06-01 20:03:45.315
[ohasd(10460)]CRS-2765:Resource 'ora.crsd' has failed on server 'dbserver2'.
2012-06-01 20:03:47.083
[crsd(11235)]CRS-1013:The OCR location in an ASM disk group is inaccessible. Details in /apps/oracle/11.2.0/grid/log/dbserver2/crsd/crsd.log.
2012-06-01 20:03:47.631
[ohasd(10460)]CRS-2765:Resource 'ora.crsd' has failed on server 'dbserver2'.
2012-06-01 20:03:49.501
[crsd(11250)]CRS-1013:The OCR location in an ASM disk group is inaccessible. Details in /apps/oracle/11.2.0/grid/log/dbserver2/crsd/crsd.log.
2012-06-01 20:03:49.986
[ohasd(10460)]CRS-2765:Resource 'ora.crsd' has failed on server 'dbserver2'.
2012-06-01 20:03:49.988
[ohasd(10460)]CRS-2771:Maximum restart attempts reached for resource 'ora.crsd'; will not restart.


继续查看: /apps/oracle/11.2.0/grid/log/dbserver2/crsd/crsd.log.

[root@dbserver2 crsd]# tail -200 /apps/oracle/11.2.0/grid/log/dbserver2/crsd/crsd.log

gipcretSuccess (0)
2012-06-01 20:16:12.859: [  OCRASM][3086657216]proprasmo: Error in open/create file in dg [crs]
[  OCRASM][3086657216]SLOS : SLOS: cat=8, opn=kgfoOpenFile01, dep=15056, loc=kgfokge
ORA-17503: ksfdopn:DGOpenFile05 Failed to open file +CRS.255.4294967295
ORA-17503: ksfdopn:2 Failed to open file +CRS.255.4294967295
ORA-15001: diskgroup "CRS"
2012-06-01 20:03:49.499: [  OCRASM][3086726848]proprasmo: kgfoCheckMount returned [6]
2012-06-01 20:03:49.499: [  OCRASM][3086726848]proprasmo: The ASM disk group crs is not found or not mounted
2012-06-01 20:03:49.501: [  OCRRAW][3086726848]proprioo: Failed to open [+crs]. Returned proprasmo() with [26]. Marking location as UNAVAILABLE.
2012-06-01 20:03:49.502: [  OCRRAW][3086726848]proprioo: No OCR/OLR devices are usable
2012-06-01 20:03:49.502: [  OCRASM][3086726848]proprasmcl: asmhandle is NULL
2012-06-01 20:03:49.502: [  OCRRAW][3086726848]proprinit: Could not open raw device
2012-06-01 20:03:49.502: [  OCRASM][3086726848]proprasmcl: asmhandle is NULL
2012-06-01 20:03:49.505: [  OCRAPI][3086726848]a_init:16!: Backend init unsuccessful : [26]
2012-06-01 20:03:49.508: [  CRSOCR][3086726848] OCR context init failure.  Error: PROC-26: Error while accessing the physical storage ASM error[SLOS: cat=8, opn=kgfoOpenFile01, dep=15056, loc=kgfokge
ORA-17503: ksfdopn:DGOpenFile05 Failed to open file +CRS.255.4294967295
ORA-17503: ksfdopn:2 Failed to open file +CRS.255.4294967295
ORA-15001: diskgroup "CRS"
] [8]
2012-06-01 20:03:49.508: [    CRSD][3086726848][PANIC] CRSD exiting: Could not init OCR, code: 26
2012-06-01 20:03:49.509: [    CRSD][3086726848] Done.

检查如下:

[grid@dbserver2 ~]$ asmcmd
ASMCMD> ls
没有任何内容

检查磁盘:
[root@dbserver2 dbserver2]# fdisk -l
……
Disk /dev/sdh: 2147 MB, 2147483648 bytes
255 heads, 63 sectors/track, 261 cylinders
Units = cylinders of 16065 * 512 = 8225280 bytes

   Device Boot      Start         End      Blocks   Id  System
/dev/sdh1               1         261     2096451   83  Linux

Disk /dev/sdi: 2147 MB, 2147483648 bytes
255 heads, 63 sectors/track, 261 cylinders
Units = cylinders of 16065 * 512 = 8225280 bytes

   Device Boot      Start         End      Blocks   Id  System
/dev/sdi1               1         261     2096451   83  Linux

Disk /dev/sdj: 2147 MB, 2147483648 bytes
255 heads, 63 sectors/track, 261 cylinders
Units = cylinders of 16065 * 512 = 8225280 bytes

   Device Boot      Start         End      Blocks   Id  System
/dev/sdj1               1         261     2096451   83  Linux


[root@dbserver2 dbserver2]# /etc/init.d/oracleasm listdisks

[root@dbserver2 dbserver2]# /etc/init.d/oracleasm listdisks
CRS1
CRS2
CRS3
[root@dbserver2 dbserver2]#

[root@dbserver2 dbserver2]# cd /dev/oracleasm/disks/
[root@dbserver2 disks]# ll
total 0
brw-rw----  1 grid asmadmin 8, 113 Jun  2 10:44 CRS1
brw-rw----  1 grid asmadmin 8, 129 Jun  2 10:44 CRS2
brw-rw----  1 grid asmadmin 8, 145 Jun  2 10:44 CRS3
[root@dbserver2 disks]#

下面的检查:
[root@dbserver2 oracle]# cd /apps/oracle/11.2.0/grid/bin/
[root@dbserver2 bin]# ./crsctl query css votedisk
##  STATE    File Universal Id                File Name Disk group
--  -----    -----------------                --------- ---------
 1. ONLINE   6e49b9f535744ffcbf42cf5c1315be5c (/dev/oracleasm/disks/CRS1) [CRS]
 2. ONLINE   8319759558834fc8bf89f783a8177b5b (/dev/oracleasm/disks/CRS2) [CRS]
 3. ONLINE   94e522f261334f18bfc59d576f112016 (/dev/oracleasm/disks/CRS3) [CRS]
Located 3 voting disk(s).
[root@dbserver2 bin]#


重新删除后再安装一次,问题依旧。


dd if=/dev/zero of=/dev/sdh1 bs=1024 count=300
rm -rf /apps/*
chown -R grid:oinstall /apps/
rm -rf /tmp/*
rm -rf /etc/ora*


又未成功


继续检查:
[root@dbserver2 bin]# ./ocrcheck
PROT-602: Failed to retrieve data from the cluster registry
PROC-26: Error while accessing the physical storage ASM error [SLOS: cat=6, opn=kgfo, dep=0, loc=kgfoCkMt03
diskgroup CRS not mounted ()
] [6]
[root@dbserver2 bin]#
在另一个节点 :
[root@dbserver1 bin]# ./ocrcheck
Status of Oracle Cluster Registry is as follows :
         Version                  :          3
         Total space (kbytes)     :     262120
         Used space (kbytes)      :       2280
         Available space (kbytes) :     259840
         ID                       : 1856240294
         Device/File Name         :       +CRS
                                    Device/File integrity check succeeded

                                    Device/File not configured

                                    Device/File not configured

                                    Device/File not configured

                                    Device/File not configured

         Cluster registry integrity check succeeded

         Logical corruption check succeeded
SQL> select name from V$asm_diskgroup
  2  ;

NAME
------------------------------
CRS

SQL> alter diskgroup CRS mount;
alter diskgroup CRS mount
*
ERROR at line 1:
ORA-15032: not all alterations performed
ORA-15017: diskgroup "CRS" cannot be mounted
ORA-15063: ASM discovered an insufficient number of disks for diskgroup "CRS"
SQL> select group_number,disk_number,mode_status,name from V$asm_disk;

GROUP_NUMBER DISK_NUMBER MODE_ST NAME
------------ ----------- ------- ------------------------------
           0           0 ONLINE
           0           2 ONLINE
           0           1 ONLINE

 

重新安装ASMLIB

[root@dbserver1 bin]# rpm -qa|grep oracleasm
oracleasm-support-2.1.7-1.el4
oracleasmlib-2.0.4-1.el4
oracleasm-2.6.9-89.EL-2.0.5-1.el4
oracleasm-2.6.9-89.ELsmp-2.0.5-1.el4

[root@dbserver1 bin]# rpm -e oracleasm-2.6.9-89.EL-2.0.5-1.el4
[root@dbserver1 bin]# rpm -e oracleasmlib-2.0.4-1.el4
[root@dbserver1 bin]# rpm -e oracleasm-2.6.9-89.ELsmp-2.0.5-1.el4
[root@dbserver1 bin]# rpm -e oracleasm-support-2.1.7-1.el4
warning: /etc/sysconfig/oracleasm saved as /etc/sysconfig/oracleasm.rpmsave
[root@dbserver1 soft]# rpm -ivh oracleasm*

 

SQL> show parameter asm

NAME                                 TYPE        VALUE
------------------------------------ ----------- ------------------------------
asm_diskgroups                       string
asm_diskstring                       string
asm_power_limit                      integer     1
asm_preferred_read_failure_groups    string


 


SQL> select group_number,disk_number,mount_status,header_status,path from V$asm_disk;

GROUP_NUMBER DISK_NUMBER MOUNT_S HEADER_STATU PATH
------------ ----------- ------- ------------ ----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
           0           0 CLOSED  MEMBER       /dev/oracleasm/disks/CRS1
           0           2 CLOSED  MEMBER       /dev/oracleasm/disks/CRS3
           0           1 CLOSED  MEMBER       /dev/oracleasm/disks/CRS2

SQL>

最后偿试的解决办法


在虚拟机的配置中加入了:
scsi1:0.deviceType = "disk"
scsi1:1.deviceType = "disk"
scsi1:2.deviceType = "disk"
scsi1:3.deviceType = "disk"
scsi1:4.deviceType = "disk"
scsi1:5.deviceType = "disk"
scsi1:6.deviceType = "disk"
scsi1:7.deviceType = "disk"
scsi1:8.deviceType = "disk"
scsi1:9.deviceType = "disk"
scsi1:10.deviceType = "disk"
scsi1:11.deviceType = "disk"
再次重试


dd if=/dev/zero of=/dev/sdh1 bs=1024 count=300
dd if=/dev/zero of=/dev/sdi1 bs=1024 count=300
dd if=/dev/zero of=/dev/sdj1 bs=1024 count=300
rm -rf /apps/*
chown -R grid:oinstall /apps/
rm -rf /tmp/*
rm -rf /etc/ora*
rm -f /usr/local/bin/dbhome
rm -f /usr/local/bin/oraenv
rm -f /usr/local/bin/coraenv
chown -R grid:oinstall /soft/

重新安装,居然成功,如下所示:

oracleasm scandisks
oracleasm listdisks
[root@dbserver1 ~]# oracleasm createdisk vol1 /dev/sdh1
[root@dbserver1 ~]# oracleasm createdisk vol2 /dev/sdi1
[root@dbserver1 ~]# oracleasm createdisk vol3 /dev/sdj1
[root@dbserver1 ~]# oracleasm listdisks
……
[root@dbserver2 ~]# /apps/oracle/11.2.0/grid/root.sh
Running Oracle 11g root.sh script...

The following environment variables are set as:
    ORACLE_OWNER= grid
    ORACLE_HOME=  /apps/oracle/11.2.0/grid

Enter the full pathname of the local bin directory: [/usr/local/bin]:
   Copying dbhome to /usr/local/bin ...
   Copying oraenv to /usr/local/bin ...
   Copying coraenv to /usr/local/bin ...


Creating /etc/oratab file...
Entries will be added to the /etc/oratab file as needed by
Database Configuration Assistant when a database is created
Finished running generic part of root.sh script.
Now product-specific root actions will be performed.
2012-06-05 15:52:57: Parsing the host name
2012-06-05 15:52:57: Checking for super user privileges
2012-06-05 15:52:57: User has super user privileges
Using configuration parameter file: /apps/oracle/11.2.0/grid/crs/install/crsconfig_params
Creating trace directory
LOCAL ADD MODE
Creating OCR keys for user 'root', privgrp 'root'..
Operation successful.
Adding daemon to inittab
CRS-4123: Oracle High Availability Services has been started.
ohasd is starting
ADVM/ACFS is not supported on centos-release-4-8

 

CRS-4402: The CSS daemon was started in exclusive mode but found an active CSS daemon on node dbserver1, number 1, and is terminating
CRS-2673: Attempting to stop 'ora.cssdmonitor' on 'dbserver2'
CRS-2677: Stop of 'ora.cssdmonitor' on 'dbserver2' succeeded
An active cluster was found during exclusive startup, restarting to join the cluster
CRS-2672: Attempting to start 'ora.mdnsd' on 'dbserver2'
CRS-2676: Start of 'ora.mdnsd' on 'dbserver2' succeeded
CRS-2672: Attempting to start 'ora.gipcd' on 'dbserver2'
CRS-2676: Start of 'ora.gipcd' on 'dbserver2' succeeded
CRS-2672: Attempting to start 'ora.gpnpd' on 'dbserver2'
CRS-2676: Start of 'ora.gpnpd' on 'dbserver2' succeeded
CRS-2672: Attempting to start 'ora.cssdmonitor' on 'dbserver2'
CRS-2676: Start of 'ora.cssdmonitor' on 'dbserver2' succeeded
CRS-2672: Attempting to start 'ora.cssd' on 'dbserver2'
CRS-2672: Attempting to start 'ora.diskmon' on 'dbserver2'
CRS-2676: Start of 'ora.diskmon' on 'dbserver2' succeeded
CRS-2676: Start of 'ora.cssd' on 'dbserver2' succeeded
CRS-2672: Attempting to start 'ora.ctssd' on 'dbserver2'
CRS-2676: Start of 'ora.ctssd' on 'dbserver2' succeeded
CRS-2672: Attempting to start 'ora.asm' on 'dbserver2'
CRS-2676: Start of 'ora.asm' on 'dbserver2' succeeded
CRS-2672: Attempting to start 'ora.crsd' on 'dbserver2'
CRS-2676: Start of 'ora.crsd' on 'dbserver2' succeeded
CRS-2672: Attempting to start 'ora.evmd' on 'dbserver2'
CRS-2676: Start of 'ora.evmd' on 'dbserver2' succeeded
/apps/oracle/11.2.0/grid/bin/srvctl start vip -i dbserver2 ... failed
Preparing packages for installation...
cvuqdisk-1.0.7-1
Configure Oracle Grid Infrastructure for a Cluster ... failed
Updating inventory properties for clusterware
Starting Oracle Universal Installer...

Checking swap space: must be greater than 500 MB.   Actual 3602 MB    Passed
The inventory pointer is located at /etc/oraInst.loc
The inventory is located at /apps/oraInventory
'UpdateNodeList' was successful.

看来问题的原因还就是虚拟机的设置问题。不过此次安装两个节点都有另外的问题,
/apps/oracle/11.2.0/grid/bin/srvctl start nodeapps -n dbserver1 ... failed
Configure Oracle Grid Infrastructure for a Cluster ... failed
这个问题再通过另的办法解决。

Logo

更多推荐