Wednesday, May 23, 2012
swinstall U320 error on Superdome running HP-UX 11.11
swinstall U320 error on Superdome running HP-UX 11.11
Problem description/symptoms/errors:
EAT BEGIN install AGENT SESSION (pid=28264)
(jobid=NMKT1-0109)
*** Agent session started for user "root@NMKT1". (pid=28264)
*** Beginning Analysis Phase.
*** Source:
NMKT1:/tmp/20120510/scsiU320-00_B.11.11.0911_HP-UX_B.11.11_64.depot
*** Target: NMKT1:/
*** Target logfile: NMKT1:/var/adm/sw/swagent.log
*** Reading source for product information.
NOTE: The filesystems in the filesystem table will not be checked
against those currently mounted because the
"mount_all_filesystems" option is set to "false".
NOTE: The fileset "scsiU320.SCSIU320-KRN,r=B.11.11.0911" will be
reinstalled because the "reinstall" option is set to "true".
NOTE: The fileset "scsiU320.SCSIU320-MAN,r=B.11.11.0911" will be
reinstalled because the "reinstall" option is set to "true".
NOTE: The fileset "scsiU320.SCSIU320-RUN,r=B.11.11.0911" will be
reinstalled because the "reinstall" option is set to "true".
*** Reading source for file information.
*** Executing preDSA command.
NOTE: The used disk space on filesystem "/" is estimated to remain
unchanged.
This will leave 871088 Kbytes of available user disk space
after the installation.
NOTE: The used disk space on filesystem "/stand" is estimated to
increase by 6840 Kbytes.
This will leave 690008 Kbytes of available user disk space
after the installation.
NOTE: The used disk space on filesystem "/usr" is estimated to
remain unchanged.
This will leave 2038968 Kbytes of available user disk space
after the installation.
NOTE: The used disk space on filesystem "/var" is estimated to
remain unchanged.
This will leave 7406408 Kbytes of available user disk space
after the installation.
NOTE: The used disk space on filesystem "/opt" is estimated to
remain unchanged.
This will leave 8652552 Kbytes of available user disk space
after the installation.
*** Summary of Analysis Phase:
*** 3 of 3 filesets had no Errors or Warnings.
*** The Analysis Phase succeeded.
*** Beginning the Install Execution Phase.
*** Filesets: 3
*** Files: 12
*** Kbytes: 763
*** Installing bundle "scsiU320-00,r=B.11.11.0911" .
NOTE: Saving the current system file at "/stand/system" to
"/stand/system.prev"
NOTE: Dynamic tunable values (if applicable) have been added to
"/stand/system".
NOTE: The template file has been extracted from "/stand/vmunix"
It has been placed in "/stand/system" where it will be used
to build a new kernel.
*** Installing fileset "scsiU320.SCSIU320-KRN,r=B.11.11.0911" (1
of 3).
*** Installing fileset "scsiU320.SCSIU320-MAN,r=B.11.11.0911" (2
of 3).
*** Installing fileset "scsiU320.SCSIU320-RUN,r=B.11.11.0911" (3
of 3).
NOTE: Building a new kernel based on template file "/stand/system"
Generating module: krm...
Generating module: SEOS...
Compiling /stand/build/conf.c...
Loading the kernel...
Generating kernel symbol table...
Usage: bstab input_data symbol_cnt strtbl_size kernel_file output_file
*** Error exit code 1
Stop.
make failure.
ERROR: The command "/usr/sbin/mk_kernel", which is used to rebuild
the kernel, has failed. Because kernel-related filesets were
installed, this command must be executed by "swinstall",
without failures, before the load can continue. Check the
above output for details about the failure.
NOTE: The Install Phase has suspended. Check the above output for
reasons.
*** Aborting the Install Phase.
Asked to run /usr/contrib/bin/check_patches
Sol:-
sicne kernel building is failing , will test to build kernel
#[/stand]mk_kernel
Generating module: krm...
Generating module: SEOS...
/usr/bin/mkdir -p /stand/build
Compiling /stand/build/conf.c...
Loading the kernel...
Generating kernel symbol table...
Usage: bstab input_data symbol_cnt strtbl_size kernel_file output_file
*** Error exit code 1
Stop.
config: make did an exit(1)
since it failed again
tusc -E -A -v -o /tmp/tusc.log -a -e -t -l -u -R mk_kernel
after verifying with tusc, found Linker patch PHSS_33035 is failing the verify and it deals with mk_kernel
There were other patches in question too:-
PHKL_32647.C-INC
PHKL_32647.CORE-KRN
PHKL_32647.CORE2-KRN
PHKL_33369.C-INC
PHKL_33369.CORE-KRN
PHKL_33369.CORE2-KRN
PHKL_33369.KERN2-RUN
Are they related?
let me check
PHKL_33369 is a AGP cumulative graphics mulitcard patch so that should not be an issue
PHKL_32647 is an extending Physical I/O Addressing patch - I do not think that would have any affect
After looking at the swagent.log from the swverify ran by the check_patches we see the following issues:
Verified PHSS_33035.C-ENG-A-MAN,l=/,r=1.0
Verified PHSS_33035.C-INC,l=/,r=1.0
ERROR: Verify failed PHSS_33035.C-KRN,l=/,r=1.0
ERROR: Verify failed PHSS_33035.C-MIN,l=/,r=1.0
ERROR: Verify failed PHSS_33035.C-MIN-64ALIB,l=/,r=1.0
Verified PHSS_33035.CAUX-ENG-A-MAN,l=/,r=1.0
ERROR: Verify failed PHSS_33035.CMDS-AUX,l=/,r=1.0
Verified PHSS_33035.CORE-64SLIB,l=/,r=1.0
Verified PHSS_33035.CORE-SHLIBS,l=/,r=1.0
Verified PHSS_33035.LANG-MIN,l=/,r=1.0
Verified PHSS_33035.LINKER-HELP,l=/,r=1.0
Verified PHSS_33035.PAUX-ENG-A-MAN,l=/,r=1.0
ERROR: Verify failed PHSS_33035.PROG-AUX,l=/,r=1.0
Verified PHSS_33035.PROG-AX-64ALIB,l=/,r=1.0
PHSS_33035 is the ld(1) and linker tools cumulative patch. Corruption with this patch can cause issues with making a new kernel.
By either reinstalling this patch with the reinstall options set to true or installing the latest linker patch PHSS_42253 should resolve the mk_kernel failure and allow the install of the scsiU320 driver to complete with no errors. Neither patch requires a reboot.
Happy troubleshooting!!
rx8640/11.31 Process Resource Manager//Cu using SMH to try to set up FSS PRM Group and getting error
FW: rx8640/11.31 Process Resource Manager//Cu using SMH to try to set up FSS PRM Group and getting error
Problem Description:Unable to add a New PRM Group Definition: I am trying to create a new PRM group for one Database instance I want to use the FSS group type, however during testing I was not able to add anything other than an "OTHERS" group.
Total Cores: -2, Available Cores: -3
(1 core reserved for FSS groups)
error states
Add PRM Groups
Error: You will exceed your core capacity. Percentages are inaccurate until you adjust your core allocation.
PHKL_39606 1.0 PRM/FSS cumulative patch with LVM enhancement
PRM-Sw-Gui C.03.05 Process Resource Manager PRM-Sw-Gui product
PRM-Sw-Krn C.01.05 Process Resource Manager PRM-Sw-Krn product
PRM-Sw-Lib C.03.05 Process Resource Manager PRM-Sw-Lib product
http://bizsupport2.austin.hp.com/bc/docs/support/SupportManual/c01911854/c01911854.pdf
Cu has one rx8640 split into 2 vpars. He is using SMH to try to create the FSS PRM group.
collect PRM specific data and
send it to me? I attached the wlmprm.sh script. Save it to a directory and run it with the -p option:
# chmod 755 wlmprm.sh
# ./wlmprm.sh -p
Command line: ./wlmprm.sh
./wlmprm.sh:
wlmprm.sh 1.08 Mar 9, 2010
Saving data to /tmp/WLMPRM
...
Ready. Check the /tmp/WLMPRM/wlmprm.scan and provide the file /tmp/WLMPRM/wlmprm.tar .
You need to cleanup /tmp/WLMPRM manually.
After receiving the logs,observations.
The source of the error in SMH seems to be that the GUI does not properly recognize the number of cores.
On test server ,got the same error despite 4 cores being available:
Total Cores: -2, Available Cores: -3
So updated to PRM version 3.06 from the March 2012 application DVDs:
B3835DA C.03.06 HP Process Resource Manager
ran the config script again (Need to adjust java path):
# /opt/prm/bin/prmsmhconfig -c
Creating /etc/opt/prm/conf directory.
Copying files to the SMH directory.
/opt/prm/bin/prmsmhconfig[59]: /opt/java/bin/jar: not found.
The PRM GUI for SMH was successfully configured.
# vi /opt/prm/bin/prmsmhconfig
...
#
# JAR points to the location where Java/jar is installed
#
JAR=/opt/java6/bin/jar
# /opt/prm/bin/prmsmhconfig -c
Creating /etc/opt/prm/conf directory.
Copying files to the SMH directory.
The PRM GUI for SMH was successfully configured.
Now the SMH PRM GUI recognizes the cores and can add FSS groups:
Total Cores: 4, Available Cores: 3
There are a number of defects fixed in 3.06 like this issue. To resolve it, please update
to PRM 3.06. It does require a reboot though.
First we should configure/test
on test server and then copy the prm configuration file over to the production server,
adjust it and use the manual method to activate PRM.
The manual configuration of PRM as described in the User Guide still works.
attached the latest version of the HP-UX recovery handbook - PRM chapter FYI.
Monday, March 19, 2012
Mounting of NFS File System on HP-UX 11.31 Server.
Mounting of NFS File System on HP-UX 11.31 Server.
Source Server : Server whose file system you want to share
Destination Server : Server where you want to mount.
From the HP-UX 11.31 Source Server :
e.g I want to mount /stage/StageR12 on HP-UX Destination Server.
# share -F nfs -o rw /stage/StageR12/
# share -- will show which file system you want to share.
- /stage/StageR12 rw " "
#
# shareall
From the HP-UX 11.31 Destination Server :
# mount hostname:/stage/StageR12/ /backup/stage/
or
# mount ipaddress:/stage/StageR12/ /backup/stage/
Unmounting of NFS File System
Destination Server:
# umount /backup/stage/
if getting error
# umount /mountpoint e.g # umount /backup/stage/
umount: cannot unmount /dev/disk/disk23 : Device busy
umount: return error 1.
Run :
# fuser -kuc /mountpoint e.g # fuser -kuc /backup/stage/
# umount /backup/stage/
Source Server :
# unshare /stage/StageR12/
#unshareall if using shareall command on source server
Source Server : Server whose file system you want to share
Destination Server : Server where you want to mount.
From the HP-UX 11.31 Source Server :
e.g I want to mount /stage/StageR12 on HP-UX Destination Server.
# share -F nfs -o rw /stage/StageR12/
# share -- will show which file system you want to share.
- /stage/StageR12 rw " "
#
# shareall
From the HP-UX 11.31 Destination Server :
# mount hostname:/stage/StageR12/ /backup/stage/
or
# mount ipaddress:/stage/StageR12/ /backup/stage/
Unmounting of NFS File System
Destination Server:
# umount /backup/stage/
if getting error
# umount /mountpoint e.g # umount /backup/stage/
umount: cannot unmount /dev/disk/disk23 : Device busy
umount: return error 1.
Run :
# fuser -kuc /mountpoint e.g # fuser -kuc /backup/stage/
# umount /backup/stage/
Source Server :
# unshare /stage/StageR12/
#unshareall if using shareall command on source server
Monday, January 23, 2012
Appreciation mail
From: Miranda, Noel
Sent: Monday, January 23, 2012 7:34 PM
To: K, Krishna Murthy (HP-UX-SWD)
Cc: GSCB-AMS-HPUX
Subject: RE:- GBDHPH14 and himhhqa02 - login issue
Good work, Krish!!
Best regards,
Noel
From: K, Krishna Murthy (HP-UX-SWD)
Sent: Sunday, January 22, 2012 8:36 PM
To: Miranda, Noel
Subject: FW:- GBDHPH14 and himhhqa02 - login issue
fyi
From: Nirmal Rana [mailto:nirmal.rana@in.ibm.com]
Sent: Sunday, January 22, 2012 8:30 PM
To: K, Krishna Murthy (HP-UX-SWD)
Cc: Hartford Unix-ITD GD-HYD-IN; K, Krishna Murthy (HP-UX-SWD); Nirmal.Rana@thehartford.com
Subject: RE:- GBDHPH14 and himhhqa02 - login issue
Hi Krishna,
It was nice working with you. issue seems to be fixed now.
Its working fine for our your IDs. We are just waiting for DBA users to confirm .
You can keep the case under pending for now. I will update for closer.
I appreciate your effort and approach on this issue. Thanks for providing quick resolution.
Regards,
NIRMAL RANA
SME -UNIX, TEAM LEAD- The Hartford-UNIX BAU, GTS Service Delivery
________________________________________
Phone: 1 877-283-8066 Option2 | Mobile: 91-7702548000
E-mail: nirmal.rana@in.ibm.com
"Choose a job you love, and you will never have to work a day in your life -Confucius"
Sent: Monday, January 23, 2012 7:34 PM
To: K, Krishna Murthy (HP-UX-SWD)
Cc: GSCB-AMS-HPUX
Subject: RE:
Good work, Krish!!
Best regards,
Noel
From: K, Krishna Murthy (HP-UX-SWD)
Sent: Sunday, January 22, 2012 8:36 PM
To: Miranda, Noel
Subject: FW:
fyi
From: Nirmal Rana [mailto:nirmal.rana@in.ibm.com]
Sent: Sunday, January 22, 2012 8:30 PM
To: K, Krishna Murthy (HP-UX-SWD)
Cc: Hartford Unix-ITD GD-HYD-IN; K, Krishna Murthy (HP-UX-SWD); Nirmal.Rana@thehartford.com
Subject: RE:
Hi Krishna,
It was nice working with you. issue seems to be fixed now.
Its working fine for our your IDs. We are just waiting for DBA users to confirm .
You can keep the case under pending for now. I will update for closer.
I appreciate your effort and approach on this issue. Thanks for providing quick resolution.
Regards,
NIRMAL RANA
SME -UNIX, TEAM LEAD- The Hartford-UNIX BAU, GTS Service Delivery
________________________________________
Phone: 1 877-283-8066 Option2 | Mobile: 91-7702548000
E-mail: nirmal.rana@in.ibm.com
"Choose a job you love, and you will never have to work a day in your life -Confucius"
swinstall U320 error on Superdome running HP-UX 11.11
Problem description/symptoms/errors:
EAT BEGIN install AGENT SESSION (pid=28264)
(jobid=NMKT1-0109)
*** Agent session started for user "root@NMKT1". (pid=28264)
*** Beginning Analysis Phase.
*** Source:
NMKT1:/tmp/20120510/scsiU320-00_B.11.11.0911_HP-UX_B.11.11_64.depot
*** Target: NMKT1:/
*** Target logfile: NMKT1:/var/adm/sw/swagent.log
*** Reading source for product information.
NOTE: The filesystems in the filesystem table will not be checked
against those currently mounted because the
"mount_all_filesystems" option is set to "false".
NOTE: The fileset "scsiU320.SCSIU320-KRN,r=B.11.11.0911" will be
reinstalled because the "reinstall" option is set to "true".
NOTE: The fileset "scsiU320.SCSIU320-MAN,r=B.11.11.0911" will be
reinstalled because the "reinstall" option is set to "true".
NOTE: The fileset "scsiU320.SCSIU320-RUN,r=B.11.11.0911" will be
reinstalled because the "reinstall" option is set to "true".
*** Reading source for file information.
*** Executing preDSA command.
NOTE: The used disk space on filesystem "/" is estimated to remain
unchanged.
This will leave 871088 Kbytes of available user disk space
after the installation.
NOTE: The used disk space on filesystem "/stand" is estimated to
increase by 6840 Kbytes.
This will leave 690008 Kbytes of available user disk space
after the installation.
NOTE: The used disk space on filesystem "/usr" is estimated to
remain unchanged.
This will leave 2038968 Kbytes of available user disk space
after the installation.
NOTE: The used disk space on filesystem "/var" is estimated to
remain unchanged.
This will leave 7406408 Kbytes of available user disk space
after the installation.
NOTE: The used disk space on filesystem "/opt" is estimated to
remain unchanged.
This will leave 8652552 Kbytes of available user disk space
after the installation.
*** Summary of Analysis Phase:
*** 3 of 3 filesets had no Errors or Warnings.
*** The Analysis Phase succeeded.
*** Beginning the Install Execution Phase.
*** Filesets: 3
*** Files: 12
*** Kbytes: 763
*** Installing bundle "scsiU320-00,r=B.11.11.0911" .
NOTE: Saving the current system file at "/stand/system" to
"/stand/system.prev"
NOTE: Dynamic tunable values (if applicable) have been added to
"/stand/system".
NOTE: The template file has been extracted from "/stand/vmunix"
It has been placed in "/stand/system" where it will be used
to build a new kernel.
*** Installing fileset "scsiU320.SCSIU320-KRN,r=B.11.11.0911" (1
of 3).
*** Installing fileset "scsiU320.SCSIU320-MAN,r=B.11.11.0911" (2
of 3).
*** Installing fileset "scsiU320.SCSIU320-RUN,r=B.11.11.0911" (3
of 3).
NOTE: Building a new kernel based on template file "/stand/system"
Generating module: krm...
Generating module: SEOS...
Compiling /stand/build/conf.c...
Loading the kernel...
Generating kernel symbol table...
Usage: bstab input_data symbol_cnt strtbl_size kernel_file output_file
*** Error exit code 1
Stop.
make failure.
ERROR: The command "/usr/sbin/mk_kernel", which is used to rebuild
the kernel, has failed. Because kernel-related filesets were
installed, this command must be executed by "swinstall",
without failures, before the load can continue. Check the
above output for details about the failure.
NOTE: The Install Phase has suspended. Check the above output for
reasons.
*** Aborting the Install Phase.
Asked to run /usr/contrib/bin/check_patches
Sol:-
sicne kernel building is failing , will test to build kernel
#[/stand]mk_kernel
Generating module: krm...
Generating module: SEOS...
/usr/bin/mkdir -p /stand/build
Compiling /stand/build/conf.c...
Loading the kernel...
Generating kernel symbol table...
Usage: bstab input_data symbol_cnt strtbl_size kernel_file output_file
*** Error exit code 1
Stop.
config: make did an exit(1)
since it failed again
tusc -E -A -v -o /tmp/tusc.log -a -e -t -l -u -R mk_kernel
after verifying with tusc, found Linker patch PHSS_33035 is failing the verify and it deals with mk_kernel
There were other patches in question too:-
PHKL_32647.C-INC
PHKL_32647.CORE-KRN
PHKL_32647.CORE2-KRN
PHKL_33369.C-INC
PHKL_33369.CORE-KRN
PHKL_33369.CORE2-KRN
PHKL_33369.KERN2-RUN
Are they related?
let me check
PHKL_33369 is a AGP cumulative graphics mulitcard patch so that should not be an issue
PHKL_32647 is an extending Physical I/O Addressing patch - I do not think that would have any affect
After looking at the swagent.log from the swverify ran by the check_patches we see the following issues:
Verified PHSS_33035.C-ENG-A-MAN,l=/,r=1.0
Verified PHSS_33035.C-INC,l=/,r=1.0
ERROR: Verify failed PHSS_33035.C-KRN,l=/,r=1.0
ERROR: Verify failed PHSS_33035.C-MIN,l=/,r=1.0
ERROR: Verify failed PHSS_33035.C-MIN-64ALIB,l=/,r=1.0
Verified PHSS_33035.CAUX-ENG-A-MAN,l=/,r=1.0
ERROR: Verify failed PHSS_33035.CMDS-AUX,l=/,r=1.0
Verified PHSS_33035.CORE-64SLIB,l=/,r=1.0
Verified PHSS_33035.CORE-SHLIBS,l=/,r=1.0
Verified PHSS_33035.LANG-MIN,l=/,r=1.0
Verified PHSS_33035.LINKER-HELP,l=/,r=1.0
Verified PHSS_33035.PAUX-ENG-A-MAN,l=/,r=1.0
ERROR: Verify failed PHSS_33035.PROG-AUX,l=/,r=1.0
Verified PHSS_33035.PROG-AX-64ALIB,l=/,r=1.0
PHSS_33035 is the ld(1) and linker tools cumulative patch. Corruption with this patch can cause issues with making a new kernel.
By either reinstalling this patch with the reinstall options set to true or installing the latest linker patch PHSS_42253 should resolve the mk_kernel failure and allow the install of the scsiU320 driver to complete with no errors. Neither patch requires a reboot.
Tuesday, January 3, 2012
REPLACING QUOROM SERVER
http://thomasvogt.wordpress.com/2009/08/05/mcserviceguard-cluster-replace-quorum-server-online/
CFS
http://thomasvogt.wordpress.com/2010/04/27/hp-ux-increase-veritas-cluster-filesystem-cfs-online/
service guard steps
http://thomasvogt.wordpress.com/2008/08/26/mcserviceguard-cluster-installation-on-hp-ux-1131/
Subscribe to:
Posts (Atom)