Pages

Friday, March 18, 2011

How to check for hardware errors in Aix?

If the system has crashed or automaticall rebooted, the  following needs to be checked
1) sysdumpdev -L => This will list if any dump files are present. If so, this needs to be sent to the IBM engineer.

2) errpt => To check for the error logs

3) errpt -a -j identifier => Once you find out what is causing the error message use this command. This will tell you whether the error is a permanent one or a temporary and what are the probable causes for the error.

4) diag - This needs to be run ONLY when required. It is not recommended to run this script unnecessarily, especially if the system has not crashed or rebooted automatically.

5)  snap => => This command gathers system configuration information and compresses the information into tar file.

6) snap -r => This will delete the previous snaps if any

7) snap -gcb => This will create the compressed file ( uses the ouput of lslpp -hBc command which is required to recreate exact OS environment, it gathers SSA information as well )

 

No comments: