Hello to all in the Forum,
I have a customer who has a recurring problem where the server has to be rebooted. At first it was once a month
on average and now the server requires a reboot about every 2 weeks. The problem presents itself as a major slow
down of I/O on the TSM server not only to disk but to tape devices as well. If a backup completes, it will often
run 50% (or more) longer than normal.
There are no errors reported in the TSM act log. There are no errors shown by the errpt command. All normal TSM
backup processes and sessions just slow down to a crawl. The server itself is still responsive when you login. We
have thought it was TSM but in the past we have tried to just restart the TSM service to see if that would resolve
the problem but it did not seem to help and a full reboot of the server is needed to restore performance.
Has any of you seen this problem? Any suggestions? We are hoping to get some advice from this forum before we get
IBM involved since we are still not sure if the problem is with TSM or with AIX (or even possibly storage-
related).
Hardware and software environment:
AIX 7.1 TL1 SP5
P740 32GB of memory 4 procs
single LPAR for TSM only
TS3310 Tape library with 4 LTO SAN-attached tape drives
TSM Server V6.3.1.0
DB and logs on storage presented by SVC
30TB of primary storage (deduped) on a non-shared SAN-attached Hitachi AMS2100 disk array
SAN attachment uses separate HBA ports for tape, SVC, and Hitachi connections
Best Regards,
Bob...
I have a customer who has a recurring problem where the server has to be rebooted. At first it was once a month
on average and now the server requires a reboot about every 2 weeks. The problem presents itself as a major slow
down of I/O on the TSM server not only to disk but to tape devices as well. If a backup completes, it will often
run 50% (or more) longer than normal.
There are no errors reported in the TSM act log. There are no errors shown by the errpt command. All normal TSM
backup processes and sessions just slow down to a crawl. The server itself is still responsive when you login. We
have thought it was TSM but in the past we have tried to just restart the TSM service to see if that would resolve
the problem but it did not seem to help and a full reboot of the server is needed to restore performance.
Has any of you seen this problem? Any suggestions? We are hoping to get some advice from this forum before we get
IBM involved since we are still not sure if the problem is with TSM or with AIX (or even possibly storage-
related).
Hardware and software environment:
AIX 7.1 TL1 SP5
P740 32GB of memory 4 procs
single LPAR for TSM only
TS3310 Tape library with 4 LTO SAN-attached tape drives
TSM Server V6.3.1.0
DB and logs on storage presented by SVC
30TB of primary storage (deduped) on a non-shared SAN-attached Hitachi AMS2100 disk array
SAN attachment uses separate HBA ports for tape, SVC, and Hitachi connections
Best Regards,
Bob...