Netbackup Restore priority problem over duplications
We are using SYMC Netbackup 7.5.0.4 and EMC Data Domain. Data from the EMC Data Domain gets duplicated to the Tape library for offsite purposes.
With just 10 drives, only 10 duplications jobs runs parallel and others remain in queue. During this time when we get a restore request, it just remains in teh queue and never gets the require priority. Even if I cancel one of the duplication job to release the drive, it's the another queued job goes active but restore remains in queue again. For some reason Restore is always given less priority than the queued duplicaiton jobs which shouldn't be the case.
Any ideas? I tried suspending the SLP (nbstlutil -lifecycle inactive) and but don't won't to kill/cancel all the duplications as that's not a practical solution.
Regards
Nithin
before canceling the duplication job, reduce the number of concurent Drives in the tape storage unit to 9 or less, so that once you cancle, they will not get assigned to another SLP job, and avaliable for restore.
see the netbackup admin guide 2 to understand how EMM allocates the Resources to the jobs.
There is always a little confusion when high priority jobs queue when lower priority jobs that are queued appear to get precedence over resource allocation - an issue we've covered a few times on these pages.
If resources are already in use (e.g. specific media & therefore tape drives) and there are jobs queued that can utilise those resources, then they will take precedence irrespective of the priority of any other queued job that requires some of those same resources (e.g. tape drive).
NetBackup works this way so that it does not unnecessarily keep loading & unloading media - if queued jobs of a lower priority can utilise the loaded media then they will take precedence over a higher priority queued job that would require different media.
Understanding the Job Priority setting on Windows
http://www.symantec.com/business/support/index?page=content&id=HOWTO34237The only time I've personally seen a higher priority (restore) job 'jump in', as it were, is when the lower priority jobs required a media change at which point the higher priority job took control of the drive & the lower priority jobs waited until the drive (or another) became free.
Do you try & mitigate this by wastefully always having one drive free just in case you get a restore request when all drives are being utilised, or do you deal with it as and when it occurs?