Introduction to PBS
What is the PBS?
Portable Batch System (PBS) is the name of computer software that performs job scheduling. Its primary task is to allocate computational tasks, i.e., batch jobs, among the available computing resources. The following versions of PBS are currently available:
- PBS Professional (PBS Pro)
A brief history of the PBS
Before the emergence of clusters, the Unix-based Network Queuing System (
NQS) from NASA Ames Research Center was a commonly used batch-queuing system. Then with the emergence of
parallel distributed system,
NQS began to show its limitations. Consequently, Ames then led an
effort to develop requirements and specifications for a newer, cluster-compatible system. With NASA’s funding this efforts resulted into PBS in the early 1990s. In 2003, PBS was acquired by Altair Engineering and is now marketed as PBS Pro by Altair Grid Technologies, a subsidiary of Altair Engineering.
PBS Pro’s key features
Scalability: supports millions of cores with fast job dispatch and minimal latency; tested beyond 50,000 nodes
Policy-Driven Scheduling: meets unique site goals and SLAs by balancing job turnaround time and utilization with optimal job placement
Resiliency: includes automatic fail-over architecture with no single point of failure – jobs are never lost, and jobs continue to run despite failures
Flexible Plugin Framework: simplifies administration with enhanced visibility and extensibility; customize implementations to meet complex requirements
Health Checks: monitors and automatically mitigates faults with a comprehensive health check framework