Auto Scaling allows your application to always have the compute capacity needed and reduces the need to manually monitor server capacity. You can autoscale based on incoming requests (front-end) or number of jobs in the queue and how long jobs have been in the queue (back-end).


The benefits of Auto Scaling:

    • Responsive server usage based on traffic load
    • Handle unexpected traffic spikes and avoid downtime
    • Plan scaling activities if you have predictable load changes
    • Uncover and replace faulty instances and unhealthy applications automatically