Scaling App Engine Applications
Instances are created to execute an application on an App Engine managed server. App Engine can automatically add or remove instances as needed based on load. When instances are scaled based on load, they are called dynamic instances. These dynamic instances help optimize your costs by shutting down when demand is low.
Alternatively, you can configure your instances to be resident or running all the time. These are optimized for performance so users will wait less while an instance is started.
Your configuration determines whether an instance is resident or dynamic. If you configure autoscaling or basic scaling, then instances will be dynamic. If you configure manual scaling, then your instances will be resident.
To specify automatic scaling, add a section to app.yaml that includes the term automatic_scaling followed by key-value pairs of configuration options. These include the following:
- target_cpu_utilization
- target_throughput_utilization
- max_concurrent_requests...