4.3.9 Elasticity


Cloud Elasticity also known as Elasticity “is the degree to which a system is able to adapt to workload changes by provisioning and de-provisioning resources in an autonomic manner, such that at each point in time the available resources match the current demand as closely as possible.”1). A primary motivation behind Elasticity is to save money by not investing in Infrastructure-as-a-Service (IaaS) that is not used or under used. It also saves natural resources since heating and air conditioning are not used on resources that are on standby2).

The following are the various strategies used to achieve elasticity:

  • Cost-aware criteria: The default is to assume that there is a firm fixed price for IaaS providers, however, some providers allow for spot pricing schemes (i.e., Amazon) which can allow users to tap into IaaS excess capacity. This excess capacity is there so that the IaaS provider can meet the Service Level Agreements (SLAs) guaranteed to all customers.
  • Power-aware cost function: Using the power required to meet the application's needs and little more, i.e., using off-peak power consumption only.
  • Multiple classes of requests: Allow applications to be segmented into categories based on the need for service. For example, customers' requests for service from the application can be divided into three categories: High Priority for performing financial transactions; Medium Priority for those making product inquiries; Low priority for simple browsing.
  • Scaling multiple applications: Allow an application to be broken up into smaller applications whose functionality and services are orchestrated.

DIDO Specifics

To be added/expanded in future revisions of the DIDO RA
Nikolas Roman Herbst, Samuel Kounev and Ralf Reussner, Elasticity in Cloud Computing: What It Is, and What It Is Not, Accessed on 11 August 2020,
Rui Han, Investigations into Elasticity in Cloud Computing, November 2013, Accessed 12 August 2020,
