Cloud computing - Benefits

Table of Contents

Fault Tolerance

The ability of a system to be accessible and usable by users when they need it.

A conscious effort to avoid the obvious sources of downtime

Also called “up time”.
The ability of a system to remain operational to users during planned or unplanned outages.
No major service provider has 100% availability

Mitigating them:

Mitigating them:

Every single core component has redundancy
Availability
1. Availability sets
2. Availability zones
3. Cross region load balancing
Constant health monitoring
Automation
Strong security practices
Be geographically distributed
Have a disaster recovery plan
Test that disaster recovery plan / fire drills
Load testing

The ability of a system to accommodate increasing (varying) demand by adding or removing resources as needed.

Allows a system to adapt to changing usage patterns and handle increased traffic without requiring changes to the application code or system design.

Having a scalable system allows for a system to be perfectly sized. This optimizes the cost by reducing wasted computer resources.

The ability of a system to quickly and easily scale up or down the amount of resouces that a system uses in response to changing demand.

Has to involve some sort of automation
Often called “autoscaling” in cloud computing
The system monitors some metric (e.g. CPU utilization) to determine how busy a system is
Add resources when it exceeds a limit for being busy
Removes resources when it falls below a limit for not being busy
More efficient and cost-effective use of resources
Minimizing computing “waste” - resouces paid for and not used
Self-hosted systems tend to have a large percentage of “over-provisioned” resources for anticipated future growth
Have the potential to have a maximum capacity higher than you could afford if you had a static provisioning of resources

How dependable a system is
The ability of a system to perform its intended function without interruption and with a high degree of accuracy
You have to trust that your cloud provider is doing everything it can to make its platform reliable
This includes transparency during service issues
How is it implemented?
1. Auto-scaling
2. Multiple regions
3. Data backup ad replication
4. Health checks and self healing

A system can be highly available to users - in that, it responds instantly to every request. However, don’t look behind the curtain. The system itself might be highly unreliable. e.g. a calculator that responds all the time, but gives wrong answers or an app that loses your data sometimes randomly.
Availability is an appearance to the end users
Reliability is the underlying truth

The ability to forecase and control the performance and behavior of a system
Includes the ability to predict future costs
Why?
1. Gives us the confidence that the system will continue to perform at the expected level in the future
2. We will not get a crazy bill unexpectedly
How?
1. Auto scaling
2. Load balancing
3. Different instance types, sizes, pricing tiers
4. Cost management tools
5. APIs for billing
6. Pricing calculators

Security is a full-time job

Cloud providers are obviously massive targets for hackers, and so they rightly spend a lot of time, money and effort on platform security
Cloud providers go through security audits and compliance certifications
They provide customers the tools they need to enable and monitor security with their own applications/data
Why?
1. Fundamental challenge in IT
2. We want confidence that our cloud provider cannot easily be defeated by hackers and those with malicious intent
How?
1. Industry standard compliance certifications
2. Always-on DDoS
3. Microsoft Security Response Center (MSRC)
4. Azure Policy and Blueprint
5. Role based access control (RBAC)
6. Azure Active Directory
7. Always up-to-date platform services
8. Update management
9. Encryption by default
10. Dozens of security services like firewall

How your organization chooses to do business
Could be executive governance, IT governance, business governance
The process of defining, implementing, and monitoring a framework of policies that guides an organization’s cloud operations
Why?
1. The company wants to ensure it’s policies are followed in the cloud
2. Includes basic auditing and reporting, as well as enforcement
3. The company wants to be compliant with industry standards such as HIPPA or PCC or GDPR
How?
1. Azure Policy and Blueprint
2. Management groups
3. Custom roles
4. Soft delete
5. Guides and best practices such as Cloud Adoption Framework

Management of the cloud
1. Templates
2. Automation
3. Scaling
4. Monitoring and alerts
5. Self-healing
Management in the cloud
1. Web portal
2. Command line interface and scripts
3. APIs
4. PowerShell
Why?
1. How easy it is to work with your applications in the cloud impacts cost, performance, security and other priorities
2. Different cloud vendors are going to be easier or harder to work with
How?
1. Azure Portal, CLI, PowerShell, Cloud Shell, REST APIs, and other programmatic methods
2. Consolidated monitoring and alerting system
3. Ability to use ARM templates, Bicep, Terraform, etc.
4. Autoscaling of most types of compute resources