The nature of my job forces me to build quick simple sample applications that are easy to understand and demonstrate a single concept. After all I am trying to enable developers, and making something overly complex generally turns developers off when they are trying to learn something new. However I often hear questions about how to build real world production grade applications that run in the cloud. This is something that is very hard to demonstrate simply because, lets face it, no production quality application has ever been considered simple to understand. However, I do think it is an important topic to address because if you are serious about building an application in the cloud you should understand how cloud applications are different from your everyday app that everyone is familiar with building.
In my opinion, the cloud was originally marketed to developers as “the same thing you have today but a vendor is taking care of the infrastructure”. In some ways this is true, take any cloud vendor that allows you to request a VM from them. It feels just like the VM you had running in your own data center for the most part. The only difference is that its NOT IN YOUR DATA CENTER. The fact that it is not running in your datacenter is exactly the reason why cloud applications need to be architected differently. The lack of absolute control and predicability of the cloud is what makes it necessary to think differently when running applications in the cloud. Everything from the hardware to the networking is all someone elses responsibility, and you can’t even get that person on the phone if you think something is wrong. On top of that, if you are serving millions upon millions of people every day (i.e. Netflix, Amazon, Google) by the time your realize something is wrong it is too late, you have already lost a huge chunk of money.
So at the end of the day if you are serious about building and running an application in the cloud you need to first understand that you should not be building a cloud application like it is running in your own data center. You need to build it with every intention of it running in a very volatile place where much of what lies underneath your application is completely our of your control. A term has recently emerged for applications that have built to be run in the cloud, we refer to them as “cloud native applications”. So what makes an application a cloud native application? In Matt Stine’s book “Migrating To Cloud-Native Application Architectures” he describes cloud native applications as having the following characteristics
- Is a 12-factor application
- Follows a microservices architecture
- Uses self-service agile infrastructure – AKA a Platform-as-a-service
- Uses API-based collaboration
- Is antifragile
Lets look at each of these in more detail.
12-Factor Applications
When Heroku was first introduced it was such a new concept that the engineers there came up with a set of guidelines for developers trying to use the platform. There ended up being 12 common practices that all successful applications deployed to Heroku had in common, they called these applications 12-Factor Applications. All of the applications had the following in common:
- Once codebase tracked in revision control, multiple deploys
- Explicitly declare and isolate dependencies
- Store config in the environment
- Treat backing resources as attaches resources
- Sticky separate build and run stages
- Execute the app as one or more stateless resources
- Export services via port binding
- Scale out via the process model
- Maximize robustness with fast startup and graceful shutdown
- Keep development, staging, and production as similar as possible
- Treat logs as events streams
- Run admin/management tasks as one-off processes
I won’t describe the details behind each one of these guidelines, instead I will let you read more about them on 12factor.net.
Microservices Architecture
If you have not heard of the term microservices yet than you have not been looking into building cloud applications for too long You cannot read a blog post or attend a conference on cloud today without hearing something about it. What is it? I think Martin Fowler and James Lewis describe it best:
In short, the microservice architectural style is an approach to developing a single application as a suite of small services, each running in its own process and communicating with lightweight mechanisms, often an HTTP resource API.
In essence we are all used to building large monolithic applications. These applications typically package everything into a single artifact that needs to be built, tested, and deployed to an application server and typically has a 3 tier architecture. So why do monolithic apps not work in the cloud? One reason is they are slow. Building them is slow, testing them is slow, deploying them is slow. With cloud native applications we want as much speed as possible. They are also very fragile, if one component in the monolith crashes everything goes down with it. This is not acceptable in the cloud. They are also not easy to scale. If I need to scale a single component of my application I end up scaling the entire monolith. This requires me to consume more infrastructure and therefore pay more money to my cloud provider.
When you build an application using microservices all of these issues go away and your application is much more suited for the cloud. Much of what it means to build a cloud native application builds off of using microservices including the rest of the characteristics mentioned in this blog post. I will look at microservices in detail in future blog posts (Part 1).
Self-Service Agile Infrastructure
Another term for Self-Service Agile Infrastructure is platform-as-a-service. In essence when deploying cloud native applications you do not want to go through the process of requesting a VM, configuring that VM, securing that VM, installing dependencies, deploying your application, configuring your application, and setting up tools to monitor your application, you want my platform to do that for you. Remember, if our cloud native application is using microservices it can potentially have 100s of services that need to be deployed in order for the application to work. Going through the above process for each microservice is not going to scale, no matter how many people work on the team.
Contrast the above process to one where you use a platform-as-a-service. All you need to do is hand the platform your application code and it takes care of the rest. Of course this can also be automated via tools like Jenkins and other build and deploy tools (perhaps your platform has these tools built in, i.e. IBM Bluemix and DevOps Services).
The self-service agile infrastructure should also be responsible for providing your application with the backing services is needs to run. Things like databases, message queues, and caching services should be able to requested on demand and provisioned rapidly when an application is deployed.
API Based Collaboration
When you are following the microservice design pattern you are going to need a new way for the components in your application to communicate. You cannot rely on just making method calls between the components of your application anymore. Having versioned REST APIs is the most common way for your services to communicate in cloud native applications. These may not be public APIs you let external consumers use, but you should consider them like they are. Other microservices will rely on these APIs and changes to them can drastically effect how the other microservices work so thinking carefully about your APIs is very important in a cloud native application.
Using REST APIs will also be helpful in supporting various types of clients. The most robust cloud native applications today support desktops, browsers, phones, tablets, watches, TVs, cars, entertainment systems, game consoles, and may other types of clients. The protocol that works best for all these types of devices is REST.
With that said, in certain types of situations other protocols may be preferred. This is fine, if it makes sense for your use case, but it typically is for internal communication between services and not external communication with clients.
Antifragile
Todays cloud native applications are expected to have 0 down time. If Facebook goes down it is on the national news. Will your application be subject to the same level of scrutiny or publicity? Probably not, but I guarantee your application is just as important to your customer as Facebook is to the entire world so you can be sure you will hear for them if your application is not available. The good news is that by following a microservices architecture you are already making sure your application is more robust. A crash in one component should still leave the majority of your application available to your users. If you take advantage of different data centers and regions from your cloud provider you have enough redundancy to withstand an entire outage of your application in one region without anyone even noticing! There are other tools and patterns you can implement in your application that helps your application be even more robust and I will talk about some of these in future posts.