Cloud computing is getting popular because it’s mysterious, complex and only just a few people have a clue about it. Looking deep into this world we will face many challenges and solutions. It is like an adventure park for an engineer.
On this playground the job is to build up reliable and highly available systems. Handle incredible amount of users with dramatically increasing amount of data. Services like Twitter or Facebook are part of our life and it is easy to forget the work behind it.
The thesis will give the reader an overview about cloud computing principles showing the common tips and tricks for general problems and giving real life examples. As part of the cloud computing, there will be a deeper introduction to Big-Data processing. How to handle and analyze terabytes of data in a fast and distributed way with the help of MapReduce technologies.
It wasn’t more than 10 years ago when the engineers from Google made some early publication about the map-reduce technologies. Since then many things have happened. Under the Apache incubator the open source Apache Hadoop grew up and is now widely used by the industry. This is the fundament of big data processing. Based on this fundament, there are tons of frameworks for different scenarios. With the help of these, the thesis will demonstrate how to analyze big amount of data without studying research papers about what is under the hood.