Running Hadoop in the Cloud
With the growing popularity of cloud computing enterprises are seriously looking at moving workloads to the cloud. There are issues around multi-tenancy, data security, software license, data integration etc that have to be considered before enterprises cam make this shift. Even then, not all workloads can be easily moved to the cloud. In recent years, hadoop has gained a lot of interest as a big data technology that can help enterprises, cost effectively store and analyze massive amounts of data. As enterprises start evaluating hadoop one of the questions frequently asked is “Can we run hadoop in the cloud?”.
To answer this, the following key aspects of the hadoop infrastructure is important to understand:
1. Hadoop best runs on physical servers. A hadoop cluster comprises of a master node called the Name Node and multiple child nodes called Data Nodes. These data nodes…
View original post 960 more words