Microsoft has released a Linux version of its Hadoop distribution, HDInsight, used to process streams of unstructured data through its cloud computing service, Azure. Here are the details.
The Azure HDInsight service lets you easily run managed Hadoop clusters in the cloud to analyse and report on large data sets. It is a Hadoop distribution that can handle any amount of data. You used to only be able to configure those clusters using Windows Server OS but Microsoft has now added support for an Ubuntu based Linux OS.
HDInsight on Linux allows for broader support for Hadoop ecosystem partners which means more tools and applications for running Hadoop workloads on. There is now also additional capabilities such as cluster scaling, virtual network integration, and script action support.
As well as Hadoop clusters, you can create HBase and Storm clusters on Linux for NoSQL and real-time processing requirements that is useful for building an internet of things (IoT) application.
All HDInsight Linux clusters can be managed by Apache Ambari, which allows for customised configuration settings of Hadoop clusters and actively monitors them. You can find out more about Azure’s HDInsight service here.
[Via ScottGu’s Blog]