Big Data & NoSQL Support Blog

This is the team blog for the Big Data and NoSQL Support team at Microsoft. We support Azure Data Factory, Azure DocumentDB, Azure HDInsight (Hadoop running in the cloud), Azure Stream Analytics, and more!

Browse by Tags

Tagged Content List
  • Blog Post: Understanding HDInsight Custom Node VM Sizes

    With the 02/18/2015 update to HDInsight and Azure Powershell 0.8.14 we introduced a lot more options for configuring custom Head Node VM size as well as Data Node VM size and Zookeper VM size. Some workloads can benefit from increased CPU performance, increased local storage throughput, or larger memory...
  • Blog Post: Azure PowerShell 0.8.14 Released, fixes problems with pipelining HDInsight configuration cmdlets

    We recently pushed out the 0.8.14 release of Azure PowerShell . This release includes some updates to the following cmdlets to ensure that values passed in via the PowerShell pipeline, or via the -Config parameter, are maintained: Set-AzureHDInsightDefaultStorage Add-AzureHDInsightStorage ...
  • Blog Post: How to use parameter substitution with Pig Latin and PowerShell

    When running Pig in a production environment, you'll likely have one or more Pig Latin scripts that run on a recurring basis (daily, weekly, monthly, etc.) that need to locate their input data based on when or where they are run. For example, you may have a Pig job that performs daily log ingestion by...
  • Blog Post: Querying HDInsight Job Status with WebHCat via Native PowerShell or Node.js

    One of the great things about HDInsight is that under the covers, it has the same capabilities as other Hadoop installations. This means that you can use regular Hadoop endpoints like Ambari and WebHCat (formerly known as Templeton) to interact with an HDInsight Cluster. In this blog post, I’ll...
  • Blog Post: Customizing HDInsight Cluster provisioning

    In my last blog , I discussed how we can specify Hadoop configurations for a job on an HDInsight cluster. At the end of that blog, I also dicussed the alternative approach where you may want to change certain hadoop configurations from default values and would like to preserve the changes throughout...
  • Blog Post: How to pass Hadoop configuration values for a job on HDInsight

    I came across the question a few times recently from several customers– "how do we pass hadoop configurations at runtime for a mapreduce job or Hive Query via HDInsight PowerShell or .Net SDK?" I thought of sharing the answer here with others who may run into the same question. It is pretty common...
  • Blog Post: Getting started with Sqoop in HDInsight

    My name is Farooq and I am with HDinsight support team here at Microsoft. In this blog I will try to give some brief overview of Sqoop in HDinsight and then use an example of importing data from a Windows Azure SQL Database table to HDInsight cluster to demonstrate how you can get stated with Sqoop in...
  • Blog Post: Getting started with the HDInsight PowerShell tools and SDK

    Hi, my name is Azim and I work on the Big Data Support Team at Microsoft. If you have had a chance to read an earlier post by Dharshana, you may have seen how we can submit Hive query using the HDInsight PowerShell tools. In this blog, we will cover some basics of the HDInsight PowerShell tools and SDK...
  • Blog Post: Get Started with Hive on HDInsight

    Hi, my name is Dharshana and I work on the Big Data Support Team at Microsoft. As covered in the earlier post by Dan from our team, HDInsight provides a very easy to use interface to provision a Hadoop cluster with a few clicks and interact with the cluster programmatically. In this blog post, we will...
Page 1 of 1 (9 items)