Big Data & NoSQL Support Blog
This is the team blog for the Big Data and NoSQL Support team at Microsoft. We support Azure Data Factory, Azure DocumentDB, Azure HDInsight (Hadoop running in the cloud), Azure Stream Analytics, and more!
RSS for comments
RSS for posts
Search this blog
Search all blogs
Browse by Tags
Big Data & NoSQL Support Blog
Tagged Content List
Loading data in HBase Tables on HDInsight using bult-in ImportTsv utility
Apache HBase can give random access to very large tables-- billions of rows X millions of columns. But the question is how do you upload that kind of data in the Hbase tables in the first place? HBase includes several methods of loading data into tables. The most straightforward method is to either use...
12 Dec 2014
How to use parameter substitution with Pig Latin and PowerShell
When running Pig in a production environment, you'll likely have one or more Pig Latin scripts that run on a recurring basis (daily, weekly, monthly, etc.) that need to locate their input data based on when or where they are run. For example, you may have a Pig job that performs daily log ingestion by...
12 Aug 2014
Customizing HDInsight Cluster provisioning
In my last blog , I discussed how we can specify Hadoop configurations for a job on an HDInsight cluster. At the end of that blog, I also dicussed the alternative approach where you may want to change certain hadoop configurations from default values and would like to preserve the changes throughout...
15 Apr 2014
How to pass Hadoop configuration values for a job on HDInsight
I came across the question a few times recently from several customers– "how do we pass hadoop configurations at runtime for a mapreduce job or Hive Query via HDInsight PowerShell or .Net SDK?" I thought of sharing the answer here with others who may run into the same question. It is pretty common...
13 Feb 2014
How to add custom Hive UDFs to HDInsight
I recently had a need to add a UDF to Hive on HDInsight. I thought that it would be good to share that experience on a blog post. Hive provides a library of built-in functions to achieve the most common needs. The cool thing is that it also provides the framework to create your own UDF. I had a recent...
14 Jan 2014
Mount Azure Blob Storage as Local Drive
Gregory Suarez - MSFT
Gregory Suarez – 01/09/2014 I was recently working with a colleague of mine who submitted a MapReduce job via an HDInsight Powershell script and he needed a quick way to visually inspect the last several lines of the output after it had completed. He was looking for an easy and flexible way to...
9 Jan 2014
Getting started with Sqoop in HDInsight
My name is Farooq and I am with HDinsight support team here at Microsoft. In this blog I will try to give some brief overview of Sqoop in HDinsight and then use an example of importing data from a Windows Azure SQL Database table to HDInsight cluster to demonstrate how you can get stated with Sqoop in...
7 Jan 2014
Getting started with the HDInsight PowerShell tools and SDK
Hi, my name is Azim and I work on the Big Data Support Team at Microsoft. If you have had a chance to read an earlier post by Dharshana, you may have seen how we can submit Hive query using the HDInsight PowerShell tools. In this blog, we will cover some basics of the HDInsight PowerShell tools and SDK...
21 Nov 2013
Get Started with Hive on HDInsight
Hi, my name is Dharshana and I work on the Big Data Support Team at Microsoft. As covered in the earlier post by Dan from our team, HDInsight provides a very easy to use interface to provision a Hadoop cluster with a few clicks and interact with the cluster programmatically. In this blog post, we will...
11 Nov 2013
Page 1 of 1 (9 items)
© 2015 Microsoft Corporation.
Privacy & Cookies