Big Data & NoSQL Support Blog

This is the team blog for the Big Data and NoSQL Support team at Microsoft. We support Azure Data Factory, Azure DocumentDB, Azure HDInsight (Hadoop running in the cloud), Azure Stream Analytics, and more!

Browse by Tags

Tagged Content List
  • Blog Post: Encoding 101 - Exporting from SQL Server into flat files, to create a Hive external table

    Today in Microsoft Big Data Support we faced the issue of how to correctly move Unicode data from SQL Server into Hive via flat text files. The main issue faced was encoding special Unicode characters from the source database, such as the degree sign (Unicode 00B0) and other complex Unicode characters...
  • Blog Post: Encoding the Hive query file in Azure HDInsight

    Today at Microsoft we were using Azure Data Factory to run Hive Activities in Azure HDInsight on a schedule. Things were working fine for a while, but then we got an error that was hard to understand. I've simplified the scenario to illustrate the key points. The key is that Hive did not like the Byte...
  • Blog Post: Troubleshooting Hive query performance in HDInsight Hadoop cluster

    One of the common support requests we get from customers using Apache Hive is –my Hive query is running slow and I would like the job/query to complete much faster – or in more quantifiable terms, my Hive query is taking 8 hours to complete and my SLA is 2 hours. Improving or tuning hive...
  • Blog Post: How to access Hive using JDBC on HDInsight

    While following up on a customer question recently on this topic, I realized that we have seen the same question coming up from other users a few times and thought I would share a simple example here on how to connect to HiveServer2 on Azure HDInsight using JDBC. For background, please review the apache...
  • Blog Post: How to use a Custom JSON Serde with Microsoft Azure HDInsight

    I had a recent need to parse JSON files using Hive. There were a couple of options that I could use. One is using native Hive JSON function such as get_json_object and the other is to use a JSON Serde to parse JSON objects containing nested elements with lesser code. I decided to go with the second approach...
  • Blog Post: Some Frequently Asked Questions on Microsoft Azure HDInsight

    We have seen some common questions on HDInsight when interacting with customers and partners. On this blog post, we are going to help answer some of those common questions. 1. What is Microsoft Azure HDInsight? HDInsight is a Hadoop-based service from Microsoft that brings a 100 percent Apache...
  • Blog Post: HDInsight: - backup and restore hive table

    Introduction My name is Sudhir Rawat and I work on the Microsoft HDInsight support team. In this blog I am going to explain the options for backing up and restoring a Hive table on HDInsight. The general recommendation is to store hive metadata on SQL Azure during provisioning the cluster. Sometimes...
  • Blog Post: Sliding Window Data Partitioning on Microsoft Azure HDInsight

    HCatalog is a table and storage management layer for Hadoop that enables users with different data processing tools like Pig, Mapreduce, Hive, and Oozie to read and write data. HCatalog's table abstraction presents these tools and users with a relational view of data in the cluster. HCatalog Integration...
  • Blog Post: How to add custom Hive UDFs to HDInsight

    I recently had a need to add a UDF to Hive on HDInsight. I thought that it would be good to share that experience on a blog post. Hive provides a library of built-in functions to achieve the most common needs. The cool thing is that it also provides the framework to create your own UDF. I had a recent...
  • Blog Post: Get Started with Hive on HDInsight

    Hi, my name is Dharshana and I work on the Big Data Support Team at Microsoft. As covered in the earlier post by Dan from our team, HDInsight provides a very easy to use interface to provision a Hadoop cluster with a few clicks and interact with the cluster programmatically. In this blog post, we will...
Page 1 of 1 (10 items)