Cindy Gross: Small Bites of Big Data, Small Data, All Data

Small bites of Big Data, Small Data, All Data for Hadoop, SQL Server, Hive, Distributed Systems, Scale Out....

Browse by Tags

Tagged Content List
  • Blog Post: Azure Maximums and Resource Usage from PowerShell

    Technorati Tags: Azure , PowerShell Have you ever struggled to find out how many VM cores, HDInsight cores, storage accounts, or other Azure resources your subscription is set to allow or how many you actually use? Maybe you want to use this information in your automation scripts to avoid trying to create...
  • Blog Post: Get HDInsight Properties with PowerShell

    Small Bites of Big Data from AzureCAT You’ve created your HDInsight Hadoop clusters and now you want to know exactly what you have out there in Azure. Maybe you want to pull the key information into a repository periodically as a reference for future troubleshooting, comparisons, or billing. Maybe you...
  • Blog Post: Use Additional Storage Accounts with HDInsight Hive

    When you create an HDInsight Hadoop cluster you pass in one or more storage accounts and their associated keys. This allows you to access the files on all associated storage accounts from the cluster. If you want to use public storage that isn’t passed in at create time that’s easy –...
  • Blog Post: Sample PowerShell Script: HDInsight Custom Create

    This is a working script I use to create various HDInsight clusters. For a really reproducible, automated environment you would want to put this into a .ps1 script that accepts parameters (see here for an example). However, you may find the method below good for learning and experimenting. Replace all...
  • Blog Post: Your First HDInsight Cluster–Step by Step

    Small Bites of Big Data from AZURECAT Big Data Tech Training Series #1 Cindy Gross | Murshed Zaman Sometimes it is just hard to get started. Have you been putting off your first foray into Hadoop? Are you not sure where to begin? Let’s get really basic. Prerequisites: Azure subscription...
  • Blog Post: PowerShell for Azure cmdlets: Subscription was all Wacky

    I was working on some HDInsight scripts in PowerShell and doing lots of experimenting. I’m not sure what exactly I did but all of a sudden everything stopped working. With lots of interruptions from meetings and chats and lunch…. I couldn’t retrace my steps. Everything seemed to fail on the Azure subscription...
  • Blog Post: HDInsight Big Data Talks from #SQLPASS

    SQL PASS Summit 2013 was another great data geek week! I chatted with many of you about Big Data, Hadoop, HDInsight, architecting solutions, SQL Server, data, BI, analytics, and general geekiness - great fun! This time around I delivered two talks on Hadoop and HDInsight - the slides from both are attached...
  • Blog Post: Big Data Twitter Demo

    Real-time. Social Sentiment Analysis. Twitter. Cloud. Insights. We have your Big Data buzzwords here! Everyone seems to want to incorporate social sentiment into their business analysis. Well we have the demo for you! Use it for a quick demonstration of what can be done and when the excitement goes...
  • Blog Post: Access Azure Blob Stores from HDInsight

    Small Bites of Big Data Edit Mar 6, 2014: This is no longer necessary for HDInsight - you specify the storage accounts when you create the cluster and the rest happens auto-magically. See http://blogs.msdn.com/b/cindygross/archive/2013/11/25/your-first-hdinsight-cluster-step-by-step.aspx or http:...
  • Blog Post: HDInsight: Jiving about Hadoop and Hive with CAT

    Tomorrow I will be talking about Hive as part of Pragmatic Work's Women in Technology (WIT) month of webcasts. I am proud to be part of this lineup with all these stellar WITs! I encourage my fellow WITs to get more involved in your data community and if you don't already do so start tweeting, blogging...
  • Blog Post: HDInsight: Hive Internal and External Tables Intro

    Small Bites of Big Data Cindy Gross, SQLCAT PM HDInsight is Microsoft's distribution, in partnership with Hortonworks , of Hadoop. Hive is the component of the Hadoop ecosystem that imposes structure on Hadoop data in a way that makes it usable from BI tools that expect rows and columns with defined...
Page 1 of 1 (11 items)