Cindy Gross: Small Bites of Big Data, Small Data, All Data

Small Bites of Big Data, Small Data, All Data for Hadoop, SQL Server, Hive, Distributed Systems, Scale Out....

Browse by Tags

Tagged Content List
  • Blog Post: Create HDInsight Cluster in Azure Portal

    Creating an HDInsight cluster from the Azure portal is very easy. However, sometimes you want all the choices and best practices explained as well as the "how to". I have created a series of slides with audio recordings to walk you through the process and choices. They are available as sessions...
  • Blog Post: Master Choosing the Right Project for Hadoop

    Hadoop is the hot buzzword of the Big Data world, and many IT people are being told "go create a Hadoop cluster and do some magic". It's hard to know where to start or which projects are a good fit. The information available online is sparse, often conflicting, and usually focused on how to...
  • Blog Post: AzureCopy to the Rescue for an S3 to Azure Blob Copy!

    This week I helped a client move files from AWS S3 to Azure Storage blobs. Sounds simple, right? Here's the tricky part... While there are both Azure and AWS cmdlets for PowerShell, they don't cooperate. Neither has a cmdlet that accepts credentials from the other and neither accepts arbitrary URLs from...
  • Blog Post: Understanding WASB and Hadoop Storage in Azure

    Yesterday we learned Why WASB Makes Hadoop on Azure So Very Cool . Now let's dive deeper into Windows Azure storage and WASB. I'll answer some of the common questions I get when people first try to understand how WASB is the same as and different from HDFS. What is HDFS? The Hadoop Distributed File System...
  • Blog Post: Why WASB Makes Hadoop on Azure So Very Cool

    Data. It’s all about the data. We want to make more data driven decisions. We want to keep more data so we can make better decisions. We want that data stored cheaply, easily accessible, and quickly ingested. Hadoop promises to help with all those things. However, when you deal with Hadoop on-premises...
  • Blog Post: Azure Maximums and Resource Usage from PowerShell

    Technorati Tags: Azure , PowerShell Have you ever struggled to find out how many VM cores, HDInsight cores, storage accounts, or other Azure resources your subscription is set to allow or how many you actually use? Maybe you want to use this information in your automation scripts to avoid trying to create...
  • Blog Post: Get HDInsight Properties with PowerShell

    Small Bites of Big Data from AzureCAT You’ve created your HDInsight Hadoop clusters and now you want to know exactly what you have out there in Azure. Maybe you want to pull the key information into a repository periodically as a reference for future troubleshooting, comparisons, or billing. Maybe you...
  • Blog Post: Use Additional Storage Accounts with HDInsight Hive

    When you create an HDInsight Hadoop cluster you pass in one or more storage accounts and their associated keys. This allows you to access the files on all associated storage accounts from the cluster. If you want to use public storage that isn’t passed in at create time that’s easy –...
  • Blog Post: Sample PowerShell Script: HDInsight Custom Create

    This is a working script I use to create various HDInsight clusters. For a really reproducible, automated environment you would want to put this into a .ps1 script that accepts parameters (see here for an example). However, you may find the method below good for learning and experimenting. Replace all...
Page 1 of 1 (9 items)