Cindy Gross: Small Bites of Big Data, Small Data, All Data
Small Bites of Big Data, Small Data, All Data for Hadoop, SQL Server, Hive, Distributed Systems, Scale Out....
Azure Blob Store
instant file initialization
sql pass conference
sql server 2005
sql server 2008
sql server 2008 R2
sql server 2012
Browse by Tags
Cindy Gross: Small Bites of Big Data, Small Data, All Data
Tagged Content List
Hadoop Likes Big Files
One of the frequently overlooked yet essential best practices for Hadoop is to prefer fewer, bigger files over more, smaller files. How small is too small and how many is too many? How do you stitch together all those small Internet of Things files into files "big enough" for Hadoop to process...
4 May 2015
Azure Data Factory: Hub Not Found
You can use the new Azure portal to create or edit Azure Data Factory components. Once you are done you may automate the process of creating future Data Factory components from PowerShell. In that case you can use the JSON files you edited in the portal GUI as configuration files for the PowerShell cmdlets...
1 Apr 2015
Create HDInsight Cluster in Azure Portal
Creating an HDInsight cluster from the Azure portal is very easy. However, sometimes you want all the choices and best practices explained as well as the "how to". I have created a series of slides with audio recordings to walk you through the process and choices. They are available as sessions...
26 Feb 2015
Master Choosing the Right Project for Hadoop
Hadoop is the hot buzzword of the Big Data world, and many IT people are being told "go create a Hadoop cluster and do some magic". It's hard to know where to start or which projects are a good fit. The information available online is sparse, often conflicting, and usually focused on how to...
25 Feb 2015
AzureCopy to the Rescue for an S3 to Azure Blob Copy!
This week I helped a client move files from AWS S3 to Azure Storage blobs. Sounds simple, right? Here's the tricky part... While there are both Azure and AWS cmdlets for PowerShell, they don't cooperate. Neither has a cmdlet that accepts credentials from the other and neither accepts arbitrary URLs from...
21 Feb 2015
Understanding WASB and Hadoop Storage in Azure
Yesterday we learned Why WASB Makes Hadoop on Azure So Very Cool . Now let's dive deeper into Windows Azure storage and WASB. I'll answer some of the common questions I get when people first try to understand how WASB is the same as and different from HDFS. What is HDFS? The Hadoop Distributed File System...
4 Feb 2015
Why WASB Makes Hadoop on Azure So Very Cool
Data. It’s all about the data. We want to make more data driven decisions. We want to keep more data so we can make better decisions. We want that data stored cheaply, easily accessible, and quickly ingested. Hadoop promises to help with all those things. However, when you deal with Hadoop on-premises...
3 Feb 2015
Taking Flight a.k.a. The Data Dragon’s Life After Microsoft
Cross-posted (with slightly worse formatting) from http://befriendingdragons.com/2014/07/23/taking-flight-a-k-a-the-data-dragons-life-after-microsoft/ Life is a journey – we can choose to fly through it with our wings spread to catch and channel the winds, or we can let the winds pummel us to...
23 Jul 2014
Azure Maximums and Resource Usage from PowerShell
Technorati Tags: Azure , PowerShell Have you ever struggled to find out how many VM cores, HDInsight cores, storage accounts, or other Azure resources your subscription is set to allow or how many you actually use? Maybe you want to use this information in your automation scripts to avoid trying to create...
9 Jul 2014
Get HDInsight Properties with PowerShell
Small Bites of Big Data from AzureCAT You’ve created your HDInsight Hadoop clusters and now you want to know exactly what you have out there in Azure. Maybe you want to pull the key information into a repository periodically as a reference for future troubleshooting, comparisons, or billing. Maybe you...
23 May 2014
Use Additional Storage Accounts with HDInsight Hive
When you create an HDInsight Hadoop cluster you pass in one or more storage accounts and their associated keys. This allows you to access the files on all associated storage accounts from the cluster. If you want to use public storage that isn’t passed in at create time that’s easy –...
5 May 2014
Sample PowerShell Script: HDInsight Custom Create
This is a working script I use to create various HDInsight clusters. For a really reproducible, automated environment you would want to put this into a .ps1 script that accepts parameters (see here for an example). However, you may find the method below good for learning and experimenting. Replace all...
6 Dec 2013
Your First HDInsight Cluster–Step by Step
Small Bites of Big Data from AZURECAT Big Data Tech Training Series #1 Cindy Gross | Murshed Zaman Sometimes it is just hard to get started. Have you been putting off your first foray into Hadoop? Are you not sure where to begin? Let’s get really basic. Prerequisites: Azure subscription...
25 Nov 2013
PowerShell for Azure cmdlets: Subscription was all Wacky
I was working on some HDInsight scripts in PowerShell and doing lots of experimenting. I’m not sure what exactly I did but all of a sudden everything stopped working. With lots of interruptions from meetings and chats and lunch…. I couldn’t retrace my steps. Everything seemed to fail on the Azure subscription...
22 Nov 2013
HDInsight Big Data Talks from #SQLPASS
SQL PASS Summit 2013 was another great data geek week! I chatted with many of you about Big Data, Hadoop, HDInsight, architecting solutions, SQL Server, data, BI, analytics, and general geekiness - great fun! This time around I delivered two talks on Hadoop and HDInsight - the slides from both are attached...
20 Oct 2013
Big Data Twitter Demo
Real-time. Social Sentiment Analysis. Twitter. Cloud. Insights. We have your Big Data buzzwords here! Everyone seems to want to incorporate social sentiment into their business analysis. Well we have the demo for you! Use it for a quick demonstration of what can be done and when the excitement goes...
17 May 2013
Access Azure Blob Stores from HDInsight
Small Bites of Big Data Edit Mar 6, 2014: This is no longer necessary for HDInsight - you specify the storage accounts when you create the cluster and the rest happens auto-magically. See http://blogs.msdn.com/b/cindygross/archive/2013/11/25/your-first-hdinsight-cluster-step-by-step.aspx or http:...
25 Apr 2013
HDInsight: Jiving about Hadoop and Hive with CAT
Tomorrow I will be talking about Hive as part of Pragmatic Work's Women in Technology (WIT) month of webcasts. I am proud to be part of this lineup with all these stellar WITs! I encourage my fellow WITs to get more involved in your data community and if you don't already do so start tweeting, blogging...
20 Mar 2013
PASS BAC PREVIEW SERIES: SQL Professionals and the World of Self-service BI and Big Data
Are you excited about the upcoming PASS Business Analytics Conference? You should be! This conference will offer a wide range of sessions about Microsoft's End to End Business Intelligence (including Self-Service BI), Analytics, Big Data, Architecture, Reporting, Information Delivery, Data Management...
15 Feb 2013
HDInsight: Hive Internal and External Tables Intro
Small Bites of Big Data Cindy Gross, SQLCAT PM HDInsight is Microsoft's distribution, in partnership with Hortonworks , of Hadoop. Hive is the component of the Hadoop ecosystem that imposes structure on Hadoop data in a way that makes it usable from BI tools that expect rows and columns with defined...
5 Feb 2013
Hurricane Sandy Mash-Up: Hive, SQL Server, PowerPivot & Power View
Small Bites of Big Data Authors : Cindy Gross Microsoft SQLCAT PM, Ed Katibah Microsoft SQLCAT PM Tech Reviewers: Bob Beauchemin Developer Skills Partner at SQLSkills, Jeannine Nelson-Takaki Microsoft Technical Writer, John Sirmon Microsoft SQLCAT PM, Lara Rubbelke Microsoft Technical Architect...
31 Jan 2013
Big Data – All Abuzz About Hive at #SQLPASS Summit 2012
Big Data – All Abuzz About Hive Small Bites of Big Data Cindy Gross, SQLCAT PM I hope to see you at the #SQLPASS Summit 2012 this week! There are many reasons people come to the PASS Summit – SQL friends, SQL family , networking, great content in 190 sessions, the SQL clinic,...
6 Nov 2012
Load SQL Server BCP Data to Hive
Load SQL Server BCP Data to Hive Small Bites of Big Data Cindy Gross, SQLCAT PM As you start learning more about Hadoop you may want to take a look at how the same data and queries work for SQL Server and for Hadoop. There are various ways to do this. For now I’ll show you something...
28 Sep 2012
What’s all the Buzz about Hadoop and Hive?
What’s all the Buzz about Hadoop and Hive? Why it Matters for SQL Server Peeps Small Bites of Big Data Cindy Gross, SQLCAT PM On September 20, 2012 we have another 24 Hours of PASS event! This PASS Summit Preview will give you a taste of what is coming at this year’s PASS...
19 Sep 2012
How to Install the PowerShell Cmdlets for Apache™ Hadoop™-based Services for Windows
How to Install the PowerShell Cmdlets for Apache™ Hadoop™-based Services for Windows Small Bites of Big Data Cindy Gross, SQLCAT PM UPDATED JUNE 2013 - The very early version of PowerShell cmdlets I discussed below have been replaced - see Managing Your HDInsight Cluster with...
23 Aug 2012
Page 1 of 2 (30 items)
© 2015 Microsoft Corporation.
Privacy & Cookies