Sign in
MSDN Blogs
Microsoft Blog Images
More ...
Browse by Tags
MSDN Blogs
>
Carl's Blog
>
All Tags
>
hadoop streaming
Server & Tools Blogs
>
Business Intelligence Blogs
>
Carl Nolan's Blog
All About Business Intelligence
Microsoft Business Intelligence
Information Management
Master Data Services
SSIS
Data Quality Services
Jessica Meats’ Blog
Analysis Services
Analysis Services and PowerPivot
AS_EMEA
All MSDN Analysis Services Posts
PowerPivot
Analysis Services and PowerPivot
Data Explorer
All MSDN PowerPivot Postings
Big Data
SQL Server
Carl Nolan
Reporting Services
Reporting Services
Analysis Services and PowerPivot
SQL Server
All MSDN Reporting Services Posts
Data in the Cloud
Your Data in the Cloud
Windows Azure
SQL Server
Silver Lining
Carl Nolan’s Blog
Carl Nolan’s ramblings on development
Live Now on Server & Tools Blogs
Subscribe
Comments
Home
Menu
Atom
Translate this page
Powered by
Microsoft® Translator
About Carl Nolan
I work for Microsoft Consulting Services in the UK. My primary focus is enterprise applications using C# and SQL Server. However I also have an interest in other programming paradigms, hence the interest in F#. A summary of my previous publications can be found
here
. A list of posted code can be found
here
.
Twitter Feed
Recent Posts
Hadoop .Net HDFS File Access
Posted
3 months ago
by
Carl Nolan
7
Comments
Submitting Hadoop MapReduce Jobs using PowerShell
Posted
3 months ago
by
Carl Nolan
0
Comments
Try F# 3.0 Released
Posted
4 months ago
by
Carl Nolan
0
Comments
Hive and XML File Processing
Posted
5 months ago
by
Carl Nolan
3
Comments
Tags
.Net
.Net Programming
Binary Streaming
C#
Console
F#
FSharp
Hadoop
Hadoop on Azure
Hadoop.Net
IEnumerable
iTextSharp
Json Serialization
Machine Learning
MapReduce
Matrix
Open XML SDK
Partitioning
Recommendations
Serialization
Sorting
Vector
Windows Azure
WPF
XML Streaming
Archives
Archives
February 2013
(2)
January 2013
(1)
December 2012
(2)
November 2012
(1)
October 2012
(2)
July 2012
(4)
June 2012
(5)
May 2012
(1)
April 2012
(3)
February 2012
(3)
January 2012
(5)
December 2011
(6)
November 2011
(7)
October 2011
(8)
September 2011
(1)
August 2011
(1)
July 2011
(11)
More
▼
Less
▲
Tagged Content List
Blog Post:
Implementing a MapReduce Join with Hadoop and the .Net Framework
Carl Nolan
I have often been asked how does one implement a Join whilst writing MapReduce code. As such, I thought it would be useful to add an additional sample demonstrating how this is achieved. There are multiple mechanisms one can employ to perform a Join operation, and the one to be discussed will be a Reduce...
on
13 Nov 2012
Blog Post:
Framework for .Net Hadoop MapReduce Job Submission V1.0 Release
Carl Nolan
It has been a few months since I have made a change to the “ Generics based Framework for .Net Hadoop MapReduce Job Submission ” code. However I was going to put together a sample for a Reduce side join and came across a issue around the usage of partitioners. As such I decided to add support...
on
10 Oct 2012
Blog Post:
Framework for .Net Hadoop MapReduce Job Submission TextOutput Type
Carl Nolan
Some recent changes made to the “ Generics based Framework for .Net Hadoop MapReduce Job Submission ” code were to support Json and Binary Serialization from Mapper, in and out of Combiners, and out from the Reducer. However, this precluded one from controlling the format of the Text output. Say one...
on
10 Jul 2012
Blog Post:
C# MapReduce Based Co-occurrence Item Based Recommender
Carl Nolan
As promised, to conclude the Co-occurrence Approach to an Item Based Recommender posts I wanted to port the MapReduce code to C#; just for kicks and to prove the code is also easy to write in C#. For an explanation of the MapReduce post review the previous article: http://blogs.msdn.com/b/carlnol/archive...
on
9 Jul 2012
Blog Post:
MapReduce Based Co-occurrence Approach to an Item Based Recommender
Carl Nolan
In a previous post I covered the basics for a Co-occurrence Approach to an Item Based Recommender . As promised, here is the continuation of this work, an implementation of the same algorithm using MapReduce. Before reading this post it will be worth reading the Local version as it covers the sample...
on
7 Jul 2012
Blog Post:
Framework for .Net Hadoop MapReduce Job Submission Json Serialization
Carl Nolan
A while back one of the changes made to the “ Generics based Framework for .Net Hadoop MapReduce Job Submission ” code was to support Binary Serialization from Mapper, in and out of Combiners, and out from the Reducer. Whereas this change was needed to support the Generic interfaces there...
on
1 Jul 2012
Blog Post:
Framework for .Net Hadoop MapReduce Job Submission configuration update
Carl Nolan
To better support configuring the Stream environment whilst running .Net Streaming jobs I have made a change to the “ Generics based Framework for .Net Hadoop MapReduce Job Submission ” code. I have fixed a few bugs around setting job configuration options which were being controlled by...
on
25 Jun 2012
Blog Post:
Framework for .Net Hadoop MapReduce Job Submission Binary Output
Carl Nolan
To end the week I decided to make a minor change to the “ Generics based Framework for .Net Hadoop MapReduce Job Submission ”. I have been doing some work on creating a co-occurrence matrix for item recommendations. I was going to map the process to a MapReduce job(s), then came across...
on
15 Jun 2012
Blog Post:
Framework for .Net Hadoop MapReduce Job Submission libjars update
Carl Nolan
If you have been using the “ Generics based Framework for .Net Hadoop MapReduce Job Submission ” you may want to download the latest version of the code. The previous version of the code, when processing XML and Binary files, was dependent on a custom streaming JAR that contained the necessary...
on
11 Jun 2012
Blog Post:
Generics based Framework for .Net Hadoop MapReduce Job Submission
Carl Nolan
Over the past month I have been working on a framework to allow composition and submission of MapReduce jobs using .Net. I have put together two previous blog posts on this, so rather than put together a third on the latest change I thought I would create a final composite post. To understand why lets...
on
29 Apr 2012
Blog Post:
.Net Hadoop MapReduce Job Framework - Revisited (Archived)
Carl Nolan
An updated version of this post can be found at: http://blogs.msdn.com/b/carlnol/archive/2012/04/29/generic-based-framework-for-net-hadoop-mapreduce-job-submission.aspx If you have been using the Framework for Composing and Submitting .Net Hadoop MapReduce Jobs you may want to download an updated...
on
25 Apr 2012
Blog Post:
Framework for Composing and Submitting .Net Hadoop MapReduce Jobs (Archived)
Carl Nolan
An updated version of this post can be found at: http://blogs.msdn.com/b/carlnol/archive/2012/04/29/generic-based-framework-for-net-hadoop-mapreduce-job-submission.aspx If you have been following my blog you will see that I have been putting together samples for writing .Net Hadoop MapReduce jobs;...
on
10 Apr 2012
Blog Post:
Hadoop Streaming in F# and MapReduce (summary)
Carl Nolan
With all my recent posts around Hadoop Streaming I thought it would be useful to summarize them into a single post. The main objective of these posts was to put together a codebase to enable F# developers to write Map/Reduce libraries through a simple API. The full code posting can be found here: http...
on
22 Feb 2012
Blog Post:
Hadoop XML Streaming and F# MapReduce
Carl Nolan
So, to round out the Hadoop Streaming samples I thought I would put together an XML Streaming sample. As always the code can be found here: http://code.msdn.microsoft.com/Hadoop-Streaming-and-F-f2e76850 XML Streaming Reader So how does one stream in XML? If you read the Hadoop Streaming documentation...
on
21 Jan 2012
Blog Post:
Hadoop Streaming and Windows Azure Blob Storage
Carl Nolan
One of the cool features of the Microsoft Distribution of Hadoop (MDH) is the native support for Windows Azure Blob Storage. When performing HDFS operations by default one can omit the scheme such that: hadoop fs -lsr /mobile Is equivalent to: hadoop fs -lsr hdfs:///mobile The commands are defaulting...
on
7 Jan 2012
Blog Post:
Hadoop Streaming and Reporting
Carl Nolan
If like me you are a .Net developer and have written some Streaming jobs it is not immediately obvious how one can do any reporting. However if you dig through the Streaming Documentation you will come across this in the FAQs: How do I update counters in streaming applications? A streaming process can...
on
6 Jan 2012
Blog Post:
Hadoop Binary Streaming and PDF File Inclusion
Carl Nolan
In a previous post I talked about Hadoop Binary Streaming for the processing of Microsoft Office Word documents. However, due to there popularity, I thought inclusion for support of Adobe PDF documents would be beneficial. To this end I have updated the source code to support processing of both...
on
1 Jan 2012
Blog Post:
Hadoop Binary Streaming and F# MapReduce
Carl Nolan
As mentioned in my previous post Hadoop Streaming not only supports text streaming, but it also supports Binary Streaming. As such I wanted to put together a sample that supports processing Office documents; more on support for PDF in a later post. As always the code can be downloaded from: http:...
on
30 Dec 2011
Blog Post:
MapReduce Tester: A Quick Word
Carl Nolan
In my previous post I talked a little about testing the Hadoop Streaming F# MapReduce code; but it is worth saying a few words about the tester application. The complete code for this blog post and the F# MapReduce code can be found at: http://code.msdn.microsoft.com/Hadoop-Streaming-and-F-f2e76850 As...
on
29 Dec 2011
Blog Post:
Hadoop Streaming and F# MapReduce
Carl Nolan
And now for something completely different. As you may know Microsoft has recently announced plans for a Hadoop adoption for both Windows Server and Windows Azure . You can find out more about Hadoop and Windows Azure at Apache Hadoop-based Services for Windows Azure and Availability of Community...
on
16 Dec 2011
Page 1 of 1 (20 items)