Blog - Title

January, 2012

  • Carl's Blog

    Hadoop XML Streaming and F# MapReduce

    • 0 Comments
    So, to round out the Hadoop Streaming samples I thought I would put together an XML Streaming sample. As always the code can be found here: http://code.msdn.microsoft.com/Hadoop-Streaming-and-F-f2e76850 XML Streaming Reader So how does one stream in XML...
  • Carl's Blog

    Hadoop Streaming and Windows Azure Blob Storage

    • 0 Comments
    One of the cool features of the Microsoft Distribution of Hadoop (MDH) is the native support for Windows Azure Blob Storage. When performing HDFS operations by default one can omit the scheme such that: hadoop fs -lsr /mobile Is equivalent to: hadoop...
  • Carl's Blog

    Hadoop Streaming and Reporting

    • 0 Comments
    If like me you are a .Net developer and have written some Streaming jobs it is not immediately obvious how one can do any reporting. However if you dig through the Streaming Documentation you will come across this in the FAQs: How do I update counters...
  • Carl's Blog

    A lazy evaluation of F# Seq.groupBy for sorted sequences

    • 0 Comments
    In doing some recent work with Hadoop I needed to process a sequence which was grouped by a projected key. Whereas the Seq.groupBy can perform this operation, the Seq.groupBy function makes no assumption on the ordering of the original sequence. As a...
  • Carl's Blog

    Hadoop Binary Streaming and PDF File Inclusion

    • 2 Comments
    In a previous post I talked about Hadoop Binary Streaming for the processing of Microsoft Office Word documents. However, due to there popularity, I thought inclusion for support of Adobe PDF documents would  be beneficial. To this end I have updated...
Page 1 of 1 (5 items)