Browse by Tags

Tagged Content List
  • Blog Post: Try F# 3.0 Released

    From the F# Team Blog: If you're familiar with the site Try F#, which lets you run F# right in your browser, you'll be happy to hear that the Try F# team has launched Try F# 3.0 today! You will find updated content and tutorials, plus content that is tailored to particular areas--such as data science...
  • Blog Post: Co-occurrence Approach to an Item Based Recommender Update

    In a previous post I talked about a Co-occurrence Approach to an Item Based Recommender , that utilized the Math.Net Numerics library. Recently the Math.Net Numerics library was updated to version 2.3.0. With this version of the library I was able to update the code to more efficiently read the Sparse...
  • Blog Post: Implementing a MapReduce Join with Hadoop and the .Net Framework

    I have often been asked how does one implement a Join whilst writing MapReduce code. As such, I thought it would be useful to add an additional sample demonstrating how this is achieved. There are multiple mechanisms one can employ to perform a Join operation, and the one to be discussed will be a Reduce...
  • Blog Post: MapReduce Based Co-occurrence Approach to an Item Based Recommender

    In a previous post I covered the basics for a Co-occurrence Approach to an Item Based Recommender . As promised, here is the continuation of this work, an implementation of the same algorithm using MapReduce. Before reading this post it will be worth reading the Local version as it covers the sample...
  • Blog Post: Framework for .Net Hadoop MapReduce Job Submission Json Serialization

    A while back one of the changes made to the “ Generics based Framework for .Net Hadoop MapReduce Job Submission ” code was to support Binary Serialization from Mapper, in and out of Combiners, and out from the Reducer. Whereas this change was needed to support the Generic interfaces there...
  • Blog Post: Co-occurrence Approach to an Item Based Recommender

    For a while I thought I would tackle the problem of creating an item-based recommender. Firstly I will start with a local variant before moving onto a MapReduce version. The current version of the code can be found at: http://code.msdn.microsoft.com/Co-occurrence-Approach-to-57027db7 The approach taken...
  • Blog Post: Framework for .Net Hadoop MapReduce Job Submission Binary Output

    To end the week I decided to make a minor change to the “ Generics based Framework for .Net Hadoop MapReduce Job Submission ”. I have been doing some work on creating a co-occurrence matrix for item recommendations. I was going to map the process to a MapReduce job(s), then came across...
  • Blog Post: .Net Implementation of a Priority Queue (aka Heap)

    I thought I would take a break for a while from Hadoop and put together an F# .Net implementation of a Priority Queue; implemented using a heap data structure. Conceptually we can think of a heap as a balanced binary tree. The tree will have a root, and each node can have up to two children; a left and...
  • Blog Post: Generics based Framework for .Net Hadoop MapReduce Job Submission

    Over the past month I have been working on a framework to allow composition and submission of MapReduce jobs using .Net. I have put together two previous blog posts on this, so rather than put together a third on the latest change I thought I would create a final composite post. To understand why lets...
  • Blog Post: .Net Hadoop MapReduce Job Framework - Revisited (Archived)

    An updated version of this post can be found at: http://blogs.msdn.com/b/carlnol/archive/2012/04/29/generic-based-framework-for-net-hadoop-mapreduce-job-submission.aspx If you have been using the Framework for Composing and Submitting .Net Hadoop MapReduce Jobs you may want to download an updated...
  • Blog Post: Framework for Composing and Submitting .Net Hadoop MapReduce Jobs (Archived)

    An updated version of this post can be found at: http://blogs.msdn.com/b/carlnol/archive/2012/04/29/generic-based-framework-for-net-hadoop-mapreduce-job-submission.aspx If you have been following my blog you will see that I have been putting together samples for writing .Net Hadoop MapReduce jobs;...
  • Blog Post: Hadoop Streaming in F# and MapReduce (summary)

    With all my recent posts around Hadoop Streaming I thought it would be useful to summarize them into a single post. The main objective of these posts was to put together a codebase to enable F# developers to write Map/Reduce libraries through a simple API. The full code posting can be found here: http...
  • Blog Post: FSharpChart for Valentines Day

    As I have not posted anything about FSharpChart for a while I thought I would do a quick post, one that befits Valentines Day: Plotting the heart was surprisingly easy to do: [ for x in -1.1 .. 0.001 .. 1.0 do      let y1 = abs(x)+sqrt(1.0-x**2.0)      let y2 = abs(x)-sqrt(1.0-x**2.0)      yield (x,...
  • Blog Post: Hadoop XML Streaming and F# MapReduce

    So, to round out the Hadoop Streaming samples I thought I would put together an XML Streaming sample. As always the code can be found here: http://code.msdn.microsoft.com/Hadoop-Streaming-and-F-f2e76850 XML Streaming Reader So how does one stream in XML? If you read the Hadoop Streaming documentation...
  • Blog Post: A lazy evaluation of F# Seq.groupBy for sorted sequences

    In doing some recent work with Hadoop I needed to process a sequence which was grouped by a projected key. Whereas the Seq.groupBy can perform this operation, the Seq.groupBy function makes no assumption on the ordering of the original sequence. As a consequence the resulting sequence is not lazily evaluated...
  • Blog Post: Hadoop Binary Streaming and PDF File Inclusion

    In a previous post I talked about Hadoop Binary Streaming for the processing of Microsoft Office Word documents. However, due to there popularity, I thought inclusion for support of Adobe PDF documents would  be beneficial. To this end I have updated the source code to support processing of both...
  • Blog Post: Hadoop Binary Streaming and F# MapReduce

    As mentioned in my previous post Hadoop Streaming not only supports text streaming, but it also supports Binary Streaming. As such I wanted to put together a sample that supports processing Office documents; more on support for PDF in a later post. As always the code can be downloaded from: http:...
  • Blog Post: MapReduce Tester: A Quick Word

    In my previous post I talked a little about testing the Hadoop Streaming F# MapReduce code; but it is worth saying a few words about the tester application. The complete code for this blog post and the F# MapReduce code can be found at: http://code.msdn.microsoft.com/Hadoop-Streaming-and-F-f2e76850 As...
  • Blog Post: Hadoop Streaming and F# MapReduce

    And now for something completely different. As you may know Microsoft has recently announced plans for a Hadoop adoption for both Windows Server and Windows Azure . You can find out more about Hadoop and Windows Azure at Apache Hadoop-based Services for Windows Azure and Availability of Community...
  • Blog Post: F# Project and Item Templates (revised)

    Over the past few months I have posted up a few Project and Item Templates for F#. Here is a quick summary of those templates. The Project Templates include: WPF Project WinForms Application Console Application Windows Service F# Class Library These Project Templates install the following Item Templates...
  • Blog Post: F# WPF Project Template

    This template is for a basic F# Windows Presentation Foundation (WPF) project. This template can be found on the Visual Studio Gallery: http://visualstudiogallery.msdn.microsoft.com/33dab4ce-62b8-419e-8072-92bc88556f78 When the template is installed you get the following template added to your...
  • Blog Post: F# Dynamic Lookup Operator and Regex

    In a recent post I talked about using String.forall for string matching over the use of Regex: http://blogs.msdn.com/b/carlnol/archive/2011/10/20/f-string-extension-methods-rather-than-using-regex.aspx I do however use Regex a lot for other types of processing. One thing I find myself doing a bit is...
  • Blog Post: F# Parallel Execution and Future Tasks

    In a previous post I talked about some options around performing parallel executions by providing some wrappers that allowed a collection of functions to be executed in parallel: http://blogs.msdn.com/b/carlnol/archive/2011/07/17/f-and-running-parallel-tasks.aspx However there is another approach one...
  • Blog Post: FSharpChart minor release available (version 0.56)

    The new version of FSharp.Chart (formely FSharpChart) can now be found on GitHib at: http://fsharp.github.io/FSharp.Charting/ I have provided a minor update to FSharpChart, which can be downloaded from the usual place: http://code.msdn.microsoft.com/FSharpChart-b59073f5 This is a minor release...
  • Blog Post: F# String extension methods rather than using RegEx

    In my previous post I talked about some extension methods for string checking: http://blogs.msdn.com/b/carlnol/archive/2011/10/19/string-extension-methods-rather-than-using-regex.aspx But what would be a post if there was not a F# equivalent. Interestingly the ability to perform checks on each character...
Page 1 of 2 (38 items) 12