Benjamin Guinebertière

This blog is about Microsoft Azure. Older stuff include architecture, SOA, BizTalk, ...

Browse by Tags

Tagged Content List
  • Blog Post: start a Pig + Jython job in HDInsight thru WebHCat

    You can also use HDInsight with Hive + Python . The drawback of the latter is that you use streaming between Hive and Python. In Hadoop streaming is just a way to call stdin/stdout inter process communication. So if you just do simple operations like string concatenations between two fields in Python...
  • Blog Post: HDInsight + PowerBI: un exemple simple

    En octobre dernier, j’ai eu l’occasion de montrer comment analyser des données venant de logs Web et Twitter avec PIG et HIVE dans Hadoop, puis de croiser les résultats dans Excel, ce qui permet de décliner le résultat dans Power BI. Je mets ici les diapos et les vidéos (les vidéos sont les vidéos de...
  • Blog Post: How to deploy a Python module to Windows Azure HDInsight

    Introduction In a previous post , I explained how to run Hive + Python in HDInsight (Hadoop as a service in Windows Azure). The sample showed a Python script using standard modules such as hashlib. In real life, modules need to be installed on the machine before they can be used. Recently, I had to use...
  • Blog Post: A simple example: how to call Python from Hive in HDInsight

    Introduction Hadoop framework distributes code execution automatically in a multi node cluster. This code is also distributed against the dataset. Code development in Hadoop can be done in Java and one has to implement a map function and a reduce function; both manipulate keys and values as inputs and...
  • Blog Post: How to use HDInsight from Linux

    HDinsight is very easy to use from PowerShell, but how would you create and delete a cluster from Linux? How would you submit a job and get the result? Here is is a simple sample and pointers to further documentation. 1. Create a cluster You can create a cluster with the Windows Azure Command Line Interface...
Page 1 of 1 (5 items)