Connect Tableau Desktop to SparkSQL

Last (but not least) post of 2014, and a new Hacking challenge. Based on the work I've done on SQLDeveloper (, I was wondering how to connect Tableau Desktop to my SparkSQL cluster. Install Tableau Desktop I'm quite new to Tableau, but it's worth giving a try. However, spending $999 for a challenge isn't worth it, … Continue reading Connect Tableau Desktop to SparkSQL


Use Spark-SQL on SQL Developer

I'm describing here how I set SQL Developer to connect / query my Spark cluster. I made it work on my local environment below: Ubuntu precise 64 bits (1 master, 2 slaves) Hadoop Hortonworks Hive version, metastore hosted on MySQL database Spark 1.1.0 prebuilt for Hadoop 2.4 SQL Developer Note that I've … Continue reading Use Spark-SQL on SQL Developer

Processing GDELT data using Hadoop InputFormat and SparkSQL

GDELT A quick overview of GDELT public data set: "GDELT Project monitors the world's broadcast, print, and web news from nearly every corner of every country in over 100 languages and identifies the people, locations, organisations, counts, themes, sources, and events driving our global society every second of every day, creating a free open platform … Continue reading Processing GDELT data using Hadoop InputFormat and SparkSQL