Using AWS Simple Email Service with Oozie in Cloudera

I’ve moved to a new role where I will be doing a lot more “devops” type work which hopefully comes with more interesting subjects to start writing about. Oozie and AWS Simple Email Services The C...

Mar 10, 2017 Big Data, Programming, AWS

Wordle Interpretation of CV

Today someone showed me a handle word cloud tool called wordle. I put the skills from my CV into it and had a play with the output… I think it looks quite cool.

Feb 14, 2017 Random

First go at creating a Garmin watch face

For a while Garmin have been adding “wearable technology” functionality to their watches - notifications from the phone etc. I got a Garmin Fenix 3 for Christmas and had a play with the vast number...

Jan 3, 2016 Programming

Scala eXchange 2015 - Embracing the community

Today I’m at my first Scala eXchange conference - the 5th annual one to be precise. The day started with the keynote session by @jessitron which was both inspiring and enlightening. The general to...

Dec 10, 2015 Conference, Programming

Unit testing HDFS code

I need to write a couple of unit tests for some code to add a log entry into HDFS but I don’t want to have to rely on having access to full blown HDFS cluster or a local install to achieve this. T...

Jul 2, 2015 Big Data, Programming

Writing a Flume Interceptor

Here we are in June, some five months since the last post and I finally have some time and content to sit and write a post. In April 2013 I started working with Hadoop, the plan was to suck in ser...

Jun 17, 2015 Big Data, Programming

Quick introduction to pyspark

All the work I have been doing with AWS has been using Python, specifically boto3 the rework of boto. One of the intentions is to limit bandwidth when transferring data to S3 the idea is to send p...

Jan 13, 2015 Programming, Spark

Client side encryption using Boto3 and AWS KMS

Towards the end of 2014 Amazon released the KMS service to provide a cheaper cut down offering for Key Management Services than those provided with the CloudHSM solutions (although it still uses ha...

Jan 6, 2015 Amazon Web Services, Programming

Adventures with Spark, part two

Some time ago, back in September, I wrote a post on starting my adventures with Spark but didn’t progress things very far. On thing that was holding me back was a reasonably real world problem to ...

Dec 3, 2014 Programming, Spark

AWS HTTPSConnectionPool max retries exceeded

I’m working with a new AWS account and I am moving to testing Boto3 to use the KMS service. I needed to make sure that the AWS account and secret keys were updated so ran aws configure to quickly u...

Dec 1, 2014 Amazon Web Services