Posted in Tips, Uncategorized | Posted on 27-05-2015
I am excited to say that the blog post that I have co-authored Avoiding The Mess In The Hadoop Cluster (Part 1) has been published by GetInData and and Apache Software Foundation.
In the first part of this blog series, we describe possible open-source solutions for data cataloguing, data discovery and process scheduling such as Apache Hive, HCatalog and Apache Falcon.
If interested, please read more at Avoiding The Mess In The Hadoop Cluster (Part 1).
A typical day of a data engineer at Spotify revolves around Hadoop and music. However after some time of simultaneous developing MapReduce jobs, maintaining a large cluster and listening to perfect music for every moment, something surprising might happen…!
Well, after some time, a data engineer starts discovering Hadoop (and its related concepts) in the lyrics of many popular songs. How can Coldplay, Black Eyed Peas, Michael Jackson or Justin Timberlake sing about Hadoop?
Maybe it is some kind of illness? Definitely! A doctor could call it “inlusio elephans” ;)