Thursday, June 26, 2008

Ad-hoc Hadoop


I am back working on my favorite software platform "Hadoop". I have been working on writing applications on Hadoop for quite some time and I love the immense power of easily available, reliable and scalable distributed computing at my disposal.

I have few things in mind though, sometimes I want to do a lot of Ad-hoc analysis and at those times the bottleneck of having to write a script or some java-code is still a hassle. SQL based database query rocks for that purpose. I have to agree though that its my relative inexperience with python and lazyness which is the problem here not Hadoop BUT I can still see that a Ad-hoc query tool on top of hadoop would be great help. Pig Latin from yahoo was a good step in that direction and I liked it before I got frustrated with bugs and not working features in it.

Hadoop can become a very powerfull backend-analysis tool for any data oriented company. The market opportunities are very very big as well coz I guess world is full of lazy people like me :)

No comments: