Hive Archives - Joydeep's Corner

HBase and Map-Reduce

Posted on April 7, 2010February 18, 2021 by jss

HBase + Map-Reduce is a really awesome combination. In all the back and forths about NoSQL – one of the things that’s often missed out is how convenient it is to be able to do scalable data analysis directly against large online data sets (that new distributed databases like HBase […]

Continue

Update on Hive+Hadoop+S3+EC2

Posted on May 20, 2009May 9, 2012 by jss

A formal recipe on running SQL queries using EC2 against S3 files is now posted at: http://wiki.apache.org/hadoop/Hive/HiveAws/HivingS3nRemotely But not before hitting a few more bugs ( HADOOP-5861 ). Running a TPCH query using Hive was a pretty high point. (I did have to omit the order by clauses though :-() […]

Continue

Hive + Hadoop + S3 + EC2 = It works!

Posted on May 14, 2009May 14, 2009 by jss

I have been enjoying my vacation time in India for the last few weeks and one of the fun projects i had taken up was getting a good story around running Hive on Amazon Infrastructure (AWS) . The use case i had in mind was something like this: A user […]

Continue

Curt Monash reports on Hadoop/Hive @ Facebook

Posted on May 12, 2009May 9, 2012 by jss

Curt Monash posted a blog post on our (myself and Ashish Thusoo’s) conversation with him regarding Hadoop and Hive and their deployment and usage at Facebook. It is heartening to see the mainstream database and analytics community starting to cover Hadoop and Hive. Even though these projects are rapidly becoming […]

Continue