« (In) The new world of Big Data, how different is the world of databases? Part II | Main

Comments

Shivrajk

Thanks Anant for putting it down here in lucid way, what you presented at Apigee campus. Looking forward for the next post.

Daniel Graham

I'm really enjoying these Big Data blogs. Especially liked "Pig and Hive add data models" --more please. Hope you can review AsterData which merges MapReduce + parallel RDBMS (best of both?
Please add LinkedIn login -- Facebook for adults.

Forcecarrier.wordpress.com

Pretty useful indeed. Should some of these indirect costs also need to be taken into consideration as some of these are proportional to the size of data

- Bandwidth costs in transporting the data to the final destination. Assumption is that the environment where the data is generated is not necessarily the same as the the environment in which it is getting processed.

- Intermediate storage (before the data is transported to final destination S3, EBS etc)

- CPU cost. All the intermediate compute (to run scribe, collectors etc ) required to receive the logs, store it temporarily and send it to the final destination

Verify your Comment

Previewing your Comment

This is only a preview. Your comment has not yet been posted.

Working...
Your comment could not be posted. Error type:
Your comment has been saved. Comments are moderated and will not appear until approved by the author. Post another comment

The letters and numbers you entered did not match the image. Please try again.

As a final step before posting your comment, enter the letters and numbers you see in the image below. This prevents automated programs from posting comments.

Having trouble reading this image? View an alternate.

Working...

Post a comment

Comments are moderated, and will not appear until the author has approved them.