The 10 worst big data practices

See the original posting on JavaWorld

Yes, you can haz big data. However, you can haz it the right way or the wrong way. Here are the top 10 worst practices to avoid.

1. Choosing MongoDB as your big data platform. Why am I picking on MongoDB? I’m not, but for whatever reason, the NoSQL database most abused at this point is MongoDB. While MongoDB has an aggregation framework that tastes like MapReduce and even a (very poorly documented) Hadoop connector, its sweet spot is as an operational database, not an analytical system.

When your sentence begins, “We will use Mongo to analyze …,” stop right there and think about what you’re doing. Sometimes you really mean “collect for later analysis,” which might be OK, depending on what you’re doing. However, if you really mean you’re going to use MongoDB as some kind of sick data-warehousing technology, your project may be doomed at the start.

To read this article in full or to leave a comment, please click here