Open Source Big Data Reporting & ETL show promises
With Hadoop/Hbase/Hive, Cassandra, etc. you can store and manipulate peta-bytes of data. But what if you want to get nice looking reports or compare data held in a NoSQL solution with data held...
View ArticleBig Data Apps and Big Data PaaS
Enterprises no longer have a lack of data. Data can be obtained from everywhere. The hard part is to convert data into valuable information that can trigger positive actions. The problem is that you...
View ArticleTrident Storm, Real-Time Analytics for Big Data
In a previous post I mentioned Storm already. Trident is an extension of Storm that makes it an easy-to-use distributed real-time analytics framework for Big Data. Both Trident and Storm were developed...
View ArticleHadoop for Real-Time: Spark, Shark, Spark Streaming, Bagel, etc. will be...
The website defines Spark as a MapReduce-like cluster computing framework designed to support low-latency iterative jobs. However it would be easier to say that Spark is Hadoop for real-time. Spark...
View ArticleMesos: Your next highly distributed Cloud architecture framework
I initially complaint about the complexity of installing Mesos when I was playing around with Spark and Shark. However when I saw the Twitter Mesos and Framework presentation, I understood why Mesos...
View ArticleScaling Machine Learning
There is currently still a vacuum for easy & scalable solutions in the machine learning space. At the moment everybody is talking about Hadoop as the de-facto standard for Big Data. Unfortunately...
View ArticleThe Big Data Revolution is likely to hit Gartner’s “trough of...
Big Data is a hype right now. Everything that comes close to Hadoop or NOSQL turns into gold! Unfortunately we are getting close to Gartner’s “Peak of Inflated Expectations”. Hadoop does an excellent...
View ArticleSolving the pressing need for Linux talent…
The Linux Foundation shared the below infographics recently. Click on it and you get the associated report. The short message is, if you are an expert in Linux you are in high demand because companies...
View ArticleA Layman’s Guide to the Big Data Ecosystem
Charles – Chuck – Butler, a colleague at Canonical, wrote a very nice blog post explaining the basics of Big Data. It does not only explain them but it also allows anybody to set up Big Data solutions...
View ArticleA Big Data-Base that is fast but inaccurate: BlinkDB
The idea might sound strange at first. Why would you want a database that delivers inaccurate data? However BlinkDB trades accuracy for speed. When you query data you can specify when you want the...
View Article
More Pages to Explore .....