Posts tagged ‘hadoop’

Hadoop & Spark are Dominating Big Data, But the Market Demands Even More

According to industry insider and InfoWorld columnist Andy Oliver, what you need to know about Hadoop is that it is no longer Hadoop. At least, it isn’t the Hadoop that everyone once knew and may or may not have loved. Hadoop’s co-creator Doug Cutting believes that the changes are a direct result of the open source roots of Hadoop and related projects, most notably Spark. Together, Hadoop and Spark are dominating the big data marketplace, with Hadoop commanding half of big data’s $100 billion annual market value, and Spark surpassing MapReduce in terms of popularity (at least among those searching for big data products on Google). While Hadoop is the go-to big data framework and Spark reigns supreme when it […] Continue Reading

Why Hadoop Is Important in Handling Big Data?

Hadoop is changing the perception of handling Big Data especially the unstructured data. Let’s know how Apache Hadoop software library, which is a framework, plays a vital role in handling Big Data.  Apache Hadoop enables surplus data to be streamlined for any distributed processing system across clusters of computers using simple programming models. It truly is made to scale up from single servers to a large number of machines, each and every offering local computation, and storage space. Instead of depending on hardware to provide high-availability, the library itself is built to detect and handle breakdowns at the application layer, so providing an extremely available service along with a cluster of computers, as both versions might be vulnerable to failures. […] Continue Reading

Are We Stuck At Big Data Base Camp?

Big Data. It’s been the subject on businesses’ lips for some time now due to its potential to improve performance across the board, delivering better results and increasing revenues. This is an unprecedented opportunity in today’s tough economic times, so why haven’t we seen more businesses shouting about the benefits they’ve seen on their Big Data journeys? Is it simply a case of businesses keeping their cards close to their chest as they work to steal a march on competitors? Or perhaps organisations still haven’t rolled out their Big Data projects so have nothing to shout about? Research from Oracle and Quocirca revealed that only an elite group of businesses fully understand the concept of Big Data in Europe and […] Continue Reading

#bdw12 Jairam Chandar “Hadoop at DataSift”

Jairam Chandar from DataSift talkin “Hadoop at DataSift” The Apache Hadoop software library is a framework that allows for the distributed processing of large data sets across clusters of computers using simple programming models. It is designed to scale up from single servers to thousands of machines, each offering local computation and storage. Rather than rely on hardware to deliver high-avaiability, the library itself is designed to detect and handle failures at the application layer, so delivering a highly-availabile service on top of a cluster of computers, each of which may be prone to failures. Find out more about Hadoop

#bdw12 – Check out “Data Stories” – Podcast

DATA STORIES is a bi-weekly podcast on data visualization with Enrico Bertini and Moritz Stefaner. Their  first guest on the show is Andy Kirk who is on our all star panel on April 24th for the Big Data week community meetup, also Moritz just came back from Malofiej 20, the Infographic World Summit, and we used the chance to discuss our impressions of the event — the conference, the awards, the workshops and the general vibe. On this show Andy Kirk discusses Big Data week and his involvement with it how he is supporting the event itself and also how data visualisation has exploded over the last two years and interesting debate on the Infographic World summit. LISTEN to the […] Continue Reading

Show Buttons
Hide Buttons