Americas

  • United States
brandon_butler
Senior Editor

Amazon: New analytics tool can scrutinize massive amounts of data

Feature
Nov 15, 20133 mins
Amazon Web ServicesAmazon.comCloud Computing

Amazon Web Services rolls out Kenesis can handle up to terabytes of data an hour

Amazon Web Services this week rolled out a new cloud-based data analytics tool named Kenesis, which can analyze massive amounts of data in real time and be paid for by the hour.

Kenesis is an application that sits in the cloud and receives data from any number of sources: databases within Amazon’s cloud, like warehouse tool Redshift; NoSQL database DynamoDB; or relational database RDS. It then performs analytics on the data and spits out returns on the data. AWS developed the program using its own combination of hardware and software. The system is scalable too, able to handle up to terabytes of data an hour from potentially thousands of sources.

[MORE AWS: Amazon ratchets up its enterprise focus]

In announcing the tool at re:Invent, the company’s customer conference, AWS showed off how Kenesis can be used to analyze thousands of updates to Twitter in real-time, allowing queries to be performed on the data. For example, Kenesis was able to pinpoint the most popular word that was tweeted within an hour-long timespan of Tweets that were uploaded into the system. The data that Kenesis generates can then be offloaded into one of Amazon’s storage platforms like Simple Storage Service (S3). It could also be used to analyze real-time financial transactions, in-bound marketing or metering data, for example.

The new service compliments data analysis tools that AWS already has. RedShift, for example, has the ability to run analyses on data stored there, but it’s meant for longer-term data that is stored in its cloud. Kenesis is meant for rapid, real-time analysis of data.

Kenesis also fits in well with a growing number of Amazon partner companies who offer tools to help make sense of data that AWS analyzes. Jaspersoft, for example, is a company that can take the results of queries that RedShift has done and create visualizations from it and set up alerts. That sort of platform is a natural fit for being able to provide customers actionable insight from analysis that AWS performs.

The move represents AWS’s continued push into giving customers more options for analyzing their data as well. AWS already has a Hadoop system named Elastic Map Reduce (EMR), which is a pay-by-the-hour Hadoop cluster. S3 has scaled to store literally trillions of objects in AWS’s cloud. Having new tools to be able to run analytics jobs on all that data is an area experts were expecting AWS to make announcements in at re:Invent.

The service was released in limited preview starting today.

Senior Writer Brandon Butler covers cloud computing for Network World and NetworkWorld.com. He can be reached at BButler@nww.com and found on Twitter at @BButlerNWW. Read his Cloud Chronicles here.  http://www.networkworld.com/community/blog/26163

brandon_butler
Senior Editor

Senior Editor Brandon Butler covers the cloud computing industry for Network World by focusing on the advancements of major players in the industry, tracking end user deployments and keeping tabs on the hottest new startups. He contributes to NetworkWorld.com and is the author of the Cloud Chronicles blog. Before starting at Network World in January 2012, he worked for a daily newspaper in Massachusetts and the Worcester Business Journal, where he was a senior reporter and editor of MetroWest 495 Biz. Email him at bbutler@nww.com and follow him on Twitter @BButlerNWW.

More from this author