Skip Links

MongoDB refines load balancing

The MongoDB data store can now randomize placement of data on different shards to smooth load balancing

By Joab Jackson, IDG News Service
March 19, 2013 11:21 AM ET

IDG News Service - Following the tradition set by recent versions, the new release of the MongoDB NoSQL data store comes with a batch of new features designed to appeal to the enterprise market, including a new built-in search engine, more support for geospatial data and the ability to balance workloads across multiple servers more effectively.

"We're moving more quickly," said Kelly Stirman, 10gen director of product marketing, referring to how MongoDB's growing user base is giving the company more resources. MongoDB 2.4, available Tuesday, was released only six months after the last major version, 2.2 "We've substantially increased the size of the engineering corporation and the company has grown dramatically over the past year," Stirman said.

[ MORE ON DATABASES: It's an unstructured world and Rackspace wants to play in it ]

The company has also updated the commercially supported version of this open source data store, called MongoDB Enterprise.

Since 10Gen began work on MongoDB in 2007, the data store has been downloaded more than 4 million times. The document data store was designed to ingest and read large amounts of data very quickly, and has proved itself to popular in the fields of analysis, content management, mobile and social infrastructure and user data management. 10gen supports more than 600 commercial customers with the enterprise version of the data store, including Craigslist, Disney, Electronic Arts, eBay, Foursquare, Intuit, LexisNexis, MTV, Salesforce.com and Telefonica.

One of the chief new features of MongoDB 2.4 is hashed-based sharding. Sharding takes place when different parts of a data table are spread across multiple servers. Hashed-based sharding randomly distributes new entries across all the available servers. As a result, data is distributed more evenly, minimizing hotspots that occur when too much frequently consulted data -- such as recently captured data -- gets placed on a single server.

"You get a nice distribution of all the documents across all the shards for reads and writes," Stirman said, adding that range-based sharding -- which was the previous default sharding algorithm -- will continue to be available.

The new built-in search engine, still in beta mode, may eliminate the need to maintain an external search engine, such as Apache Lucene/Solr. It offers simple text search, so it does not have all the capabilities of stand-alone search engine, such as natural language processing. But because it is built into MongoDB itself, it is a lot easier to configure and maintain, Stirman said.

"It will be good enough for a lot of applications, and the community will be excited because they won't have to worry about integrating another technology, especially if they are deploying across multiple data centers. With MongoDB, that is pretty easy," Stirman said.

MongoDB 2.4 contains a number of new techniques that may allow developers to make better use of data. One new feature, capped arrays, should find itself useful in interactive Web 2.0 environments. A capped array is an array with a predefined limit of the number of items it can hold. So it could be used for a website for displaying the 20 most popular user comments, for instance.

Latest News
rssRss Feed
View more Latest News