Skip Links

Network World

  • Social Web 
  • Email 
  • Close

MIT baby-talk project spawns massive IP SAN

MIT Media Lab builds a 1.4-petabyte SAN to study how babies learn to talk.
By Lucas Mearian , Computerworld , 05/16/2006
  • Share/Email
  • Comment
  • Print

Imagine a storage array with capacity that's equivalent to a stack of iPods three times the height of the Empire State Building but that can be managed with common Ethernet networking tools, and you'll get what a group of MIT scientists and four storage vendors are in the process of building.

The storage array will support an MIT Media Lab project called the Human Speechome Project that is studying how babies develop the ability to talk. The project began three months ago when MIT associate professor Deb Roy began recording his baby's everyday life through the use of 14 fish-eye lens cameras set up throughout his house, giving researchers a bird's-eye view of every room.

In order to store and then process the video and audio data, a massive storage-area network (SAN) was needed to archive and search what is expected to be 1.4 petabytes of data, or 1,400TB of data, over the span of the three-year project.

The SAN is being built from commodity hardware and uses a 10 Gigabit Ethernet IP network for data transfer between the backend SAN and hundreds of servers.

"I think here what we're seeing is what the future of storage is going to be like. This is a great marriage between industry and the academic world," said Frank Moss, director of the Media Lab and a former CEO of Tivoli Systems, a maker of storage management software now owned by IBM.

Moss spoke at a press conference held Monday at MIT's Media Lab in Cambridge, Mass.

The Human Speechome Project computing infrastructure is expected to be composed of more than 300 Hammer Z-Rack storage enclosures from Bell Microproducts, about 3,000 SATA (Serial Advanced Technology Attachment) hard disk drives from Seagate Technology and more than 100 10 Gigabit Ethernet switches and 400 blade processors from Marvell Technology Group Ltd.

The high-throughput switches are needed for the storage I/O anticipated by researchers who believe they'll be processing 700TB of data during every 12-hour analytical run. To achieve the desired performance requirements, 150-drive stripes (aggregated virtual volumes) will be created using the native virtualization capabilities of Bell's Z-SAN. Protection against data loss will be delivered through RAID 10 mirrors (duplicate copies) of the raw video data, transform data, and metadata files.

  • Share/Email
  • Comment
  • Print
Partner Content

Explore the Ultrium Edge

The powerful tape technology can address data security with tape encryption as well as long term data protection.

Find out more

Disk and Tape Square Off

Discover what disk and tape really cost -- and which solution provides lower total cost of ownership and optimizes energy use for your organization

Download the White Paper

Don't Fall For The Myths

The Clipper Group explores the truth behind the myths of tape, digging into the misconceptions in the disk vs. tape debate.

Download the White Paper

Will You Add Tape Too?

Over two thirds of disk-only users look to add tape back into storage infrastructure according to recent survey.

Download Survey Information

Comment
Login
Forgot your account info?
Add comment
Anonymous comments subject to approval. Register here for member benefits.
Have a NetworkWorld account? Log in here. Register now for a free account.

Videos

rssRss Feed
Get instant email notification when white papers, webcasts, executive guides are added to our library. Stay informed and up-to-date with the latest on IT Technologies with Network World's Resource Alerts.
Network World,to go. Wherever you are. Breaking news delivered to your mobile device. Select the hottest topics in networking and start receiving Network World on your mobile device today.