- How to use electrical outlets and cheap lasers to steal data
- The botnet world is booming
- NTIA seeks volunteers to review broadband applications
- The 10 dumbest mistakes network managers make
- What's driving this university to IPv6? Going green
Yahoo could add millions of dollars to its bottom line using a new artificial intelligence system to figure out what news stories its readers like, a company researcher said Thursday.
In the past few months, Yahoo has been testing a system that helps it identify the most elusive aspect of a story: whether it is hot or not.
Yahoo Senior Researcher Deepak Agarwal is part of a team of about 10 researchers that has spent the past few months testing whether their artificial intelligence algorithms can help boost site traffic within the Yahoo Today module -- the part of the Yahoo.com page that features the top news stories of the day.
Internally, they call the system the Content Optimization Knowledge Engine (COKE), and the geeks who work on the system jokingly call themselves Cokeheads.
Yahoo's editors already have pretty good instincts about whether a story is hot, and convincing the people in charge of the front page to let machines make calls about story placement was initially a challenge, said Agarwal, speaking at the New Paradigms in Using Computers conference at IBM's Almaden research facility. "Editors have been doing this for years," he said. "Convincing them that this would work was not easy."
But the researchers have shown how the system can help humans better understand what's going on with the Web site. They've created a dashboard that can instantly show editors how certain stories are playing among different groups -- young adults vs. young boomers, for example.
To solve the hotness problem developers had to go beyond the usual approach to selecting stories, he explained, instead of basing suggestions solely on factors such as age, gender and location of the reader, and keywords in the article. "We tried to personalize, sorted by age and gender, but it did not work," he said.
That's because this approach didn't take into account things such as the fact that certain types of stories are more likely to be clicked at certain times of the day.
So researchers developed a model that takes a variety of factors into account and measures how popular stories are doing almost instantaneously. "We had to build a dynamic model," Agarwal said.
Collecting and crunching all of this data quickly enough for it to be relevant in a world where stories grow cold after six hours is the hard part, he said. Building a system that could be used even when traffic demands increased was also a top priority, he added.
Partner Content
Explore the Ultrium Edge
The powerful tape technology can address data security with tape encryption as well as long term data protection.
Find Out More
Disk and Tape Square Off
Discover what disk and tape really cost and which solution provides lower total cost of ownership and optimizes energy use for your organization
Download this White Paper
Don't Fall for the Myths
The Clipper Group explores the truth behind the myths of tape, digging into the misconceptions in the disk vs. tape debate.
Review this information
information examination
An examination of information security issues, methods and securing data with LTO-4 tape drive encryption
Read this analysis
Comment