Yahoo's COKE project adds life to front page
By
Robert McMillan
,
IDG News Service
, 07/10/2008
- Share/Email
- Tweet This
- Print
Yahoo could add millions of dollars to its bottom line using a new artificial intelligence system to figure out what news
stories its readers like, a company researcher said Thursday.
In the past few months, Yahoo has been testing a system that helps it identify the most elusive aspect of a story: whether
it is hot or not.
Yahoo Senior Researcher Deepak Agarwal is part of a team of about 10 researchers that has spent the past few months testing
whether their artificial intelligence algorithms can help boost site traffic within the Yahoo Today module -- the part of
the Yahoo.com page that features the top news stories of the day.
Internally, they call the system the Content Optimization Knowledge Engine (COKE), and the geeks who work on the system jokingly
call themselves Cokeheads.
Yahoo's editors already have pretty good instincts about whether a story is hot, and convincing the people in charge of the
front page to let machines make calls about story placement was initially a challenge, said Agarwal, speaking at the New Paradigms in Using Computers conference at IBM's Almaden research facility. "Editors have been doing this for years," he said. "Convincing them that this
would work was not easy."
But the researchers have shown how the system can help humans better understand what's going on with the Web site. They've
created a dashboard that can instantly show editors how certain stories are playing among different groups -- young adults
vs. young boomers, for example.
To solve the hotness problem developers had to go beyond the usual approach to selecting stories, he explained, instead of
basing suggestions solely on factors such as age, gender and location of the reader, and keywords in the article. "We tried
to personalize, sorted by age and gender, but it did not work," he said.
That's because this approach didn't take into account things such as the fact that certain types of stories are more likely
to be clicked at certain times of the day.
So researchers developed a model that takes a variety of factors into account and measures how popular stories are doing almost
instantaneously. "We had to build a dynamic model," Agarwal said.
Collecting and crunching all of this data quickly enough for it to be relevant in a world where stories grow cold after six
hours is the hard part, he said. Building a system that could be used even when traffic demands increased was also a top priority,
he added.
It's still not clear when, or if, COKE will leave the research phase, but the early results show promise. COKE has boosted
click rates by 25% to 30%, Agarwal said. That could translate to several millions of dollars in revenue per year.
The IDG News Service is a Network World affiliate.
Partner Content
www.bmc.com
Gartner 2009 Magic Quadrant for Job Scheduling
Gartner has positioned BMC CONTROL-M in the Leaders Quadrant of their "2009 Magic Quadrant for Job Scheduling." The report assesses the ability to execute and completeness of vision of key vendors in the marketplace. Read a full copy today, courtesy of BMC Software.
Download whitepaper
Dell's SMART Approach to Workload Automation
Read a compelling case study by EMA, Inc. to learn how Dell uses BMC CONTROL-M to cut cost and increase productivity with workload automation.
Download whitepaper
Workload Automation Cost Savings 2 Minute Video
A major computer manufacturer uses BMC CONTROL-M and just four people to schedule and run over 85,000 jobs every month. By switching to BMC CONTROL-M, they more than quadrupled the workload without adding a single staff member. See how in this 2-minute video overview.
Go to video
Comment