Machine learning fuels Sun music recommendation technology
Open source project aims to overhaul music search
By
Jon Brodkin
,
Network World
, 10/31/2007
- Share/Email
- Tweet This
- Print
Software that listens to and analyzes music is driving a Sun open source project, which aims to build a music recommendation system that surpasses the systems used today by iTunes and Amazon.
Automated recommendations today are typically based not on analysis of the music, but on who’s listening to which songs, said
Paul Lamere, principal investigator of Sun’s Search Inside the Music. If one customer buys a Beatles album, he’ll be told what other music has been purchased by fellow Beatles fans.
Listeners’ preferences and buying habits vary widely, so these systems lead to some wacky recommendations. Lamere said one
popular music service has told people who purchased the Britney Spears hit “Baby One More Time” that they might also enjoy
an audio recording of the report on prewar intelligence by the U.S. Senate intelligence committee.
“It’s kind of a funny recommendation, and you have to think there’s something broken here,” Lamere said Tuesday at the Sun
Labs Innovation Update in Burlington, Mass., where he demonstrated the Sun music search technology. Sun officials say they
plan to release the software as open source, perhaps within six months.
Sun has built “one of the best music similarity algorithms” that’s based on the actual sound, with machine learning that analyzes features such as frequency and beats per minute to map out the rhythm structure, and determine the genre and
which instruments are playing, Lamere said. Sun has taken advantage of prior research into speech recognition technology to
tease out the features that correspond with the timbre of music and can be measured with computers, he said.
This technology could level the playing field between popular artists and newcomers who are trying to get attention in the
increasingly crowded World Wide Web. It’s hard to find new and relevant music, Lamere said, because there are millions of
tracks online and that number will expand into the billions, with the Internet acting as a repository for “the entire history
of all recorded music.”
“Recommendation technology is key,” Lamere said. “The Web is going to be filled with billions of tracks and there’s going
to be millions of tracks arriving every week. The question is, when you have a million songs in your in-box, how are you going
to find something you really like?”
The project is reminiscent of Pandora, a free Internet radio service that recommends songs based on the results of the Music Genome Project, which analyzes sound
based on hundreds of musical attributes.
Sound recognition technology is just one piece of Sun’s project, though. Sun’s other innovation is a tagging system that categorizes
music based not on who’s purchased it but on its attributes, described with tags like “quirky,” “indie,” “rock,” “fast,” “frenzied,”
“90s,” or “cute” and “fun.”
Sun is compiling these tags by searching reviews, lyrics, music blogs, social tagging sites and artist biographies, and incorporating
the information into a prototype search engine Lamere demonstrated on Tuesday. Compiling the tags based on a comprehensive
search of the Web prevents people from gaming the system by generating their own tags to enhance the popularity of certain
tracks, he said.
Comment