Online dating: The technology behind the attraction

When Joe wanted to find love, he turned to science.

Rather than hang out in bars or hope that random dates worked out, the 34-year-old aerospace engineer signed up for eHarmony.com , an online dating service that uses detailed profiles, proprietary matching algorithms and a tightly controlled communications process to help people find their perfect soul mate.

Avatars tackle the first date for you

Blocking the bad guys

Your profile's long, scary shelf life

Over a three-month period last fall, Joe found 500 people who appeared to fit his criteria. He initiated contact with 100 of them, corresponded with 50 and dated three before finding the right match. He's now happily in a relationship, and although he was skeptical at first, he says high tech played a big role in his success.

Internet dating sites are the love machines of the Web, and they're big business. eHarmony and similar sites drew 22.1 million unique visitors during just one month, December 2008, according to comScore Media Metrix .

And unlike many social networking sites, they actually make money -- the top sites bring in hundreds of millions per year, mostly in subscription fees.

These online dating services run on a curious mix of technology, science (some say pseudoscience), alchemy and marketing. Under the covers, they combine large databases with business intelligence, psychological profiling, matching algorithms and a variety of communications technologies (is your online avatar ready for a little virtual dating?) to match up lonely singles -- and to convert one-time visitors into paying monthly subscribers.

All is not chocolates and roses online, however. Security is one big challenge for e-dating services, which can attract pedophiles, sexual predators, scammers, spammers and plain old liars -- most notably, people who say they're single when in fact they're married. And sticky questions have yet to be answered over what rights such sites have to your personal information -- how they use it to market other services to you, if and how they share it with advertisers, and how long they store it after you've moved on.

Finally, there's the biggest question of all -- do these tech-driven, algorithm-heavy sites work any better to help people find true love than the local bar, church group or chance encounter in the street?

Armed with these questions, a passably decent head shot, and a very patient wife, I set out to discover what's under the covers in the world of online dating.

The business model behind online dating

A well-oiled Internet dating machine can generate well in excess of $200 million a year in a market that's expected to top $1.049 billion in 2009 -- only gaming and digital music sites generate higher revenues -- and is expected to grow at a rate of 10% annually, according to Forrester Research .

Most online dating sites generate the bulk of that revenue from subscriptions, although free, advertising-supported sites are starting to gain some ground.

In fact, Plenty of Fish , a free service, was the second-most-visited online dating site last year, behind Singlesnet, according to Hitwise , a Web site traffic monitoring service.

Most dating sites allow users to sign up and create a profile for free.

Before communicating with matches, however, visitors must sign on as a paying member.

To succeed, a site needs to do the following:

Offer excellent response times. People want instant gratification, so the sites try to give users at least some matches as soon as they've created an account and completed their profiles.

Convert at least 10% of visitors who register into paying customers -- preferably more.

Deliver an acceptable range of probable matches and offer a variety of ways to pursue those prospects, including high-tech developments from video chat to photo-realistic avatars.

Keep the quality of the prospect pool high by weeding out inactive and misbehaving users and by blocking the 10% or more of new accounts every day that are estimated to be scammers, con artists, criminals, sexual predators and other undesirables that can overwhelm a site and drive away paying customers.

The battle isn't over once a service has its inventory in place and has paying customers. The business needs to keep priming the pump to bring on new subscribers because the typical customer -- one of the 10% who actually pay -- stays on less than three months.

But one man's folly is another man's fortune: A large percentage of customers fall off the love wagon after finding their "one true love." They keep coming back over and over again, producing a revenue stream that has a very long tail, says Herb Vest, CEO and founder of the dating site True.com .

Step 1: A perfect match, served up fast

Online dating sites take two basic approaches to provide users with matches.

Online personals services such as Yahoo Personals (which costs $29.99 for one month, $59.97 for three months or $95.94 for six months), are glorified search engines -- big, searchable databases. Users fill out a short profile with check-box items and short descriptions about themselves.

They then narrow down the search by filtering prospects using criteria such as gender, ZIP code, race, religion, marital status and whether or not a person is a smoker. Users filter through the results themselves, deciding on their own which prospects to pursue.

The "scientific" matching services, such as eHarmony (which costs $59.95 for one month, $119.85 for three or $179.70 for six), PerfectMatch and Chemistry.com, attempt to identify the most compatible matches for the user by asking anywhere from a few dozen to several hundred questions. The services then assemble a personality profile and use that against an algorithm that ranks users within a set of predefined categories; from there, the system produces a list of appropriate matches.

Some sites take a hybrid approach. PerfectMatch.com, for example, issues recommended picks but also lets customers browse the "inventory" for themselves.

The technology that powers these dating sites ranges from incredibly simple to incredibly complicated. Unsurprisingly, eHarmony has one of the most sophisticated data centers. Joseph Essas, vice president of technology, says the company stores 4 terabytes of data on some 20 million registered users, each of whom has filled out a 400-question psychological profile (eHarmony's founder is a clinical psychologist).

The company uses proprietary algorithms to score that data against 29 "dimensions of compatibility" -- such as values, personality styles, attitudes and interests -- and match up customers with the best possible prospects for a long-term relationship.

A giant Oracle 10G database spits out a few preliminary candidates immediately after a user signs up, to prime the pump, but the real matching work happens later, after eHarmony's system scores and matches up answers to hundreds of questions from thousands of users. The process requires just under 1 billion calculations that are processed in a giant batch operation each day. These MapReduce operations execute in parallel on hundreds of computers and are orchestrated using software written to the open-source Hadoop software platform.

Once matches are sent to users, the users' actions and outcomes are fed back into the model for the next day's calculations. For example, if a customer clicked on many matches that were at the outset of his or her geographical range -- say, 25 miles away -- the system would assume distance wasn't a deal-breaker and next offer more matches that were just a bit farther away.

"Our biggest challenge is the amount of data that we have to constantly score, move, apply and serve to people, and that is fluid," Essas says. To that end, the architecture is designed to scale quickly to meet growth and demand peaks around major holidays. The highest demand comes just before Valentine's Day. "Our demand doubles, if not quadruples," Essas says.

PerfectMatch.com, which claims to have 5 million members, uses a matching algorithm, but its psychological test is shorter than that required by eHarmony. "We wanted to take the basic concept of the Myers-Briggs indicator and apply that to relationships," says Founder and CEO Duane Dahl. The core architecture of the system consists of five front-end Web servers and a large, back-end SQL Server database, plus a variety of servers that handle messaging, marketing and other functions. The matching process is immediate.

True.com also offers "scientific compatibility" matching based on how users answer about 200 questions. The site uses about 200 servers, including a 64-bit, 32-processor Unisys server running Microsoft SQL Server. The matching algorithm's calculations are performed on an array of 64-bit servers that hold a compressed version of the entire multi-terabyte database in memory to facilitate fast matching. "The system can shoot back [matches] with little or no delay," says CEO Vest.

On the other end of the spectrum, Plentyoffish.com's philosophy is to keep it simple. The service focuses on searching and filters: It uses a short questionnaire, and while it does offer some matching capabilities if users want them, CEO Markus Frind says he doesn't promote them -- and he is disdainful of the complex matching algorithms offered by some competitors.

The business operates on just three Web servers, five messaging servers and five database servers (the entire database is just 200GB in size), yet it serves up 200 billion pages a month to some 12 million users. "My entire cost is only a few hundred thousand dollars a year," says Frind. The biggest piece isn't the technology, he says, but the bandwidth required to keep traffic to the site flowing smoothly.

Step 2: From "just looking" to "paying customer"

When it comes to converting users to paid subscribers, the battle is all uphill in an industry in which more than 90% of users never pay a dime. That's where having extensive demographic and psychological data on customers comes in handy.

In fact, online dating sites are so adept at using personal data, potential customers can be forgiven for wondering just who is being "matched up" -- two strangers bent on true love, or lonely customers and the matchmaking site that needs them. (See Online dating: Your profile's long, scary shelf life for details on the ways dating sites mine the data they collect.)

Yahoo Personals uses all of the information at its disposal to tailor its sales pitch to the user. "We try to take advantage of what we know about the user and where they are in their level of engagement with the product," says Ellen Perelman, general manager.

Once users sign up for a free account and fill out a short questionnaire, Yahoo uses targeted messaging to push them through a "conversion tunnel." The messages that users see to persuade them to sign on as paying customers vary depending on the user's profile and his or her behavior on the site.

Similarly, PerfectMatch.com puts users on different "message tracks" based on their profile and what they're doing on the site at any given time. "Everything you do or don't do triggers a response," says Dahl. "We take the information and do a comparative analysis on the fly to serve up the best possible offers to you based on your profile."

Users who aren't "taking full advantage of the site" -- who haven't posted a photo, for example, or have failed to review all their matches -- are targeted by the system. "You will get an e-mail message custom to your situation, encouraging you to perform the action needed," Dahl says.

eHarmony, which has the most comprehensive user profiles, may be the most sophisticated in the ways in which it leverages that information. It pulls information -- more than a terabyte of data each day -- from its Oracle database into high-performance Netezza data warehouse appliances that slice and dice users into behavioral and demographic "buckets."

"We use [Netezza] to do a lot of offline calculations to try to understand patterns and business intelligence about user behavior," explains Essas. Some of that feeds back into the matching process, but it also helps eHarmony persuade users to subscribe to its service. "Because we know more about them, we can target them much better," says Essas. Messaging is tailored to each user's behavior on the site -- and their personality type.

Step 3: Make a high-quality connection

Once users have paid for a subscription, online dating sites offer different tech-driven options for contacting and getting to know prospective dates, everything from chat rooms to instant messaging, e-mail and even video chat.

eHarmony controls the process by moving users through a series of proscribed communication steps on its Web site. The idea is to make users of the site comfortable with each other, but sometimes the technology just gets in the way, or backfires, users say.

Mary, a 45-year-old executive for a large IT consultancy, says the process of moving from eHarmony's prewritten questions and responses to online chat to e-mail to telephone can be tedious when what you really want is to meet someone. "You continuously go through this job interview." Then, after all that, people will suddenly cut off communications. "What happened?" she asks.

1 2 Page 1
Page 1 of 2
The 10 most powerful companies in enterprise networking 2022