HOME PAGE FOR THE WORLD'S BUSINESS LEADERSFree Trial Issue 
CIO NetworkDigital EntertainmentEnterpriseImagingIntelligent InfrastructureInternetPersonal TechSciencesWireless

  
Research
Two Thumbs Up
Leah Hoffmann, 11.15.05, 10:00 AM ET

 By This Author
Leah Hoffmann
• First Job: Steven Pinker
• First Job: Todd Hanson
• Can A Computer Read You Like A Book?
More Headlines

Related Quotes
32.69- 0.07
413.30- 2.40
82.92- 0.18
28.03+ 0.09

Most Popular Stories
What Would An iPhone Look Like?
401(k)s In The Crosshairs
Fictional Savings
California Charges Dunn
Dow's Still Trucking

New York -

Ever searched Google for advice on vacation destinations? Unless you're traveling to an obscure village in Siberia, chances are your search turned up thousands of relevant Web sites. Looking at all of them would be tedious and time consuming. But what if there were a way to sort them based on whether or not they're favorable?

As it turns out, there is. "Sentiment analysis," as the field of research is known, is a hot topic among computer scientists these days. The goal is to create computer programs that can determine whether a document is positive or negative. And corporations, like IBM (nyse: IBM - news - people ), Microsoft (nasdaq: MSFT - news - people ), Google (nasdaq: GOOG - news - people ), and Amazon.com (nasdaq: AMZN - news - people ), are paying attention to the results. Successful applications could help automate market and product research and dramatically alter the future of a simple Internet search.

The most common approaches begin by identifying certain "indicator words" within a text--words like "good," "bad" or "beautiful"--that convey positive or negative emotions. But that's not as easy as it sounds.

"The variety of words that people use for subjective expressions is staggering," says Janyce Wiebe, a professor of computer science at the University of Pittsburgh. Wiebe and her colleagues have already assembled a dictionary of some 8,000 indicator words and phrases.

"The dictionary tells you whether a word is positive or negative when it's taken out of context," Wiebe explains. "The challenge is to figure out whether it's positive or negative in each individual instance."

There are a number of different ways to accomplish this. Wiebe uses a program that can--with assistance from humans who "train" it to recognize the right answers--learn how context impacts the meanings of words.

Peter Turney, a researcher at the Canadian National Research Center's Institute for Information Technology, uses his own dictionary of indicator words to assign each of a document's adjectives a positive or negative value. He then averages these values to create an overall opinion score. Accuracy rates for these methods range from 70% to 85%, depending on the kind of document.

If you're trying to classify documents within a specific field--reviews for a particular product, for instance--you can increase accuracy by customizing your indicator words. (After all, a "hot" refrigerator is bad, but a "hot" nightclub is not.) That's the approach taken by Norwegian search company Fast Search and Transfer ASA, which unveiled a customizable sentiment analysis program, Marketrac, last year. FAST has already licensed Marketrac to more than a dozen clients, who use it to keep track of everything from online hotel reviews to the speeches of European business leaders. Licensing fees for the program are more than $100,000 per year.

Other potential applications in the field of sentiment analysis include automated flame detectors for online bulletin boards, tracking systems for stock market reports and programs that monitor movie or product reviews. You may also one day be able to do a simple Web search to find out what people are saying about a given issue.

"When people begin expressing their emotions, their language gets very florid and very complicated, very quickly," says Lillian Lee, a computer science professor at Cornell University. It will be a few years before scientists are able to refine their algorithms and get greater accuracy with a range of documents. Once they do, however, it's a safe bet that companies will be giving them a thumbs up.

Want to track news by this author or about this industry? Forbes Attache makes it easy. Click here.




More On This Topic
Companies: GOOG | IBM | MSFT | AMZN

Article Controls
E-mail | Print | Comments | Request Reprints | E-Mail Newsletters | My Yahoo! | RSS


Related Sections
Home > Technology > Innovation


Today On Forbes.com
Fictional Savings

The alliance between General Motors and Renault/Nissan was never likely. Here's why.

StreetTalk: Forget The Record Books
StreetTalk: Forget The Record Books
  The Next YouTube?
The Next YouTube?
  Mad. Ave Goes (Soft) Porn
Mad. Ave Goes (Soft) Porn
  Feats Of Clay
Feats Of Clay

News Headlines | More From Forbes.com | Special Reports    
Subscriptions >

Free Trial Issue of Forbes Forbes Gift Subscription
Subscribe To Newsletters Subscriber Customer Service
Buy Audio Version of Forbes



  
E-Mail Alerts


Companies
 Amazon.com
 Microsoft
 IBM
 GOOG
Topics
 Software
 E-Commerce
 Internet
 Research And Development
Enter E-Mail Address:


FAQ  Privacy Policy

Free Trial Issue
Gift Subscriptions

  
Trading Center
Brought to you by the sponsors below

 
 

Free Credit Report more >

Credit Reporting & Monitoring:
Discover how it can protect your credit well-being.
First Name
Last Name


ForbesAutos.com


Find the Hybrid that's Right for You in The World's Largest
Luxury Showroom


CEO Book Club more >
Book Review
The Philosopher Kings Of Hedging
Book Review
The Philosopher Kings Of Hedging
Robert Lenzner
Steven Drobny reveals insights from the hedge fund all-stars.

Search Books

 
 
Advanced Search
 
 
New & Notable

 
    
 
    
Blank Slate more >
Blank Slate
What if you could pick one thing and start over from scratch? What would you change?


SitemapHelpContact UsInvestment NewslettersForbes ConferencesForbes MagazinesForbes Autos
Ad Information   Forbes.com Wireless   RSS   Reprints/Permissions   Subscriber Services  
© Forbes.com Inc.™   All Rights Reserved   Privacy Statement   Terms, Conditions and Notices


Stock quotes are delayed at least 15 minutes for Nasdaq, at least 20 minutes for NYSE/AMEX. U.S. indexes are delayed at least 15 minutes with the exception of Nasdaq, Dow Jones Industrial Average and S&P 500 which are 2 minutes delayed.


Powered By