|
Improving your ability to find information online
Conceptual Indexing
| ||||||||||||||||||||||||||||||||
Can't find what you want? |
How often have you failed to find what you wanted in an online search because the words you used failed to match words in the material that you needed? Concept-based retrieval systems attempt to reach beyond the standard keyword approach of simply counting the words from your request that occur in a document. The Conceptual Indexing Project is developing techniques that use knowledge of concepts and their interrelationships to find correspondences between the concepts in your request and those that occur in text passages. Our goal is to improve the convenience and effectiveness of online information access. |
The Paraphrase Problem |
The central focus of this project is the "paraphrase problem," in which
the words used in a query are different from, but conceptually related to,
those in material that you need.
For example, in a collection of articles by James Fallows, then Washington
editor of the Atlantic Monthly, the query, "change in the deficit,"
results in several relevant passages including "Last year's reductions in tax
rates are part of the reason for the deficits, as are the administration's
plans for a sustained military buildup." Based on this passage a user can
then decide whether to read the rest of the article. While the query and
the passage convey similar ideas, the wording in each is different, a typical
case of the paraphrase problem.
In addressing the paraphrase problem, three challenges must be met:
|
Elements of the Technology |
The technology, which is called "Precison Content Retrieval," is composed of two parts:
|
| Introduction | Key Ideas | Examples |
| Benefits | Papers | People |
For more information about the Conceptual Indexing Project, contact indexing-info@east.sun.com.
Sun Microsystems Laboratories (Burlington, MA)
Knowledge Technology Group, Sun Microsystems Laboratories