Query-free Information Retrieval | ||
![]() | ||
| Peter E. Hart and Jamey Graham | ||
| California Research Center | ||
| Ricoh Innovations, Inc. | ||
| 2882 Sand Hill Road, Suite 115 | ||
| Menlo Park, CA 94025 | ||
| Email: fixit@crc.ricoh.com | ||
| Abstract | ||
|
We introduce query-free information retrieval, a paradigm in
which queries are constructed autonomously and information relevant to
a user is offered without explicit request. Query-free methods offer
an apparently new approach for integrating knowledge-based
applications with legacy databases. We describe a fielded system,
FIXIT, which integrates an expert diagnostic system with a
pre-existing full-text database of maintenance manuals. The reported
results suggest that query-free information retrieval can liberate the
user from burdensome information retrieval activities while incurring
only modest system development costs and minimal run-time
costs.
| ||
Expert systems designers in particular face these problems because modern expert systems are seldom fielded as isolated, independent software applications. Instead, they are most frequently embedded in some larger environment that presents both opportunities and challenges for sub-system integration.
This paper reports work aimed at addressing some of these questions in a specific system context. We describe a fielded system, FIXIT, that integrates a probabilistic expert system with a full-text database of maintenance documentation. This legacy database is the online source of the actual printed manuals used by technical support personnel. In FIXIT, it entirely replaces the ``help-text'' found in conventional expert systems with far more comprehensive and up-to-date reference material.
FIXIT users are either service technicians or customer support representatives. With appropriate knowledge bases, they might be either troubleshooting a complex copier or helping a customer solve a less-complicated problem over the phone. In either case, because the expert system directly supports the user's diagnostic goal, we regard it as the base application.
The dialog between user and base application establishes a context and from time to time generates a need for additional diagnostic and repair information. In principal, such information could be made available directly from the on-line database, and a motivated user might wish to access the database directly and search for relevant topics. Of course, this requires familiarity with access paths and query procedures and also might be a distraction from the primary task. Our challenge is to provide the user with the benefits of a combined expert and full-text database system without requiring knowledge of query procedures, without distracting attention from the diagnostic task, and without imposing noticeable runtime costs.
We have approached this challenge by introducing an intelligent agent that analyzes interactions between user and expert system and automatically constructs database queries based on this analysis. The user is unobtrusively notified when information relevant to the current diagnostic context has been returned, and may immediately access it if desired. From the user's perspective all database machinery is entirely transparent; indeed no formal query language is even made available. Hence we term this approach query-free information retrieval.
As we hope will be apparent from what follows, the introduction of the intelligent agent additionally offers one solution to a fundamental problem facing designers of cooperative information systems: How can legacy systems of substantial complexity be integrated within a larger system context? By requiring that all interactions with the legacy database be mediated by the agent, we have been able to isolate the database system cleanly while still supporting query-free information retrieval.
FIXIT is comprised of the three subsystems already mentioned: the probabilistic expert system, the legacy full-text database system (to which we added a new, semantically-based, indexing structure that supports limited natural language queries), and the intelligent agent that effectively integrates them. The following sections describe these system components, provide implementation details, illustrate the runtime behavior of FIXIT, report on operational experience, and close with some observations about query-free information retrieval and the potential for generalizing the underlying paradigm.
Expert System Component. For our purely diagnostic applications, we elected to use the belief net (or Bayesian net) representation that has received considerable attention in recent years1.
Belief networks use conditional probabilities of the form
p(sj|fi) to represent associations between a fault fi and an observable symptom sj that it may produce. A knowledge base for a belief net expert system consists principally of a collection of conditional probabilities of this form.The relations among faults and symptoms are conveniently represented as a directed acyclic graph as shown in Figure 1.
|
| Figure 1 - A simplified belief net for copier diagnosis |
Nodes represent either faults, observable symptoms or unobservable internal variables. Arcs are informally ``causal'', pointing from underlying faults to the symptoms they may cause. Formally, arcs correspond exactly to those conditional probabilities required by the expert to represent the important probabilistic dependencies among faults, observables, and unobservable internal variables of the problem domain.
At knowledge engineering time the expert specifies the conditional probability of observing a particular symptom given the occurrence of some fault. At runtime these conditional probabilities are inverted: Given some sequence
{s1, s2, ... , sj} of observed symptoms, we compute for each possible fault fi the conditional probability p(fi|s1, s2, ... , sj) of that fault given the sequence of observations. At each stage in the sequence a decision is taken whether to accept the currently most probable fault as the diagnosis or to make another observation. That decision can be based on either formal utility theory, informal heuristics, or the user's own judgment.
|
| Figure 2 - A fragment from the table of contents tree |
A ToC tree data structure mirrors the form of the Table of Contents of the maintenance documentation, with each node in the tree corresponding to a topic in the documentation. However, the node contains not the literal topic name, but instead contains the constituents obtained by parsing the topic name.
|
| Figure 3 - ToC Information Retrieval example |
The figure displays two sets of results for each topic the semantic pattern is compared with: local and global. The local results represent the comparison between semantic pattern and the individual topic.
{s1, s2, ... , sj-1} have already been observed and we now observe the jth symptom sj. For each activated fault fi we easily compute Dpij as p(fi|s1, s2, ... , sj) - p(fi|s1, s2, ... , sj-1). In words, Dpij measures the marginal effect on fi of observing sj in the context of the previous j-1 observations.|Dpij|; i.e., we say a symptom sj supports an activated fault fi when |Dpij| > 0.01.sj that strongly supports some fault diagnosis in one sequence of observations may be irrelevant in another sequence. This will occur whenever sj is largely unnecessary or redundant given previous observations. From an information retrieval standpoint we consider this to be desirable behavior, arguing that the user is most likely to be interested in symptoms that substantially effect the diagnosis in the context of previous observations. FIXIT Implementation
FIXIT's component subsystems originated from different sources. For the belief net expert system shell we use DXpress(TM), a commercial product currently available from Knowledge Industries. Several probabilistic knowledge bases were built in collaboration with Ricoh technical experts, and cover a variety of the most widely distributed Ricoh products. (For our illustrative example in the next section we use the simplest fax machine knowledge base, which covers about a hundred or so malfunctions.) The full-text database is prepared and maintained by Ricoh operational staff. The ToC information retrieval system was developed by one of us2.
|
| Figure 4a - FIXIT's Diagnosic window |
Indicators'') and then a specific symptom (e.g. ``Clear Copy Indicator'') results both in asserting that symptom as ``observed'' and posting it to the current history of the session (bottom left). The expert system computes the probability of each of the hundred or so faults given this symptom, and displays the leading candidates in the lower half of the window (``Possible Problems'') along with their probabilities and fault categories. The agent indicates, via a text icon, that relevant documentation was found for some faults.
There are three kinds of icons used by the agent to indicate that relevant documentation is available for a fault. The primary text icon, as previously mentioned, indicates the availability of topics relevant to both the fault and to one or more of the observed symptoms. The secondary text icons represent the availability of documentation relevant to the fault. (We use two different icons to denote ``many'' or ``few'' available topics.)
Paper Nonfeed-RX''. By selecting a topic in the list we
can see which document this topic comes from: ``User's'' manual.
|
| Figure 4b - Relevant topics for a fault |
|
| Figure 4c - Documentation browser |
1. Replacing Paper'' is indeed relevant to the fault ``Paper Nonfeed-RX''.The context-sensitive nature of FIXIT's information retrieval can be illustrated by continuing the diagnosis. Figure 5a presents the same diagnostic session after asserting a few more symptoms. The number one fault is now ``
Paper Nonfeed-RX''.
|
| Figure 5a - Agent indicates availability of primary topics |
Paper Nonfeed-RX'' fault and the
``Chk Cassette Area-No'' observed symptom.
|
| Figure 5b - User selects a primary topic |
|
| Figure 5c - Documentation browser displaying primary topic |
In January 1996, following continuing effective use, the system was rolled out to additional Ricoh locations. At the same time, knowledge engineering activities were expanded to increase the coverage of Ricoh's broad product line, and development of a Japanese version was begun. As of this writing it seems fair to say that, viewed from the standpoint of operational managers (rather than from only our perspective as researchers and developers), the system is successful.
What has been learned from this experience?
Related Work. Decision support systems utilizing multiple technologies can provide fertile ground for exploring new software architectures. As with many developments in this emerging field, the work reported here bears a tantalizing similarity to several other previously-described ideas and yet appears to be different from any of them. For example, work on information filtering might appear to be closely related to ours, but the emphasis there is on a user's ``...relatively stable, long-term... goals... that may change slowly over time.''4 This contrasts sharply with the query-free emphasis on contextually-dependent user needs that typically change in the few seconds required to complete a single interaction with the base task. Alternatively, work on plan recognition might in principle support query-free retrieval from external information resources, but the focus there is more typically on generating natural language output from formal internal representations5.
FIXIT's design philosophy of providing unrequested yet ``relevant'' information resembles the philosophy underlying the query relaxation approach6. There, however, the emphasis is on generalizing a query already posed by a user in order to find ``neighboring'' information, rather than on avoiding explicit queries entirely. Possibly, the two approaches could be usefully combined.
Related Goals. A common thread running through all these approaches is the goal of providing a user with relevant information beyond that which has been explicitly requested. We believe this to be an important goal because the ever-increasing availability-- and most especially the heterogeneity-- of data, computational and communication resources threatens to overwhelm the abilities of users to access them effectively. This trend has been evident for many years, but the needs have of course become especially acute with the explosive growth of the World Wide Web. In the context of both the Web as well as in more localized contexts, we should also be thinking about invoking active computation rather than only accessing static databases7.
Lessons from FIXIT. We think the present work provides evidence that the fundamental architecture of FIXIT --namely, a base application which furnishes contextual clues about relevant external information-- is an effective way to provide users with additional useful information that was not explicitly requested. Moreover, the intelligent agent at the core of FIXIT provides the architectural benefit of cleanly separating system components.
At a more detailed level, what can be learned from FIXIT's specific mechanisms? Plainly, the activation predicate and support predicate are tied directly to the belief net formalism. On the other hand, the abstractions underlying these predicates-- focus of attention and support for hypothesis-- have great generality and appeal. The critical assumption underlying their utility is that the user is engaged in a task-oriented dialog whose goals are known or can be determined. While that assumption is not always valid there are of course many application settings in which it is, and in such settings useful if imperfect surrogates for these abstractions may be identified.
Future Research. In the near term, FIXIT might be improved by extending the activation and support predicates, moving beyond the current static thresholds to more refined dynamic ones. More ambitiously, the intelligent agent could be improved to exploit user interactions with the full-text database, rather than with the expert system alone, to assess the current context. Either improvement might lead to greater precision in retrieving database topics.
We think that other exciting opportunities lie in the direction of applying the underlying approach to new application areas. Of these, the Web has been an irresistible magnet for new applications, but at the same time raises difficult interoperability issues at several levels: interoperability across protocol domains8; interoperability across pre-existing ontologies9; and interoperability by means of cooperative information retrieval agents10. In this expansive-- literally global-- context FIXIT's architecture perhaps suggests a solution to one important sub-problem: How can processes for retrieving information from distributed sources be invoked without explicit user request?
While we have developed considerable confidence in FIXIT's architecture, we have no doubt that many challenges lie ahead.
References
1. D. Heckerman and M.P. Wellman, ``Bayesian Networks'', Comm. ACM, Vol. 38, No. 3, March 1995.
2. J. Graham, ``A natural language method of semantic pattern matching for user manual text retrieval'', Ricoh California Research Center (CRC) Technical Report CRC-TR-91-16, March 28, 1991.
3. J. Allen, Natural Language Understanding, The Benjamin/Cummings Publishing Company, Menlo Park, 1987.
4. N.J. Belkin and W.B. Croft, ``Information filtering and information retrieval: two sides of the same coin?'', Comm. ACM, Vol. 35, No. 12, 1992.
5. U. Wolz, ``Providing opportunistic enrichment in customized on-line assistance'', Proc. 1993 International Workshop on Intelligent User Interfaces, Orlando, Florida, January 1993. ACM Press.
6. T. Gaasterland, ``Restricting query relaxation through user constraints'', Int. Conf. on Intelligent and Cooperative Information Systems, Rotterdam, The Netherlands, May 1993, IEEE Computer Society Press.
7. C. Brown, L. Gasser, D. O'Leary, and A. Sangster, ``AI on the WWW: Supply and Demand Agents'', IEEE Expert/Intelligent Systems & Their Applications, Vol. 10, No. 4, August 1995.
8. E. Mena, V. Kashyap, A. P. Sheth, A. Illarramendi, ``OBSERVER: An Approach for Query Processing in Global Information Systems based on Interoperation across Pre-existing Ontologies'', International Conference on Cooperative Information Systems, Brussels, Belgium, June 1996, IEEE-CS Press, 1996.
9. A. Paepcke, S.B. Cousins, H. Garcia-Molina, S.W.Hassan, S.P. Ketchpel, M. Roscheisen, and T. Winograd, ``Using Distributed Objects for Digital Library Interoperability'', Computer, Vol. 29, No. 5, May 1996, pp. 61-68.
10. L.F. Bic, M. Fukuda, and M.B. Dillencourt, ``Distributed Computing Using Autonomous Objects'', Computer, Vol.,No., August 1996, pp. 55-61.