FOA Home | UP: Mathematical Foundations

Dimensionaility Reduction

As Section §3.5 has already suggested, our interest in matching queries and documents within a vector space can benefit greatly from other kinds of matching that have arisen in other kinds of vector spaces [Schutze93] . We review several other basic features of the mathematical topic of linear algebra before applying them to the problem of FOA.

It is worth remembering that similarity information can come from many sources. For example, we will later (cf. Section §6.1 ) have much to say about how CITATION structure can be represented as a graph, with each document represented by a node in the graph and a directed edge going from $d_{i}$ to $d_{j}$, when $d_{j}$ appears in the bibliography of $d_{i}$. Note that the documents' citation information is entirely independent of the words they contain, and can be the basis for another characterization of the topical similarity between documents: Two documents are about the same topic to the extent that they share the same documents in their bibliographies. For now, such bibliometric data need only seem like a plausible, new way to analyze document content. Chapter 6 will discuss other features that we might also exploit.

Our goal in casting similarity matching of queries with documents in general, mathematical terms is to make the resulting solutions sufficiently broad to handle any kind of features, keywords or bibliographic citations.


Top of Page | UP: Mathematical Foundations | ,FOA Home

FOA © R. K. Belew - 00-09-21