-----BEGIN PGP SIGNED MESSAGE-----
Rels is a program that determines the relevance of text documents to a
set of keywords expressed in boolean infix notation. The relevance is
determined by comparing the phonetic representation of the keywords
with the phonetic representation of every word in a
document. (Phonetic searching has some degree of tolerance to
misspelled words.) The list of file names that are relevant are
printed to the standard output, in order of relevance.
For example, the command:
rel "(directory & listing)" /usr/share/man/cat1
(ie., find the relevance of all files that contain both of the words
"directory" and "listing" in the catman directory) will list 21 files,
out of the 782 catman files, (totaling 6.8 MB,) of which "ls.1" is the
fifth most relevant-meaning that to find the command that lists
directories in a Unix system, the "literature search" was cut, on
average, from 359 to 5 files, or a reduction of approximately 98%.
Although this example is remedial, a similar expediency can be
demonstrated in searching for documents in email repositories and text
archives.
Additional applications include information robots, (ie., "mailbots,"
or "infobots,") where the disposition (ie., delivery, filing, or
viewing,) of text documents can be determined dynamically, based on
the relevance of the document to a set of criteria, framed in boolean
infix notation. Or, in other words, the program can be used to order,
or rank, text documents based on a "context," specified in a general
mathematical language, similar to that used in calculators.
There is a companion application, wgetrels, which is an intelligent
Internet Web page search engine.
Title: rels-1.4
Version: 1.40
Entered-date: February 13, 1998
Description: Rels is a program that determines the relevance of text
documents to a set of keywords expressed in boolean
infix notation. The relevance is determined by comparing
the phonetic representation of the keywords with the
phonetic representation of every word in a document.
(Phonetic searching has some degree of tolerance to
misspelled words.) The list of file names that are
relevant are printed to the standard output, in order
of relevance.
Keywords: relevance fill text database information retrieval phonetic
Author: john _at_ johncon.com (John Conover)
Maintained-by: john _at_ johncon.com (John Conover)
Primary-site: sunsite.unc.edu
Alternate-site:
Original-site: johncon.com
Platform: Linux, USG, BSD
Copying-policy: No limitations for non-commercial use
- --
John Conover, 631 Lamont Ct., Campbell, CA., 95008, USA.
VOX 408.370.2688, FAX 408.379.9602
john _at_ johncon.com
- --
This article has been digitally signed by the moderator, using PGP.
http://www.iki.fi/mjr/cola-public-key.asc has PGP key for validating signature.
Send submissions for comp.os.linux.announce to: linux-announce _at_ news.ornl.gov
PLEASE remember a short description of the software and the LOCATION.
This group is archived at http://www.iki.fi/liw/linux/cola.html
-----BEGIN PGP SIGNATURE-----
Version: 2.6.3ia
Charset: latin1
iQCVAgUBNPVpqlrUI/eHXJZ5AQEhlgQAngS/wfB1aAHXF8fcWru0qqm0Ie0Z9XMm
zg+lXUsbcYdnoBalmOQRwBaFmN7BmpJE4S7fz11w0XZX2R4Z0mt2y3e0Qnei7dhT
nj5YNdFUpwLRLT8JkkVZJmVIdE/f4qi+/kUzW3v0mEwhqumPCtBBqlUVNPrZILk6
aaKhY9Ctq6U=
=MoCB
-----END PGP SIGNATURE-----