[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: Another feature for Ht://Dig



On Mon, 13 Dec 1999, Marc Britten wrote:

> well, right now I don't think I have too much time for either the SQL or
> this, however just off the top of my head, I remember talking about

fair enough.  Both of those projects are probably more than one person
jobs, but I'm way out of the know.

> natrual language searches a while back with someone(name escapes me, but
> they usually do), he said the easiest way around this is to figure out
> which words to drop, and which ones to group
> 
> from your examples
> 
> > How do I compile a Linux kernel?
> becomes
> 
> compile AND "Linux kernel"
> 
> and
> 
> > How can I setup a PPP connection to my ISP?
> becomes
> 
> setup AND "PPP connection"
> 
> I'm not sure how well htdig treats groups/phrases such as the "PPP
> connection" (most however know how to use them)

3.1.x series doesn't support phrase searching.  3.2.x does, hence my
request for you to port your patch over. :)  3.1.4 also has a number of
bug fixes that aren't in those 3 patches for 3.1.3, and since 3.2.x isn't
even beta yet, the 3.1.4 patch is more important.

> its easier than teaching a search engine english, and basically how we
> actually parse sentences. (this is coming from the linked database
> school of AI)

I can see why.  Not sure how to do the grouping though once we do have a
search engine that support phrases.  Do you know of any existing code that
does this?  Maybe someone did a thesis paper on this?

[snip]

-- 
Aaron Turner, Core Developer       http://vodka.linuxkb.org/~aturner/
Linux Knowledge Base Organization  http://linuxkb.org/
Because world domination requires quality open documentation.
aka: aturner@vicinity.com, aturner@pobox.com, ion_beam_head@ashtech.net