Re: Site Search Concepts

This WebDNA talk-list message is from

2003


It keeps the original formatting.
numero = 47925
interpreted = N
texte = Dan,This is what I was thinking of, but after re-reading it I'm not sure if it would work for your case. Perhaps with some modifications. It's neat code nevertheless:http://developer.apple.com/internet/javascript/iframe.htmlGK | Howdy, | | I will be embarking on some yet unexplored (for me) terrain here in a few weeks, and I was | wondering if I could get some direction from the list. I'm not looking for anyone to do my work, I | just need some basic advice as to a concept/startegy. | | Ok, so I want to build a site search. That's it. Basically a mini-Google. User enters in their | search term, WebDNA searches the entire site(s) spits out the results and viola! Done. I have done | this easy enough with small databases, of course (particularly Storebuilder's products.db), and I | 'get it', but what's got me baffled at the moment is how do I index all of the text of every | single page of every single site that the search needs to look through, especially considering | that I am retro-fitting this search onto an existing (webDNA) site with numerous pages to look | through and mucho text, and NOT starting from scratch (which would be much easier)? | | Can [search] 'see' .html pages, and if so could I jimmy up some sort of [listfiles]/[search] | widget that would treat the entire site as a giant database? And If so, how would I get it to look | only at text and not HTML within each doc? In terms of scalability, assuming this is even | possible, wouldn't it kill any decent server to have to look through a 'giant' database like that | all the time, especially if multiple requests are made? | | Or do I actually need to make a special 'index.db' that catalogues everything? If so, how best to | catalogue each pages (voluminous) text? [include]s? | | Am I over thinking this? Should I just buy a good search (Atomz, Google, whatever)? Know any good | (and affordable) ones? | | Ok enough for now, and thanks in adavance. | | -Dan | ------------------------------------------------------------ | http://www.StrongGraphicDesign.com | (208) 319-0137 | Toll-free p/f 877-561-1656 | ------------------------------------------------------------ | | ------------------------------------------------------------- | This message is sent to you because you are subscribed to | the mailing list . | To unsubscribe, E-mail to: | To switch to the DIGEST mode, E-mail to | Web Archive of this list is at: http://webdna.smithmicro.com/ ------------------------------------------------------------- This message is sent to you because you are subscribed to the mailing list . To unsubscribe, E-mail to: To switch to the DIGEST mode, E-mail to Web Archive of this list is at: http://webdna.smithmicro.com/ Associated Messages, from the most recent to the oldest:

    
  1. Re: Site Search Concepts (Dan Strong 2003)
  2. Re: Site Search Concepts (Clint Davis 2003)
  3. Re: Site Search Concepts (Tom Duke 2003)
  4. Re: Site Search Concepts (Clint Davis 2003)
  5. Re: Site Search Concepts (Dan Strong 2003)
  6. Re: Site Search Concepts (Alex McCombie 2003)
  7. Re: Site Search Concepts (Dan Strong 2003)
  8. Re: Site Search Concepts (Dan Strong 2003)
  9. Re: Site Search Concepts (Gary Krockover 2003)
  10. Re: Site Search Concepts (Dale's Stuff 2003)
  11. Re: Site Search Concepts (Dan Strong 2003)
  12. Re: Site Search Concepts (Gary Krockover 2003)
  13. Re: Site Search Concepts (Donovan 2003)
  14. Site Search Concepts (Dan Strong 2003)
Dan,This is what I was thinking of, but after re-reading it I'm not sure if it would work for your case. Perhaps with some modifications. It's neat code nevertheless:http://developer.apple.com/internet/javascript/iframe.htmlGK | Howdy, | | I will be embarking on some yet unexplored (for me) terrain here in a few weeks, and I was | wondering if I could get some direction from the list. I'm not looking for anyone to do my work, I | just need some basic advice as to a concept/startegy. | | Ok, so I want to build a site search. That's it. Basically a mini-Google. User enters in their | search term, WebDNA searches the entire site(s) spits out the results and viola! Done. I have done | this easy enough with small databases, of course (particularly Storebuilder's products.db), and I | 'get it', but what's got me baffled at the moment is how do I index all of the text of every | single page of every single site that the search needs to look through, especially considering | that I am retro-fitting this search onto an existing (webDNA) site with numerous pages to look | through and mucho text, and NOT starting from scratch (which would be much easier)? | | Can [search] 'see' .html pages, and if so could I jimmy up some sort of [listfiles]/[search] | widget that would treat the entire site as a giant database? And If so, how would I get it to look | only at text and not HTML within each doc? In terms of scalability, assuming this is even | possible, wouldn't it kill any decent server to have to look through a 'giant' database like that | all the time, especially if multiple requests are made? | | Or do I actually need to make a special 'index.db' that catalogues everything? If so, how best to | catalogue each pages (voluminous) text? [include]s? | | Am I over thinking this? Should I just buy a good search (Atomz, Google, whatever)? Know any good | (and affordable) ones? | | Ok enough for now, and thanks in adavance. | | -Dan | ------------------------------------------------------------ | http://www.StrongGraphicDesign.com | (208) 319-0137 | Toll-free p/f 877-561-1656 | ------------------------------------------------------------ | | ------------------------------------------------------------- | This message is sent to you because you are subscribed to | the mailing list . | To unsubscribe, E-mail to: | To switch to the DIGEST mode, E-mail to | Web Archive of this list is at: http://webdna.smithmicro.com/ ------------------------------------------------------------- This message is sent to you because you are subscribed to the mailing list . To unsubscribe, E-mail to: To switch to the DIGEST mode, E-mail to Web Archive of this list is at: http://webdna.smithmicro.com/ Gary Krockover

DOWNLOAD WEBDNA NOW!

Top Articles:

Talk List

The WebDNA community talk-list is the best place to get some help: several hundred extremely proficient programmers with an excellent knowledge of WebDNA and an excellent spirit will deliver all the tips and tricks you can imagine...

Related Readings:

RAW=T..Strange behaviour (2000) Separate SSL Server (1997) using showpage and showcart commands (1996) carriage returns in data (1997) help needed: Non-english characters in WebCatalog (1997) unable to launch acgi in WebCat (1997) [SearchString] problem with [search] context (1997) Hiding a subsection of text (2002) Linebreak as a delimiter in listwords? (2003) Re:Emailer setup (1997) emailer (1997) Re1000001: Setting up shop (1997) autocommit problem (1998) Replace context problem ... (1997) Format Thousands Looks Busted (2000) WebCAT has the devil in it! (2003) Off Topic: Sound Clips (2003) Next (1997) & not allowed in db by definition? (1999) X etc.... (1999)