Re: Site Search Concepts
This WebDNA talk-list message is from 2003
It keeps the original formatting.
numero = 47925
interpreted = N
texte = Dan,This is what I was thinking of, but after re-reading it I'm not sure if itwould work for your case. Perhaps with some modifications. It's neat codenevertheless:http://developer.apple.com/internet/javascript/iframe.htmlGK| Howdy,|| I will be embarking on some yet unexplored (for me) terrain here in a fewweeks, and I was| wondering if I could get some direction from the list. I'm not looking foranyone to do my work, I| just need some basic advice as to a concept/startegy.|| Ok, so I want to build a site search. That's it. Basically a mini-Google.User enters in their| search term, WebDNA searches the entire site(s) spits out the results andviola! Done. I have done| this easy enough with small databases, of course (particularlyStorebuilder's products.db), and I| 'get it', but what's got me baffled at the moment is how do I index all ofthe text of every| single page of every single site that the search needs to look through,especially considering| that I am retro-fitting this search onto an existing (webDNA) site withnumerous pages to look| through and mucho text, and NOT starting from scratch (which would be mucheasier)?|| Can [search] 'see' .html pages, and if so could I jimmy up some sort of[listfiles]/[search]| widget that would treat the entire site as a giant database? And If so,how would I get it to look| only at text and not HTML within each doc? In terms of scalability,assuming this is even| possible, wouldn't it kill any decent server to have to look through a'giant' database like that| all the time, especially if multiple requests are made?|| Or do I actually need to make a special 'index.db' that catalogueseverything? If so, how best to| catalogue each pages (voluminous) text? [include]s?|| Am I over thinking this? Should I just buy a good search (Atomz, Google,whatever)? Know any good| (and affordable) ones?|| Ok enough for now, and thanks in adavance.|| -Dan| ------------------------------------------------------------| http://www.StrongGraphicDesign.com| (208) 319-0137 | Toll-free p/f 877-561-1656| ------------------------------------------------------------|| -------------------------------------------------------------| This message is sent to you because you are subscribed to| the mailing list
.| To unsubscribe, E-mail to: | To switch to the DIGEST mode, E-mail to| Web Archive of this list is at: http://webdna.smithmicro.com/-------------------------------------------------------------This message is sent to you because you are subscribed to the mailing list .To unsubscribe, E-mail to: To switch to the DIGEST mode, E-mail to Web Archive of this list is at: http://webdna.smithmicro.com/
Associated Messages, from the most recent to the oldest:
Dan,This is what I was thinking of, but after re-reading it I'm not sure if itwould work for your case. Perhaps with some modifications. It's neat codenevertheless:http://developer.apple.com/internet/javascript/iframe.htmlGK| Howdy,|| I will be embarking on some yet unexplored (for me) terrain here in a fewweeks, and I was| wondering if I could get some direction from the list. I'm not looking foranyone to do my work, I| just need some basic advice as to a concept/startegy.|| Ok, so I want to build a site search. That's it. Basically a mini-Google.User enters in their| search term, WebDNA searches the entire site(s) spits out the results andviola! Done. I have done| this easy enough with small databases, of course (particularlyStorebuilder's products.db), and I| 'get it', but what's got me baffled at the moment is how do I index all ofthe text of every| single page of every single site that the search needs to look through,especially considering| that I am retro-fitting this search onto an existing (webDNA) site withnumerous pages to look| through and mucho text, and NOT starting from scratch (which would be mucheasier)?|| Can [search] 'see' .html pages, and if so could I jimmy up some sort of[listfiles]/[search]| widget that would treat the entire site as a giant database? And If so,how would I get it to look| only at text and not HTML within each doc? In terms of scalability,assuming this is even| possible, wouldn't it kill any decent server to have to look through a'giant' database like that| all the time, especially if multiple requests are made?|| Or do I actually need to make a special 'index.db' that catalogueseverything? If so, how best to| catalogue each pages (voluminous) text? [include]s?|| Am I over thinking this? Should I just buy a good search (Atomz, Google,whatever)? Know any good| (and affordable) ones?|| Ok enough for now, and thanks in adavance.|| -Dan| ------------------------------------------------------------| http://www.StrongGraphicDesign.com| (208) 319-0137 | Toll-free p/f 877-561-1656| ------------------------------------------------------------|| -------------------------------------------------------------| This message is sent to you because you are subscribed to| the mailing list .| To unsubscribe, E-mail to: | To switch to the DIGEST mode, E-mail to| Web Archive of this list is at: http://webdna.smithmicro.com/-------------------------------------------------------------This message is sent to you because you are subscribed to the mailing list .To unsubscribe, E-mail to: To switch to the DIGEST mode, E-mail to Web Archive of this list is at: http://webdna.smithmicro.com/
Gary Krockover
DOWNLOAD WEBDNA NOW!
Top Articles:
Talk List
The WebDNA community talk-list is the best place to get some help: several hundred extremely proficient programmers with an excellent knowledge of WebDNA and an excellent spirit will deliver all the tips and tricks you can imagine...
Related Readings:
RAW=T..Strange behaviour (2000)
Separate SSL Server (1997)
using showpage and showcart commands (1996)
carriage returns in data (1997)
help needed: Non-english characters in WebCatalog (1997)
unable to launch acgi in WebCat (1997)
[SearchString] problem with [search] context (1997)
Hiding a subsection of text (2002)
Linebreak as a delimiter in listwords? (2003)
Re:Emailer setup (1997)
emailer (1997)
Re1000001: Setting up shop (1997)
autocommit problem (1998)
Replace context problem ... (1997)
Format Thousands Looks Busted (2000)
WebCAT has the devil in it! (2003)
Off Topic: Sound Clips (2003)
Next (1997)
& not allowed in db by definition? (1999)
X etc.... (1999)