Re: How do I get Google to crawl a WebCat site?
This WebDNA talk-list message is from 2003
It keeps the original formatting.
numero = 48673
interpreted = N
texte = > Let's look at CXS and for example say we want to search for X-Ray Film>> The first three pages aren't relevant to our search at all as far as> the search engines are concerned. In other words these pages are not> very interesting at all to the search engine because they contain no> textual information and that's what they eat! : c)I'm not trying to be thick-headed here, but my point right now is not searchresults as they appear in Google. Right now, Google would not have any informationon the pages you listed, simply because it has never been there. This is myproblem. Of all the links on the home page (with several being text links) Googlehas never even visited them. It's not that the site cannot be found in the searchengine, it's that the robots are not even accessing the pages in the first place.Once I can get the robots to at least visit the pages in question, I can worryabout where the pages show up in the search results. The problem that I am havingis that I might as well have no links at all on the home page at all. I could haveevery word in the dictionary on the catalog page, but if Googlebot doesn't gothere, it does me no good. I'm not basing my contention that the search enginesare not spidering this site based on results in the search engine. I am basingthis on the results of my server's logs. Google has never been to this link:http://www.cxsonline.com/text/searchindex.tmplI want to know why. Bear in mind that the link to this address was a 10x10 whiteimage prior to today (Googlebot hit the site Monday). Since Googlebot neverdownloaded this image, it could not have known that. If it had downloaded theimage, then at least I would have a plausible explanation as to why it ignored thelink.Dennis-------------------------------------------------------------This message is sent to you because you are subscribed to the mailing list
.To unsubscribe, E-mail to: To switch to the DIGEST mode, E-mail to Web Archive of this list is at: http://webdna.smithmicro.com/
Associated Messages, from the most recent to the oldest:
> Let's look at CXS and for example say we want to search for X-Ray Film>> The first three pages aren't relevant to our search at all as far as> the search engines are concerned. In other words these pages are not> very interesting at all to the search engine because they contain no> textual information and that's what they eat! : c)I'm not trying to be thick-headed here, but my point right now is not searchresults as they appear in Google. Right now, Google would not have any informationon the pages you listed, simply because it has never been there. This is myproblem. Of all the links on the home page (with several being text links) Googlehas never even visited them. It's not that the site cannot be found in the searchengine, it's that the robots are not even accessing the pages in the first place.Once I can get the robots to at least visit the pages in question, I can worryabout where the pages show up in the search results. The problem that I am havingis that I might as well have no links at all on the home page at all. I could haveevery word in the dictionary on the catalog page, but if Googlebot doesn't gothere, it does me no good. I'm not basing my contention that the search enginesare not spidering this site based on results in the search engine. I am basingthis on the results of my server's logs. Google has never been to this link:http://www.cxsonline.com/text/searchindex.tmplI want to know why. Bear in mind that the link to this address was a 10x10 whiteimage prior to today (Googlebot hit the site Monday). Since Googlebot neverdownloaded this image, it could not have known that. If it had downloaded theimage, then at least I would have a plausible explanation as to why it ignored thelink.Dennis-------------------------------------------------------------This message is sent to you because you are subscribed to the mailing list .To unsubscribe, E-mail to: To switch to the DIGEST mode, E-mail to Web Archive of this list is at: http://webdna.smithmicro.com/
Dennis J. Bonsall, Jr.
DOWNLOAD WEBDNA NOW!
Top Articles:
Talk List
The WebDNA community talk-list is the best place to get some help: several hundred extremely proficient programmers with an excellent knowledge of WebDNA and an excellent spirit will deliver all the tips and tricks you can imagine...
Related Readings:
Two submit buttons ? (1997)
[protect admin] (1997)
Search context not finding recent entries (1998)
[WebDNA] New problem with [ShowNext] (2010)
Re[2]: Unix Webcat Permission - Suggestions (2000)
Null Chars (1999)
HTML Mail & Line breaks... (2004)
[ShowNext] feature in 2.0 (1997)
Europe, (1998)
Graphing Modules (2004)
year 2000 + and webmerch, macauth? (1998)
Separate SSL Server (1997)
Nesting format tags (1997)
WebCat2final1 crashes (1997)
View order not right (1997)
Appendfile memory usage (redux) (2003)
WARNING: MacOS The installer is broken... (2000)
PCS Frames (1997)
Credit Card processing (1998)
View Source from cache (1997)