How do I get Google to crawl a WebCat site?
This WebDNA talk-list message is from 2003
It keeps the original formatting.
numero = 48645
interpreted = N
texte = I've developed numerous WebCat sites over the years, but have always haddismal results with search engines. While I have not had problemsgetting the home pages listed, I have never been able to get theremainder of the site crawled. I figured that part of the problem isrelated to the fact that all the links contain a '?' with at least'cart=[cart]' after that point. However, on my most recent project, Ihave created a link on the home page that takes the search enginedirectly to the catalog without a question mark in the URL at all. But,Googlebot has visited the site three times over the past few months.According to my logs, it has requested robots.txt (as expected), thenthe home page. After that, it simply leaves and goes no further. Thereis no robots.txt file at all, so there is no restrictions on the searchengines. I was hoping that someone on this list who knows more aboutsearch engines than I do might be able to tell me what I am doingwrong. The url for the site is http://www.cxsonline.com.Thanks in advance for the assistance,Dennis-------------------------------------------------------------This message is sent to you because you are subscribed to the mailing list
.To unsubscribe, E-mail to: To switch to the DIGEST mode, E-mail to Web Archive of this list is at: http://webdna.smithmicro.com/
Associated Messages, from the most recent to the oldest:
I've developed numerous WebCat sites over the years, but have always haddismal results with search engines. While I have not had problemsgetting the home pages listed, I have never been able to get theremainder of the site crawled. I figured that part of the problem isrelated to the fact that all the links contain a '?' with at least'cart=[cart]' after that point. However, on my most recent project, Ihave created a link on the home page that takes the search enginedirectly to the catalog without a question mark in the URL at all. But,Googlebot has visited the site three times over the past few months.According to my logs, it has requested robots.txt (as expected), thenthe home page. After that, it simply leaves and goes no further. Thereis no robots.txt file at all, so there is no restrictions on the searchengines. I was hoping that someone on this list who knows more aboutsearch engines than I do might be able to tell me what I am doingwrong. The url for the site is http://www.cxsonline.com.Thanks in advance for the assistance,Dennis-------------------------------------------------------------This message is sent to you because you are subscribed to the mailing list .To unsubscribe, E-mail to: To switch to the DIGEST mode, E-mail to Web Archive of this list is at: http://webdna.smithmicro.com/
Dennis J. Bonsall, Jr.
DOWNLOAD WEBDNA NOW!
Top Articles:
Talk List
The WebDNA community talk-list is the best place to get some help: several hundred extremely proficient programmers with an excellent knowledge of WebDNA and an excellent spirit will deliver all the tips and tricks you can imagine...
Related Readings:
wc 2 pro users - sites, quotes wanted (1997)
[numFound] inside [showIf]? (2000)
WebCat editing, SiteGuard & SiteEdit (1997)
TaxTotal (2003)
Major Security Hole IIS NT (1998)
FTP FOLDER PERMISSIONS (2004)
Web Catalog vs. ICAT (1997)
Include a big block of text (1997)
What am I missing (1997)
carriage returns in data (1997)
WebDelivery downloads alias, not original ? (1997)
Setting up shop (1997)
webcat 2.1 new cart fields - please explain more (1998)
Feature Request: ! character bug correct in [showif [variable]=] (2000)
Shopping Cart Problem (1998)
SearchTitle Question (1998)
[writefile] (1997)
X etc.... (1999)
SERIAL NUMBER PROBLEM *AGAIN*!!! (1998)
No Access warning when caching HTML files (1997)