Re: [OT] Google Info

This WebDNA talk-list message is from

2004


It keeps the original formatting.
numero = 55989
interpreted = N
texte = Thanks for this I'll explain further... If you type 'web hosting aberdeen' into google within the top 2 or 3 you will see a link to http://www.europahosting.co.uk/web-hosting-aberdeen/. Try it again with birmingham instead of aberdeen and you'll get http://www.europahosting.co.uk/web-hosting-birmingham/ so you immediately think he's obviously done lots of doorway pages. OK so now go to his site and type in anything like http://www.europahosting.co.uk/fkjfjdfhj and you will always get the same page, so it would appear that they catch the error and create a page - easy enough so far. The problem starts when you look at the description in Google, this doesn't appear anywhere on the returned pages, there are no keywords fields and not a lot of text on the page. If I use http://www.delorie.com/web/headers.html to check the headers info all the pages are returned with a 410 Gone error which you would think that Google would object to as well. So why is he so far up the results? Is it just a matter of quantity? They appear to have somehow submitted hundreds of URL's that Google has now indexed with all the location variations to the URL but they seem to catch the fact that it's Googlebot visiting and present them with something different. This allows them to have 600+ pages indexed in the search engines with /web-hosting-aberdeen /web-hosting-birmingham etc. When I pretend to be Google using the method below I get "HTTP/1.1 503 Service Temporarily Unavailable" so maybe they are also checking for Google ip numbers? This has really got me curious, so any of you care to offer anything? Mainly I would like to know why their so highly rated. Cheers ======================================== Steve Craig - Asylum Interactive Ltd Tel +44 1224 642960 Fax +44 1224 642962 ======================================== http://www.asylumweb.com Email: steve@asylumweb.com ======================================== > From: Joe D'Andrea > Reply-To: (WebDNA Talk) > Date: Tue, 10 Feb 2004 08:57:05 -0500 > To: (WebDNA Talk) > Subject: Re: [OT] (more) Google Info > > Try this: > >> telnet theirsite.com 80 > > once connected, type the following with a CR at the end of each line, and two > CRs at the end. You should see the html that their site returns if it's > recognizing google. Try it without the user-agent header to see if it's > different. > > get / http/1.1 > host: theirsite.com > user-agent: Googlebot/2.1 (+http://www.googlebot.com/bot.html) > > > ~joe > > > -- > _______________________________________________________________ > Joseph D'Andrea ~ http://www.west21.com/ ~ JoeDan@West21.com > WEST21.com Internet services for the 21st Century > webhosting ~ co-location ~ wireless access ~ WebCat programming > > ------------------------------------------------------------- > This message is sent to you because you are subscribed to > the mailing list . > To unsubscribe, E-mail to: > To switch to the DIGEST mode, E-mail to > > Web Archive of this list is at: http://webdna.smithmicro.com/ > ------------------------------------------------------------- This message is sent to you because you are subscribed to the mailing list . To unsubscribe, E-mail to: To switch to the DIGEST mode, E-mail to Web Archive of this list is at: http://webdna.smithmicro.com/ Associated Messages, from the most recent to the oldest:

    
  1. Re: [OT] Google Info ( Steve Craig 2004)
  2. Re: [OT] Google Info ( Steve Craig 2004)
  3. Re: [OT] Google Info ( Howard Wolosky 2004)
  4. Re: [OT] Google Info ( Stuart Tremain 2004)
  5. Re: [OT] Google Info ( Frank Nordberg 2004)
  6. Re: [OT] Google Info ( Jeff Logan 2004)
  7. Re: [OT] Google Info ( "Andrew Simpson" 2004)
  8. Re: [OT] Google Info ( Frank Nordberg 2004)
  9. Re: [OT] Google Info ( "Andrew Simpson" 2004)
  10. Re: [OT] Google Info ( Gary Krockover 2004)
  11. Re: [OT] Google Info ( Donovan Brooke 2004)
  12. Re: [OT] Google Info ( John Peacock 2004)
  13. Re: [OT] Google Info ( Steve Craig 2004)
  14. Re: [OT] Google Info ( devaulw@onebox.com 2004)
  15. Re: [OT] Google Info ( devaulw@onebox.com 2004)
  16. Re: [OT] Google Info ( Christer Olsson 2004)
  17. Re: [OT] Google Info ( Joe D'Andrea 2004)
  18. Re: [OT] Google Info ( Donovan Brooke 2004)
  19. [OT] Google Info ( Steve Craig 2004)
Thanks for this I'll explain further... If you type 'web hosting aberdeen' into google within the top 2 or 3 you will see a link to http://www.europahosting.co.uk/web-hosting-aberdeen/. Try it again with birmingham instead of aberdeen and you'll get http://www.europahosting.co.uk/web-hosting-birmingham/ so you immediately think he's obviously done lots of doorway pages. OK so now go to his site and type in anything like http://www.europahosting.co.uk/fkjfjdfhj and you will always get the same page, so it would appear that they catch the error and create a page - easy enough so far. The problem starts when you look at the description in Google, this doesn't appear anywhere on the returned pages, there are no keywords fields and not a lot of text on the page. If I use http://www.delorie.com/web/headers.html to check the headers info all the pages are returned with a 410 Gone error which you would think that Google would object to as well. So why is he so far up the results? Is it just a matter of quantity? They appear to have somehow submitted hundreds of URL's that Google has now indexed with all the location variations to the URL but they seem to catch the fact that it's Googlebot visiting and present them with something different. This allows them to have 600+ pages indexed in the search engines with /web-hosting-aberdeen /web-hosting-birmingham etc. When I pretend to be Google using the method below I get "HTTP/1.1 503 Service Temporarily Unavailable" so maybe they are also checking for Google ip numbers? This has really got me curious, so any of you care to offer anything? Mainly I would like to know why their so highly rated. Cheers ======================================== Steve Craig - Asylum Interactive Ltd Tel +44 1224 642960 Fax +44 1224 642962 ======================================== http://www.asylumweb.com Email: steve@asylumweb.com ======================================== > From: Joe D'Andrea > Reply-To: (WebDNA Talk) > Date: Tue, 10 Feb 2004 08:57:05 -0500 > To: (WebDNA Talk) > Subject: Re: [OT] (more) Google Info > > Try this: > >> telnet theirsite.com 80 > > once connected, type the following with a CR at the end of each line, and two > CRs at the end. You should see the html that their site returns if it's > recognizing google. Try it without the user-agent header to see if it's > different. > > get / http/1.1 > host: theirsite.com > user-agent: Googlebot/2.1 (+http://www.googlebot.com/bot.html) > > > ~joe > > > -- > _______________________________________________________________ > Joseph D'Andrea ~ http://www.west21.com/ ~ JoeDan@West21.com > WEST21.com Internet services for the 21st Century > webhosting ~ co-location ~ wireless access ~ WebCat programming > > ------------------------------------------------------------- > This message is sent to you because you are subscribed to > the mailing list . > To unsubscribe, E-mail to: > To switch to the DIGEST mode, E-mail to > > Web Archive of this list is at: http://webdna.smithmicro.com/ > ------------------------------------------------------------- This message is sent to you because you are subscribed to the mailing list . To unsubscribe, E-mail to: To switch to the DIGEST mode, E-mail to Web Archive of this list is at: http://webdna.smithmicro.com/ Steve Craig

DOWNLOAD WEBDNA NOW!

Top Articles:

Talk List

The WebDNA community talk-list is the best place to get some help: several hundred extremely proficient programmers with an excellent knowledge of WebDNA and an excellent spirit will deliver all the tips and tricks you can imagine...

Related Readings:

passing search criteria (1997) HTML editing and webcatalog (2000) How to Display text in empty fields (1997) Running 2 two WebCatalog.acgi's (1996) Resume Catalog ? (1997) Buying sans cart (1997) Reversed words (1997) Quickie question on the email templates (1997) Date field search needs ... (1998) Creating main- and sub-category search (1997) Re1000001: Setting up shop (1997) Needed, Freelance Web Developer (2007) WebCatalog can't find database (1997) Emailer setup (1997) Merchant account (1998) Help formatting search results w/ table (1997) formatting a number (1999) Separate SSL Server (1997) Superfilous Characters (1998) problems with 2 tags (1997)