Avoiding Google and other search engines.

topic posted Sun, September 25, 2005 - 6:07 PM by  Paul
I have some pages on my personal web site that are x-rated. What can I do so these pages aren't referenced by search engines?

I don't care if the rest of the pages are hit... in fact it is fun to see my host reports that show me which keywords and phrases are getting hits. But I'd prefer if some pages just weren't in the loop.

Thanks.
posted by:
Paul
Seattle
  • Re: Avoiding Google and other search engines.

    Sun, September 25, 2005 - 7:13 PM
    put the following into your robots.txt file:

    User-agent: *
    Disallow: /privatedata

    cahnge privatedata to the name of the file you want to protect. leave off the file append (IE: HTM,PHP etc) so it will wildcard. Use a seperate line for each page.

    HTH
    • Re: Avoiding Google and other search engines.

      Sun, September 25, 2005 - 8:41 PM
      Thanks a lot Chris,

      Sorry... I am a hobbyist, not a professional...

      Are saying I can change my page name from "porno1.html" to "porno1.xyz" and it will work in a browser, be linkable... but won't be searched?

      Also, I was unaware of the purpose of the robots.txt I have always seen it, my host provider put it there. I just added the line at the end:

      # Dont allow search engines to index specific folder
      User-agent: *
      Disallow: /fpdb
      Disallow: /stats
      Disallow: /_borders
      Disallow: /_derived
      Disallow: /_fpclass
      Disallow: /_overlay
      Disallow: /_private
      Disallow: /_contentindex
      Disallow: /_themes
      Disallow: /_vti_bin
      Disallow: /_vti_cnf
      Disallow: /_vti_log
      Disallow: /_vti_map
      Disallow: /_vti_pvt
      Disallow: /_vti_txt
      Disallow: /_vti_script
      Disallow: /XXX

      So an XXX folder anywhere in a folder tree will stop further searching in that tree? Or does the XXX have to be a root? (I currently have more folders below XXX -- I was doing that for my organization -- I didn't know it would help with this!)

      Do I need to do both the robot.txt and renaming exentions to xyz?

      Also, is there a way to get my current XXX pages out of Google?

      Thanks again.