+ Reply to Thread
Results 1 to 9 of 9

Thread: Google has started to ignore my robots.txt

  1. #1
    galaxyAbstractor's Avatar
    galaxyAbstractor is offline Community Advocate galaxyAbstractor is on a distinguished road
    Join Date
    Oct 2007
    Location
    Land of Null and Insanity
    Posts
    5,495

    Google has started to ignore my robots.txt

    why?

    Code:
    User-agent: *
    Disallow: /bigjoe/
    That's one part of my robots.txt that got indexed. I belive it is strongly against the rules that search engines index sites in the robots.txt.
    Edit:
    see:


    Tillåten=allowed
    Last edited by galaxyAbstractor; 05-18-2008 at 11:46 AM. Reason: Automerged Doublepost

  2. #2
    DeadBattery's Avatar
    DeadBattery is offline Community Support Team DeadBattery is a name known to allDeadBattery is a name known to all
    Join Date
    Mar 2008
    Location
    localhost
    Posts
    4,019

    Re: Google has started to ignore my robots.txt

    Sometimes, it takes time to have Google process things.
    That was probably from the last time GoogleBot went to your forum.
    I suggest waiting a few days and then see what happens.


  3. #3
    galaxyAbstractor's Avatar
    galaxyAbstractor is offline Community Advocate galaxyAbstractor is on a distinguished road
    Join Date
    Oct 2007
    Location
    Land of Null and Insanity
    Posts
    5,495

    Re: Google has started to ignore my robots.txt

    Quote Originally Posted by aopsftw View Post
    Sometimes, it takes time to have Google process things.
    That was probably from the last time GoogleBot went to your forum.
    I suggest waiting a few days and then see what happens.

    I made that robots.txt at january and google bot can access all. The Bigjoe thing was added 2 weeks ago and google dled it 3 hours ago.

  4. #4
    tittat's Avatar
    tittat is offline x10 Spammer tittat is an unknown quantity at this point
    Join Date
    Sep 2007
    Location
    Kerala,India
    Posts
    2,479

    Re: Google has started to ignore my robots.txt

    How you save your file is it robots.txt or robot.txt ???
    please double check that 's' .Many people often make this mistake.
    PLAY ONLINE GAMES
    WWW.TMONDO.COM PlayFar Flash Games
    Former X10 Forum Senior Moderator(Retired)


  5. #5
    galaxyAbstractor's Avatar
    galaxyAbstractor is offline Community Advocate galaxyAbstractor is on a distinguished road
    Join Date
    Oct 2007
    Location
    Land of Null and Insanity
    Posts
    5,495

    Re: Google has started to ignore my robots.txt

    Quote Originally Posted by tittat View Post
    How you save your file is it robots.txt or robot.txt ???
    please double check that 's' .Many people often make this mistake.
    it is robots.txt but I noticed there is difference between /bigjoe/ and /bigjoe

  6. #6
    tittat's Avatar
    tittat is offline x10 Spammer tittat is an unknown quantity at this point
    Join Date
    Sep 2007
    Location
    Kerala,India
    Posts
    2,479

    Re: Google has started to ignore my robots.txt

    But i think Disallow: /bigjoe/ is correct itself. Am i right?

    where is your robots.txt located. Is it right inside your public_html folder itself?
    Please give your site URL. Is it an Addon-Domain or Parked one?
    PLAY ONLINE GAMES
    WWW.TMONDO.COM PlayFar Flash Games
    Former X10 Forum Senior Moderator(Retired)


  7. #7
    ASPX.King's Avatar
    ASPX.King is offline x10 Sophmore ASPX.King is an unknown quantity at this point
    Join Date
    Mar 2008
    Posts
    155

    Re: Google has started to ignore my robots.txt

    you need to disallow /bigjoe/* and /bigjoe/sub_dir/* and all the other sub-folders
    don't forget the *
    to be on the safe side, just put in every single filename!
    If my post helped you, you know what to do... hints: reputation, blue, checkbox

    Official Member of the Anti-Apple Club

  8. #8
    dWhite Guest

    Re: Google has started to ignore my robots.txt

    http://www.robotstxt.org/robotstxt.html

    About /robots.txt
    In a nutshell

    Web site owners use the /robots.txt file to give instructions about their site to web robots; this is called The Robots Exclusion Protocol.

    It works likes this: a robot wants to vists a Web site URL, say http://www.example.com/welcome.html. Before it does so, it firsts checks for http://www.example.com/robots.txt, and finds:

    User-agent: *
    Disallow: /

    The "User-agent: *" means this section applies to all robots. The "Disallow: /" tells the robot that it should not visit any pages on the site.

    There are two important considerations when using /robots.txt:

    * robots can ignore your /robots.txt. Especially malware robots that scan the web for security vulnerabilities, and email address harvesters used by spammers will pay no attention.
    * the /robots.txt file is a publicly available file. Anyone can see what sections of your server you don't want robots to use.


    So don't try to use /robots.txt to hide information.

  9. #9
    skads is offline x10Hosting Member skads is an unknown quantity at this point
    Join Date
    Mar 2008
    Posts
    2

    Re: Google has started to ignore my robots.txt

    thx

+ Reply to Thread

Similar Threads

  1. 2 more possiblites for google
    By vonnesh in forum Scripts & 3rd Party Apps
    Replies: 21
    Last Post: 04-03-2010, 07:49 AM
  2. Replies: 7
    Last Post: 03-02-2008, 02:29 PM
  3. Google What???
    By Brandon in forum Off Topic
    Replies: 15
    Last Post: 11-30-2007, 09:30 PM
  4. Google, Yahoo, MSN Search and the Government
    By Chris S in forum Crossfire
    Replies: 21
    Last Post: 02-01-2006, 02:42 PM
  5. Google wants your car listings, events, etc.
    By n4tec in forum Off Topic
    Replies: 3
    Last Post: 10-26-2005, 06:32 PM

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
x10hosting free hosting for the masses
dedicated servers