Skip to content

robots.txt review #2924

@ghostdevv

Description

@ghostdevv

I can remove the redundant rules. I would like someone on the core or maintainer teams to request/sign off because I want changes to be as focused as possible, and my main goal is to fix the malformed robots.txt that is giving incorrect access to bots

I had Composer 2.5 LLM run a report for me on redundant fields, and this is what it came up with, and all items seem reasonable to me :

editing to remove direct quote of AI response for human-written summary. Thanks to a suggestion from @trueberryless

  1. Remove the disallow rules after Disallow: / covering lines 19–44 because Disallow: /. covers everything that is not explicitly allowed. Thus, delete - I think this is a good idea, so we do not reveal extra paths that we do not want bots/people to see.
  2. `/opensearch.xml' is both allowed and disallowed (line 15, 41). I think this should be allowed. There feels like there is a story around this that I would like to understand before taking any action.
  3. Specific AI classifier blocks are duplicates and can be simplified to just list out the crawlers and blocks once. I am neutral on this and can see value in having it both ways.

If I should implement any of these items, please let me know, and I can do it.

Originally posted by @MaxwellCohen in #2892 (comment)

Metadata

Metadata

Assignees

No one assigned

    Labels

    frontFrontend, Design

    Type

    No fields configured for Task.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions