Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

A minimum of 4 seconds.


I just had a look and there's a non-standard "crawl-delay directive" extension to robots.txt that can be used to ask a spider to take some time between page visits:

  User-agent: bingbot
  Allow : /
  Crawl-delay: 10
https://en.wikipedia.org/wiki/Robots_exclusion_standard#Craw...


Hello, MojeekBot doesn't observe the crawl-delay directive but thanks for the reminder of it as it's beneficial for us to know if site owners require more grace between requests.


Hey. Good job with Mojeek. It seems the crawl-delay directive is not part of the robots.txt standard. It probably should be but that's not up to you!




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: