Comment by πŸ’€ requiem

=> Re: "My intermittent capsule outages are being caused by what..." | In: u/jsreed5

That’s the way to do it! Also can you publish which crawler it is - what IP it is from? Maybe the creator will see it here…

=> πŸ’€ requiem

2024-05-13 Β· 9 months ago

2 Later Comments ↓

=> πŸš€ jsreed5 [OP] Β· 2024-05-13 at 14:24:

Good point! The crawler's IP address is 104.207.150.107.

=> πŸ’€ requiem Β· 2024-05-13 at 16:29:

Reverse DNS resolves to celery.eu.org; over HTTP it says 'unplanned maintenance', copy-pasting the IP into the browser redirects you to a rickroll. TBH I would just keep the domain in your blacklist for now.

Original Post

=> πŸš€ jsreed5

My intermittent capsule outages are being caused by what appears to be a very aggressive crawler. The capsule's robots.txt file tells bots not to index my CGI scripts, but this crawler is ignoring the file and sending multiple requests per second against my scripts, which overloads the server and causes it to crash. I've temporarily solved the problem by blocking the crawler entirely; I'll look for a more permanent solution.

=> πŸ’¬ 3 comments Β· 3 likes Β· 2024-05-13 Β· 9 months ago

Proxy Information
Original URL
gemini://bbs.geminispace.org/u/requiem/16907
Status Code
Success (20)
Meta
text/gemini; charset=utf-8
Capsule Response Time
72.726365 milliseconds
Gemini-to-HTML Time
0.366514 milliseconds

This content has been proxied by September (3851b).