@jasonkoebler we were fortunate to have enough technical skill to be able to add rate limiting for specific URLs on a per-user-agent basis. If you look around the Meta developer forums you can find evidence of this happening to other people and I cannot imagine how less technically-savvy companies deal with this as there isn't really a way to contact someone at Meta about it. If you block this bot, Facebook won't be able to load previews of links to your site.
=> More informations about this toot | View the thread
@jasonkoebler I'm listening to your segment on the podcast about crawlers and it reminded me of an incident we ran into with the FacebookExternalHit crawler (https://developers.facebook.com/docs/sharing/webmasters/web-crawlers) back in late March. For reasons I cannot possibly fathom, the crawler got into a loop on our site scanning the same few URLs over and over again at a rate of up to 80 requests per second for at least a few weeks. I don't think Meta had a separate crawler for training AI models at the time but I'd have to check.
=> More informations about this toot | View the thread
Happy 12th birthday to Miles!
[#]DogsOfMastodon
=> View attached media | View attached media | View attached media
=> More informations about this toot | View the thread
@coffeegeek this is the test result from mixing one gallon of distilled water with a small packet of Third Wave Water’s espresso machine formulation. If I recall correctly, the SCA says you should be between 35 and 85 ppm? I guess I’ll try half a packet. I’m also curious to test the output of a Brita filter. #coffee #espresso @coffee @espresso
=> More informations about this toot | View the thread
=> This profile without reblog | Go to ezarowny@file-explorers.club account This content has been proxied by September (ba2dc).Proxy Information
text/gemini