:ellie: Noëlle the 8-Bit🏳️‍🌈🎄 is a user on elekk.xyz. You can follow them or interact with them if you have an account anywhere in the fediverse. If you don't, you can sign up here.
:ellie: Noëlle the 8-Bit🏳️‍🌈🎄 @noelle

- heads up!

Mastodon.social's posts are not federating out in a timely fashion because another instance is lagging, which is causing mastodon.social's delivery jobs to pile up. Right now there are about 300,000 delivery jobs waiting to go out, but they're reliant on that one instance not taking 10 seconds to respond to every request.

There's not a whole lot m.s can do except wait it out; hopefully this will be resolved soon.

· Web · 22 · 26

@noelle how does this happen? I have no idea how the protocol works but shouldn’t it like idk connect to another instance instead

@jordyd In theory, yes, but a given masto instance only has so many workers it can deploy, and when they /all/ get tied up trying to connect to a single other instance...

@noelle @jordyd Fairness algorithms can be surprisingly hard to tune correctly.

@jordyd @noelle The issue is how HTML protocol requests work. In order to try and send something, it has to wait for the other server to respond to initiate the send.

The server needs to send 300,000 things to the other server. It's not smart enough to say "Hey, let's skip all the requests to this server because it's being too slow"

So it just keeps sending them, one at a time, and waiting. While it does have multiple threads to do this, it seems all are taken up with the huge load.

@viomi @noelle glad we all decided to do everything over HTTP instead of TCP (not on you guys, on the industry as a whole)

@jordyd @noelle The solution is, we add a feature to mastodon where if after a certain amount of times trying to make a connection, it temporarily stops trying to send those requests to server 2 and somehow stores them to wait to send them until the requests to said server are going faster. Not sure if you'd throw them all in the postresql or like.. Take up a shitton of RAM temporarily

@viomi @noelle who needs a database when you have a swap file amirite

@noelle If possible, I would appreciate an update when this is resolved. Don't go out of your way, but if you do hear more it would be nice if you shared.

@minego when I, from m.s, see this and want to reassure you that you'll get a message when it's resolved. But... like..... Yeah.... You wont see this for like 30min or something @noelle

@maloki @noelle I just got this, but I don't know how long ago you wrote it.

Lb: so I guess my friends on m.s could be trying to talk to me and I can't hear them oh cool.

@noelle This sounds like the Fediverse has a denial-of-service vulnerability

@noelle Is it .cloud's fault? .cloud feels like it's taking ten seconds to respond to each request right now.

@ocdtrekkie @noelle i've seen a relatively large number of instances who's tls certs expired in my pleroma log that is probably causing it tbh.

@jeff @noelle My impression was most of those instances are just gone outright. I migrated my follows here, and while trying to track down the missing ones, it just seems like a lot of them don't exist anymore, but Mastodon views it as a TLS failure.

@noelle please let it not be my instance... please let it not be my instance...

@noelle
Oh wow, feels like slowloris :D
Seriously tho, one single instance can slow down the entire fediverse?