[RFC] Use of Automated Moderation Tools

Crashdoom (he/him) · 1 year ago

[RFC] Use of Automated Moderation Tools

Elbrar · 1 year ago

I think 1-3 are fine (since nothing really happens without a human involved), but 4 should come in after several months of testing the model to make sure its false positive rate is as close to 0 as possible.

I in general think that LLM/“AI” stuff is massively overblown when used for creating content, but when analyzing stuff, it’s much more reasonable to employ as a referral to humans to make the final decision.

I guess I’ve just been lucky in that I’ve not gotten any spam yet on masto…

[RFC] Use of Automated Moderation Tools

[RFC] Use of Automated Moderation Tools

1. Monitoring of Public Streaming Feed

2. Building of a local AI spam-detection model

3. Use of local posts for non-spam training

4. Temporarily limiting suspected spam accounts