[RFC] Use of Automated Moderation Tools

Crashdoom (he/him) · 1 year ago

[RFC] Use of Automated Moderation Tools

Crashdoom (he/him) · 1 year ago

#2: I’ve had some light experience before specifically with TensorFlow Lite models during my degree program. For the Coral Edge TPU, we wanted to off-load the processing to try to get the speed as near to zero latency as possible, though admittedly, it would potentially be superfluous. I’m also looking into some existing models I could potentially use but hadn’t found any that particularly stood out, but if anyone has any recommendations I’d love to check them out!

#3: Good question; If the system flags a post automatically as potentially spam, and the team determine it’s not spam, I would probably like to be able to train on that message as “ham” / not-spam to avoid future false positives. But, that would be an extension of the scope of what we’d train on, so I’d very much like feedback on that too.

#4: Yes, when a user is limited the profile will show a content warning before the contents of the profile. I believe the prompt is something like “This user has been hidden by the moderators of [instance name]”. For repeated mis-identifications, yes, edge-cases like this we could approve the user and exempt them from future automated reports.

[RFC] Use of Automated Moderation Tools

[RFC] Use of Automated Moderation Tools

1. Monitoring of Public Streaming Feed

2. Building of a local AI spam-detection model

3. Use of local posts for non-spam training

4. Temporarily limiting suspected spam accounts