Jan 29, 2026
What attack strategies have adversaries used to exploit reputation‑based moderation systems?
by manipulating the signals platforms use to trust accounts—aging and “priming” sockpuppets, camouflaging language to fool classifiers, and gaming community reporting or automated nudges—thereby bypas...