automod

package

v0.0.0-...-9815da9 Latest Latest Go to latest Published: Jun 4, 2024 License: Apache-2.0, MIT Imports: 2 Imported by: 0

Details

Valid go.mod file

The Go module system was introduced in Go 1.11 and is the official dependency management solution for Go.
Redistributable license

Redistributable licenses place minimal restrictions on how software can be used, modified, and redistributed.
Tagged version

Modules with tagged versions give importers more predictable builds.
Stable version

When a project reaches major version v1 it is considered stable.
Learn more about best practices

Repository

github.com/bluesky-social/indigo

Links

Open Source Insights

README ¶

`indigo/automod`: rules engine for anti-spam and other moderation tasks

automod is a "rules engine" framework, used to augment human moderators in the atproto network by proactively identifying patterns of behavior and content. Batches of rules are processed for novel "events" such as a new post or update of an account handle. Counters and other statistics are collected, which can drive subsequent rule invocations. The outcome of rules can be moderation events like "report account for human review" or "label post". Much of what this framework does is simply aggregating and maintaining caches of relevant metadata about accounts and pieces of content, so that rules have efficient access to this information.

A primary design goal is to have a flexible framework to allow new rules to be written and deployed rapidly in response to new patterns of spam and abuse.

Some example rules are included in the automod/rules package, but the expectation is that some real-world rules will be kept secret.

Code for subscribing to a firehose is not included here; see ../cmd/hepa for a service daemon built on this package.

API reference documentation can be found on pkg.go.dev.

Architecture

The runtime (automod.Engine) manages network requests, caching, and configuration. Outside calling code makes concurrent calls to the Process* methods that the runtime provides. The runtime constructs event context structs (eg, automod.RecordContext), hydrates relevant metadata from (cached) external services, and then executes a configured set of rules on the event. Rules may request additional information (via the Context object), do arbitrary local compute, and then update the context with "effects" (such as moderation actions). After all rules have run, the runtime will inspect the context and persist any side-effects, such as updating counter state and pushing any new moderation actions to external services.

The runtime maintains state in several "stores", each of which has an interface and both in-memory and Redis implementations. The automod stores are semi-ephemeral: they are persisted and are important state for rules to work as expected, but they are not a canonical or long-term store for moderation decisions or actions. It is expected that Redis is used in virtually all deployments. The store types are:

automod/cachestore: generic data caching with expiration (TTL) and explicit purging. Used to cache account-level metadata, including identity lookups and (if available) private account metadata
automod/countstore: keyed integer counters with time bucketing (eg, "hour", "day", "total"). Also includes probabilistic "distinct value" counters (eg, Redis HyperLogLog counters, with roughly 2% precision)
automod/setstore: configurable static string sets. May eventually be runtime configurable
automod/flagstore: mechanism to keep track of automod-generated "flags" (like labels or hashtags) on accounts or records. Mostly used to detect new flags. May eventually be moved in to the moderation service itself, similar to labels

Prior Art

The SQRL language and runtime was originally developed by an industry vendor named Smyte, then acquired by Twitter, with some core Javascript components released open source in 2023. The SQRL documentation is extensive and describes many of the design trade-offs and features specific to rules engines. Bluesky considered adopting SQRL but decided to start with a simpler runtime with rules in a known language (golang).
Reddit's automod system is simple an accessible for non-technical sub-reddit community moderators. Discord has a large ecosystem of bots which can help communities manage some moderation tasks, in particular mitigating spam and brigading.
Facebook's FXL and Haxl rule languages have been in use for over a decade. The 2012 paper "The Facebook Immune System" gives a good overview of design goals and how a rules engine fits in to a an overall anti-spam/anti-abuse pipeline.
Email anti-spam systems like SpamAssassin and rspamd have been modular and configurable for several decades.

Documentation ¶

Overview ¶

Auto-Moderation rules engine for anti-spam and other moderation tasks.

This package (`github.com/bluesky-social/indigo/automod`) contains a "rules engine" to augment human moderators in the atproto network. Batches of rules are processed for novel "events" such as a new post or update of an account handle. Counters and other statistics are collected, which can drive subsequent rule invocations. The outcome of rules can be moderation events like "report account for human review" or "label post". A lot of what this package does is collect and maintain caches of relevant metadata about accounts and pieces of content, so that rules have efficient access to this information.

See `automod/README.md` for more background, and `cmd/hepa` for a daemon built on this package.

Index ¶

Variables
type AccountContext
type AccountMeta
type BlobRuleFunc
type Engine
type IdentityRuleFunc
type NotificationContext
type NotificationRuleFunc
type Notifier
type PostRuleFunc
type ProfileRuleFunc
type RecordContext
type RecordOp
type RecordRuleFunc
type RuleSet
type SlackNotifier

Constants ¶

This section is empty.

Variables ¶

View Source

var (
	ReportReasonSpam       = engine.ReportReasonSpam
	ReportReasonViolation  = engine.ReportReasonViolation
	ReportReasonMisleading = engine.ReportReasonMisleading
	ReportReasonSexual     = engine.ReportReasonSexual
	ReportReasonRude       = engine.ReportReasonRude
	ReportReasonOther      = engine.ReportReasonOther

	PeriodTotal = countstore.PeriodTotal
	PeriodDay   = countstore.PeriodDay
	PeriodHour  = countstore.PeriodHour

	CreateOp = engine.CreateOp
	UpdateOp = engine.UpdateOp
	DeleteOp = engine.DeleteOp
)

Functions ¶

This section is empty.

Types ¶

type AccountContext ¶

type AccountContext = engine.AccountContext

type AccountMeta ¶

type AccountMeta = engine.AccountMeta

type BlobRuleFunc ¶

type BlobRuleFunc = engine.BlobRuleFunc

type Engine ¶

type Engine = engine.Engine

type IdentityRuleFunc ¶

type IdentityRuleFunc = engine.IdentityRuleFunc

type NotificationContext ¶

type NotificationContext = engine.NotificationContext

type NotificationRuleFunc ¶

type NotificationRuleFunc = engine.NotificationRuleFunc

type Notifier ¶

type Notifier = engine.Notifier

type PostRuleFunc ¶

type PostRuleFunc = engine.PostRuleFunc

type ProfileRuleFunc ¶

type ProfileRuleFunc = engine.ProfileRuleFunc

type RecordContext ¶

type RecordContext = engine.RecordContext

type RecordOp ¶

type RecordOp = engine.RecordOp

type RecordRuleFunc ¶

type RecordRuleFunc = engine.RecordRuleFunc

type RuleSet ¶

type RuleSet = engine.RuleSet

type SlackNotifier ¶

type SlackNotifier = engine.SlackNotifier

Source Files ¶

View all Source files

Directories ¶

Path	Synopsis
cachestore Automod component for caching arbitrary data (as JSON strings) with a fixed TTL and purging.	Automod component for caching arbitrary data (as JSON strings) with a fixed TTL and purging.
capture Automod development helpers for fetching and saving snapshots of real-world content and metadata.	Automod development helpers for fetching and saving snapshots of real-world content and metadata.
countstore Interface for fast atomic counters, and separate implementations using redis and in-process memory.	Interface for fast atomic counters, and separate implementations using redis and in-process memory.
engine Core automod rules engine implementation and related types	Core automod rules engine implementation and related types
flagstore Interface for storing "flags", a form of private automod metadata.	Interface for storing "flags", a form of private automod metadata.
keyword String processing helpers for doing fuzzy detection and normalized token matching against keyword lists.	String processing helpers for doing fuzzy detection and normalized token matching against keyword lists.
cmd/kw-cli
rules Example automod rules and helpers.	Example automod rules and helpers.
setstore Interface for simple sets of strings, with fast inclusion checks.	Interface for simple sets of strings, with fast inclusion checks.
visual automod helpers for visual content (image blobs)	automod helpers for visual content (image blobs)

?	: This menu
/	: Search site
f or F	: Jump to
y or Y	: Canonical URL