BlogBluesky’s Stackable Approach to Moderation

Bluesky’s Stackable Approach to Moderation

March 12, 2024

by The Bluesky Team

Bluesky was created to put users and communities in control of their social spaces online. The first generation of social media platforms connected the world, but ended up consolidating power in the hands of a few corporations and their leaders. Our online experience doesn’t have to depend on billionaires unilaterally making decisions over what we see. On an open social network like Bluesky, you can shape your experience for yourself.

Today, we’re excited to announce that we’re open-sourcing Ozone, our collaborative moderation tool. With Ozone, individuals and teams can work together to review and label content across the network. Later this week, we’re opening up the ability for you to run your own independent moderation services, seamlessly integrated into the Bluesky app. This means that you'll be able to create and subscribe to additional moderation services on top of what Bluesky requires, giving you unprecedented control over your social media experience.

At Bluesky, we’re investing in safety from two angles. First, we've built our own moderation team dedicated to providing around-the-clock coverage to uphold our community guidelines. Additionally, we recognize that there is no one-size-fits-all approach to moderation — no single company can get online safety right for every country, culture, and community in the world. So we’ve also been building something bigger — an ecosystem of moderation and open-source safety tools that gives communities power to create their own spaces, with their own norms and preferences. Still, using Bluesky feels familiar and intuitive. It's a straightforward app on the surface, but under the hood, we have enabled real innovation and competition in social media by building a new kind of open network.

In designing these moderation services, Bluesky operated by three principles:

Simple and Powerful: Give users a pleasant default experience, with customization options under the hood
User Choice: Empower users and communities to develop their own moderation systems
Openness: Create an open system that increases trust in the governance of our digital spaces

We first shared our vision for composable moderation before Bluesky even had 20,000 users last year. Now, we serve over 5 million users. In this blog post, we’ll dive into how moderation works on the network, where you can choose and customize all the pieces that make up your social media experience.

Sensible Defaults with User Choice

In everything we build, we aim to provide a polished user experience with further customization options for those you who want them. When you sign up for Bluesky, you will be subscribed to Bluesky’s built-in moderation service by default. This is similar to how custom feeds work on Bluesky — we’ll show you a couple feeds by default, but you can also create and subscribe to more. Bluesky’s moderation service combines around-the-clock coverage by our team to resolve user reports according to our community guidelines with several automated moderation systems. This provides a strong foundation for moderation on the app. You can read our 2023 Moderation Report for more details.

Bluesky’s vision for moderation is a stackable ecosystem of services. Starting this week, you'll have the power to install filters from independent moderation services, layering them like building blocks on top of the Bluesky app's foundation. This allows you to create a customized experience tailored to your preferences (see example below).

In the first stage of this week’s rollout, these independent moderation services filters will be available on the desktop version of the app. Soon, they’ll also be available on mobile, so you can shape your social media experience across all platforms.

This hybrid approach is intended to provide a cohesive experience, where our in-house moderation works in conjunction with additional layers customized to each community. The Bluesky app, as an online space that we created and maintain, will always have the foundation of the moderation we provide. Independent moderation services will let you and community builders further customize your own spaces, and open APIs will let developers evolve and innovate on these systems.

Configurable options per label from an example moderation service.

Example of a post that the moderation service The Chiller has labeled as “rude.”

One team will never be perfect at moderation and curation for the entire world, with its wide variety of contexts, cultures, and preferences. So we’re excited about opening the ecosystem to empower experts, developers, and users with local context to provide their own input that you can additionally subscribe to, on top of Bluesky’s moderation service.

For additional information, read our technical explainer for moderation architecture across the AT Protocol here.

How will this all work?

Here’s what users, moderators, and developers can expect to see this week:

From a user perspective:

First and foremost, we want Bluesky to be a great and intuitive experience as soon as you install the app. But if you want to customize your experience, you can easily browse and select from other independent moderation services and subscribe to them in the Bluesky app — as easily as you’d follow another account.

For example, someone could make a moderation service that blocks photos of spiders from Bluesky — let’s call it the Spider Shield. If you get a jump scare from seeing spiders in your otherwise peaceful nature feed, you could install this moderation service and immediately any labeled spider pictures would disappear from your experience.

Moderation services can also accept reports, so if you came across an unlabeled picture of a spider, you could report it to the Spider Shield for review.

From a moderator perspective:

If you want to offer a moderation layer on top of what the Bluesky app provides, you can do this without running a lot of infrastructure or building your own client app. We’ve built open source software to simplify the process of running a moderation service. While you need some technical know-how for now, we expect this process to get simpler over time.

Screenshot of the Ozone interface, which a team can use to inspect and label content on the network.

Today, you can already run a mute list or block list that other users can subscribe to. But it often gets tied to your account in a way that makes it hard to delegate responsibility to others. Once you’re running a popular blocklist, that list becomes associated with your account, and users may start directly tagging you in the app. This can get overwhelming at scale.

Ozone, the open source moderation labeling system we’re releasing today, lets you set up a service like a blocklist, but more nuanced — instead of just adding accounts, you can label specific posts too. You will have access to a reporting queue, and users will be able to send reports via the in-app reporting flow. You will be able to set custom labels, and specify what those labels should do. Moderation services will not be tied to individual users, and multiple people can manage them. Tooling designed for teams and communities can help take the burden off individuals and make it possible to run a sustainable moderation service.

To make this more concrete, let’s say you’re the creator of the previously mentioned Spider Shield labeler that labels photos of spiders. You can set up an Ozone dashboard that provides a queue of spider pictures that have been reported, reducing the need for people to tag you directly every time they find a new spider picture online. It’s customizable — you could create one kind of label that blocks pictures of real spiders, and one kind of label that blurs out illustrations of spiders. You could recruit others who don’t like spiders to help you manage the reports, and even hand the project off to someone else altogether without disrupting the people who are using it.

From a developer perspective:

As a developer, you've got options when it comes to labeling content. You can use our software, like Ozone, or you can apply labels directly through the API. Ozone is built to help humans review moderation reports, but you can also use automated labeling to power your moderation services. Check out Ozone's open-source repo here.

If you want to set up Spider Shield as a fully automated service that uses machine learning to find and label spider pictures, you can do that without even touching Ozone, our moderation tool. And if you want to customize Ozone for your own purposes, you can submit a PR or fork the project. Our goal in building this open source moderation tooling is to help apps in the AT Protocol ecosystem handle trust & safety challenges without having to start from scratch.

The generic, customizable nature of labels allows you to get creative with them — it would be possible to use labels to “verify” nature accounts that don’t post pictures of spiders, for example. Although the initial functionality of labelers is intended to hide, block, or blur content, it could eventually be used for curation or verification too. We think that the atproto developer ecosystem will find even more ways to use labels and independent moderation services, and will drive innovation in how moderation works on social networks.

Moderation services can work across the entire atproto network, not just the Bluesky app. Imagine if someone creates a new photo-sharing app called Skygram. The Spider Shield moderation service built for Bluesky could easily be used on Skygram too. That's the power of "composable moderation" — all the pieces can be mixed and matched in tons of different ways, even across completely separate apps.

We think it's important for Bluesky to lay the groundwork for a great experience in the app, which is why our moderation service is the default for all Bluesky app users. But we also believe in giving users the freedom to choose and the right to leave. So if Bluesky's moderation doesn’t meet your needs and you want an altogether different experience, you can make that happen. You'll need to build or use a different client app with your own moderation service, but this option gives you full flexibility to implement your own moderation system from the ground up. However, all content shown in the Bluesky app must adhere to Bluesky’s community guidelines.

FAQ

Where can I find the open-sourced Ozone tool?

You can find the GitHub repository here.

Why are you open-sourcing Ozone?

By making Ozone open-source and providing it as a ready-to-use tool for independent moderators on the AT Protocol, we're creating a system that encourages collaboration and transparency. Unlike most social media companies that develop their safety tools in private, Ozone's development will be out in the open. This means that new social apps built on the AT Protocol can benefit from all the improvements made by Bluesky and other organizations. By working together and sharing knowledge, we can create better tools faster and build a social media ecosystem that works for everyone.

How is this different from community moderation on Mastodon?

Moderation on Bluesky is not tied to your server, like it is on Mastodon. Defederation, a way of addressing moderation issues in Mastodon by disconnecting servers, is not as relevant on Bluesky because there are other layers to the system. Server operators can set rules for what content they will host, but tools like blocklists and moderation services are what help communities self-organize around moderation preferences. Our post on federation goes into more detail on how Bluesky differs from Mastodon.

What do I need to do to moderate my own community on Bluesky?

All users of Bluesky’s client app are subscribed by default to Bluesky’s moderation. If you would like to run a moderation service that layers on top of these defaults, you can create a new account for that new service and get it up and running with Ozone.

However, if you want to opt out of our defaults, this is still possible — we believe it’s important to give users the right to leave and not lock you in. You would need to use or develop a separate client app that connects to the AT Protocol, but this option gives you full flexibility to implement your own moderation system from the ground up.

How will running a moderation service be sustainable?

Moderation services, much like feed generators, will likely start off as community-run projects. Just like the 40,000+ custom feeds on Bluesky, or the many Mastodon instances that exist, they may continue to operate as independent projects of individuals or organizations. However, there is also nothing stopping a moderation service from having paid subscribers.

Bluesky’s Stackable Approach to Moderation

Sensible Defaults with User Choice

How will this all work?

From a user perspective:

From a moderator perspective:

From a developer perspective:

FAQ

We're Hiring

Bluesky

Links

Connect