matrix-org · Yoric · May 24, 2021 · May 24, 2021 · May 24, 2021 · May 24, 2021
diff --git a/proposals/3215-towards-decentralized-moderation.md b/proposals/3215-towards-decentralized-moderation.md
@@ -70,101 +70,128 @@ can be invited to moderation rooms act upon abuse reports:
 
 ### Invariants
 
-- Each room MAY have a state event `m.room.moderation_room`. If specified, this is the room ID towards which
-    abuse reports MUST be sent. As rooms may be deleted `m.room.moderation_room` MAY be an invalid room ID.
+- Each room MAY have a state event `m.room.moderated_by`. If specified, this is the room ID towards which
+    abuse reports MUST be sent. As rooms may be deleted `m.room.moderated_by` MAY be an invalid room ID.
+    A room that has a state event `m.room.moderated_by` supports moderation.
 
 ```jsonc
 {
-    "state_key": "m.room.moderation_room",
-    "type": "m.room.moderation_room",
+    "state_key": "m.room.moderated_by",
+    "type": "m.room.moderated_by",
     "content": {
         "room_id": XXX, // The room picked for moderation.
+        "user_id": XXX, // The bot in charge of forwarding reports to `room_id`.
     }
     // ... usual fields
 }
 ```
 
-### Client behavior
+- Each room MAY have state events `m.room.moderator_of`. A room that has a state event `m.room.moderation.
+
+```jsonc
+{
+    "state_key": "m.room.moderation.moderator_of.XXX", // XXX is the ID of the Community Room, i.e. the room being moderated.
+    "type": "m.room.moderation.moderator_of",
+    "content": {
+        "user_id": XXX, // The bot in charge of forwarding reports to this room.
+    }
+    // ... usual fields
+}
+```
 
+### Client behavior
 
 #### Opting in for moderation
 
-When a user Alice creates a room or when a room moderator accesses the room's configuration, they MAY opt-in for moderation.
-When they do, they MUST pick a moderation room. The client SHOULD check that the moderation room is a room in which Alice
-has a powerlevel sufficient for sending messages.
+When a user Alice creates a room ("the Community Room") or when a room moderator accesses the Community Room's configuration,
+they MAY opt-in for moderation. When they do, they MUST pick a Moderation Room. The Client SHOULD check that:
+- the Moderation Room is a room in which Alice has a powerlevel sufficient for sending messages;
+- the Moderation Room has a state event `m.room.moderation.moderator_of`.
 
-This room ID is materialized as a state event `m.room.moderation_room`, as described above.
+If Alice has opted-in for moderation, mased on the Moderation Room's Room ID and `m.room.moderation.moderator_of`, the Client
+MUST create a state event `m.room.moderated_by` (see above) in the Community Room.
 
-Similarly, if a moderator has opted in for moderation in a room, a moderator MAY opt out of moderation for that room.
-This is materialized as deleting `m.room.moderation_room`.
+Similarly, if a moderator has opted in for moderation in a Community Room, a moderator MAY opt out of moderation for that
+Community Room. This is materialized as deleting `m.room.moderated_by`.
+
+#### Rejecting moderation
+
+A member of a Moderation Room may disconnect the Moderation Room from a Community Room by removing state event
+`m.room.moderation.moderator_of.XXX`. This may serve to reconfigure moderation if a Community Room is deleted
+or grows sufficiently to require its dedicated moderation team/bots.
 
 #### Reporting an event
 
-Any member of a room that supports moderation MAY report an event from that room, by sending a `m.abuse.report` event
+Any member of a Community Room that supports moderation MAY report an event from that room, by sending a `m.abuse.report` event
 with content
 
+| field    | Description |
+|----------|-------------|
 | event_id | **Required** id of the event being reported. |
 | room_id  | **Required** id of the room in which the event took place. |
+| moderated_by_id | **Required** id of the moderation room, as taken from `m.room.moderated_by`. |
 | nature   | **Required** The nature of the event, see below. |
 | comment  | Optional. String. A freeform description of the reason for sending this abuse report. |
 
 `nature` is an enum:
 
-- `abuse.disagreement`: disagree with other user;
-- `abuse.toxic`: toxic behavior, including insults, unsollicited invites;
-- `abuse.illegal`: illegal behavior, including child pornography, death threats,...;
-- `abuse.spam`: commercial spam, propaganda, ... whether from a bot or a human user;
-- `abuse.room`: report the entire room, e.g. for voluntarily hosting behavior that violates server ToS;
-- `abuse.other`: doesn't fit in any category above.
+- `m.abuse.disagreement`: disagree with other user;
+- `m.abuse.toxic`: toxic behavior, including insults, unsollicited invites;
+- `m.abuse.illegal`: illegal behavior, including child pornography, death threats,...;
+- `m.abuse.spam`: commercial spam, propaganda, ... whether from a bot or a human user;
+- `m.abuse.room`: report the entire room, e.g. for voluntarily hosting behavior that violates server ToS;
+- `m.abuse.other`: doesn't fit in any category above.
 
 We expect that this enum will be amended by further MSCs.
 
 The rationale for requiring a `nature` is twofold:
 
-- a Client may give to give a users the opportunity to think a little about whether the behavior they is truly abuse;
+- a Client may give to give a users the opportunity to think a little about whether the behavior they report truly is abuse;
 - this gives the Client the ability to split between
-    - `abuse.room`, which should be routed to an administrator;
+    - `abuse.room`, which should be routed to an administrator (in the current MSC, using the existing moderation API);
     - `abuse.disagreement`, which may better be handled by blurring messages from offending user;
     - everything else, which needs to be handled by a room moderator or a bot.
 
-Any `m.abuse.report` message sent to a moderation room is an abuse report.
-
-This proposal does not specify behavior when `m.room.moderation_room` is not set or when the room doesn't exist.
+To send an `m.abuse.report`, the Client posts the `m.abuse.report` message as DM to the `user_id` specified in the 
+`m.room.moderated_by`.
 
+This proposal does not specify behavior when `m.room.moderated_by` is not set or when the `user_id` doesn't exist.
 
-### Server behavior
+### Built-in routing bot behavior
 
-#### Routing messages
+Users should not need to join the moderation room to be able to send `m.abuse.report` messages to it, as it would
+let them snoop on reports from other users. Rather, we introduce a built-in bot as part of this specification: the
+Routing Bot. This Routing Bot is part of the server and has access to priviledged information such as room membership.
 
-When user Alice attempts to send a `m.abuse.report` message _M_ to room _R_:
+1. When the Routing Bot is invited to a room, it always accepts invites.
+2. When the Routing Bot receives a message other than `m.abuse.report`, it ignores the message.
+3. When the Routing Bot receives a message _M_ with type `m.abuse.report` from Alice:
+    - If the Routing Bot is not a member of _M_`.moderated_by_id`, reject the message.
+    - If Alice is not a member of _M_.`room_id`, reject the message.
+    - If room _M_.`moderated_by_id`  does not contain a state event `m.room.moderation.moderator_of.XXX`, where `XXX`
+        is _M_.`room_id`
+            - Reject the message.
+        - Otherwise
+            - Call _S_ the above state event
+            - If _S_ does not have type `m.room.moderation.moderator_of`, reject the message.
+            - If _S_ is missing field `user_id`, reject the message.
+            - If _S_.`user_id` is not the id of the Routing Bot, reject the message.
+            - If event _M_.`event_id` did not take place in room _M_.`room_id`, reject the message.
+            - If Alice could not witness event _M_.`event_id`, reject the message.
+            - Copy the message to room _M_.
 
-- if Alice is not a member of _M_`.room_id`, reject the message;
-- if room _M_.`room_id` does not have a state event `m.room.moderation_room`, reject the message;
-- if room _M_.`room_id` has a state event `m.room.moderation_room` and its value is other than _R_, reject the message;
-- if event _M_.`event_id` did not take place in room _M_`.room_id`, reject the message;
-- if Alice could not witness event _M_.`event_id`, reject the message;
-- otherwise, send the message to room _R_ **even if Alice is not a member of room _R_**.
 
-**Note** This may needs a new API comparable to https://spec.matrix.org/unstable/server-server-api/#knocking-upon-a-room . To be specified.
-
-### Possible bot behavior
+### Possible Moderation Bot behavior
 
 This section is provided as an illustration of the spec, not as part of the spec.
 
-A possible setup would involve two bots, both members of a moderation room _MR_.
+A possible setup would involve two Moderation Bots, both members of a moderation room _MR_.
 
-- A classifier bot consumes `m.abuse.report` messages, discards messages from users who have joined recently or never
+- A Classifier Bot consumes `m.abuse.report` messages, discards messages from users who have joined recently or never
     been active in the room (possible bots/sleeping bots), then collates reports against users. If there are more than
     e.g. 10 reports in the last hour against a single user, post a `m.policy.rule.user` message in the same room specifying that the user
     should undergo temporary ban.
-- Another bot consumes `m.policy.rule.user` messages and implement bans.
-
-## Open questions
-
-- If all the moderators of room _R_ leave its moderation room _MR_ or are kick/banned from _MR_, we can end up with an orphan
-    room _R_, which sends its moderation on _MR_ but doesn't have moderators in _MR_. Do we need to handle this?
-- Should we allow the members or moderators of a moderation room _MR_ to reject a room _R_ from moderation? If so,
-    how do we implement this?
+- A Ban Bot consumes `m.policy.rule.user` messages and implements bans.
 
 ## Security considerations
 
@@ -176,7 +203,8 @@ room. There is the possibility that this mechanism could be abused.
 We believe that it cannot readily be abused for spam, as these are structured data messages, which are usually not visible to members
 of the moderation room.
 
-However, it is possible that it can become a vector for attacks if combined with a bot that treats said structured data messages.
+However, it is possible that it can become a vector for attacks if combined with a bot that treats said structured data messages,
+e.g. a Classifier Bot and/or a Ban Bot.
 
 ### Revealing oneself
 
@@ -209,18 +237,28 @@ As bots are invited to moderation rooms, a compromised bot has access to all mod
 
 ## Alternatives
 
+### MSC 2938
 MSC 2938 (by the same author) has previously been posted to specify a mechanism for reporting events to room moderators. The current MSC is considered
-    - simpler to implement;
     - more reliable (it does not need to roll out its own federation communication);
-    - less specialized.
+    - less specialized/more general.
 
 I am not aware of other proposals that cover the same needs.
 
+### Alternatives to the Routing Bot
+
+The "knocking" protocol is an example of an API that lets users inject state events in a room in which they do
+not belong. It is possible that we could follow the example of this protocol and implement a similar "abuse" API.
+
+However, this would require implementing yet another new communication protocol based on PDUs/EDUs, including a
+(small) custom encryption/certificate layer and another retry mechanism. The author believes that this would entail
+a higher risk and result in code that is harder to test and trust.
+
 ## Unstable prefix
 
 During experimentation
 
-- `m.room.moderation_room` will be prefixed `org.matrix.msc3215.room.moderation_room`;
+- `m.room.moderated_by` will be prefixed `org.matrix.msc3215.room.moderated_by`;
+- `m.room.moderator_of` will be prefixed `org.matrix.msc3215.room.moderator_of`;
 - `m.abuse.report` will be prefixed `org.matrix.msc3215.abuse.report`;
 - `abuse.*` will be prefixed `org.matrix.msc3215.abuse.nature.*`.