Internet Engineering Task Force (IETF)                         J. Lennox
Request for Comments: 8861                                   8x8 / Jitsi
Category: Standards Track                                  M. Westerlund
ISSN: 2070-1721                                                 Ericsson
                                                                   Q. Wu
                                                                  Huawei
                                                              C. Perkins
                                                   University of Glasgow
                                                            January 2021

Sending Multiple RTP Streams in a Single RTP Session: Grouping RTP

Control Protocol (RTCP) Reception Statistics and Other Feedback

Abstract

RTP allows multiple RTP streams to be sent in a single session but requires each Synchronization Source (SSRC) to send RTP Control Protocol (RTCP) reception quality reports for every other SSRC visible in the session. This causes the number of RTCP reception reports to grow with the number of SSRCs, rather than the number of endpoints. In many cases, most of these RTCP reception reports are unnecessary, since all SSRCs of an endpoint are normally co-located and see the same reception quality. This memo defines a Reporting Group extension to RTCP to reduce the reporting overhead in such scenarios.

Status of This Memo

This is an Internet Standards Track document.

This document is a product of the Internet Engineering Task Force (IETF). It represents the consensus of the IETF community. It has received public review and has been approved for publication by the Internet Engineering Steering Group (IESG). Further information on Internet Standards is available in Section 2 of RFC 7841.

Information about the current status of this document, any errata, and how to provide feedback on it may be obtained at https://www.rfc-editor.org/info/rfc8861.

Copyright Notice

Copyright © 2021 IETF Trust and the persons identified as the document authors. All rights reserved.

This document is subject to BCP 78 and the IETF Trust's Legal Provisions Relating to IETF Documents (https://trustee.ietf.org/license-info) in effect on the date of publication of this document. Please review these documents carefully, as they describe your rights and restrictions with respect to this document. Code Components extracted from this document must include Simplified BSD License text as described in Section 4.e of the Trust Legal Provisions and are provided without warranty as described in the Simplified BSD License.

Table of Contents

   1.  Introduction
   2.  Terminology
   3.  RTCP Reporting Groups
     3.1.  Semantics and Behavior of RTCP Reporting Groups
     3.2.  Identifying Members of an RTCP Reporting Group
       3.2.1.  Definition and Use of the RTCP RGRP SDES Item
       3.2.2.  Definition and Use of the RTCP RGRS Packet
     3.3.  Interactions with the RTP/AVPF Feedback Profile
     3.4.  Interactions with RTCP Extended Report (XR) Packets
     3.5.  Middlebox Considerations
     3.6.  SDP Signaling for Reporting Groups
   4.  Properties of RTCP Reporting Groups
     4.1.  Bandwidth Benefits of RTCP Reporting Groups
     4.2.  Compatibility of RTCP Reporting Groups
   5.  Security Considerations
   6.  IANA Considerations
   7.  References
     7.1.  Normative References
     7.2.  Informative References
   Authors' Addresses

1. Introduction

The Real-time Transport Protocol (RTP) [RFC3550] is a protocol for group communication, supporting multiparty multimedia sessions. A single RTP session can support multiple participants sending data at once and can also support participants sending multiple simultaneous RTP streams. Examples of the latter might include a participant with multiple cameras who chooses to send multiple views of a scene, or a participant that sends audio and video flows multiplexed in a single RTP session. Rules for handling RTP sessions containing multiple RTP streams are described in [RFC3550], with some clarifications in [RFC8108].

An RTP endpoint will have one or more Synchronization Sources (SSRCs). It will have at least one RTP stream, and thus at least one SSRC, for each media source it sends, and it might use multiple SSRCs per media source when using media scalability features [RFC6190], forward error correction, RTP retransmission [RFC4588], or similar mechanisms. An endpoint that is not sending any RTP streams will have at least one SSRC to use for reporting and any feedback messages. Each SSRC has to send RTP Control Protocol (RTCP) Sender Reports (SRs) corresponding to the RTP packets it sends and Receiver Reports (RRs) for traffic it receives. (SRs and RRs are described in [RFC3550].) That is, every SSRC will send RTCP packets to report on every other SSRC. This rule is simple, but it can be quite inefficient for endpoints that send large numbers of RTP streams in a single RTP session. Consider a session comprising ten participants, each sending three media sources, each media source associated with its own RTP stream. There will be 30 SSRCs in such an RTP session, and each of those 30 SSRCs will send an RTCP SR/RR packet (containing several report blocks) per reporting interval as each SSRC reports on all the others. However, the three SSRCs comprising each participant are commonly co-located such that they see identical reception quality. If there was a way to indicate that several SSRCs are co- located and see the same reception quality, then two-thirds of those RTCP reports could be suppressed. This would allow the remaining RTCP reports to be sent more often, while keeping within the same RTCP bandwidth fraction.

This memo defines such an RTCP extension: RTCP Reporting Groups. This extension is used to indicate the SSRCs that originate from the same endpoint and therefore have identical reception quality, hence allowing the endpoints to suppress unnecessary RTCP reception quality reports.

2. Terminology

The key words "MUST", "MUST NOT", "REQUIRED", "SHALL", "SHALL NOT", "SHOULD", "SHOULD NOT", "RECOMMENDED", "NOT RECOMMENDED", "MAY", and "OPTIONAL" in this document are to be interpreted as described in BCP 14 [RFC2119] [RFC8174] when, and only when, they appear in all capitals, as shown here.

3. RTCP Reporting Groups

An RTCP Reporting Group is a set of SSRCs that are co-located at a single endpoint (which could be an end host or a middlebox) in an RTP session. Since they are co-located, every SSRC in the RTCP Reporting Group will have an identical view of the network conditions and will see the same lost packets, jitter, etc. This allows a single representative to send RTCP reception quality reports on behalf of the rest of the Reporting Group, reducing the number of RTCP packets that need to be sent without loss of information.

3.1. Semantics and Behavior of RTCP Reporting Groups

A group of co-located SSRCs that see identical network conditions can form an RTCP Reporting Group. If Reporting Groups are in use, an RTP endpoint with multiple SSRCs MAY put those SSRCs into a Reporting Group if their view of the network is identical, i.e., if they report on traffic received at the same interface of an RTP endpoint. SSRCs with different views of the network MUST NOT be put into the same Reporting Group.

An endpoint that has combined its SSRCs into an RTCP Reporting Group will choose one (or a subset) of those SSRCs to act as "reporting source(s)" for that RTCP Reporting Group. A reporting source will send RTCP SR/RR reception quality reports on behalf of the other members of the RTCP Reporting Group. A reporting source MUST suppress the RTCP SR/RR reports that relate to other members of the Reporting Group and only report on remote SSRCs. The other members (non-reporting sources) of the RTCP Reporting Group will suppress their RTCP reception quality reports and will instead send an RTCP Reporting Group Reporting Sources (RGRS) packet (see Section 3.2.2) to indicate that they are part of an RTCP Reporting Group and give the SSRCs of the reporting sources.

If there are large numbers of remote SSRCs in the RTP session, then the reception quality reports generated by the reporting source might grow too large to fit into a single compound RTCP packet, forcing the reporting source to use a round-robin policy to determine what remote SSRCs it includes in each compound RTCP packet, and so reducing the frequency of reports on each SSRC. To avoid this, in sessions with large numbers of remote SSRCs, an RTCP Reporting Group MAY use more than one reporting source. If several SSRCs are acting as reporting sources for an RTCP Reporting Group, then each reporting source MUST have non-overlapping sets of remote SSRCs it reports on.

An endpoint MUST NOT create an RTCP Reporting Group that comprises only a single local SSRC (i.e., an RTCP Reporting Group where the reporting source is the only member of the group), unless it is anticipated that the group might have additional SSRCs added to it in the future.

If a reporting source leaves the RTP session (i.e., if it sends an RTCP BYE packet or it leaves the session without sending a BYE according to the rules of [RFC3550], Section 6.3.7), the remaining members of the RTCP Reporting Group MUST (a) have another reporting source -- if one exists -- report on the remote SSRCs that the leaving SSRC had reported on, (b) choose a new reporting source, or (c) disband the RTCP Reporting Group and begin sending reception quality reports per [RFC3550] and [RFC8108].

The RTCP timing rules assign different bandwidth fractions to senders and receivers. This lets senders transmit RTCP reception quality reports more often than receivers. If a reporting source in an RTCP Reporting Group is a receiver but one or more non-reporting SSRCs in the RTCP Reporting Group are senders, then the endpoint MAY treat the reporting source as a sender for the purpose of RTCP bandwidth allocation, increasing its RTCP bandwidth allocation, provided it also treats one of the senders as if it were a receiver and makes the corresponding reduction in RTCP bandwidth for that SSRC. However, the application needs to consider the impact on the frequency of transmitting of the synchronization information included in RTCP SRs.

3.2. Identifying Members of an RTCP Reporting Group

When RTCP Reporting Groups are in use, the other SSRCs in the RTP session need to be able to identify which SSRCs are members of an RTCP Reporting Group. Two RTCP extensions are defined to support this: the RTCP Reporting Group (RGRP) Source Description (SDES) item is used by the reporting source(s) to identify an RTCP Reporting Group, and the RTCP RGRS packet is used by other members of an RTCP Reporting Group to identify the reporting source(s).

3.2.1. Definition and Use of the RTCP RGRP SDES Item

This document defines a new RTCP RGRP SDES item to identify an RTCP Reporting Group. The motivation for giving a Reporting Group an identifier is to ensure that (1) the RTCP Reporting Group and its member SSRCs can be correctly associated when there are multiple reporting sources and (2) a reporting SSRC can be associated with the correct Reporting Group if an SSRC collision occurs.

This document defines the RTCP RGRP SDES item. The RTCP RGRP SDES item MUST be sent by the reporting sources in a Reporting Group and MUST NOT be sent by other members of the Reporting Group or by SSRCs that are not members of any RTCP Reporting Group. Specifically, every reporting source in an RTCP Reporting Group MUST include an RTCP SDES packet containing an RGRP item in every compound RTCP packet in which it sends an RR or SR packet (i.e., in every RTCP packet it sends, unless Reduced-Size RTCP [RFC5506] is in use).

Syntactically, the format of the RTCP RGRP SDES item is identical to that of the RTCP SDES CNAME item [RFC7022], except that the SDES item type field MUST have value RGRP=11 instead of CNAME=1. The value of the RTCP RGRP SDES item MUST be chosen with the same concerns about global uniqueness and the same privacy considerations as the RTCP SDES CNAME. The value of the RTCP RGRP SDES item MUST be stable throughout the lifetime of the Reporting Group, even if some or all of the reporting sources change their SSRC due to collisions or if the set of reporting sources changes.

An RTP mixer or translator that forwards RTCP SR or RR packets from members of a Reporting Group MUST forward the corresponding RTCP RGRP SDES items as well, even if it otherwise strips SDES items other than the CNAME item.

3.2.2. Definition and Use of the RTCP RGRS Packet

A new RTCP packet type is defined to allow the members of an RTCP Reporting Group to identify the reporting sources for that group. This allows participants in an RTP session to distinguish an SSRC that is sending empty RTCP reception reports because it is a member of an RTCP Reporting Group from an SSRC that is sending empty RTCP reception reports because it is not receiving any traffic. It also explicitly identifies the reporting sources, allowing other members of the RTP session to (1) know which SSRCs are acting as the reporting sources for an RTCP Reporting Group and (2) detect if RTCP packets from any of the reporting sources are being lost.

The format of the RTCP RGRS packet is defined below. It comprises the fixed RTCP header that indicates the packet type and length, the SSRC of the packet sender, and a list of reporting sources for the RTCP Reporting Group of which the packet sender is a member.

    0                   1                   2                   3
    0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1
   +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
   |V=2|P|    SC   | PT=RGRS(212)  |             length            |
   +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
   |                     SSRC of packet sender                     |
   +=+=+=+=+=+=+=+=+=+=+=+=+=+=+=+=+=+=+=+=+=+=+=+=+=+=+=+=+=+=+=+=+
   :          List of SSRC(s) for the Reporting Source(s)          :
   :                              ...                              :
   +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+

The fields in the RTCP RGRS packet have the following definitions:

   version (V):  2-bit unsigned integer.  This field identifies the RTP
   
      version.  The current RTP version is 2.
   
   padding (P):  1 bit.  If set, the padding bit indicates that the RTCP
      packet contains additional padding octets at the end that are not
      part of the control information but are included in the length
      field.  See [RFC3550].
   
   Source Count (SC):  5-bit unsigned integer.  Indicates the number of
      reporting source SSRCs that are included in this RTCP packet.  As
      the RTCP RGRS packet MUST NOT be sent by reporting sources, all
      the SSRCs in the list of reporting sources will be different from
      the SSRC of the packet sender.  Every RTCP RGRS packet MUST
      contain at least one reporting source SSRC.
   
   Payload type (PT):  8-bit unsigned integer.  The RTCP packet type
      number that identifies the packet as being an RTCP RGRS packet.
      The RGRS RTCP packet has the value 212.
   
   Length:  16-bit unsigned integer.  The length of this packet in
      32-bit words minus one, including the header and any padding.
      This is in line with the definition of the length field used in
      RTCP SRs and RRs [RFC3550].  Since all RTCP RGRS packets include
      at least one reporting source SSRC, the length will always be 2 or
      greater.
   
   SSRC of packet sender:  32 bits.  The SSRC of the sender of this
      packet.
   
   List of SSRCs for the Reporting Source(s):  A variable number (as
      indicated by the SC header field) of 32-bit SSRC values of the
      reporting sources for the RTCP Reporting Group of which the packet
      sender is a member.

Every source that belongs to an RTCP Reporting Group but is not a reporting source MUST include an RTCP RGRS packet in every compound RTCP packet in which it sends an RR or SR packet (i.e., in every RTCP packet it sends, unless Reduced-Size RTCP [RFC5506] is in use). Each RTCP RGRS packet MUST contain the SSRC identifier of at least one reporting source. If there are more reporting sources in an RTCP Reporting Group than can fit into an RTCP RGRS packet, the members of that Reporting Group MUST send the SSRCs of the reporting sources in a round-robin fashion in consecutive RTCP RGRS packets, such that all the SSRCs of the reporting sources are included over the course of several RTCP reporting intervals.

An RTP mixer or translator that forwards RTCP SR or RR packets from members of a Reporting Group MUST also forward the corresponding RGRS RTCP packets. If the RTP mixer or translator rewrites SSRC values of the packets it forwards, it MUST make the corresponding changes to the RTCP RGRS packets.

3.3. Interactions with the RTP/AVPF Feedback Profile

The use of the RTP/AVPF Feedback Profile [RFC4585] allows SSRCs to send rapid RTCP feedback requests and codec control messages. If the use of the RTP/AVPF profile has been negotiated in an RTP session, members of an RTCP Reporting Group can send rapid RTCP feedback and codec control messages per [RFC5104], per [RFC4585] as updated by Section 5.4 of [RFC8108], and by the following considerations.

The members of an RTCP Reporting Group will all see identical network conditions. Accordingly, one might therefore think that it doesn't matter which SSRC in the Reporting Group sends the RTP/AVPF feedback or codec control messages. There might be, however, cases where the sender of the feedback/codec control message has semantic importance, or when only a subset of the members of an RTCP Reporting Group might want to send RTP/AVPF feedback or a codec control message in response to a particular event. For example, an RTP video sender might choose to treat packet loss feedback received from SSRCs known to be audio receivers with less urgency than feedback that it receives from video receivers when deciding what packets to retransmit, and a multimedia receiver using Reporting Groups might want to choose the outgoing SSRC for feedback packets to reflect this.

Each member of an RTCP Reporting Group SHOULD therefore send RTP/AVPF feedback/codec control messages independently of the other members of the Reporting Group, to respect the semantic meaning of the message sender. The suppression rules of [RFC4585] will ensure that only a single copy of each feedback packet is (typically) generated, even if several members of a Reporting Group send the same feedback. When an endpoint knows that several members of its RTCP Reporting Group will be sending identical feedback and that the sender of the feedback is not semantically important, that endpoint MAY choose to send all its feedback from the reporting source and deterministically suppress feedback packets generated by the other sources in the Reporting Group.

It is important to note that the RTP/AVPF timing rules operate on a per-SSRC basis. Using a single reporting source to send all feedback for a Reporting Group will hence limit the amount of feedback that can be sent to that which can be sent by one SSRC. If this limit is a problem, then the Reporting Group can allow each of its members to send its own feedback, using its own SSRC.

If the RTP/AVPF feedback messages or codec control requests are sent as compound RTCP packets, then those compound RTCP packets MUST include either an RTCP RGRS packet or an RTCP RGRP SDES item, depending on whether they are sent by the reporting source or a non-reporting source in the RTCP Reporting Group, respectively. The contents of noncompound RTCP feedback or codec control messages are not affected by the use of RTCP Reporting Groups.

3.4. Interactions with RTCP Extended Report (XR) Packets

When using RTCP Extended Report (XR) packets [RFC3611] with RTCP Reporting Groups, it is RECOMMENDED that the reporting source be used to send the RTCP XR packets. If multiple reporting sources are in use, the reporting source that sends the SR/RR packets that relate to a particular remote SSRC SHOULD send the RTCP XR reports about that SSRC. This is motivated as one commonly combine the RTCP XR metrics with the regular report block to more fully understand the situation. Receiving these blocks in different compound packets reduces their value, as the measuring intervals are not synchronized in those cases.

Some RTCP XR report blocks are specific to particular types of media and might be relevant to only some members of a Reporting Group. For example, it would make no sense for an SSRC that is receiving video to send a Voice over IP (VoIP) metric RTCP XR report block. Such media-specific RTCP XR report blocks MUST be sent by the SSRC to which they are relevant and MUST NOT be included in the common report sent by the reporting source. This might mean that some SSRCs send RTCP XR packets in compound RTCP packets that contain an empty RTCP SR/RR packet and that the time period covered by the RTCP XR packet is different from that covered by the RTCP SR/RR packet. If it is important that the RTCP XR packet and RTCP SR/RR packet cover the same time period, then that source SHOULD be removed from the RTCP Reporting Group, and standard RTCP packets be sent instead.

3.5. Middlebox Considerations

Many different types of middleboxes are used with RTP. RTCP Reporting Groups are potentially relevant to those types of RTP middleboxes that have their own SSRCs and generate RTCP reports for the traffic they receive. RTP middleboxes that do not have their own SSRC and that do not send RTCP reports on the traffic they receive cannot use the RTCP Reporting Group extension, since they generate no RTCP reports to that group.

An RTP middlebox that has several SSRCs of its own can use the RTCP Reporting Group extension to group the RTCP reports it generates. This can occur, for example, if a middlebox is acting as an RTP mixer for both audio and video flows that are multiplexed onto a single RTP session, where the middlebox has one SSRC for the audio mixer and one for the video mixer part, and when the middlebox wants to avoid cross-reporting between audio and video.

A middlebox cannot use the RTCP Reporting Group extension to group RTCP packets from the SSRCs that it is forwarding. It can, however, group the RTCP packets from the SSRCs it is forwarding into compound RTCP packets, following the rules in Section 6.1 of [RFC3550] and Section 5.3 of [RFC8108]. If the middlebox is using RTCP Reporting Groups for its own SSRCs, it MAY include RTCP packets from the SSRCs that it is forwarding as part of the compound RTCP packets its reporting source generates.

A middlebox that forwards RTCP SR or RR packets sent by members of a Reporting Group MUST forward the corresponding RTCP RGRP SDES items, as described in Section 3.2.1. A middlebox that forwards RTCP SR or RR packets sent by members of a Reporting Group MUST also forward the corresponding RTCP RGRS packets, as described in Section 3.2.2. Failure to forward these packets can cause compatibility problems, as described in Section 4.2.

If a middlebox rewrites SSRC values in the RTP and RTCP packets that it is forwarding, then it MUST make the corresponding changes in RTCP SDES packets containing RGRP items and in RTCP RGRS packets, to allow them to be associated with the rewritten SSRCs.

3.6. SDP Signaling for Reporting Groups

This document defines the "a=rtcp-rgrp" Session Description Protocol (SDP) [RFC4566] attribute to indicate if the session participant is capable of supporting RTCP Reporting Groups for applications that use SDP for configuration of RTP sessions. It is a property attribute and hence takes no value. The multiplexing category [RFC8859] is IDENTICAL, as the functionality applies at the RTP session level. A participant that proposes the use of RTCP Reporting Groups SHALL itself support the reception of RTCP Reporting Groups. The formal definition of this attribute is as follows:

      Name:  rtcp-rgrp
      Value:  None
      Usage Level:  session, media
      Charset Dependent:  no
      Example:  a=rtcp-rgrp

When using SDP Offer/Answer [RFC3264], the following procedures are to be used:

Generating the initial SDP offer:

If the offerer supports the RTCP Reporting Group extensions and is willing to accept RTCP packets containing those extensions, then it MUST include an "a=rtcp-rgrp" attribute in the initial offer. If the offerer does not support RTCP Reporting Group extensions or is not willing to accept RTCP packets containing those extensions, then it MUST NOT include the "a=rtcp-rgrp" attribute in the offer.

Generating the SDP answer:

If the SDP offer contains an "a=rtcp-rgrp" attribute, and if the answerer supports RTCP Reporting Groups and is willing to receive RTCP packets using the RTCP Reporting Group extensions, then the answerer MAY include an "a=rtcp-rgrp" attribute in the answer and MAY send RTCP packets containing the RTCP Reporting Group extensions. If the offer does not contain an "a=rtcp-rgrp" attribute, or if the offer does contain such an attribute but the answerer does not wish to accept RTCP packets using the RTCP Reporting Group extensions, then the answer MUST NOT include an "a=rtcp-rgrp" attribute.

Offerer processing of the SDP answer:

If the SDP answer contains an "a=rtcp-rgrp" attribute and the corresponding offer also contained an "a=rtcp-rgrp" attribute, then the offerer MUST be prepared to accept and process RTCP packets that contain the Reporting Group extensions and MAY send RTCP packets that contain the Reporting Group extensions. If the SDP answer contains an "a=rtcp-rgrp" attribute but the corresponding offer did not contain the "a=rtcp-rgrp" attribute, then the offerer MUST reject the call. If the SDP answer does not contain an "a=rtcp-rgrp" attribute, then the offerer MUST NOT send packets containing the RTCP Reporting Group extensions and does not need to process packets containing the RTCP Reporting Group extensions.

In declarative usage of SDP, such as the Real-Time Streaming Protocol (RTSP) [RFC7826] and the Session Announcement Protocol (SAP) [RFC2974], the presence of the attribute indicates that the session participant MAY use RTCP Reporting Groups in its RTCP transmissions. An implementation that doesn't explicitly support RTCP Reporting Groups MAY join an RTP session as long as it has been verified that the implementation doesn't suffer from the problems discussed in Section 4.2.

4. Properties of RTCP Reporting Groups

This section provides additional information on what the resulting properties are (i.e., resulting effects or impacts) as related to the design specified in Section 3. The content of this section is non- normative.

4.1. Bandwidth Benefits of RTCP Reporting Groups

To understand the benefits of RTCP Reporting Groups, consider a scenario in which the two endpoints in a session each have a hundred sources, of which eight each are sending within any given reporting interval.

For ease of analysis, we can make the simplifying approximation that the duration of the RTCP reporting interval is equal to the total size of the RTCP packets sent during an RTCP interval, divided by the RTCP bandwidth. (This will be approximately true in scenarios where the bandwidth is not so high that the minimum RTCP interval is reached.) To further simplify, we can assume that RTCP senders are following the recommendations regarding compound RTCP packets in [RFC8108]; thus, the per-packet transport-layer overhead will be small relative to the RTCP data. Thus, only the actual RTCP data itself need be considered.

In a report interval in this scenario, there will, as a baseline, be 200 SDES packets, 184 RR packets, and 16 SR packets. This amounts to approximately 6.5 KB of RTCP packets per report interval, assuming 16-byte CNAMEs and no other SDES information.

Using the original "everyone reports on every sender" feedback rules [RFC3550], each of the 184 receivers will send 16 report blocks, and each of the 16 senders will send 15. This amounts to approximately 76 KB of report block traffic per interval; 92% of RTCP traffic consists of report blocks.

If Reporting Groups are used, however, there is only 0.4 KB of reports per interval, with no loss of useful information. Additionally, there will be (assuming 16-byte RGRPs and a single reporting source per Reporting Group) an additional 2.4 KB per cycle of RTCP RGRP SDES items and RGRS packets. Put another way, the unmodified reporting interval per [RFC3550] is approximately 9 times longer than if Reporting Groups are in use.

4.2. Compatibility of RTCP Reporting Groups

The RTCP traffic generated by receivers using RTCP Reporting Groups might appear, to observers unaware of these semantics, to be generated by receivers who are experiencing a network disconnection, as the non-reporting sources appear not to be receiving a given sender at all.

This could be a potentially critical problem for such a sender using RTCP for congestion control, as such a sender might think that it is sending so much traffic that it is causing complete congestion collapse.

However, such an interpretation of the session statistics would require a fairly sophisticated RTCP analysis. Any receiver of RTCP statistics that is just interested in information about itself needs to be prepared for the possibility that any given reception report might not contain information about a specific media source, because reception reports in large conferences can be round-robined.

Thus, the extent to which such backward-compatibility issues would actually cause trouble in practice is unclear.

5. Security Considerations

The security considerations of [RFC3550] and [RFC8108] apply. If the RTP/AVPF profile is in use, then the security considerations of [RFC4585] (and [RFC5104], if used) also apply. If RTCP XR is used, the security considerations of [RFC3611], including security considerations regarding any XR report blocks used, also apply.

The RTCP RGRP SDES item is vulnerable to malicious modifications unless integrity protection is used. A modification of this item's length field causes the parsing of the RTCP packet in which it is contained to fail. Depending on the implementation, parsing of the full compound RTCP packet can also fail, causing the whole packet to be discarded. A modification of the value of this SDES item would make the receiver of the report think that the sender of the report was a member of a different RTCP Reporting Group. This will potentially create an inconsistency, when the RGRS reports the source as being in the same Reporting Group as another source with another Reporting Group identifier. The impacts on a receiver implementation that such inconsistencies could cause are difficult to fully predict. One case is that when congestion control or other adaptation mechanisms are used, an inconsistent report can result in a media sender reducing its bitrate. However, a direct modification of the RR or a feedback message itself would be a more efficient attack and would be equally costly to perform.

The new RGRS RTCP packet type is very simple. The common RTCP packet type header shares the same security risks as those that affect previous RTCP packet types. Errors or modification of the length field can cause the full compound packet to fail header validation (see Appendix A.2 of [RFC3550]), resulting in the whole compound RTCP packet being discarded. Modification of the SC field or the P field would cause an inconsistency when processing the RTCP packet, likely resulting in the packet being classified as invalid. A modification of the PT field would cause the packet to be interpreted according to some other packet type's rules. In such a case, the result might be more or less predictable but would be specific to the packet type. Modification of the "SSRC of packet sender" field would attribute this packet to another sender, resulting in a receiver believing that the Reporting Group also applies for this SSRC, if it exists. If it doesn't exist, unless corresponding modifications are also done on an SR/RR packet and an SDES packet, the RTCP packet SHOULD be discarded. If consistent changes are done, such a scenario could be part of a resource exhaustion attack on a receiver implementation. Modification of the "List of SSRCs for the Reporting Source(s)" field would change the SSRC the receiver expects to report on behalf of this SSRC. If that SSRC exists, this situation could potentially change the Reporting Group used for this SSRC. A change to another Reporting Group belonging to another endpoint is likely detectable, as there would be a mismatch between the SSRC of the packet sender's endpoint information, transport addresses, SDES CNAME, etc., and the corresponding information from the Reporting Group indicated.

In general, the Reporting Group is providing limited-impact attacks on the endpoints. The most significant result from a deliberate attack would be to cause the information to be discarded or be inconsistent, including the discarding of all RTCP packets that are modified. This causes a lack of information at any receiver entity, possibly disregarding the endpoint's participation in the session.

To protect against such attacks from external non-trusted entities, integrity and source authentication SHOULD be applied. This can be done, for example, by using the Secure Real-time Transport Protocol (SRTP) [RFC3711] with appropriate key management; other options exist, as discussed in "Options for Securing RTP Sessions" [RFC7201].

The Reporting Group Identifier has properties that could potentially impact privacy. If this identifier were to be generated by an implementation in a way that makes it long-term stable or predictable, it could be used for tracking a particular endpoint. Therefore, it is RECOMMENDED that it be generated as a short-term persistent RGRP, following the rules for short-term persistent CNAMEs in [RFC7022]. The rest of the information revealed, i.e., the SSRCs, the size of the Reporting Group, and the number of reporting sources in a Reporting Group, is of a less sensitive nature, considering that the SSRCs and the communication would be revealed without this extension anyway. By encrypting the Reporting Group extensions, the confidentiality of the SSRC values would be preserved, but the values can still be revealed if SRTP [RFC3711] is used. The size of the Reporting Groups and the number of reporting sources are likely determinable from analysis of the packet pattern and sizes. However, this information appears to have limited value.

6. IANA Considerations

IANA has registered a new RTCP RGRP SDES item in the "RTP SDES Item Types" registry, as follows:

        +=======+========+============================+===========+
        | Value | Abbrev | Name                       | Reference |
        +=======+========+============================+===========+
        | 11    | RGRP   | Reporting Group Identifier | RFC 8861  |
        +-------+--------+----------------------------+-----------+

Table 1: New RTCP RGRP SDES Item: Reporting Group

Identifier

The definition of the RTCP RGRP SDES item is given in Section 3.2.1 of this memo.

IANA has registered a new RTCP packet type in the "RTCP Control Packet Types (PT)" registry, as follows:

    +=======+========+===================================+===========+
    | Value | Abbrev | Name                              | Reference |
    +=======+========+===================================+===========+
    | 212   | RGRS   | Reporting Group Reporting Sources | RFC 8861  |
    +-------+--------+-----------------------------------+-----------+

Table 2: New RTCP Packet Type: Reporting Group Reporting Sources

The definition of the RTCP RGRS packet type is given in Section 3.2.2 of this memo.

IANA has also registered a new SDP attribute.

SDP Attribute ("att-field"):

      Contact Name:         IESG
      
      Contact Email:        iesg@ietf.org
      
      Attribute name:       rtcp-rgrp
      
      Long form:            RTCP Reporting Groups
      
      Type of name:         att-field
      
      Type of attribute:    Media or session level
      
      Subject to charset:   No
      
      Purpose:              To negotiate or configure the use of the
                            RTCP Reporting Group extension
      
      Reference:            RFC 8861
      
      Value:                None
      
      Mux Category:         IDENTICAL

The definition of the "a=rtcp-rgrp" SDES attribute is given in Section 3.6 of this memo.

7. References

7.1. Normative References

   [RFC2119]  Bradner, S., "Key words for use in RFCs to Indicate
              Requirement Levels", BCP 14, RFC 2119,
              DOI 10.17487/RFC2119, March 1997,
              <https://www.rfc-editor.org/info/rfc2119>.
   
   [RFC3264]  Rosenberg, J. and H. Schulzrinne, "An Offer/Answer Model
              with Session Description Protocol (SDP)", RFC 3264,
              DOI 10.17487/RFC3264, June 2002,
              <https://www.rfc-editor.org/info/rfc3264>.
   
   [RFC3550]  Schulzrinne, H., Casner, S., Frederick, R., and V.
              Jacobson, "RTP: A Transport Protocol for Real-Time
              Applications", STD 64, RFC 3550, DOI 10.17487/RFC3550,
              July 2003, <https://www.rfc-editor.org/info/rfc3550>.
   
   [RFC4566]  Handley, M., Jacobson, V., and C. Perkins, "SDP: Session
              Description Protocol", RFC 4566, DOI 10.17487/RFC4566,
              July 2006, <https://www.rfc-editor.org/info/rfc4566>.
   
   [RFC7022]  Begen, A., Perkins, C., Wing, D., and E. Rescorla,
              "Guidelines for Choosing RTP Control Protocol (RTCP)
              Canonical Names (CNAMEs)", RFC 7022, DOI 10.17487/RFC7022,
              September 2013, <https://www.rfc-editor.org/info/rfc7022>.
   
   [RFC8108]  Lennox, J., Westerlund, M., Wu, Q., and C. Perkins,
              "Sending Multiple RTP Streams in a Single RTP Session",
              RFC 8108, DOI 10.17487/RFC8108, March 2017,
              <https://www.rfc-editor.org/info/rfc8108>.
   
   [RFC8174]  Leiba, B., "Ambiguity of Uppercase vs Lowercase in RFC
              2119 Key Words", BCP 14, RFC 8174, DOI 10.17487/RFC8174,
              May 2017, <https://www.rfc-editor.org/info/rfc8174>.
   
   [RFC8859]  Nandakumar, S., "A Framework for Session Description
              Protocol (SDP) Attributes When Multiplexing", RFC 8859,
              DOI 10.17487/RFC8859, January 2021,
              <https://www.rfc-editor.org/info/rfc8859>.

7.2. Informative References

   [RFC2974]  Handley, M., Perkins, C., and E. Whelan, "Session
              Announcement Protocol", RFC 2974, DOI 10.17487/RFC2974,
              October 2000, <https://www.rfc-editor.org/info/rfc2974>.
   
   [RFC3611]  Friedman, T., Ed., Caceres, R., Ed., and A. Clark, Ed.,
              "RTP Control Protocol Extended Reports (RTCP XR)",
              RFC 3611, DOI 10.17487/RFC3611, November 2003,
              <https://www.rfc-editor.org/info/rfc3611>.
   
   [RFC3711]  Baugher, M., McGrew, D., Naslund, M., Carrara, E., and K.
              Norrman, "The Secure Real-time Transport Protocol (SRTP)",
              RFC 3711, DOI 10.17487/RFC3711, March 2004,
              <https://www.rfc-editor.org/info/rfc3711>.
   
   [RFC4585]  Ott, J., Wenger, S., Sato, N., Burmeister, C., and J. Rey,
              "Extended RTP Profile for Real-time Transport Control
              Protocol (RTCP)-Based Feedback (RTP/AVPF)", RFC 4585,
              DOI 10.17487/RFC4585, July 2006,
              <https://www.rfc-editor.org/info/rfc4585>.
   
   [RFC4588]  Rey, J., Leon, D., Miyazaki, A., Varsa, V., and R.
              Hakenberg, "RTP Retransmission Payload Format", RFC 4588,
              DOI 10.17487/RFC4588, July 2006,
              <https://www.rfc-editor.org/info/rfc4588>.
   
   [RFC5104]  Wenger, S., Chandra, U., Westerlund, M., and B. Burman,
              "Codec Control Messages in the RTP Audio-Visual Profile
              with Feedback (AVPF)", RFC 5104, DOI 10.17487/RFC5104,
              February 2008, <https://www.rfc-editor.org/info/rfc5104>.
   
   [RFC5506]  Johansson, I. and M. Westerlund, "Support for Reduced-Size
              Real-Time Transport Control Protocol (RTCP): Opportunities
              and Consequences", RFC 5506, DOI 10.17487/RFC5506, April
              2009, <https://www.rfc-editor.org/info/rfc5506>.
   
   [RFC6190]  Wenger, S., Wang, Y.-K., Schierl, T., and A.
              Eleftheriadis, "RTP Payload Format for Scalable Video
              Coding", RFC 6190, DOI 10.17487/RFC6190, May 2011,
              <https://www.rfc-editor.org/info/rfc6190>.
   
   [RFC7201]  Westerlund, M. and C. Perkins, "Options for Securing RTP
              Sessions", RFC 7201, DOI 10.17487/RFC7201, April 2014,
              <https://www.rfc-editor.org/info/rfc7201>.
   
   [RFC7826]  Schulzrinne, H., Rao, A., Lanphier, R., Westerlund, M.,
              and M. Stiemerling, Ed., "Real-Time Streaming Protocol
              Version 2.0", RFC 7826, DOI 10.17487/RFC7826, December
              2016, <https://www.rfc-editor.org/info/rfc7826>.

Authors' Addresses

Jonathan Lennox
8x8, Inc. / Jitsi
Jersey City, NJ 07302
United States of America

   Email: jonathan.lennox@8x8.com

Magnus Westerlund
Ericsson
Torshamnsgatan 23
SE-164 80 Kista
Sweden

   Email: magnus.westerlund@ericsson.com

Qin Wu
Huawei
101 Software Avenue, Yuhua District
Nanjing, Jiangsu 210012
China

   Email: bill.wu@huawei.com

Colin Perkins
University of Glasgow
School of Computing Science
Glasgow
G12 8QQ
United Kingdom

Email:

          csp@csperkins.org