Network Working Group
Request for Comments: 2805
Category: Informational
N. Greene
Nortel Networks
M. Ramalho
Cisco Systems
B. Rosen
Marconi
April 2000

Media Gateway Control Protocol Architecture and Requirements

Status of this Memo

This memo provides information for the Internet community. It does not specify an Internet standard of any kind. Distribution of this memo is unlimited.

Copyright Notice

Copyright © The Internet Society (2000). All Rights Reserved.

Abstract

This document describes protocol requirements for the Media Gateway Control Protocol between a Media Gateway Controller and a Media Gateway.

Table of Contents

   1.  Introduction ..............................................  3
   2.  Terminology ...............................................  3
   3.  Definitions ...............................................  3
   4.  Specific functions assumed within the MG ..................  5
   5.  Per-Call Requirements .....................................  6
      5.1.  Resource Reservation .................................  6
      5.2.  Connection Requirements ..............................  7
      5.3.  Media Transformations ................................  8
      5.4.  Signal/Event Processing and Scripting ................  9
      5.5.  QoS/CoS .............................................. 10
      5.6.  Test Support ......................................... 11
      5.7.  Accounting ........................................... 11
      5.8.  Signalling Control ................................... 11
   6.  Resource Control .......................................... 12
      6.1.  Resource Status Management ........................... 12
      6.2.  Resource Assignment .................................. 13
   7.  Operational/Management Requirements ....................... 13
      7.1.  Assurance of Control/Connectivity .................... 13
      7.2.  Error Control ........................................ 14
      7.3.  MIB Requirements ..................................... 15
   8.  General Protocol Requirements ............................. 15
      8.1.  MG-MGC Association Requirements ...................... 16
      8.2.  Performance Requirements ............................. 17
   9.  Transport ................................................. 17
      9.1.  Assumptions made for underlying network .............. 17
      9.2.  Transport Requirements ............................... 18
   10.  Security Requirements .................................... 18
   11.  Requirements specific to particular bearer types ......... 19
      11.1.  Media-specific Bearer types ......................... 20
         11.1.1.  Requirements for TDM PSTN (Circuit) ............ 20
         11.1.2.  Packet Bearer type ............................. 22
         11.1.3.  Bearer type requirements for ATM ............... 23
      11.2.  Application-Specific Requirements ................... 26
         11.2.1.  Trunking Gateway ............................... 26
         11.2.2.  Access Gateway ................................. 27
         11.2.3.  Trunking/Access Gateway with fax ports ......... 27
         11.2.4.  Trunking/Access Gateway with text telephone .... 28
         11.2.5.  Network Access Server .......................... 29
         11.2.6.  Restricted Capability Gateway .................. 30
         11.2.7.  Multimedia Gateway ............................. 31
         11.2.8.  Audio Resource Function ........................ 32
         11.2.9. Multipoint Control Units ........................ 42
   12.  References ............................................... 43
   13.  Acknowledgements ......................................... 43
   14.  Authors' Addresses ....................................... 44
   15.  Full Copyright Statement ................................. 45

1. Introduction

This document describes requirements to be placed on the Media Gateway Control Protocol. When the word protocol is used on its own in this document it implicitly means the Media Gateway Control Protocol.

2. Terminology

In this document, the key words "MUST", "MUST NOT", "REQUIRED", "SHALL", "SHALL NOT", "SHOULD", "SHOULD NOT", "RECOMMENDED", "MAY", and "OPTIONAL" are to be interpreted as described in RFC 2119 [1] and indicate requirement levels for the protocol.

3. Definitions

   *    Connection

Under the control of a Media Gateway Controller (MGC), the Media Gateway (MG) realizes connections. In this document, connections are associations of resources hosted by the MG. They typically involve two terminations, but may involve more.

   *    Line or Loop

An analogue or digital access connection from a user terminal which carries user media content and telephony access signalling (DP, DTMF, BRI, proprietary business set).

   *    Media Gateway (MG) function

A Media Gateway (MG) function provides the media mapping and/or transcoding functions between potentially dissimilar networks, one of which is presumed to be a packet, frame or cell network. For example, an MG might terminate switched circuit network (SCN) facilities (trunks, loops), packetize the media stream, if it is not already packetized, and deliver packetized traffic to a packet network. It would perform these functions in the reverse order for media streams flowing from the packet network to the SCN.

Media Gateways are not limited to SCN <-> packet/frame/cell functions: A conference bridge with all packet interfaces could be an MG, as well as an (IVR) interactive voice recognition unit, an audio resource function, or a voice recognition system with a cell interface.

   *    Media Gateway unit (MG-unit)

An MG-unit is a physical entity that contains an MG function and may also contain other functions, e.g. an SG function.

   *    Media Gateway Controller (MGC) function

A Media Gateway Controller (MGC) function controls a MG.

   *    Media Resource

Examples of media resources are codecs, announcements, tones, and modems, interactive voice response (IVR) units, bridges, etc.

   *    Signaling Gateway (SG) function

An SG function receives/sends SCN native signalling at the edge of a data network. For example the SG function may relay, translate or terminate SS7 signaling in an SS7-Internet Gateway. The SG function may also be co-resident with the MG function to process SCN signalling associated with line or trunk terminations controlled by the MG, such as the "D" channel of an ISDN PRI trunk.

   *    Termination

A termination is a point of entry and/or exit of media flows relative to the MG. When an MG is asked to connect two or more terminations, it understands how the flows entering and leaving each termination are related to each other.

Terminations are, for instance, DS0's, ATM VCs and RTP ports. Another word for this is bearer point.

   *    Trunk

An analog or digital connection from a circuit switch which carries user media content and may carry telephony signalling (MF, R2, etc.). Digital trunks may be transported and may appear at the Media Gateway as channels within a framed bit stream, or as an ATM cell stream. Trunks are typically provisioned in groups, each member of which provides equivalent routing and service.

   *    Type of Bearer

A Type of Bearer definition provides the detailed requirements for its particular application/bearer type. A particular class of Media Gateway, for example, would support a particular set of Bearer types.

4. Specific functions assumed within the MG

This section provides an environment for the definition of the general Media Gateway Control Protocol requirements.

MGs can be architected in many different ways depending where the media conversions and transcoding (if required) are performed, the level of programmability of resources, how conferences are supported, and how associated signalling is treated. The functions assumed to be within the MG must not be biased towards a particular architecture.

For instance, announcements in a MG could be provided by media resources or by the bearer point resource or termination itself. Further, this difference must not be visible to MGC: The MGC must be able to issue the identical request to two different implementations and achieve the identical functionality.

Depending on the application of the MG (e.g., trunking, residential), some functions listed below will be more prominent than others, and in some cases, functions may even disappear.

Although media adaptation is the essence of the MG, it is not necessary for it to be involved every time. An MG may join two terminations/resources of the same type (i.e., the MG behaves as a switch). The required media conversion depends on the media type supported by the resources being joined together.

In addition to media adaptation function, resources have a number of unique properties, for instance:

   *    certain types of resources have associated signalling
   
        capabilities (e.g., PRI signalling, DTMF),
   
   *    some resources perform maintenance functions (e.g., continuity
        tests),
   
   *    the MGC needs to know the state changes of resources (e.g., a
        trunk group going out of service),
   
   *    the MG retains some control over the allocation and control of
        some resources (e.g., resource name space: RTP port numbers).

Therefore, an MG realizes point-to-point connections and conferences, and supports several resource functions. These functions include media conversion, resource allocation and management, and event notifications. Handling termination associated signalling is either done using event notifications, or is handled by the signalling backhaul part of a MG-unit (i.e. NOT directly handled by the MG).

MGs must also support some level of system related functions, such as establishing and maintaining some kind of MG-MGC association. This is essential for MGC redundancy, fail-over and resource sharing.

Therefore, an MG is assumed to contain these functions:

   *    Reservation and release, of resources
   
   *    Ability to provide state of resources
   
   *    Maintenance of resources - It must be possible to make
        maintenance operations independent of other termination
        functions, for instance, some maintenance states should not
        affect the resources associated with that resource . Examples of
        maintenance functions are loopbacks and continuity tests.
   
   *    Connection management, including connection state.
   
   *    Media processing, using media resources: these provide services
        such as transcoding, conferencing, interactive voice recognition
        units, audio resource function units. Media resources may or may
        not be directly part of other resources.
   
   *    Incoming digit analysis for terminations, interpretation of
        scripts for terminations
   
   *    Event detection and signal insertion for per-channel signalling
   
   *    Ability to configure signalling backhauls (for example, a
        Sigtran backhaul)
   
   *    Management of the association between the MGC and MG, or between
        the MGC and MG resources.

5. Per-Call Requirements

5.1. Resource Reservation

The protocol must:

   a.   Support reservation of bearer terminations and media resources
        for use by a particular call and support their subsequent
        release (which may be implicit or explicit).
   
   b.   Allow release in a single exchange of messages, of all resources
        associated with a particular set of connectivity and/or
        associations between a given number terminations.
   
   c.   The MG is not required (or allowed) by the protocol to maintain
        a sense of future time: a reservation remains in effect until
        explicitly released by the MGC.

5.2. Connection Requirements

The protocol must:

   a.   Support connections involving packet and circuit bearer
        terminations in any combination, including "hairpin" connections
        (connections between two circuit connections within the same
        MG).
   
   b.   Support connections involving TDM, Analogue, ATM, IP or FR
        transport in any combination.
   
   c.   Allow the specification of bearer plane (e.g. Frame Relay, IP,
        etc.) on a call by call basis.
   
   d.   Support unidirectional, symmetric bi-directional, and asymmetric
        bi-directional flows of media.
   
   e.   Support multiple media types (e.g. audio, text, video, T.120).
   
   f.   Support point-to-point and point-to-multipoint connections.
   
   g.   Support creation and modification of more complex flow
        topologies e.g. conference bridge capabilities.  Be able to add
        or delete media streams during a call or session, and be able to
        add or subtract participants to/from a call or session.
   
   h.   Support inclusion of media resources into call or session as
        required.  Depending on the protocol and resource type, media
        resources may be implicitly included, class-assigned, or
        individually assigned.
   
   i.   Provide unambiguous specification of which media flows pass
        through a point and which are blocked at a given point in time,
        if the protocol permits multiple flows to pass through the same
        point.
   
   j.   Allow modifications of an existing termination, for example, use
        of higher compression to compensate for insufficient bandwidth
        or changing transport network connections.
   
   k.   Allow the MGC to specify that a given connection has higher
        priority than other connections.
   
   l.   Allow a reference to a port/termination on the MG to be a
        logical identifier,

with a one-to-one mapping between a logical identifier and a physical port.

   m.   Allow the MG to report events such as resource reservation and
        connection completion.

5.3. Media Transformations

The Protocol must:

   a.   Support mediation/adaptation of flows between different types of
        transport
   
   b.   Support invocation of additional processing such as echo
        cancellation.
   
   c.   Support mediation of flows between different content encoding
        (codecs, encryption/decryption)
   
   d.   Allow the MGC to specify whether text telephony/FAX/data modem
        traffic is to be terminated at the MG, modulated/demodulated,
        and converted to packets or forwarded by the MG in the media
        flow as voice band traffic.
   
   e.   Allow the MGC to specify that Dual-Tone MultiFrequency (DTMF)
        digits or other line and trunk signals and general Multi-
        Frequency (MF) tones are to be processed in the MG and how these
        digits/signals/tones are to be handled. The MGC must be able to
        specify any of the following handling of such
        digits/signals/tones:
   
   1.   The digits/signals/tones are to be encoded normally in the audio
        RTP stream (e.g., no analysis of the digits/signals/tones).
   
   2.   Analyzed and sent to the MGC.
   
   3.   Received from the MGC and inserted in the line-side audio
        stream.
   
   4.   Analyzed and sent as part of a separate RTP stream (e.g., DTMF
        digits sent via a RTP payload separate from the audio RTP
        stream).
   
   5.   Taken from a separate RTP stream and inserted in the line-side
        audio stream.
   
   6.   Handled according to a script of instructions.  For all but the
        first case, an option to mute the digits/signals/tones with
        silence, comfort noise, or other means (e.g., notch filtering of
        some telephony tones) must be provided.  As detection of these
        events may take up to tens of milliseconds, the first few
        milliseconds of such digit/signal/tone may be encoded and sent
        in the audio RTP stream before the digit/signal/tone can be
        verified. Therefore muting of such digits/signals/tones in the
        audio RTP stream with silence or comfort noise is understood to
        occur at the earliest opportunity after the digit/signal/tone is
        verified.
   
   f.   Allow the MGC to specify signalled flow characteristics on
        circuit as well as on packet bearer connections, e.g. u-law/a-
        law.
   
   g.   Allow for packet/cell transport adaptation only (no media
        adaptation) e.g. mid-stream (packet-to-packet)
        transpacketization/transcoding, or ATM AAL5 to and from ATM AAL2
        adaptation.
   
   h.   Allow the transport of audio normalization levels as a setup
        parameter, e.g., for conference bridging.
   
   i.   Allow conversion to take place between media types e.g., text to
        speech and speech to text.

5.4. Signal/Event Processing and Scripting

The Protocol must:

   a.   Allow the MGC to enable/disable monitoring for specific
        supervision events at specific circuit terminations
   
   b.   Allow the MGC to enable/disable monitoring for specific events
        within specified media streams
   
   c.   Allow reporting of detected events on the MG to the MGC. The
        protocol should provide the means to minimize the messaging
        required to report commonly-occurring event sequences.
   
   d.   Allow the MGC to specify other actions (besides reporting) that
        the MG should take upon detection of specified events.
   
   e.   Allow the MGC to enable and/or mask events.
   
   f.   Provide a way for MGC to positively acknowledge event
        notification.
   
   g.   Allow the MGC to specify signals (e.g., supervision, ringing) to
        be applied at circuit terminations.
   
   h.   Allow the MGC to specify content of extended duration
        (announcements, continuous tones) to be inserted into specified
        media flows.
   
   i.   Allow the MGC to specify alternative conditions (detection of
        specific events, timeouts) under which the insertion of
        extended-duration signals should cease.
   
   j.   Allow the MGC to download, and specify a script to be invoked on
        the occurrence of an event.
   
   k.   Specify common events and signals to maximize MG/MGC
        interworking.
   
   l.   Provide an extension mechanism for implementation defined events
        and signals with, for example, IANA registration procedures. It
        may be useful to have an Organizational Identifier (i.e. ITU,
        ETSI, ANSI, ) as part of the registration mechanism.
   
   m.   The protocol shall allow the MGC to request the arming of a
        mid-call trigger even after the call has been set up.

5.5. QoS/CoS

The Protocol must:

   a.   Support the establishment of a bearer channel with a specified
        QoS/CoS.
   
   b.   Support the ability to specify QoS for the connection between
        MGs, and by direction.
   
   c.   Support a means to change QoS during a connection, as a whole
        and by direction.
   
   d.   Allow the MGC to set QOS thresholds and receive notification
        when such thresholds cannot be maintained.
   
   e.   Allow the jitter buffer parameters on RTP channels to be
        specified at connection setup.

5.6. Test Support

The protocol must:

   a.   Support of the different types of PSTN Continuity Testing (COT)
        for both the originating and terminating ends of the circuit
        connection (2-wire and 4- wire).
   
   b.   Specifically support test line operation (e.g. 103, 105, 108).

5.7. Accounting

The protocol must:

   a.   Support a common identifier to mark resources related to one
        connection.
   
   b.   Support collection of specified accounting information from MGs.
   
   c.   Provide the mechanism for the MGC to specify that the MG report
        accounting information automatically at end of call, in mid-call
        upon request, at specific time intervals as specified by the MGC
        and at unit usage thresholds as specified by the MGC.
   
   d.   Specifically support collection of:
   
   *    start and stop time, by media flow,
   
   *    volume of content carried (e.g. number of packets/cells
        transmitted, number received with and without error, inter-
        arrival jitter), by media flow,
   
   *    QOS statistics, by media flow.
   
   e.   Allow the MGC to have some control over which statistics are
        reported, to enable it to manage the amount of information
        transferred.

5.8. Signalling Control

Establishment and provisioning of signalling backhaul channels (via SIGTRAN for example) is out of scope. However, the MG must be capable of supporting detection of events, and application of signals associated with basic analogue line, and CAS type signalling. The protocol must:

   a.   Support the signalling requirements of analogue lines and
        Channel Associated Signaling (CAS).
   
   b.   Support national variations of such signalling.
   
   c.   Provide mechanisms to support signalling without requiring MG-
        MGC timing constraints beyond that specified in this document.
   
   d.   Must not create a situation where the MGC and the MG must be
        homologated together as a mandatory requirement of using the
        protocol;

i.e. it must be possible to optionally conceal signaling type variation from the MGC.

6. Resource Control

6.1. Resource Status Management

The protocol must:

   a.   Allow the MG to report changes in status of physical entities
        supporting bearer terminations, media resources, and facility-
        associated signalling channels, due to failures, recovery, or
        administrative action. It must be able to report whether a
        termination is in service or out of service.
   
   b.   Support administrative blocking and release of TDM circuit
        terminations.

Note: as the above point only relates to ISUP-controlled circuits, it may be unnecessary to require this since the MGC controls their use. However, it may be meaningful for MF and R2-signalled trunks, where supervisory states are set to make the trunks unavailable at the far end.

   c.   Provide a method for the MGC to request that the MG release all
        resources under the control of a particular MGC currently in
        use, or reserved, for any or all connections.
   
   d.   Provide an MG Resource Discovery mechanism which must allow an
        MGC to discover what resources the MG has. Expressing resources
        can be an arbitrarily difficult problem and the initial release
        of the protocol may have a simplistic view of resource
        discovery.

At a minimum, resource discovery must enumerate the names of

available circuit terminations and the allowed values for

parameters supported by terminations.

The protocol should be defined so that simple gateways could respond with a relatively short, pre-stored response to the discovery request mechanism. In general, if the protocol defines a mechanism that allows the MGC to specify a setting or parameter for a resource or connection in the MG, and MGs are not required to support all possible values for that setting or parameter, then the discovery mechanism should provide the MGC with a method to determine what possible values such settings or parameters are supported in a particular MG.

   e.   Provide a mechanism to discover the current available resources
        in the MG, where resources are dynamically consumed by
        connections and the MGC cannot reasonably or reliably track the
        consumption of such resources. It should also be possible to
        discover resources currently in use, in order to reconcile
        inconsistencies between the MGC and the MG.
   
   f.   Not require an MGC to implement an SNMP manager function in
        order to discover capabilities of an MG that may be specified
        during context establishment.

6.2. Resource Assignment

The protocol must:

   a.   Provide a way for the MG to indicate that it was unable to
        perform a requested action because of resource exhaustion, or
        because of temporary resource unavailability.
   
   b.   Provide an ability for the MGC to indicate to an MG the resource
        to use for a call (e.g. DS0) exactly, or indicate a set of
        resources (e.g. pick a DS0 on a T1 line or a list of codec
        types) via a "wild card" mechanism from which the MG can select
        a specific resource for a call (e.g. the 16th timeslot, or
        G.723).
   
   c.   Allow the use of DNS names and IP addresses to identify MGs and
        MGCs. This shall not preclude using other identifiers for MGs or
        MGCs when other non IP transport technologies for the protocol
        are used.

7. Operational/Management Requirements

7.1. Assurance of Control/Connectivity

To provide assurance of control and connectivity, the protocol must provide the means to minimize duration of loss of control due to loss of contact, or state mismatches.

The protocol must:

   a.   Support detection and recovery from loss of contact due to
        failure/congestion of communication links or due to MG or MGC
        failure.

Note that failover arrangements are one of the mechanisms which could be used to meet this requirement.

   b.   Support detection and recovery from loss of synchronized view of
        resource and connection states between MGCs and MGs. (e.g.
        through the use of audits).
   
   c.   Provide a means for MGC and MG to provide each other with
        booting and reboot indications, and what the MG's configuration
        is.
   
   d.   Permit more than one backup MGC and provide an orderly way for
        the MG to contact one of its backups.
   
   e.   Provide for an orderly switchback to the primary MGC after it
        recovers. How MGCs coordinate resources between themselves is
        outside the scope of the protocol.
   
   f.   Provide a mechanism so that when an MGC fails, connections
        already established can be maintained. The protocol does not
        have to provide a capability to maintain connections in the
        process of being connected, but not actually connected when the
        failure occurs.
   
   g.   The Protocol must allow the recovery or redistribution of
        traffic without call loss.

7.2. Error Control

The protocol must:

   a.   Allow for the MG to report reasons for abnormal failure of lower
        layer connections e.g. TDM circuit failure, ATM VCC failure.
   
   b.   Allow for the MG to report Usage Parameter Control (UPC) events.
   
   c.   Provide means to ameliorate potential synchronization or focused
        overload of supervisory/signaling events that can be detrimental
        to either MG or MGC operation. Power restoration or signaling
        transport re-establishment are typical sources of potentially
        detrimental signaling showers from MG to MGC or vice-versa.
   
   d.   Allow the MG to notify the MGC that a termination was terminated
        and communicate a reason when a terminations is taken out-of-
        service unilaterally by the MG due to abnormal events.
   
   e.   Allow the MGC to acknowledge that a termination has been taken
        out-of-service.
   
   f.   Allow the MG to request the MGC to release a termination and
        communicate a reason.
   
   g.   Allow the MGC to specify, as a result of such a request its
        decision to take termination down, leave it as is or modify it.

7.3. MIB Requirements

The Protocol must define a common MG MIB, which must be extensible, but must:

   a.   Provide information on:
   
   *    mapping between resources and supporting physical entities.
   
   *    statistics on quality of service on the control and signalling
        backhaul interfaces.
   
   *    statistics required for traffic engineering within the MG.
   
   b.   The protocol must allow the MG to provide to the MGC all
        information the MGC needs to provide in its MIB.
   
   c.   MG MIB must support implementation of H.341 by either the MG,
        MGC, or both acting together.

8. General Protocol Requirements

The protocol must:

   a.   Support multiple operations to be invoked in one message and
        treated as a single transaction.
   
   b.   Be both modular and extensible. Not all implementations may wish
        to support all of the possible extensions for the protocol. This
        will permit lightweight implementations for specialized tasks
        where processing resources are constrained. This could be
        accomplished by defining particular profiles for particular uses
        of the protocol.
   
   c.   Be flexible in allocation of intelligence between MG and MGC.
        For example, an MGC may want to allow the MG to assign
        particular MG resources in some implementations, while in
        others, the MGC may want to be the one to assign MG resources
        for use.
   
   d.   Support scalability from very small to very large MGs: The
        protocol must support MGs with capacities ranging from one to
        millions of terminations.
   
   e.   Support scalability from very small to very large MGC span of
        control: The protocol should support MGCs that control from one
        MG to a few tens of thousands of MGs.
   
   f.   Support the needs of a residential gateway that supports one to
        a few lines, and the needs of a large PSTN gateway supporting
        tens of thousands of lines. Protocol mechanisms favoring one
        extreme or the other should be minimized in favor of more
        general purpose mechanism applicable to a wide range of MGs.
        Where special purpose mechanisms are proposed to optimize a
        subset of implementations, such mechanisms should be defined as
        optional, and should have minimal impact on the rest of the
        protocol.
   
   g.   Facilitate MG and MGC version upgrades independently of one
        another. The protocol must include a version identifier in the
        initial message exchange.
   
   h.   Facilitate the discovery of the protocol capabilities of the one
        entity to the other.
   
   i.   Specify commands as optional (they can be ignored) or mandatory
        (the command must be rejected), and within a command, to specify
        parameters as optional (they can be ignored) or mandatory (the
        command must be rejected).

8.1. MG-MGC Association Requirements

The Protocol must:

   a.   Support the establishment of a control relationship between an
        MGC and an MG.
   
   b.   Allow multiple MGCs to send control messages to an MG. Thus, the
        protocol must allow control messages from multiple signalling
        addresses to a single MG.
   
   c.   Provide a method for the MG to tell an MGC that the MG received
        a command for a resource that is under the control of a
        different MGC.
   
   d.   Support a method for the MG to control the rate of requests it
        receives from the MGC (e.g. windowing techniques, exponential
        back-off).
   
   e.   Support a method for the MG to tell an MGC that it cannot handle
        any more requests.

8.2. Performance Requirements

The protocol must:

   a.   Minimize message exchanges between MG and MGC, for example
        during boot/reboot, and during continuity tests.
   
   b.   Support Continuity test constraints which are a maximum of 200ms
        cross-MGC IAM (IAM is the name given to an SS7 connection setup
        msg) propagation delay, and a maximum of 200ms from end of
        dialing to IAM emission.
   
   c.   Make efficient use of the underlying transport mechanism. For
        example, protocol PDU sizes vs. transport MTU sizes needs to be
        considered in designing the protocol.
   
   d.   Not contain inherent architectural or signaling constraints that
        would prohibit peak calling rates on the order of 140
        calls/second on a moderately loaded network.
   
   e.   Allow for default/provisioned settings so that commands need
        only contain non-default parameters.

9. Transport

9.1. Assumptions made for underlying network

The protocol must assume that the underlying network:

   a.   May be over large shared networks: proximity assumptions are not
        allowed.
   
   b.   Does not assure reliable delivery of messages.
   
   c.   Does not guarantee ordering of messages: Sequenced delivery of
        messages associated with the same source of events is not
        assumed.
   
   d.   Does not prevent duplicate transmissions.

9.2. Transport Requirements

The protocol must:

   a.   Provide the ability to abort delivery of obsolete messages at
        the sending end if their transmission has not been successfully
        completed. For example, aborting a command that has been
        overtaken by events.
   
   b.   Support priority messages: The protocol shall allow a command
        precedence to allow priority messages to supercede non-priority
        messages.
   
   c.   Support of large fan-out at the MGC.
   
   d.   Provide a way for one entity to correlate commands and responses
        with the other entity.
   
   e.   Provide a reason for any command failure.
   
   f.   Provide that loss of a packet not stall messages not related to
        the message(s) contained in the packet lost.

Note that there may be enough protocol reliability requirements here to warrant a separate reliable transport layer be written apart from the Media Gateway Control Protocol. Also need to compare Megaco reliable transport requirements with similar Sigtran requirements.

10. Security Requirements

Security mechanisms may be specified as provided in underlying transport mechanisms, such as IPSEC. The protocol, or such mechanisms, must:

   a.   Allow for mutual authentication at the start of an MGC-MG
        association
   
   b.   Allow for preservation of the of control messages once the
        association has been established.
   
   c.   Allow for optional confidentiality protection of control
        messages.  The mechanism should allow a choice in the algorithm
        to be used.
   
   d.   Operate across untrusted domains in a secure fashion.
   e.   Support non-repudiation for a customer-located MG talking to a
        network operator's MGC.
   
   f.   Define mechanisms to mitigate denial of service attacks

Note: the protocol document will need to include an extended discussion of security requirements, offering more precision on each threat and giving a complete picture of the defense including non- protocol measures such as configuration.

   g.   It would be desirable for the protocol to be able to pass
        through commonly-used firewalls.

11. Requirements specific to particular bearer types

The bearer types listed in Table 1 can be packaged into different types of MGs. Examples are listed in the following sections. How they are packaged is outside the scope of the general Media Gateway control protocol. The protocol must support all types of bearer types listed in Table 1.

Table 1: Bearer Types and Applications

     Bearer Type                   Applications       Transit Network
     ================================================================
     Trunk+ISUP                    trunking/access    IP, ATM, FR
                                   Voice,Fax,NAS,
                                   Multimedia
     
     Trunk+MF                      trunking/access    IP, ATM, FR
                                   Voice,Fax,NAS,
                                   Multimedia
     
     ISDN                          trunking/access    IP, ATM, FR
                                   Voice,Fax,NAS,
                                   Multimedia
     
     Analogue                      Voice,Fax,         IP, ATM, FR
                                   Text Telephony
     
     Termination in a Restricted   Voice,Fax,         IP, ATM, FR
     Capability Gateway            Text Telephony
     
     Application Termination       IVR,ARF, Announcement Server,
                                   Voice Recognition Server,...
     
     Multimedia H.323              H.323 Multimedia   IP, ATM, FR
                                   Gateway and MCU
     
     Multimedia H.320              H.323 GW and MCU   ISDN, IP, ATM, FR

11.1. Media-specific Bearer Types

This section describes requirements for handling terminations attached to specific types of networks.

11.1.1. Requirements for TDM PSTN (Circuit)

This bearer type is applicable to a Trunking GW, Access GW, ...

The protocol must allow:

   a.   the MGC to specify the encoding to use on the attached circuit.
   
   b.   In general, if something is set by a global signalling protocol
        (e.g. ISUP allows mu-Law or A-Law to be signaled using ISUP)
        then it must be settable by the protocol.
   
   c.   TDM attributes:
   
   *    Echo cancellation,
   
   *    PCM encoding or other voice compression (e.g. mu-law or A-law),
   
   *    encryption,
   
   *    rate adaptation (e.g. V.110, or V.120).
   
   d.   for incoming calls, identification of a specific TDM circuit
        (timeslot and facility).
   
   e.   for calls outgoing to the circuit network, identification of a
        specific circuit or identification of a circuit group with the
        indication that the MG must select and return the identification
        of an available member of that group.
   
   f.   specification of the default encoding of content passing to and
        from a given circuit, possibly on a logical or physical circuit
        group basis.
   
   g.   specification at any point during the life of a connection of
        variable aspects of the content encoding, particularly including
        channel information capacity.
   
   h.   specification at any point during the life of a connection of
        loss padding to be applied to incoming and outgoing media
        streams at the circuit termination.
   
   i.   specification at any point during the life of a connection of
        the applicability of echo cancellation to the outgoing media
        stream.
   
   j.   Multi-rate calls to/from the SCN.
   
   k.   H-channel (n x 64K) calls to/from the SCN.
   
   l.   B channel aggregation protocols for creating high speed channels
        for multimedia over the SCN.
   
   m.   Modem terminations and negotiations.

The protocol may also allow:

   n.   specification of sub-channel media streams,
   
   o.   specification of multi-channel media streams.

11.1.2. Packet Bearer Type

The protocol must be able to specify:

   a.   ingress and egress coding (i.e. the way packets coming in and
        out are encoded) (including encryption).
   
   b.   near and far-end ports and other session parameters for RTP and
        RTCP.

The protocol must support reporting of:

   c.   re-negotiation of codec for cause - for further study
   
   d.   on Trunking and Access Gateways, resources capable of more than
        one active connection at a time must also be capable of mixing
        and packet duplication.

The protocol must allow:

   e.   specification of parameters for outgoing and incoming packet
        flows at separate points in the life of the connection (because
        far-end port addresses are typically obtained through a separate
        signalling exchange before or after the near-end port addresses
        are assigned).
   
   f.   the possibility for each Media Gateway to allocate the ports on
        which it will receive packet flows (including RTCP as well as
        media streams) and report its allocations to the Media Gateway
        Controller for signalling to the far end.  Note that support of
        different IP backbone providers on a per call basis would
        require that the ports on which packets flow be selected by the
        MGC. (but only if the IP address of the MG is different for each
        backbone provider).
   
   g.   the specification at any point during the life of a connection
        of RTP payload type and RTP session number for each RTP-
        encapsulated media flow.
   
   h.   the ability to specify whether outgoing flows are to be uni-cast
        or multi-cast. Note that on an IP network this information is
        implicit in the destination address, but in other networks this
        is a connection parameter.
   
   i.   invoking of encryption/decryption on media flows and
        specification of the associated algorithm and key.

The protocol should also allow:

   j.   the MGC to configure non-RTP (proprietary or other) encapsulated
        packet flows.

11.1.3. Bearer type requirements for ATM

This bearer type is applicable to Trunking GW, Access GW, ....

11.1.3.1. Addressing
   a.   The protocol must be able to specify the following termination
        attributes:
   
   *    VC identifier,
   
   *    VC identifier plus AAL2 slot, and variant of these allowing the
        gateway to choose (part of) the identifier,
   
   *    remote termination network address, remote MG name.
   
   b.   Allow specification of an ATM termination which is to be
        assigned to an MG connection as a VC identifier, a VC identifier
        plus AAL2 slot, a wild-carded variant of either of these. A
        remote termination network address, or a remote MG name could
        also be used when the MG can select the VC and change the VC
        during the life of the connection by using ATM signalling.
   
   c.   Provide an indication by the MG of the VC identifier and
        possibly AAL2 slot of the termination actually assigned to a
        connection.
   
   d.   Provide a means to refer subsequently to that termination.
   
   e.   Refer to an existing VCC as the physical interface + Virtual
        Path Identifier (VPI) + Virtual Circuit Identifier (VCI).
   
   f.   Where the VCC is locally established (SVCs signalled by the
        Gateway through UNI or PNNI signalling or similar), the VCC must
        be indirectly referred to in terms which are of significance to
        both ends of the VCC. For example, a global name or the ATM
        address of the ATM devices at each end of the VCC. However, it
        is possible/probable that there may be several VCCs between a
        given pair of ATM devices. Therefore the ATM address pair must
        be further resolved by a VCC identifier unambiguous within the
        context of the ATM address pair.
   
   g.   refer to a VCC as the Remote GW ATM End System Address + VCCI.
   h.   allow the VCCI to be selected by the MG or imposed on the MG.
   
   i.   support all ATM addressing variants (e.g. ATM End System Address
        (AESA) and E.164).
11.1.3.2. Connection related requirements

The protocol must:

   a.   Allow for the de-coupling of creation/deletion of the narrow-
        band connection from the creation/deletion of the underlying
        VCC.
   
   b.   Allow for efficient disconnection of all connections associated
        with a physical port or VCC. As an example, this could aggregate
        disconnections across a broadband circuit which experienced a
        physical error.
   
   c.   Allow the connection established using this protocol to be
        carried over a VCC, which may be a:
   
   *    PVC or SPVC,
   
   *    an SVC established on demand, either by the MGC itself or by a
        broker acting on its behalf or,
   
   *    an SVC originated as required by the local MG, or by the remote
        end to the local MG through UNI or PNNI signalling.
   
   d.   Allow ATM transport parameters and QoS parameters to be passed
        to the MG.
   
   e.   Allow blocking and unblocking of a physical interface, a VCC or
        an AAL1/AAL2 channel.

The protocol should:

   f.   Where a VCC is required to be established on a per narrow-band
        call basis, allow all necessary information to be passed in one
        message.
11.1.3.3. Media adaptation

The protocol must:

   a.   Allow AAL parameters to be passed to the MG.
   
   b.   Allow AAL1/AAL2 multiple narrow-band calls to be mapped to a
        single VCC. For AAL2, these calls are differentiated within each
        VCC by a AAL2 channel identifier. An AAL2 connection may span
        more than 1 VCC and transit AAL2 switching devices.  ITU
        Q.2630.1 [2] defines an end-to-end identifier called the Served
        User Generated Reference (SUGR). It carries information from the
        originating user of the AAL2 signalling protocol to the
        terminating user transparently and unmodified.
   
   c.   Allow unambiguous binding of a narrow band call to an AAL2
        connection identifier, or AAL1 channel, within the specified
        VCC.
   
   d.   Allow the AAL2 connection identifier, or AAL1 channel, to be
        selected by the MG or imposed on the MG.
   
   e.   Allow the use of the AAL2 channel identifier (cid) instead of
        the AAL2 connection identifier.
   
   f.   Allow the AAL2 voice profile to be imposed or negotiated before
        the start of the connection.  AAL2 allows for variable length
        packets and varying packet rates, with multiple codecs possible
        within a given profile. Thus a given call may upgrade or
        downgrade the codec within the lifetime of the call. Idle
        channels may generate zero bandwidth. Thus an AAL2 VCC may vary
        in bandwidth and possibly exceed its contract. Congestion
        controls within a gateway may react to congestion by modifying
        codec rates/types.
   
   g.   Allow the MGC to instruct the MG on how individual narrow-band
        calls behave under congestion.
   
   h.   Allow for the MGC to specify an AAL5 bearer, with the following
        choices:
   
   *    Per ATM Forum standard AF-VTOA-0083 [4],
   
   *    RTP with IP/UDP,
   
   *    RTP without IP/IDP per H.323v2 Annex C [5],
   
   *    Compressed RTP per ATM Forum AF-SAA-0124.000 [6].
   
   i.   Allow unambiguous binding of a narrow band call to an AAL1
        channel within the specified VCC. (In AAL1, multiple narrow-band
        calls may be mapped to a single VCC.)
11.1.3.4. Reporting requirements

The protocol should:

   a.   Allow any end-of-call statistics to show loss/restoration of
        underlying VCC within the calls duration, together with duration
        of loss.
   
   b.   Allow notification, as requested by MGC, of any congestion
        avoidance actions taken by the MG.

The protocol must:

   c.   Allow for ATM VCCs or AAL2 channels to be audited by the MGC.
   
   d.   Allow changes in status of ATM VCCs or AAL2 channels to be
        notified as requested by the MGC.
   
   e.   Allow the MGC to query the resource and endpoint availability.
        Resources may include VCCs, and DSPs. VCCs may be up or down.
        End-points may be connection-free, connected or unavailable.
11.1.3.5. Functional requirements

The protocol must:

   a.   Allow an MGC to reserve a bearer, and specify a route for it
        through the network.

11.2. Application-Specific Requirements

11.2.1. Trunking Gateway

A Trunking Gateway is an interface between SCN networks and Voice over IP or Voice over ATM networks. Such gateways typically interface to SS7 or other NNI signalling on the SCN and manage a large number of digital circuits.

The protocol must:

   a.   Provide circuit and packet-side loopback.
   
   b.   Provide circuit-side n x 64kbs connections.
   
   c.   Provide subrate and multirate connections for further study.
   d.   Provide the capability to support Reporting/generation of
        per-trunk CAS signalling (DP, DTMF, MF, R2, J2, and national
        variants).
   
   e.   Provide the capability to support reporting of detected DTMF
        events either digit-by-digit, as a sequence of detected digits
        with a flexible mechanism For the MG to determine the likely end
        of dial string, or in a separate RTP stream.
   
   f.   Provide the capability to support ANI and DNIS generation and
        reception.

11.2.2. Access Gateway

An Access Gateway connects UNI interfaces like ISDN (PRI and BRI) or traditional analog voice terminal interfaces, to a Voice over IP or Voice over ATM network, or Voice over Frame Relay network.

The Protocol must:

   a.   Support detection and generation of analog line signaling
        (hook-state, ring generation).
   
   b.   Provide the capability to support reporting of detected DTMF
        events either digit-by-digit, as a sequence of detected digits
        with a flexible mechanism For the MG to determine the likely end
        of dial string, or in a separate RTP stream.
   
   c.   Not require scripting mechanisms, event buffering, digit map
        storage when implementing restricted function (1-2 line)
        gateways with very limited capabilities.
   
   d.   Provide the capability to support CallerID generation and
        reception.

Proxying of the protocol is for further study.

11.2.3. Trunking/Access Gateway with fax ports

   a.   the protocol must be able to indicate detection of fax media.
   
   b.   the protocol must be able to specify T.38 for the transport of
        the fax.
   
   c.   the protocol must be able to specify G.711 encoding for
        transport of fax tones across a packet network.

11.2.4. Trunking/Access Gateway with text telephone access ports

An access gateway with ports capable of text telephone communication, must provide communication between text telephones in the SCN and text conversation channels in the packet network.

Text telephone capability of ports is assumed to be possible to combine with other options for calls as described in section 11.2.6 (e.) on "Adaptable NASes".

The port is assumed to adjust for the differences in the supported text telephone protocols, so that the text media stream can be communicated T.140 coded in the packet network without further transcoding [7].

   The protocol must be capable of reporting the type of text telephone
   that is connected to the SCN port. The foreseen types are the same as
   the ones supported by ITU-T V.18:  DTMF, EDT, Baudot-45, Baudot-50,
   Bell, V.21, Minitel and V.18. It should be possible to control which
   protocols are supported. The SCN port is assumed to contain ITU-T
   V.18 functionality [8].

The protocol must be able to control the following functionality levels of text telephone support:

   a.   Simple text-only support: The call is set into text mode from
        the beginning of the call, in order to conduct a text-only
        conversation.
   
   b.   Alternating text-voice support: The call may begin in voice mode
        or text mode and, at any moment during the call, change mode on
        request by the SCN user. On the packet side, the two media
        streams for voice and text must be opened, and it must be
        possible to control the feeding of each stream by the protocol.
   
   c.   Simultaneous text and voice support: The call is performed in a
        mode when simultaneous text and voice streams are supported. The
        call may start in voice mode and during the call change state to
        a text-and-voice call.

A port may implement only level a, or any level combination of a, b and c, always including level a.

The protocol must support:

   d.   A text based alternative to the interactive voice response, or
        audio resource functionality of the gateway when the port is
        used in text telephone mode.
   
   e.   Selection of what national translation table to be used between
        the Unicode based T.140 and the 5-7 bit based text telephone
        protocols.
   
   f.   Control of the V.18 probe message to be used on incoming calls.

11.2.5. Network Access Server

A NAS is an access gateway, or Media Gateway (MG), which terminates modem signals or synchronous HDLC connections from a network (e.g. SCN or xDSL network) and provides data access to the packet network. Only those requirements specific to a NAS are described here.

Figure 1 provides a reference architecture for a Network Access Server (NAS). Signaling comes into the MGC and the MGC controls the NAS.

                          +-------+        +-------+
               Signaling  |       |        |       |
               -----------+  MGC  +        |  AAA  |
                          |       |        |       |
                          +---+---+        +--+----+
                              |               |
                        Megaco|_______________|
                                              |
                                              |
                          +---+---+         ~~|~~~
                Bearer    |       |        (      )
               -----------+  NAS  +-------(   IP   )
                          |       |        (      )
                          +-------+         ~~~~~~

Figure 1: NAS reference architecture

The Protocol must support:

   a.   Callback capabilities:
   
   *    Callback
   
   b.   Modem calls.  The protocol must be able to specify the modem
        type(s) to be used for the call.
   
   c.   Carriage of bearer information.  The protocol must be able to
        specify the data rate of the TDM connection (e.g., 64 kbit/s, 56
        kbit/s, 384 kbit/s), if this is available from the SCN.
   
   d.   Rate Adaptation: The protocol must be able to specify the type
        of rate adaptation to be used for the call including indicating
        the subrate, if this is available from the SCN (e.g. 56K, or
        V.110 signaled in Bearer capabilities with subrate connection of
        19.2kbit/s).
   
   e.   Adaptable NASes: The protocol must be able to support multiple
        options for an incoming call to allow the NAS to dynamically
        select the proper type of call.  For example, an incoming ISDN
        call coded for "Speech" Bearer Capability could actually be a
        voice, modem, fax, text telephone, or 56 kbit/s synchronous
        call.  The protocol should allow the NAS to report back to the
        MGC the actual type of call once it is detected.

The 4 basic types of bearer for a NAS are:

   1.   Circuit Mode, 64-kbps, 8-khz structured, Speech
   
   2.   Circuit Mode, 64-kbps, 8-khz structured, 3.1-khz, Audio
   
   3.   Circuit Mode, 64-kbps, 8-khz structured, Unrestricted Digital
        Transmission-Rate Adapted from 56-kbps
   
   4.   Circuit Mode, 64-kbps, 8-khz structure, Unrestricted Digital
        Transmission
   
   f.   Passage of Called and Calling Party Number information to the
        NAS from the MGC. Also, passage of Charge Number/Billing Number,
        Redirecting Number, and Original Call Number, if known, to the
        NAS from the MGC. If there are other Q.931 fields that need to
        be passed from the MGC to the MG, then it should be possible to
        pass them [9].
   
   g.   Ability for the MGC to direct the NAS to connect to a specific
        tunnel, for example to an LNS, or to an AAA server.
   
   h.   When asked by the MGC, be able to report capability information,
        for example, connection types (V.34/V90/Synch ISDN..), AAA
        mechanism (RADIUS/DIAMETER/..), access type (PPP/SLIP/..) after
        restart or upgrade.

11.2.6. Restricted Capability Gateway

The requirements here may also be applied to small analog gateways, and to cable/xDSL modems. See also the section on access gateways.

The Protocol must support:

   a.   The ability to provide a scaled down version of the protocol.
        When features of the protocol are not supported, an appropriate
        error message must be sent. Appropriate default action must be
        defined.  Where this is defined may be outside the scope of the
        protocol.
   
   b.   The ability to provide device capability information to the MGC
        with respect to the use of the protocol.

11.2.7. Multimedia Gateway

The protocol must have sufficient capability to support a multimedia gateway. H.320 and H.324 are characterized by a single data stream with multiple media streams multiplexed on it.

If the mapping is from H.320 or H.324 on the circuit side, and H.323 on the packet side, it is assumed that the MG knows how to map respective subchannels from H.320/H.324 side to streams on packet side. If extra information is required when connecting two terminations, then it must be supplied so that the connections are not ambiguous.

The Multimedia Gateway:

   1)   should support Bonding Bearer channel aggregation,
   
   2)   must support 2xB (and possibly higher rates) aggregation via
        H.221,
   
   3)   must be able to dynamically change the size of audio, video and
        data channels within the h.320 multiplex,
   
   4)   must react to changes in the H.320 multiplex on 20 msec
        boundaries,
   
   5)   must support TCS4/IIS BAS commands,
   
   6)   must support detection and creation of DTMF tones,
   
   7)   should support SNMP MIBS as specified in H.341 [3]
   
   a.   If some of the above cannot be handled by the MGC to MG protocol
        due to timing constraints, then it is likely that the H.245 to
        H.242 processing must take place in the MG. Otherwise, support
        for this functionality in the multimedia gateway are protocol
        requirements.
   
   b.   It must be possible on a call by call basis for the protocol to
        specify different applications. Thus, one call might be PSTN to
        PSTN under SS7 control, while the next might be ISDN/H.320 under
        SS7 control to H.323.  This is only one example; the key
        requirement is that the protocol not prevent such applications.

11.2.8. Audio Resource Function

An Audio Resource Function (ARF) consists of one or more functional modules which can be deployed on an stand alone media gateway server IVR, Intelligent Peripheral, speech/speaker recognition unit, etc. or a traditional media gateway. Such a media gateway is known as an Audio Enabled Gateway (AEG) if it performs tasks defined in one or more of the following ARF functional modules:

Play Audio,
DTMF Collect,
Record Audio,
Speech Recognition, Speaker Verification/Identification, Auditory Feature Extraction/Recognition, or Audio Conferencing.

Additional ARF function modules that support human to machine communications through the use of telephony tones (e.g., DTMF) or auditory means (e.g. speech) may be appended to the AEG definition in future versions of these requirements.

Generic scripting packages for any module must support all the requirements for that module. Any package extension for a given module must include, by inheritance or explicit reference, the requirements for that given module.

The protocol requirements for each of the ARF modules are provided in the following subsections.

11.2.8.1. Play Audio Module
   a.   Be able to provide the following basic operation:
   
   -    request an ARF MG to play an announcement.
   
   b.   Be able to specify these play characteristics:
   
   -    Play volume
   
   -    Play speed
   -    Play iterations
   
   -    Interval between play iterations
   
   -    Play duration
   
   c.   Permit the specification of voice variables such as DN, number,
        date, time, etc.  The protocol must allow specification of both
        the value (eg 234-3456), and well as the type (Directory
        number).
   
   d.   Using the terminology that a segment is a unit of playable
        speech, or is an abstraction that is resolvable to a unit of
        playable speech, permit specification of the following segment
        types:
   
   -    A provisioned recording.
   
   -    A block of text to be converted to speech.
   
   -    A block of text to be displayed on a device.
   
   -    A length of silence qualified by duration.
   
   -    An algorithmically generated tone.
   
   -    A voice variable, specified by type and value.  Given a variable
        type and value, the IVR/ARF unit would dynamically assemble the
        phrases required for its playback.
   
   -    An abstraction that represents a sequence of audio segments.
        Nesting of these abstractions must also be permitted.

An example of this abstraction is a sequence of audio segments, the first of which is a recording of the words "The number you have dialed", followed by a Directory Number variable, followed by a recording of the words "is no longer in service".

   -    An abstraction that represents a set of audio segments and which
        is resolved to a single segment by a qualifier.  Nesting of
        these abstractions must be permitted.

For example take a set of audio segments recorded in different languages all of which express the semantic concept "The number you have dialed is no longer in service". The set is resolved by a language qualifier. If the qualifier is "French", the set resolves to the French version of this announcement.

   In the case of a nested abstraction consisting of a set qualified by
   language at one level and and a set qualified by gender at another
   level,  it would be possible to specify that an announcement be
   played in French  and spoken by a female voice.
   
   e.   Provide two different methods of audio specification:
   
   -    Direct specification of the audio components to be played by
        specifying the sequence of segments in the command itself.
   
   -    Indirect specification of the audio components to be played by
        reference to a single identifier that resolves to a provisioned
        sequence of audio segments.
11.2.8.2. DTMF Collect Module

The DTMF Collect Module must support all of the requirements in the Play Module in addition to the following requirements:

   a.   Be able to provide the following basic operation:
   
   -    request an AEG to play an announcement, which may optionally
        terminated by DTMF, and then collect DTMF
   
   b.   Be able to specify these event collection characteristics:
   
   -    The number of attempts to give the user to enter a valid DTMF
        pattern.
   
   c.   With respect to digit timers, allow the specification of:
   
   -    Time allowed to enter the first digit.
   
   -    Time allowed for user to enter each digit subsequent to the
        first digit.
   
   -    Time allowed for user to enter a digit once the maximum expected
        number of digits has been entered.
   
   d.   To be able to allow multiple prompt operations DTMF digit
        collection, voice recording (if supported), and/or speech
        recognition analysis (if supported) provide the following types
        of prompts:
   
   -    Initial Prompt
   
   -    Reprompt
   -    Error prompt
   
   -    Failure announcement
   
   -    Success announcement.
   
   e.   To allow digit pattern matching, allow the specification of:
   
   -    maximum number of digits to collect.
   
   -    minimum number of digits to collect.
   
   -    a digit pattern using a regular expression.
   
   f.   To allow digit buffer control, allow the specification of:
   
   -    Ability to clear digit buffer prior to playing initial prompt
        (default is not to clear buffer).
   
   -    Default clearing of buffer following playing of un-interruptible
        announcement segment.
   
   -    Default clearing of buffer before playing a re-prompt in
        response to previous invalid input.
   
   g.   Provide a method to specify DTMF interruptibility on a per audio
        segment basis.
   
   h.   Allow the specification of definable key sequences for DTMF
        digit collection to:
   
   -    Discard collected digits in progress, replay the prompt, and
        resume DTMF digit collection.
   
   -    Discard collected digits in progress and resume DTMF digit
        collection.
   
   -    Terminate the current operation and return the terminating key
        sequence to the MGC.
   
   i.   Provide a way to ask the ARF MG to support the following
        definable keys for digit collection and recording. These keys
        would then be able to be acted upon by the ARF MG:
   
   -    A key to terminate playing of an announcement in progress.
   
   -    A set of one or more keys that can be accepted as the first
        digit to be collected.
   
   -    A key that signals the end of user input.  The key may or may
        not be returned to the MGC along with the input already
        collected.
   
   -    Keys to stop playing the current announcement and resume playing
        at the beginning of the first segment of the announcement, last
        segment of the announcement, previous segment of the
        announcement, next segment of the announcement, or the current
        announcement segment.
11.2.8.3. Record Audio Module

The Record Module must support all of the requirements in the Play Module as in addition to the following requirements:

   a.   Be able to provide the following basic operation:
   
   -    request an AEG to play an announcement and then record voice.
   
   b.   Be able to specify these event collection characteristics:
   
   -    The number of attempts to give the user to make a recording.
   
   c.   With respect to recording timers, allow the specification of:
   
   -    Time to wait for the user to initially speak.
   
   -    The amount of silence necessary following the last speech
        segment for the recording to be considered complete.
   
   -    The maximum allowable length of the recording  (not including
        pre- and post- speech silence).
   
   d.   To be able to allow multiple prompt operations for DTMF digit
        collection (if supported), voice recording (if supported),
        speech recognition analysis (if supported) and/or speech
        verification/identification (if supported) and then to provide
        the following types of prompts:
   
   -    Initial Prompt
   
   -    Reprompt
   
   -    Error prompt
   
   -    Failure announcement
   
   -    Success announcement.
   e.   Allow the specification of definable key sequences for digit
        recording or speech recognition analysis (if supported) to:
   
   -    Discard recording in progress, replay the prompt, and resume
        recording.
   
   -    Discard recording in progress and resume recording.
   
   -    Terminate the current operation and return the terminating key
        sequence to the MGC.
   
   f.   Provide a way to ask the ARF MG to support the following
        definable keys for recording. These keys would then be able to
        be acted upon by the ARF MG:
   
   -    A key to terminate playing of an announcement in progress.
   
   -    A key that signals the end of user input.  The key may or may
        not be returned to the MGC along with the input already
        collected.
   
   -    Keys to stop playing the current announcement and resume playing
        at the beginning of the first segment of the announcement, last
        segment of the announcement, previous segment of the
        announcement, next segment of the announcement, or the current
        announcement segment.
   
   g.   While audio prompts are usually provisioned in IVR/ARF MGs,
        support changing the provisioned prompts in a voice session
        rather than a data session.  In particular, with respect to
        audio management:
   
   -    A method to replace provisioned audio with audio recorded during
        a call. The newly recorded audio must be accessible using the
        identifier of the audio it replaces.
   
   -    A method to revert from replaced audio to the original
        provisioned audio.
   
   -    A method to take audio recorded during a call and store it such
        that it is accessible to the current call only through its own
        newly created unique identifier.
   
   -    A method to take audio recorded during a call and store it such
        that it is accessible to any subsequent call through its own
        newly created identifier.
11.2.8.4. Speech Recognition Module

The speech recognition module can be used for a number of speech recognition applications, such as:

   -    Limited Vocabulary Isolated Speech Recognition (e.g., "yes",
        "no", the number "four"),
   
   -    Limited Vocabulary Continuous Speech Feature Recognition (e.g.,
        the utterance "four hundred twenty-three dollars"),and/or
   
   -    Continuous Speech Recognition (e.g., unconstrained speech
        recognition tasks).

The Speech Recognition Module must support all of the requirements in the Play Module as in addition to the following requirements:

   a.   Be able to provide the following basic operation: request an AEG
        to play an announcement and then perform speech recognition
        analysis.
   
   b.   Be able to specify these event collection characteristics:
   
   -    The number of attempts to give to perform speech recognition
        task.
   
   c.   With respect to speech recognition analysis timers, allow the
        specification of:
   
   -    Time to wait for the user to initially speak.
   
   -    The amount of silence necessary following the last speech
        segment for the speech recognition analysis segment to be
        considered complete.
   
   -    The maximum allowable length of the speech recognition analysis
        (not including pre- and post- speech silence).
   
   d.   To be able to allow multiple prompt operations for DTMF digit
        collection  (if supported), voice recording (if supported),
        and/or speech recognition analysis and then to provide the
        following types of prompts:
   
   -    Initial Prompt
   
   -    Reprompt
   
   -    Error prompt
   -    Failure announcement
   
   -    Success announcement.
   
   e.   Allow the specification of definable key sequences for digit
        recording (if supported) or speech recognition analysis to:
   
   -    Discard in process analysis, replay the prompt, and resume
        analysis.
   
   -    Discard recording in progress and resume analysis.
   
   -    Terminate the current operation and return the terminating key
        sequence to the MGC.
   
   f.   Provide a way to ask the ARF MG to support the following
        definable keys for speech recognition analysis. These keys would
        then be able to be acted upon by the ARF MG:
   
   -    A key to terminate playing of an announcement in progress.
   
   -    A key that signals the end of user input.  The key may or may
        not be returned to the MGC along with the input already
        collected.
   
   -    Keys to stop playing the current announcement and resume playing
        at the beginning of the first segment of the announcement, last
        segment of the announcement, previous segment of the
        announcement, next segment of the announcement, or the current
        announcement segment.
11.2.8.5. Speaker Verification/Identification Module

The speech verification/identification module returns parameters that indicate either the likelihood of the speaker to be the person that they claim to be (verification task) or the likelihood of the speaker being one of the persons contained in a set of previously characterized speakers (identification task).

The Speaker Verification/Identification Module must support all of the requirements in the Play Module in addition to the following requirements:

   a.   Be able to download parameters, such as speaker templates
        (verification task) or sets of potential speaker templates
        (identification task), either prior to the session or in mid-
        session.
   
   b.   Be able to download application specific software to the ARF
        either prior to the session or in mid-session.
   
   c.   Be able to return parameters indicating either the likelihood of
        the speaker to be the person that they claim to be (verification
        task) or the likelihood of the speaker being one of the persons
        contained in a set of previously characterized speakers
        (identification task).
   
   d.   Be able to provide the following basic operation: request an AEG
        to play an announcement and then perform speech
        verification/identification analysis.
   
   e.   Be able to specify these event collection characteristics: The
        number of attempts to give to perform speech
        verification/identification task.
   
   f.   With respect to speech verification/identification analysis
        timers, allow the specification of:
   
   -    Time to wait for the user to initially speak.
   
   -    The amount of silence necessary following the last speech
        segment for the speech verification/identification analysis
        segment to be considered complete.
   
   -    The maximum allowable length of the speech
        verification/identification analysis  (not including pre- and
        post- speech silence).
   
   g.   To be able to allow multiple prompt operations for DTMF digit
        collection (if supported), voice recording, (if supported),
        speech recognition analysis (if supported) and/or speech
        verification/identification and provide the following types of
        prompts:
   
   -    Initial Prompt
   
   -    Reprompt
   
   -    Error prompt
   
   -    Failure announcement
   
   -    Success announcement.
   h.   Allow the specification of definable key sequences for digit
        recording (if supported) or speech recognition (if supported) in
        the speech verification/identification analysis to:
   
   -    Discard speech verification/identification in analysis, replay
        the prompt, and resume analysis.
   
   -    Discard speech verification/identification analysis in progress
        and resume analysis.
   
   -    Terminate the current operation and return the terminating key
        sequence to the MGC.
   
   i.   Provide a way to ask the ARF MG to support the following
        definable keys for speech verification/identification analysis.
        These keys would then be able to be acted upon by the ARF MG:
   
   -    A key to terminate playing of an announcement in progress.
   
   -    A key that signals the end of user input.  The key may or may
        not be returned to the MGC along with the input already
        collected.
   
   -    Keys to stop playing the current announcement and resume speech
        verification/identification at the beginning of the first
        segment of the announcement, last segment of the announcement,
        previous segment of the announcement, next segment of the
        announcement, or the current announcement segment.
11.2.8.6. Auditory Feature Extraction/Recognition Module

The auditory feature extraction/recognition module is engineered to continuously monitor the auditory stream for the appearance of particular auditory signals or speech utterances of interest and to report these events (and optionally a signal feature representation of these events) to network servers or MGCs.

The Auditory Feature Extraction/Recognition Module must support the following requirements:

   a.   Be able to download application specific software to the ARF
        either prior to the session or in mid-session.
   
   b.   Be able to download parameters, such as a representation of the
        auditory feature to extract/recognize, for prior to the session
        or in mid-session.
   
   c.   Be able to return parameters indicating the auditory event found
        or a representation of the feature found (i.e., auditory
        feature).
11.2.8.7. Audio Conferencing Module

The protocol must support:

   a.   a mechanism to create multi-point conferences of audio only and
        multimedia conferences in the MG.
   
   b.   audio mixing; mixing multiple audio streams into a new composite
        audio stream
   
   c.   audio switching; selection of incoming audio stream to be sent
        out to all conference participants.

11.2.9. Multipoint Control Units

The protocol must support:

   a.   a mechanism to create multi-point conferences of audio only and
        multimedia conferences in the MG.
   
   b.   audio mixing; mixing multiple audio streams into a new composite
        audio stream
   
   c.   audio switching; selection of incoming audio stream to be sent
        out to all conference participants.
   
   d.   video switching; selection of video stream to be sent out to all
        conference participants
   
   e.   lecture video mode; a video selection option where on video
        source is sent out to all conference users
   
   f.   multi-point of T.120 data conferencing.
   
   g.   The ability for the MG to function as an H.323 MP, and for the
        MGC to function as an H.323 MC, connected by this protocol
        (MEGACOP/H.248).  It should be possible for audio, data, and
        video MG/MPs to be physically separate while being under the
        control of a single MGC/H.323 MC.

12. References

   [1]  Bradner, S., "Key words for use in RFCs to Indicate Requirement
        Levels", BCP 14, RFC 2119, March 1997.
   
   [2]  ITU-T Recommendation Q.2630.1, AAL type 2 Signalling Protocol
        (Capability Set 1), December 1999.
   
   [3]  ITU-T Recommendation H.341, Line Transmission of Non-Telephone
        Signals, May 1999.
   
   [4]  ATM Forum Technical Committee, af-vtoa-0083.001, Voice and
        Telephony Over ATM to the Desktop Specification, March 1999.
   
   [5]  ITU-T Recommendation H.323v3, Packet-based Multimedia
        Communications Systems (includes Annex C - H.323 on ATM),
        September 1999.
   
   [6]  ATM Forum Technical Committee, af-saa-0124.000, Gateway for
        H.323 Media Transport Over ATM, May 1999.
   
   [7]  ITU-T Recommendation T.140, Protocol for Multimedia Application
        Text Conversation, February 1998.
   
   [8]  ITU-T Recommendation V.18, Operational and Interworking
        Requirements for DCEs Operating in Text Telephone Mode, February
        1998.
   
   [9]  ITU-T Recommendation Q.931, Digital Subscriber Signalling System
        No. 1 (DSS 1) - ISDN User - Network Interface Layer 3
        Specification for Basic Call Control, May 1998.

14. Acknowledgements

The authors would like to acknowledge the many contributors who debated the Media Gateway Control Architecture and Requirements on the IETF Megaco and Sigtran mailing lists. Contributions to this document have also been made through internet-drafts and discussion with members of ETSI Tiphon, ITU-T SG16, TIA TR41.3.4, the ATM Forum, and the Multiservice Switching Forum.

15. Authors' Addresses

Nancy Greene
Nortel Networks
P.O. Box 3511 Stn C
Ottawa, ON, Canada K1Y 4H7

   Phone: (514) 271-7221
   EMail: ngreene@nortelnetworks.com

Michael A. Ramalho
Cisco Systems
1802 Rue de la Port
Wall Township, NJ

   Phone: +1.732.449.5762
   EMail: mramalho@cisco.com

Brian Rosen
Marconi
1000 FORE Drive, Warrendale, PA 15086

   Phone: (724) 742-6826
   EMail: brosen@eng.fore.com

16. Full Copyright Statement

Copyright © The Internet Society (2000). All Rights Reserved.

This document and translations of it may be copied and furnished to others, and derivative works that comment on or otherwise explain it or assist in its implementation may be prepared, copied, published and distributed, in whole or in part, without restriction of any kind, provided that the above copyright notice and this paragraph are included on all such copies and derivative works. However, this document itself may not be modified in any way, such as by removing the copyright notice or references to the Internet Society or other Internet organizations, except as needed for the purpose of developing Internet standards in which case the procedures for copyrights defined in the Internet Standards process must be followed, or as required to translate it into languages other than English.

The limited permissions granted above are perpetual and will not be revoked by the Internet Society or its successors or assigns.

This document and the information contained herein is provided on an "AS IS" basis and THE INTERNET SOCIETY AND THE INTERNET ENGINEERING TASK FORCE DISCLAIMS ALL WARRANTIES, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO ANY WARRANTY THAT THE USE OF THE INFORMATION HEREIN WILL NOT INFRINGE ANY RIGHTS OR ANY IMPLIED WARRANTIES OF MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE.

Acknowledgement

Funding for the RFC Editor function is currently provided by the Internet Society.