4 Restoration Procedures related to the N4 Interface

23.5273GPP5G SystemRelease 18Restoration proceduresTS

4.1 General

This clause specifies the procedures supported in the 5G System to detect and handle failures affecting the N4 interface.

4.2 N4 Failure and Restart Detection

Across PFCP based interfaces, an SMF and UPF shall utilize the PFCP Heartbeat Request and Heartbeat Response messages to detect a peer PFCP entity failure or restart as described in clause 19A of 3GPP TS 23.007 [2].

A PFCP function shall ignore the Recovery Timestamp received in PFCP Association Setup Request and PFCP Association Setup Response messages (see clause 6.2.6 of 3GPP TS 29.244 [4]).

4.3 UPF Restoration Procedures

4.3.1 General

When a UPF fails, all its Session contexts and PFCP associations affected by the failure become invalid and may be deleted.

4.3.2 Restoration Procedure for PSA UPF Restart

If F-TEID and/or UE IP address allocation is performed in the UPF, the UPF shall ensure that previously used F-TEID values and/or UE IP addresses are not immediately reused after a UPF restart, in order to avoid inconsistent F-TEID and/or UE IP address allocation throughout the network and to enable the restoration of PFCP sessions affected by the failure. How this is ensured is implementation specific.

The UPF shall not send GTP-U Error indication message for a configurable period after an UPF restart when the UPF receives a G-PDU not matching any PDRs.

During or immediately after an UPF Restart, the UPF shall place a local UPF Recovery Time Stamp value in all Heartbeat Request/Response messages.

Immediately after the re-establishment of a PFCP association between the SMF and the UPF, the SMF may start restoring PFCP sessions in the UPF.

The SMF should prioritize the PFCP sessions to restore based on operator’s policy.

The SMF should control the load induced on the UPF when performing the PFCP restoration procedures, the way it is done is implementation specific.

When re-establishing a PFCP session and if F-TEID allocation and/or UE IP address is performed in the PSA UPF by network configuration, the SMF shall include a restoration indication in the PFCP Session Establishment Request message to indicate to the UPF it is for a restoration of an existing PFCP session and the UPF shall accept SMF allocated F-TEID and/or UE IP address if possible. If the UPF cannot accept the requested F-TEID and/or UE IP address because the IP address is no longer available at the UPF, the UPF shall reject the PFCP Session Establishment Request with the cause "PFCP session restoration failure due to requested resource not available".

NOTE: The cause "PFCP session restoration failure due to requested resource not available" corresponds to scenarios where the requested address is no longer available at the UPF, i.e. it cannot be assigned to any PFCP session, due to e.g. the address having been decommissioned by OAM from the UPF or e.g. the address being no longer operational due to a partial hardware failure at the UPF. This cause is not to be used for scenarios where the requested address would be available at the UPF but would have been re-assigned to a different PFCP session, which is a scenario that is not expected to happen based on the requirements specified above.

4.3.3 Restoration Procedure for PSA UPF Failure without Restart

Procedures for PSA UPF failure without restart are implementation specific.

4.3.4 Restoration Procedure for Intermediate UPF Restart

The SMF will receive the UPF recovery time stamps in PFCP heartbeat requests/responses.

After an Intermediate UPF restart, the PFCP association between the SMF(s) and the Intermediate UPF has to be re-established.

The restoration of the PFCP sessions may start immediately after the PFCP association setup procedure:

– if the restoration is supported in the SMF on a proactive basis, the SMF may start re-establishing PFCP sessions matching any PDRs.

– as defined in clause 4.3.2, the SMF should prioritize the PFCP sessions restoration.

– if the restoration is supported in the SMF on a reactive basis:

– the SMF shall establish an PFCP session with a wildcarded PDR to instruct the Intermediate UPF to forward G-PDU packets which are not matching any other PDRs to the SMF (to a F-TEID uniquely assigned in the SMF for this PFCP-u tunnel);

– upon receipt of G-PDUs from this PFCP-u tunnel, the SMF shall then check if it has an active session for each received G-PDU packet:

– if so, the SMF shall perform the PFCP Session establishment procedures to re-establish the corresponding PFCP sessions in the Intermediate UPF;

– otherwise the SMF shall generate a GTP-U Error Indication with a destination address set to the source IP address of the received G-PDU, and send it to the Intermediate UPF. The Intermediate UPF shall forward this GTP-U Error Indication transparently. The SMF shall delete the G-PDU after the check for active sessions.

NOTE 1: The UPF can filter the G-PDU packets with same target F-TEID and send only one such G-PDU to the Intermediate SMF.

When re-establishing a PFCP session and if F-TEID allocation is performed in the Intermediate UPF by network configuration, the SMF shall include a restoration indication in the PFCP Session Establishment Request message to indicate to the UPF it is for a restoration of an existing PFCP session and the UPF shall accept SMF allocated F-TEID if possible. If the UPF cannot accept the requested F-TEID because the IP address is no longer available at the UPF, the UPF shall reject the PFCP Session Establishment Request with the cause "PFCP session restoration failure due to requested resource not available".

The Intermediate UPF shall not send any Error indication messages for an operator configurable period after an Intermediate UPF restart when the Intermediate UPF receives G-PDU not matching any PDRs.

NOTE 2: If restoration on a reactive basis is used, the period needs to be longer than the time required by the SMF to detect the UPF restart, to establish the PFCP association and provision the wildcarded PDR. Otherwise, the period needs to be longer than the time required by the SMF to restore all the PFCP sessions on a proactive basis.

NOTE 3: The cause "PFCP session restoration failure due to requested resource not available" corresponds to scenarios where the requested address is no longer available at the UPF, i.e. it cannot be assigned to any PFCP session, due to e.g. the address having been decommissioned by OAM from the UPF or e.g. the address being no longer operational due to a partial hardware failure at the UPF. This cause is not to be used for scenarios where the requested address would be available at the UPF but would have been re-assigned to a different PFCP session, which is a scenario that is not expected to happen based on the requirements specified above.

4.3.5 Restoration Procedure for Intermediate UPF Failure without Restart

Procedures for Intermediate UPF failure without restart are implementation specific.

4.4 SMF Restoration Procedures

4.4.1 General

When a SMF fails, all its PDU session contexts and PFCP associations affected by the failure may become invalid and may be deleted.

If F-TEID allocation is performed in the SMF, the SMF should ensure as far as possible that previously used F-TEID values are not immediately reused after a SMF restart, in order to avoid inconsistent TEID allocation throughout the network.

NOTE: This is to ensure that F-TEIDs are not reused until earlier PDU sessions using them are released.

4.4.2 Restoration Procedure for SMF Restart

During or immediately after a SMF Restart, the SMF shall place local SMF-C Recovery Time Stamp value in all Heartbeat Request/Response messages.

The UPF will receive the SMF recovery time stamps in PFCP heartbeat requests/responses.

When a UPF detects that a peer PFCP entity in the SMF has restarted (as specified in clause 4.2), the UPF shall delete all session contexts affected by the PFCP entity restart that it may have stored. When the UPF receives a GTP‑U PDU not matching any PDRs, it shall discard the GTP‑U PDU and return a GTP error indication to the originating node (e.g. other UPF, gNB or N3IWF).

4.4.3 Restoration Procedure for SMF Failure without Restart

When a UPF detects that a peer PFCP entity in the SMF is not reachable for a preconfigured time, the UPF shall delete all the session contexts affected by the peer PFCP entity failure that it may have stored.

4.5 N4 path failure

If the N4 path to the UPF is down, the SMF should handle this as an UPF Failure without Restart, see clause 4.3.3.

If the N4 path to the SMF is down, the UPF should handle this as a SMF Failure without Restart, see clause 4.4.3.

4.6 Partial failure handling over N4

4.6.1 General

The SMF and UPF may support the partial failure handling feature over N4. If so, they shall support the requirements specified in this clause.

This feature enables to clean up PFCP sessions optimally in the peer PFCP node (i.e. SMF or UPF), when a hardware or software failure affects a significant number of PFCP sessions. When it is not possible to recover the affected PFCP sessions, it is useful to inform the peer PFCP node about the affected PFCP sessions using an identifier that represents a large set of PFCP sessions, rather than doing so per PFCP session.

A Connection Set Identifier (CSID) shall identify a set of PFCP sessions within a PFCP node that may belong to an arbitrary number of UEs. A CSID is an opaque parameter local to a PFCP node. Each PFCP node that supports the feature shall maintain a local mapping of CSID to its internal resources. When one or more of these resources fail, the corresponding one or more fully qualified CSIDs shall be signalled to the peer PFCP nodes.

The fully qualified CSID (FQ-CSID) is the combination of the PFCP node identity and the CSID assigned by the PFCP node which together globally identifies a set of PFCP sessions.

The node identifier shall be globally unique across all 3GPP EPS and 5GS networks. Its format is defined in 3GPP TS 29.244 [4].

4.6.2 Procedures

The SMF and the UPF may each assign one FQ-CSID to a PFCP session during the PFCP session establishment, by signalling the SMF FQ-CSID in the PFCP Session Establishment Request or the UPF FQ-CSID in the PFCP Session Establishment Response, as shown in Figure 4.6.2-1.

Figure 4.6.2-1 : PFCP Session Establishment procedure with FQ-CSID

The SMF and the UPF shall assign only one FQ-CSID for a given PFCP session and each FQ-CSID shall have exactly one CSID within the FQ-CSID. The peer PFCP node shall store the received FQ-CSID for the related PFCP session.

The receipt of a FQ-CSID also implicitly indicates that the peer FFCP node supports the feature.

The SMF and the UPF may update the FQ-CSID associated to a PFCP session during a PFCP session modification procedure, by signalling the SMF FQ-CSID in the PFCP Session Modification Request or the UPF FQ-CSID in the PFCP Session Modification Response, as shown in Figure 4.6.2-2.

Figure 4.6.2-2: PFCP Session Modification procedure with change of FQ-CSID

An SMF that detects then that it has undergone a partial failure shall send a PFCP Session Set Deletion Request, including all its CSIDs corresponding to the component(s) that failed, to the UPF, if the UPF is known to support the feature, as shown in Figure 4.6.2-3.

Figure 4.6.2-3: SMF initiated PFCP Session Set Deletion procedure

Likewise, an UPF that detects then that it has undergone a partial failure shall send a PFCP Session Set Deletion Request, including all its CSIDs corresponding to the component(s) that failed, to the SMF, if the SMF is known to support the feature, as shown in Figure 4.6.2-4.

Figure 4.6.2-4: UPF initiated PFCP Session Set Deletion procedure

Upon receiving a PFCP Session Set Deletion Request reporting a partial failure for one or more FQ-CSID(s), the SMF or UPF shall find the PFCP sessions matching the FQ-CSID(s) and then proceed with the restoration procedures specified in clause 4 for an SMF or UPF failure.

4.7 Restoration of PFCP sessions associated with a specific FQ-CISD, Group ID or SMF IP Address

4.7.1 General

To reduce signalling latency and achieve a better load balancing among SMFs in a SMF Set when it is deployed, an SMF in a SMF Set and a UPF may support the procedures specified in this clause. These procedures enable an SMF from the same set to request UPF to move PFCP sessions associated with certain FQ-CSIDs (when partial failure handling is supported as specified in clause 4.6), Group IDs or SMF IP addresses, to (another) SMF(s) in the set proactively, without causing massing signalling (per PFCP session) towards UPF(s).

NOTE: The FQ-CSID can only be used in the procedure specified in this clause when the partial failure feature (using FQ-CSID) is deployed and used (not to force the NF to use a Group ID). For a network where the partial failure feature is not deployed, a Group ID or a SMF IP address needs to be used.

4.7.2 Allocation of Group Id or FQ-CSID to a PFCP session

To optimize the resource utilization for PFCP session(s), e.g. to meet different traffic requirements for different APNs/DNNs and/or DCNs/Network Slices, and also to facilitate moving a (sub)set of PDN connections/PFCP sessions among an SMF set, e.g. for a partial or complete SMF failure or a scale-in operation, an SMF in a SMF Set may:

– allocate a globally unique Group Id in the PFCP Session Establishment Request message for a PFCP session during a PFCP session establishment procedure and update the Group Id, if necessary, in subsequent PFCP Session Modification Request messages (see also clause 5.22.X of 3GPP TS 29.244 [43]).

Alternatively, if partial failure handling is supported, the SMF may assign an FQ-CSID to a PFCP session as specified in clause 4.6.

Subsequently, e.g. when a partial or complete failure takes place affecting all PFCP sessions which are sharing either the same FQ-CSID or the Group ID (see clause 4.7.1), a PGW-C/SMF in a PGW-C/SMF Set may trigger the restoration procedure for those affected PFCP sessions as specified in clauses 4.7.3.

4.7.3 Restoration of PFCP sessions associated with an FQ-CSID, Group ID or SMF IP Address

When there is a need to change the SMF controlling certain PFCP sessions, e.g. when a partial or complete failure takes place, the SMF (either the SMF serving the PFCP sessions or another SMF in the same Set taking over the control of the PFCP sessions) may send a PFCP Session Set Modification Request message to the UPF(s) to request the UPF(s) to send subsequent PFCP Session Report Request messages to the alternative SMF (as indicated in the Alternative SMF IP Address IE) for the PFCP sessions which are associated with the FQ-CSID(s) or Group ID(s), or which have their CP F-SEIDs containing the SMF IP Address as shown in Figure 4.7.3-1.

The SMF may instruct the UPF to move sessions associated with different SMF FQ-CSIDs, Group Ids or SMF IP addresses, to different SMF addresses.

Figure 4.7.3-1: SMF initiated PFCP Session Set Modification procedure