Internet-Draft Transport and Services Working Group July 2024
Chen & Li Expires 9 January 2025 [Page]
Workgroup:
Internet Research Task Force
Internet-Draft:
draft-chen-tsvwg-high-speed-data-express-00
Published:
Intended Status:
Informational
Expires:
Authors:
D. Chen
China Mobile
Z. Li
China Mobile

High speed data express in IP: Concept, Reference Architecture and Technologies

Abstract

With the rapid development of AI technology, intelligent computing and super-computing services, the demand for wide-area transmission of massive data is increasing. This paper makes full use of the advantages of IP network, such as statistical reuse and elastic supply, and proposes a reference architecture of high elasticity and high throughput data express based on IP network. In order to support differentiated data express services and create task-type data express services, the "one low and three high" technical system is proposed.

Requirements Language

The key words "MUST", "MUST NOT", "REQUIRED", "SHALL", "SHALL NOT", "SHOULD", "SHOULD NOT", "RECOMMENDED", "MAY", and "OPTIONAL" in this document are to be interpreted as described in RFC 2119 [RFC2119].

Status of This Memo

This Internet-Draft is submitted in full conformance with the provisions of BCP 78 and BCP 79.

Internet-Drafts are working documents of the Internet Engineering Task Force (IETF). Note that other groups may also distribute working documents as Internet-Drafts. The list of current Internet-Drafts is at https://datatracker.ietf.org/drafts/current/.

Internet-Drafts are draft documents valid for a maximum of six months and may be updated, replaced, or obsoleted by other documents at any time. It is inappropriate to use Internet-Drafts as reference material or to cite them other than as "work in progress."

This Internet-Draft will expire on 9 January 2025.

Table of Contents

1. Introduction

With the rapid development of AI technology, intelligent computing, and super-computing services, the demand for wide-area transmission of massive data is increasing. How to transfer a large amount of data quickly, efficiently and conveniently has become an important problem that needs to be solved in the development of communication networks. With the network bandwidth from 10G to 25G, 100G, 200G and even 400G, bandwidth is not the bottleneck of data transmission, the key to data transmission is how to make full use of large bandwidth to achieve efficient throughput.

In most cases, the effective throughput is less than the throughput, which in turn is less than the bandwidth. Bandwidth refers to the number of bits that can be transmitted per second on the link, which depends on the link clock rate and channel coding, also known as line speed (network device transmission capacity). Throughput refers to the actual amount of data transmitted on the bandwidth (terminal device transmission capacity). Goodput refers to the throughput that truly meets service requirements, excluding invalid packets such as retransmission and discarding (service data transmission capability)..

In-depth analysis shows that the data transmission throughput performance is mainly affected by five factors: transmission distance, network packet loss rate, effective data length , sending window and the impact factor. As the transmission distance increases, RTT (round trip delay) increases. In reliable transmission mode, the reply time of ACK acknowledgement messages becomes longer, and the sending sliding window cannot be adjusted in time. Network packet loss rate will affect the rate adjustment of congestion control algorithm, and increase the tail delay of data transmission. Effective data length refers to the effective data length of the packet header removed, which can be regarded as MSS. Increasing MSS will help improve throughput. The sending window affects the duration of continuous data sending. If the window is too small, data will be sent before the ACK reply is completed, and sending will be paused. The impact factor F is a comprehensive parameter, such as the CPU frequency, PCIE rate, disk I/O rate, and bandwidth that affect the packet processing rate, 0<F<1, if these have no impact, the value of F is 1. In summary, we can draw the following conclusion: in addition to improving the network environment (improving the physical bandwidth, etc.), the end-to-end data sending and receiving mechanism and transmission protocol are also a key part of the effective throughput.

In order to improve the data transmission efficiency and reduce the data transmission cost as much as possible, we propose to build a highly elastic and high-throughput data transmission network based on IP network, and use elastic bandwidth, load balancing, security encryption and other technologies to realize new data express services.

2. Reference Architecture

The data express solution enables massive data transmission from the data source to the destination. Data sources are devices or systems that generate or store large amounts of data, such as enterprise servers, databases, and storage devices. Data sources can be multiple nodes distributed across different geographies and network environments, or they can be a single node concentrated in one location. Data destination refers to the device or system that receives or processes a large amount of data, such as cloud computing platform, intelligent computing center, and supercomputing center. The data destination can be a public service provided by a third party or a cloud data center built by a data express operator.

Based on the massive transmission requirements from the data source to the destination end, the overall architecture of the data express solution is designed according to the hierarchical idea, which mainly includes three parts: cloud private network infrastructure, network controller and data express operation platform, as shown in

                         +--------------------+
                         |                    |
                         |    Data Express    |
          Task  -------> | Operation Platform |
        Delivery         |                    |
                         +---------+----------+
                                   |
                        +----------+-----------+
                        |                      |
        +---------------+  Network Controller  +---------+
        |               |                      |         |
        |               +-----------+----------+         |
        |                           |                    |
        |                           |                    |
+-------+---------------------------+--------------------+----------+
|                                                                   |
| +-----------+                                       +-----------+ |
| |Data Source|                                       |  Super    | |
| |  Memory   |                                +------+ Computing | |
| | card/disk |                Wide Area       |      |  Center   | |
| +-----+-----+     +------+Interconnection +--+--+   +-----------+ |
|       |           |Cloud +----------------+Cloud|                 |
|       |           |  PE  |                | PE  |                 |
|       |           +---+--+                +--+--+   +-----------+ |
| +-----v------+        |                      |      |  General  | |
| |Data Express|        |                      +------+ Computing | |
| |  Station   +--------+                             |   Center  | |
| +------------+                                      +-----------+ |
|                                                                   |
|                      Infrastructure                               |
+-------------------------------------------------------------------+

3. "One low, three high" technology system

TBD

4. Summary

TBD

5. Security Considerations

TBD.

6. IANA Considerations

This document has no requests to IANA.

7. Normative References

[RFC2119]
Bradner, S., "Key words for use in RFCs to Indicate Requirement Levels", BCP 14, RFC 2119, DOI 10.17487/RFC2119, , <https://www.rfc-editor.org/info/rfc2119>.

Authors' Addresses

Danyang Chen
China Mobile
Beijing
100053
China
Zhiqiang Li
China Mobile
Beijing
100053
China