Computer Science Seminar: ‘TENSOR: Lightweight BGP Non-Stop Routing,’ Featuring Yunming Xiao

Yunming Xiao of Northwestern University will deliver a talk, “TENSOR: Lightweight BGP Non-Stop Routing,” at 12:45–1:45 p.m. Tuesday, October 31, 2023, at Stuart Building, room 111.

Abstract

As the solitary inter-domain protocol, BGP plays an important role in today’s Internet. Its failures threaten network stability and will usually result in large-scale packet losses. Thus, the non-stop routing (NSR) capability that protects inter-domain connectivity from being disrupted by various failures, is critical to any Autonomous System (AS) operator. Replicating the BGP and underlying TCP connection status is key to realizing NSR. But existing NSR solutions, which heavily rely on OS kernel modifications, have become impractical due to providers’ adoption of virtualized network gateways for better scalability and manageability. We tackle this problem by proposing TENSOR, which incorporates a novel kernel-modification-free replication design and lightweight architecture. More concretely, the kernel-modification-free replication design mitigates the reliance on OS kernel modification and hence allows the virtualization of the network gateway. Meanwhile, lightweight virtualization provides strong performance guarantees and improves system reliability. Moreover, TENSOR provides a solution to the split-brain problem that affects NSR solutions. Through extensive experiments, we show that TENSOR realizes NSR while bearing little overhead compared to open-source BGP implementations. Further, our two-year operational experience on a fleet of 400 servers controlling over 31,000 BGP peering connections at Tencent demonstrates that TENSOR reduces the development, deployment, and maintenance costs significantly – at least by factors of 20, 5, and 10, respectively, while retaining the same SLA with the NSR-enabled routers.

Bio

Yunming Xiao is a final year Ph.D. candidate at Northwestern University, advised by Professor Aleksandar Kuzmanovic. He has interned at multiple companies such as HPE Labs, Google, Nokia Bell Labs, ByteDance, solving various research or software engineering problems. He also has a long-term collaboration with Tencent. Earlier, he obtained his B.Eng. from Beijing University of Posts and Telecommunications, China. He has published papers at networking venues such as ACM SIGCOMM, SIGMETRICS, CoNEXT, WWW, etc. His research interests lie in two primary directions: (i) enhancing the security and privacy measures of Internet services, and (ii) advancing the reliability of data center networks, aiming to minimize downtime.