LoongServe: Efficiently Serving Long-Context Large Language Models with Elastic Sequence Parallelism
Bingyang Wu, Shengyu Liu, Yinmin Zhong, Peng Sun, Xuanzhe Liu, Xin Jin
ACM SOSP, November 2024
[PDF]
DistServe: Disaggregating Prefill and Decoding for Goodput-optimized Large Language Model Serving
Yinmin Zhong, Shengyu Liu, Junda Chen, Jianbo Hu, Yibo Zhu, Xuanzhe Liu, Xin Jin, Hao Zhang
USENIX OSDI, July 2024
[PDF]
dLoRA: Dynamically Orchestrating Requests and Adapters for LoRA LLM Serving
Bingyang Wu, Ruidong Zhu, Zili Zhang, Peng Sun, Xuanzhe Liu, Xin Jin
USENIX OSDI, July 2024
[PDF]
Burstable Cloud Block Storage with Data Processing Units
Junyi Shu, Kun Qian, Ennan Zhai, Xuanzhe Liu, Xin Jin
USENIX OSDI, July 2024
[PDF]
SoCFlow: Efficient and Scalable DNN Training on SoC-Clustered Edge Servers
Daliang Xu, Mengwei Xu, Chiheng Lou, Li Zhang, Gang Huang, Xin Jin, Xuanzhe Liu
ACM ASPLOS, April 2024
[PDF]
Jolteon: Unleashing the Promise of Serverless for Serverless Workflows
Zili Zhang, Chao Jin, Xin Jin
USENIX NSDI, April 2024
[PDF]
Fast Vector Query Processing for Large Datasets Beyond GPU Memory with Reordered Pipelining
Zili Zhang, Fangyue Liu, Gang Huang, Xuanzhe Liu, Xin Jin
USENIX NSDI, April 2024
[PDF]
Scaling Large Language Model Training to More Than 10,000 GPUs
Ziheng Jiang, Haibin Lin, Yinmin Zhong, Qi Huang, Yangrui Chen, Zhi Zhang, Yanghua Peng, Xiang Li, Cong Xie, Shibiao Nong, Yulu Jia, Sun He, Hongmin Chen, Zhihao Bai, Qi Hou, Shipeng Yan, Ding Zhou, Yiyao Sheng, Zhuo Jiang, Haohan Xu, Haoran Wei, Zhang Zhang, Pengfei Nie, Leqi Zou, Sida Zhao, Liang Xiang, Zherui Liu, Zhe Li, Xiaoying Jia, Jiaxin Ye, Xin Jin, Xin Liu
USENIX NSDI, April 2024
[PDF]
DistMind: Efficient Resource Disaggregation for Deep Learning Workloads
Xin Jin, Zhihao Bai, Zhen Zhang, Yibo Zhu, Yinmin Zhong, Xuanzhe Liu
IEEE/ACM Transactions on Networking, January 2024
[PDF]
SDCC: Software-defined Collective Communication for Distributed Training
Xin Jin, Zhen Zhang, Yunshan Jia, Yun Ma, Xuanzhe Liu
SCIENCE CHINA Information Sciences , November 2023
[PDF]
Halfmoon: Log-Optimal Fault-Tolerant Stateful Serverless Computing
Sheng Qi, Xuanzhe Liu, Xin Jin
ACM SOSP, October 2023
[PDF]
Automated Verification of an In-Production DNS Authoritative Engine
Naiqian Zheng, Mengqi Liu, Yuxing Xiang, Linjian Song, Dong Li, Feng Han, Nan Wang, Yong Ma, Zhuo Liang, Dennis Cai, Ennan Zhai, Xuanzhe Liu, Xin Jin
ACM SOSP, October 2023
[PDF]
Oobleck: Resilient Distributed Training of Large Models Using Pipeline Templates
Insu Jang, Zhenning Yang, Zhen Zhang, Xin Jin, Mosharaf Chowdhury
ACM SOSP, October 2023
[PDF]
Rise of Distributed Deep Learning Training in the Big Model Era: From a Software Engineering Perspective
Xuanzhe Liu, Diandian Gu, Zhenpeng Chen, Jinfeng Wen, Zili Zhang, Yun Ma, Haoyu Wang, Xin Jin
ACM TOSEM, September 2023
[PDF]
Ditto: Efficient Serverless Analytics with Elastic Parallelism
Chao Jin, Zili Zhang, Xingyu Xiang, Songyun Zou, Gang Huang, Xuanzhe Liu, Xin Jin
ACM SIGCOMM, September 2023
[PDF]
Klotski: Efficient and Safe Network Migration of Large Production Datacenters
Yihao Zhao, Xiaoxiang Zhang, Hang Zhu, Ying Zhang, Zhaodong Wang, Yuandong Tian, Alex Nikulkov, Joao Ferreira, Xuanzhe Liu, Xin Jin
ACM SIGCOMM, September 2023
[PDF]
Understanding the Micro-Behaviors of Hardware Offloaded Network Stacks with Lumina
Zhuolong Yu, Bowen Su, Wei Bai, Shachar Raindel, Vladimir Braverman, Xin Jin
ACM SIGCOMM, September 2023
[PDF]
XRON: A Hybrid Elastic Cloud Overlay Network for Video Conferencing at Planetary Scale
Bingyang Wu, Kun Qian, Bo Li, Yunfei Ma, Qi Zhang, Zhigang Jiang, Jiayu Zhao, Dennis Cai, Ennan Zhai, Xuanzhe Liu, Xin Jin
ACM SIGCOMM, September 2023
[PDF]
FaaSLight: General Application-level Cold-start Latency Optimization for Function-as-a-Service in Serverless Computing
Xuanzhe Liu, Jinfeng Wen, Zhenpeng Chen, Ding Li, Junkai Chen, Yi Liu, Haoyu Wang, Xin Jin
ACM TOSEM, July 2023
[PDF]
Rise of the Planet of Serverless Computing: A Systematic Review
Jinfeng Wen, Zhenpeng Chen, Xin Jin, Xuanzhe Liu
ACM TOSEM, July 2023
[PDF]
AlpaServe: Statistical Multiplexing with Model Parallelism for Deep Learning Serving
Zhuohan Li, Lianmin Zheng, Yinmin Zhong, Vincent Liu, Ying Sheng, Xin Jin, Yanping Huang, Zhifeng Chen, Hao Zhang, Joseph E. Gonzalez, Ion Stoica
USENIX OSDI, July 2023
[PDF]
Scalable and Efficient Full-Graph GNN Training for Large Graphs
Xinchen Wan, Kaiqiang Xu, Xudong Liao, Yilun Jin, Kai Chen, Xin Jin
ACM SIGMOD, June 2023
[PDF]
Transparent GPU Sharing in Container Clouds for Deep Learning Workloads
Bingyang Wu, Zili Zhang, Zhihao Bai, Xuanzhe Liu, Xin Jin
USENIX NSDI, April 2023
[PDF]
Fast, Approximate Vector Queries on Very Large Unstructured Datasets
Zili Zhang, Chao Jin, Linpeng Tang, Xuanzhe Liu, Xin Jin
USENIX NSDI, April 2023
[PDF]
Disaggregated RAID Storage in Modern Datacenters
Junyi Shu, Ruidong Zhu, Yun Ma, Gang Huang, Hong Mei, Xuanzhe Liu, Xin Jin
ACM ASPLOS, March 2023
[PDF]
ElasticFlow: An Elastic Serverless Training Platform for Distributed Deep Learning
Diandian Gu, Yihao Zhao, Yinmin Zhong, Yifan Xiong, Zhenhua Han, Peng Cheng, Fan Yang, Gang Huang, Xin Jin, Xuanzhe Liu
ACM ASPLOS, March 2023
[PDF]
Mandheling: Mixed-Precision On-Device DNN Training with DSP Offloading
Daliang Xu, Mengwei Xu, Qipeng Wang, Shangguang Wang, Yun Ma, Kang Huang, Gang Huang, Xin Jin, Xuanzhe Liu
ACM MobiCom, October 2022
[PDF]
MiCS: Near-linear Scaling for Training Gigantic Model on Public Cloud
Zhen Zhang, Shuai Zheng, Yida Wang, Justin Chiu, George Karypis, Trishul Chilimbi, Mu Li, Xin Jin
VLDB, October 2022
[PDF]
Optimizing Half Precision Winograd Convolution on ARM Many-Core Processors
Dedong Xie, Zhen Jia, Zili Zhang, Xin Jin
ACM APSys, August 2022
[PDF]
Multi-Resource Interleaving for Deep Learning Training
Yihao Zhao, Yuanqiang Liu, Yanghua Peng, Yibo Zhu, Xuanzhe Liu, Xin Jin
ACM SIGCOMM, August 2022
[PDF]
Meissa: Scalable Network Testing for Programmable Data Planes
Naiqian Zheng, Mengqi Liu, Ennan Zhai, Hongqiang Harry Liu, Yifan Li, Kaicheng Yang, Xuanzhe Liu, Xin Jin
ACM SIGCOMM, August 2022
[PDF]
Melon: Breaking the Memory Wall for Resource-Efficient On-Device Machine Learning
Qipeng Wang, Mengwei Xu, Chao Jin, Xinran Dong, Jinliang Yuan, Xin Jin, Gang Huang, Yunxin Liu, Xuanzhe Liu
ACM MobiSys, June 2022
[PDF]
Multi-Objective Congestion Control
Yiqing Ma, Han Tian, Xudong Liao, Junxue Zhang, Weiyan Wang, Kai Chen, Xin Jin
European Conference on Computer Systems (EuroSys), April 2022
[PDF]
NetVRM: Virtual Register Memory for Programmable Networks
Hang Zhu, Tao Wang, Yi Hong, Dan R.K. Ports, Anirudh Sivaraman, Xin Jin
USENIX NSDI, April 2022
[PDF]
An Empirical Study on Challenges of Application Development in Serverless Computing
Jinfeng Wen, Zhenpeng Chen, Yi Liu, Yiling Lou, Yun Ma, Gang Huang, Xin Jin, Xuanzhe Liu
ACM ESEC/FSE, August 2021
[PDF]
Programmable Packet Scheduling with a Single Queue
Zhuolong Yu, Chuheng Hu, Jingfeng Wu, Xiao Sun, Vladimir Braverman, Mosharaf Chowdhury, Zhenhua Liu, Xin Jin
ACM SIGCOMM, August 2021
[PDF]
Network Planning with Deep Reinforcement Learning
Hang Zhu, Varun Gupta, Satyajeet Singh Ahuja, Yuandong Tian, Ying Zhang, Xin Jin
ACM SIGCOMM, August 2021
[PDF]
Jaqen: A High-Performance Switch-Native Approach for Detecting and Mitigating Volumetric DDoS Attacks with Programmable Switches
Zaoxing Liu, Hun Namkung, Georgios Nikolaidis, Jeongkeun Lee, Changhoon Kim, Xin Jin, Vladimir Braverman, Minlan Yu, Vyas Sekar
USENIX Security, August 2021
[PDF]
Runtime Recovery of Web Applications under Zero-Day ReDoS Attacks
Zhihao Bai, Ke Wang, Hang Zhu, Yinzhi Cao, Xin Jin
IEEE Symposium on Security and Privacy (S&P), May 2021
[PDF]
Enabling Edge-Cloud Video Analytics for Robotic Applications
Yiding Wang, Weiyan Wang, Duowen Liu, Xin Jin, Junchen Jiang, Kai Chen
IEEE International Conference on Computer Communications (INFOCOM), May 2021
[PDF]
Twenty Years After: Hierarchical Core-Stateless Fair Queueing
Zhuolong Yu, Jingfeng Wu, Vladimir Braverman, Ion Stoica, Xin Jin
USENIX NSDI, April 2021
[PDF]
Ship Compute or Ship Data? Why Not Both?
Jie You, Jingfeng Wu, Xin Jin, Mosharaf Chowdhury
USENIX NSDI, April 2021
[PDF]
PipeSwitch: Fast Pipelined Context Switching for Deep Learning Applications
Zhihao Bai, Zhen Zhang, Yibo Zhu, Xin Jin
USENIX OSDI, November 2020
[PDF]
RackSched: A Microsecond-Scale Scheduler for Rack-Scale Computers
Hang Zhu, Kostis Kaffes, Zixu Chen, Zhenming Liu, Christos Kozyrakis, Ion Stoica, Xin Jin
USENIX OSDI, November 2020
[PDF]
Pegasus: Tolerating Skewed Workloads in Distributed Storage with In-Network Coherence Directories
Jialin Li, Jacob Nelson, Ellis Michael, Xin Jin, Dan R. K. Ports
USENIX OSDI, November 2020
[PDF]
Concerto: cooperative network-wide telemetry with controllable error rate
Yiran Li, Kevin Gao, Xin Jin, Wei Xu
ACM APSys, August 2020
[PDF]
NetLock: Fast, Centralized Lock Management Using Programmable Switches
Zhuolong Yu, Yiwen Zhang, Vladimir Braverman, Mosharaf Chowdhury, Xin Jin
ACM SIGCOMM, August 2020
[PDF]
Is Network the Bottleneck of Distributed Training?
Zhen Zhang, Chaokun Chang, Haibin Lin, Yida Wang, Raman Arora, Xin Jin
ACM NetAI Workshop, August 2020
[PDF]
Multitenancy for Fast and Programmable Networks in the Cloud
Tao Wang, Hang Zhu, Fabian Ruffy, Xin Jin, Anirudh Sivaraman, Dan R. K. Ports, Aurojit Panda
USENIX HotCloud Workshop, July 2020
[PDF]
On Efficient Constructions of Checkpoints
Yu Chen, Zhenming Liu, Bin Ren, Xin Jin
ICML, July 2020
[PDF]
QPipe: Quantiles Sketch Fully in the Data Plane
Nikita Ivkin, Zhuolong Yu, Vladimir Braverman, Xin Jin
ACM CoNEXT, December 2019
[PDF]
Flash: Efficient Dynamic Routing for Offchain Networks
Peng Wang, Hong Xu, Xin Jin, Tao Wang
ACM CoNEXT, December 2019
[PDF]
Harmonia: Near-Linear Scalability for Replicated Storage with In-Network Conflict Detection
Hang Zhu, Zhihao Bai, Jialin Li, Ellis Michael, Dan Ports, Ion Stoica, Xin Jin
VLDB, November 2019
[PDF]
Neural Packet Classification
Eric Liang, Hang Zhu, Xin Jin, Ion Stoica
ACM SIGCOMM, August 2019
[PDF]
DistCache: Provable Load Balancing for Large-Scale Storage Systems with Distributed Caching
Zaoxing Liu, Zhihao Bai, Zhenming Liu, Xiaozhou Li, Changhoon Kim, Vladimir Braverman, Xin Jin, Ion Stoica
USENIX FAST, February 2019
[PDF]
Best Paper Award
ASAP: Fast, Approximate Graph Pattern Mining at Scale
Anand Padmanabha Iyer, Zaoxing Liu, Xin Jin, Shivaram Venkataraman, Vladimir Braverman, Ion Stoica
USENIX OSDI, October 2018
[PDF]
AWStream: Adaptive Wide-Area Streaming Analytics
Ben Zhang, Xin Jin, Sylvia Ratnasamy, John Wawrzynek, Edward A. Lee
ACM SIGCOMM, August 2018
[PDF]
Towards Fast and Scalable Graph Pattern Mining
Anand Padmanabha Iyer, Zaoxing Liu, Xin Jin, Shivaram Venkataraman, Vladimir Braverman, Ion Stoica
USENIX HotCloud Workshop, July 2018
[PDF]
DumbNet: A Smart Data Center Network Fabric with Dumb Switches
Yiran Li, Da Wei, Xiaoqi Chen, Ziheng Song, Ruihan Wu, Yuxing Li,
Xin Jin, Wei Xu
European Conference on Computer Systems (EuroSys), April 2018
[PDF]
NetChain: Scale-Free Sub-RTT Coordination
Xin Jin, Xiaozhou Li, Haoyu Zhang, Nate Foster, Jeongkeun Lee, Robert Soule, Changhoon Kim, Ion Stoica
USENIX Symposium on Networked Systems Design and Implementation (NSDI),
April 2018
[PDF]
Best Paper Award
Proactive Video Push for Optimizing Bandwidth Consumption in Hybrid CDN-P2P
VoD Systems
Yuanxing Zhang, Chengliang Gao, Yangze Guo, Kaigui Bian, Xin Jin,
Zhi Yang, Lingyang Song, Jiangang Cheng, Hu Tuo, XiaoMing Li
IEEE International Conference on Computer Communications (INFOCOM),
April 2018
[PDF]
SnapLink: Fast and Accurate Vision-Based Appliance Control in Large
Commercial Buildings
Kaifei Chen, Jonathan Furst, John Kolb, Hyung-Sin Kim, Xin Jin,
David E. Culler, Randy H. Katz
ACM IMWUT, December 2017; ACM UbiComp, September 2018
[PDF]
Catalyst: Unlocking the Power of Choice to Speed up Network Updates
Rohan Gandhi, Ori Rottenstreich, Xin Jin
ACM CoNEXT, December 2017
[PDF]
NetCache: Balancing Key-Value Stores with Fast In-Network Caching
Xin Jin, Xiaozhou Li, Haoyu Zhang, Robert Soule, Jeongkeun Lee, Nate
Foster, Changhoon Kim, Ion Stoica
ACM SOSP, October 2017
[PDF]
SketchVisor: Robust Network Measurement for Software Packet Processing
Qun Huang, Xin Jin, Patrick P. C. Lee, Runhui Li, Lu Tang, Yi-Chao
Chen, Gong Zhang
ACM SIGCOMM, August 2017
[PDF]
Competitive Analysis for Online Scheduling in Software-Defined Optical WAN
Su Jia, Xin Jin, Golnaz Ghasemiesfeh, Jiaxin Ding, Jie Gao
IEEE International Conference on Computer Communications
(INFOCOM), May 2017
[PDF]
Optimizing Bulk Transfers with Software-Defined Optical WAN
Xin Jin, Yiran Li, Da Wei, Siming Li, Jie Gao, Lei Xu,
Guangzhi Li, Wei Xu, Jennifer Rexford
ACM SIGCOMM, August 2016
[PDF]
Increasing Large-Scale Data Center Capacity by Statistical Power Control
Guosai Wang, Shuhao Wang, Bing Luo, Weisong Shi, Yinghang Zhu,
Wenjun Yang, Dianming Hu, Longbo Huang, Xin Jin, Wei Xu
European Conference on Computer Systems (EuroSys), April 2016
[PDF]
A 12-Rack, 180-Server Datacenter Network (DCN) Using Multiwavelength
Optical Switching and Full Stack Optimization
Da Wei, Lei Xu, Xin Jin, Yiran Li, Wei Xu
Optical Fiber Communication Conference (OFC), March 2016
[PDF]
Your Data Center Switch is Trying Too Hard
Xin Jin, Nathan Farrington, Jennifer Rexford
Symposium on SDN Research (SOSR), March 2016
[PDF]
CoVisor: A Compositional Hypervisor for Software-Defined Networks
Xin Jin, Jennifer Gossels, Jennifer Rexford, David Walker
USENIX Symposium on Networked Systems Design and Implementation
(NSDI), May 2015
[PDF]
Can Accurate Predictions Improve Video Streaming in Cellular
Networks?
Xuan Kelvin Zou, Jeffrey Erman, Vijay Gopalakrishnan, Emir Halepovic,
Rittwik Jana, Xin Jin, Jennifer Rexford, and Rakesh K. Sinha
ACM International Workshop on Mobile Computing Systems and Applications
(HotMobile), February 2015.
[PDF]
Dynamic Scheduling of Network Updates
Xin Jin, Hongqiang Harry Liu, Rohan Gandhi, Srikanth
Kandula, Ratul Mahajan, Ming Zhang, Jennifer Rexford, Roger Wattenhofer
ACM SIGCOMM, August 2014
[PDF]
Incremental Update for a Compositional SDN Hypervisor
Xin Jin, Jennifer Rexford, David Walker
ACM HotSDN Workshop, August 2014
[PDF]
SoftCell: Scalable and Flexible Cellular Core Network Architecture
Xin Jin, Li Erran Li, Laurent Vanbever,
Jennifer Rexford
ACM CoNEXT, December 2013
[PDF]
Intra-Data-Center Traffic Engineering with Ensemble Routing
Ziyu Shao, Xin Jin, Wenjie Jiang, Minghua Chen,
Mung Chiang
IEEE International Conference on Computer Communications
(INFOCOM), April 2013
[PDF]
CellSDN: Software-Defined Cellular Core Networks
Xin Jin, Li Erran Li, Laurent Vanbever,
Jennifer Rexford
Open Networking Summit (Research Track), April 2013
[PDF]
Virtual Switching Without a Hypervisor for a More Secure Cloud
Xin Jin, Eric Keller, Jennifer Rexford
USENIX Workshop on Hot Topics in Management of Internet, Cloud, and
Enterprise Networks and Services (Hot-ICE), April 2012
[PDF]
Hidden Markov Model with Parameter-Optimized K-Means Clustering for
Handwriting Recognition
Weijie Su, Xin Jin
International Conference on Internet Computing and Information Services
(ICICIS), September 2011
[PDF]
Relative Link Quality Assessment and Hybrid Routing Scheme for
Wireless Mesh Networks
ChaoYi Bian, Xin Jin, Chao Liu, XiaoMing Li, Wei Yan,
IEEE International Conference on Communications (ICC), June 2011
[PDF]
Quantitative Analysis of the VANET Connectivity: Theory and
Application
Xin Jin, Weijie Su, Wei Yan
IEEE Vehicular Technology Conference (VTC), May 2011
[PDF]
A Study of the VANET Connectivity by Percolation Theory
Xin Jin, Weijie Su, Wei Yan
IEEE Consumer Communications and Networking Conference (CCNC),
January 2011
[PDF]