姓名:舒继武

职称:教授

邮箱:shujw@tsinghua.edu.cn

主页:http://storage.cs.tsinghua.edu.cn/~jiwu-shu/

教育背景

工学学士 (计算机科学与技术), 国防科技大学, 中国, 1991;

工学硕士 (计算科学与技术), 江苏大学, 中国, 1996;

工学博士 (计算机科学与技术), 南京大学, 中国, 1998.

社会兼职

中国计算机学会: 学术工委副主任  (2018-);

中国计算机学会: 信息存储技术专业委员会副主任、常务委员 (2006-);

灾备技术国家工程实验室: 副主任、技术委员会委员 (2008-);

高效能服务器和存储技术国家重点实验室: 学术委员(2008-2018.07);

信息存储系统教育部重点实验室学术委员会委员  (2017-);

清华大学信息科学技术学院学术委员会委员   (2011-);

863计划“海量存储系统关键技术” 重大项目: 总体专家组副组长 (2008-2010);

《ACM Transaction on Storage》: Associate Editor (2016-);

《计算机学报》: 编委 (2010-);

《软件学报》: 责任编委 (2012-);

《计算机研究与发展》: 编委 (2010-)

 

研究领域

我及我负责的研究组的主要研究方向包括:① 基于非易失存储器件的新型存储系统与技术、② 基于Flash器件的固态存储系统与技术、③ 网络(/云/大数据)存储系统与关键技术、④ 数据存储可靠性等。

研究概况

 ①在基于非易失存储器件的新型存储系统与技术方面,针对传统存储架构、软件及各层次都是基于传统器件设计而难以发挥NVM器件特性等问题,同时NVM器件本身也有性能和器件寿命等等不足,研究了基于NVM的存储系统与相关技术,相关主要学术论文发表在MICRO’2018/2014、USENIX ATC’2017(2)、EuroSys’2019/2016、DAC’2019/2016/2015、DSN’2014、MSST’2016/2015、ICCD’2014、DATE’2017(2)、ICPP’2016、CF’2015、GLSVLSI’16、ASP-DAC’2018/2016、NVMSA’2018/2016/2015等重要国际会议和ACM TOS 2019/2018/2016、IEEE TPDS 2017等权威国际期刊上,并获得MSST‘2015国际会议的Best Papers Nominated。同时研制出了基于NVM的本地文件系统HiNFS、分布式持久性内存文件系统Octopus。该方面研究已获得国家部委等资助外,还获得三星、华为、阿里、中兴等公司资助,部分成果已转化到公司并获得应用。

②在基于Flash器件的固态存储系统与技术方面,针对Flash的特有特性,主要研究了基于开放通道的固态存储系统(Open-Channel SSD)与相关技术,相关主要学术论文发表在FAST’2014/2013、USENIX’2016、SC’2017、DAC’2019、IPDPS’2014、ICCD’2013、DATE’2014、CODES+ISSS’2017、NVMSA’ 2014等重要国际会议和ACM TOS 2019、IEEE TC 2015/2016、IEEE TPDS 2018等权威国际期刊上,并获得NVMSA’2015国际会议的Best Papers。同时研制出了基于Open-Channel SSDs的高性能SSD存储系统等。部分成果已转化到相关公司等并获得应用。

③在网络(/云/大数据)存储系统与关键技术方面,主要针对网络(/云/大数据)存储系统中面临的性能、扩展性、兼容性、易用性、安全性、易管理等问题进行研究,在存储虚拟化、存储系统在线扩展方法、元数据管理方法、存储Cache策略、分级存储、面向SAN的并行文件系统、云存储安全等方面的研究取得了一些进展。研制出了具有自主知识产权的国产网络存储系统——“清华海量存储网络系统TH-MSNS”,2006年前,TH-MSNS系统已有近100套应用在审计署、公安局、电信、油田、高校等行业或部门;同时,也研制了相关的存储子系统,如:存储管理系统TH-MSIS、分布式虚拟化存储系统TH-DVSM、带外虚拟化存储系统TH-VSM、备份系统TH-Backup、快照系统TH-Snapshot、远程镜像系统TH-Mirror、面向SAN的并行文件系统Redbud、分级存储系统AIP、云存储安全系统Corslet,一些系统已分别在电信、教育等部门获得实际应用;这些系统与应用成果分别获得2007年获国家科技进步二等奖、2015年国家科学技术发明二等奖等; 这些系统相关的主要学术论文发表在IEEE TC 2016/2014/2010/2007/2005、ACM TOS 2010/2007、IEEE TPDS 2016/2015/2014、JPDC 2014/2018等权威国际期刊和IPDPS’2016、CODES/ISSS’2016、IWQos’2014/2013、ICPP’2012、CLUSTER’2012、MSST’2006、CCGrid’2004等重要国际会议上。

④在数据存储可靠性方面,针对存储纠删码容错能力(容1个、2个和多个错误)、性能(编码效率)和存储利用率等方面存在的问题进行研究,提出并实现了一些相关的高效实用的存储纠删码技术,提高了存储系统的可靠性,相关主要学术论文发表在DSN’2016/2014、SRDS’2017/2016/2015/2014、IPDPS’2015、ICPP’2015/2013/2012、IWQos’2016、ISIT’2010等重要国际会议和IEEE TC 2017/2016/2010、IEEE TPDS 2018/2017/2016、IEEE TDSC 2019、ACM TOS 2012/2009等权威国际期刊上,并获得2010年IEEE TC期刊的Featured Article、SRDS’2015国际会议的Nominated Best Papers。

主要研究项目和课题

国家重点研发计划重点专项项目: TB级持久性内存存储技术与系统 (2018-2021);

国家重点研发计划重点专项课题:分布式持久性共享内存框架研制及存储系统应用示范 (2018-2021);

北京市科委重点项目之课题:异构混合内存系统的大数据处理软件平台架构与优化研究(2015-2016);

863项目: 面向大数据的先进存储结构及关键技术(2013-2015);

863课题: 基于新型存储器件的存储体系结构与关键技术研究(2013-2015);

国家自然科学重点基金: 新型分布式存储系统的高可靠性关键技术研究 (2019-2023);

国家自然科学重点基金:网络公用数据存储系统可靠性及其灾备研究(2013-2017);

国家工信部子课题: 云计算平台上面向移动通信业务云服务关键技术与系统(2013-2015);

国家科技支撑计划子课题: 存储虚拟化软件开发及产业化(2010-2013);

国家杰出青年科学基金: 大规模网络存储系统的可靠性关键技术研究 (2010-2013);

国家自然科学基金: 大规模网络存储系统的容错技术研究 (2009-2011);

973一级课题: 下一代互联网存储按需部署模型与服务质量研究 (2004-2009);

863重大课题子课题: 海量存储系统软件和关键技术 (2009-2010).

奖励与荣誉

北京市科学技术进步二等奖——高可扩展海量网络存储系统 (2004);

高等学校科学技术进步二等奖——深腾2600网络服务器系统 (2007);

国家科技进步二等奖——高性能集群计算机与海量存储系统 (2007);

中国电子学会技术发明一等奖——面向社区共享的高可用云存储系统(2014);

中国国家科学技术发明二等奖——面向社区共享的高可用云存储系统(2015);

江苏省科学技术奖一等奖——大规模数据服务系统与平台的关键技术及产业应用(2018)

清华大学学术新人奖 (2005).

教育部:新世纪优秀人才 (2005);

中创软件人才奖(2007);

国家自然科学基金委: 国家杰出青年科学基金获得者 (2009);

教育部长江学者特聘教授 (2014);

IEEE Fellow (2018);

中国计算机学会会士 (2018).

学术成果

  • 代表性会议论文

[1] Fan Yang, Youyou Lu, Youmin Chen, Haiyu Mao, Jiwu Shu, No Compromises: Secure NVM with Crash Consistency, Write-Efficiency and High-Performance, the 55th Annual Design Automation Conference (DAC), June 2-6, 2019, Las Vegas, NV, USA

[2] Fei Li, Youyou Lu, Zhongjie Wu, Jiwu Shu, ASCache: An Approximate SSD Cache for Error-Tolerant Applications, the 55th Annual Design Automation Conference (DAC), June 2-6, 2019, Las Vegas, NV, USA

[3] Youmin Chen, Youyou Lu, Jiwu Shu, Scalable RDMA RPC on Reliable Connection with Efficient Resource Sharing, European Conference on Computer Systems (EuroSys), Dresden, Germany, Mar 25, 2019 - Mar 28, 2019

[4] Haiyu Mao, Mingcong Song, Tao Li, Yutiing Dai, Jiwu Shu, LerGAN: A Zero-free, Low Data Movement and PIM-based GAN Architecture, The 51st Annual IEEE/ACM International Symposium on Microarchitecture (Micro-51), Fukuoka City, Japan, 2018, Oct 20 – 24, pp.670-682 (acceptance rate: 74/351=21.06%)

[5] Siyang Li, Youyou Lu, Jiwu Shu, Fenlin Liu, Yang Hu, Tao Li, LocoFS: A Loosely-Coupled Metadata Service for Distributed File System, The International Conference for High Performance Computing, Networking, Storage and Analysis (SC), Nov. 12-17,2017, Denver, Co (Acceptance Rate: 61/327=18.65%)

[6] Youyou Lu, Jiwu Shu, Youmin Chen, Tao Li , Octopus: an RDMA-enabled Distribute Persistent Memory File System, 2017 USENIX Annual Technical Conference (USENIX ATC), July 12–14, 2017, Santa Clara, CA, USA, pp.773-785 (acceptance rate: 60/283=21%)

[7] Qingda Hu, Jinglei Ren, Anirudh Badam, Jiwu Shu, Tomas Moscibroda, Log-Structured Non-Volatile Main Memory, 2017 USENIX Annual Technical Conference (USENIX ATC), July 12–14, 2017, Santa Clara, CA, USA, pp.703-717 (acceptance rate: 60/283=21%)

[8] Jiacheng Zhang, Jiwu Shu, Youyou Lu, ParaFS: A Log-Structured File System to Exploit the Internal Parallelism of Flash Devices, 2016 USENIX Annual Technical Conference (USENIX ATC), June 22–24, 2016, Denver, Colorado, USA (Acceptance Rate: 19%)

[9] Zhirong Shen, Jiwu Shu, Patrick P.C. Lee, Reconsidering Single Failure Recovery in Cluster File Systems, The 46th Annual IEEE/IFIP International Conference on Dependable Systems and Networks (DSN), 2016, June 28-July 1st, Toulouse, France (Acceptance Rate: 58/259=22.39%)

[10] Zewei Li, Yongpan Liu, Daming Zhang, Chun Jason Xue, Zhangyuan Wang, Xin Shi, Wenyu Sun, Jiwu Shu, Huazhong Yang, HW/SW Co-design of Nonvolatile IO System in Energy Harvesting Sensor Nodes for Optimal Data Acquisition, Design Automation Conference (DAC), 5–9 June 2016, Austin, USA (Acceptance Rate: 152/876=17.4 %)

[11] Jiaxin Ou, Jiwu Shu, Youyou Lu, A High Performance File System for Non-Volatile Main Memory, The European Conference on Computer Systems (EuroSys), 18th-21st, April, 2016, London, UK (Acceptance Rate: 38/180=21.11%)

[12] Zhirong Sheng, Jiwu Shu, Patrick Lee, Seek-Efficient I/O Optimization in Single Failure Recovery for XOR-Coded Storage Systems, The 34th International Symposium on Reliable Distributed Systems (SRDS), September 28-October 1, 2015, Quebec Canada (Nominated Best Papers).

[13] Yongpan Liu, Zewei Li, Hehe Li, Yiqun Wang, Xueqing Li, Kaisheng Ma, Shuangchen Li, Meng-Fan Chang, Sampson John, Yuan Xie, Jiwu Shu, Huazhong Yang, Design Automation Conference (DAC), Moscone Center, San Francisco, CA, June 7-11, 2015, Article No.150, pp.1-6

[14] Chao Zhang, Guangyu Sun, Xian Zhang, Weiqi Zhang, Weisheng Zhao, Tao Wang, Yun Liang, Yongpan Liu, Yu Wang, Jiwu Shu, Hi-fi Playback: Tolerating Position Errors in Shift Operations of Racetrack Memory, The 42nd International Symposium on Computer Architecture (ISCA), Portland, OR, USA, June 13-17, 2015, pp. 694-706. (Acceptance Rate: 58/305=19.02%)

[15] Louyou Lu, Jiwu Shu, Long Sun, Blurred Persistence in Transactional Persistent Memory, The 31st International Conference on Massive Storage Systems and Technologies (MSST), June 1-5, 2015, Santa Clara, California, USA (Nominated Best Papers)

[16] Fan Jie, Jiang Song, Shu Jiwu, Sun Long, Hu Qingda, WL-Reviver: A Framework for Reviving any Wear-Leveling Techniques in the Face of Failures on Phase Change Memory, The Annual IEEE/IFIP International Conference on Dependable Systems and Networks (DSN), June 23 - 26, 2014, Atlanta, Georgia USA, pp.228-239. (Acceptance Rate: 49/181 = 27.07%)

[17] Zhirong Sheng, Jiwu Shu, HV Code: An MDS Code to Improve Efficiency and Reliability of RAID-6 Systems, The Annual IEEE/IFIP International Conference on Dependable Systems and Networks (DSN), June 23 - 26, 2014, Atlanta, Georgia USA, pp.550-56. (Acceptance Rate: 49/181 = 27.07%)

[18] Lu Youyou, Shu Jiwu, Wang Wei, ReconFS: A Reconstructable File System on Flash Storage, FAST, the 12th USENIX Conference on File and Storage Technologies (FAST), February 12-15, 2014, San Jose, CA, USA, pp. 75-88. (Acceptance Rate: 21/133=15.79% (full paper))

[19] Jie Fan, Song Jiang, Jiwu Shu, Youhui Zhang, Weimin Zheng, Partitioning Data Block for Efficient Recovery of Stuck-At-Faults in Phase Change Memory, The 46th Annual IEEE/ACM International Symposium on Microarchitecture (MICRO), December 7-11, 2013, Davis, California, pp. 433-444. (Acceptance Rate: 39/239=16.32%)

[20] Youyou Lu, Jiwu Shu, Weimin Zheng, Extending the Lifetime of Flash-based Storage through Reducing Write Amplification from File Systems, 11th USENIX Conference on File and Storage Technologies (FAST), February 12-15,2013, San Jose, CA, USA, pp. 257-270. (Acceptance Rate: 20/127=15.75% (full paper))

  • 代表性期刊论文

[1] Siyang Li, Fenlin Liu, Jiwu Shu, Youyou Lu, Tao Li, Yang Hu, A Flattened Metadata Service for Distributed File Systems, IEEE Transactions on Parallel and Distributed Systems (TPDS), 29(12):2641-2657, December 2018.

[2] Zhirong Shen, Patrick P. C. Lee, Jiwu Shu and Wenzhong Guo, Encoding-Aware Data Placement for Efficient Degraded Reads in XOR-Coded Storage Systems: Algorithms and Evaluation, IEEE Transactions on Parallel and Distributed Systems (TPDS), 29(12):2757-2770, December 2018.

[3] Youmin Chen, Jiwu Shu, Jiaxin Ou, Youyou Lu, HiNFS: A Persistent Memory File System with both Buffering and Direct-Access, ACM Transactions on Storage, Vol.14, Number 1, April 2018, 4:1-4:30.

[4] Zhirong Shen, Jiwu Shu, Patrick P.C. Lee, and Yingxun Fu, Seek-Efficient I/O Optimization in Single Failure Recovery for XOR-Coded Storage Systems, IEEE Transactions on Parallel and Distributed Systems (TPDS), 28(3):877-890, 2017.

[5] Yingxun Fu, Jiwu Shu, Xianghong Luo, Zhirong Shen, Qingda Hu, Short Code: An Efficient RAID-6 MDS Code for Optimizing Degraded Reads and Partial Stripe Writes, IEEE Transactions on Computers (TC), 66(1), pp. 127-137, 2017.

[6] Youyou Lu, Jiwu Shu, Jia Guo, Peng Zhu, Supporting System Consistency with Differential Transactions in Flash-based SSDs, IEEE Transactions on Computers, 65(2), pp. 627-639, 2016.

[7] Louyou Lu, Jiwu Shu, Long Sun, Blurred Persistence: Efficient Transactions in Persistent Memory, ACM Transactions on Storage, Vol.12, No. 1, Article 3, pp. 1-29, January 2016.

[8] Zhirong Shen, Jiwu Shu and Yingxun Fu, Parity-switched Data Placement: Optimizing Partial Stripe Writes in XOR-Coded Storage Systems, IEEE Transactions on Parallel and Distributed Systems (TPDS), 27(11), pp. 3311-3322, 2016.

[9] Zhirong Shen, Jiwu Shu, HV Code: An All-Around MDS Code for RAID-6 Storage Systems, IEEE Transactions on Parallel and Distributed Systems (TPDS), 27(6), pp.1674-1686, JUNE 2016.

[10] Yingxun Fu, Jiwu Shu, Zhang Guangyan, Reconsidering Single Disk Failure Recovery for Erasure Coded Storage Systems: Optimizing The Load Balancing in Stack-Level, IEEE Transactions on Parallel and Distributed Systems (TPDS), 27(5):1457-1469, 2016.

[11] Guangyan Zhang, Guiyong Wu, shupeng Wang, Jiwu Shu, Weimin Zheng, Keqin Li, CaCo: An Efficient Cauchy Coding Approach for Cloud Storage Systems, IEEE Transactions on Computers, 65(2), pp.435-447, 2016.

[12] Youyou Lu, Jiwu Shu, Jia Guo, Shuai Li, and Onur Mutlu, High-Performance and Lightweight Transaction Support in Flash-based SSDs, IEEE Transactions on Computers, 64(10), pp.2819-2832, 2015.

[13] Zhang Guangyan, Wang Jigang, Li Keqin, Shu Jiwu, and Zheng Weimin, Redistribute Data to Regain Load Balance during RAID-4 Scaling, IEEE Transactions on Parallel and Distributed Systems, Vol.25, No.7, pp.1-11, JULY 2014.

[14] Yi Letian, Shu Jiwu, Zhao Ying, Qian Yingjin, Lu Youyou, Zheng Weiming, Design and implementation of an asymmetric block-based parallel file system, IEEE Transactions on Computers, Vol.62, No.7, pp.1723-1735, JULY 2014.

[15] Xianghong Luo, Jiwu Shu, Generalized X-code: An efficient RAID-6 code for arbitrary size of disk array, ACM Transactions on Storage (TOS), Vol.8, No.3, Article 10, pp.1-19, 2012.

[16] Wang Yang, Shu Ji-Wu, Zhang Guang-Yan, Xue Wei, SOPA: Selecting the Optimal Policy Adaptively for a cache system, ACM Transactions on Storage, Vol. 6, No. 2, Article 7, pp. 1-18, 2010.

[17] Guangyan Zhang, Weimin Zheng, and Jiwu Shu. ALV: A New Data Redistribution Approach to RAID-5 Scaling, IEEE Transactions on Computers, Vol. 59(3), pp. 345-357, 2010.

[18] Guangyan Zhang, Jiwu Shu, Wei Xue, and Weimin Zheng, SLAS: An Efficient Approach to Scaling Round-robin Striped Volumes, ACM Transactions on Storage, 2007, 3(1), Article3, pp.1-39.

[19] Guangyan Zhang, Jiwu Shu, Wei Xue, and Weimin Zheng, Design and Implementation of an Out-of-Band Virtualization System for Large SANs, IEEE Transactions on Computers, 2007, 56(12), pp.1654-1665.

[20] Shu Ji-wu, Li Bigang, Zheng Wei-min, Design and Implementation of a SAN System Based on the Fiber Channel Protocol, IEEE Transactions on Computers, 54(4), 2005, pp. 439-448.

[21] JIWU SHU, WEI XUE, WEIMING ZHENG, A Parallel Transient Stability Simulation for Power System, IEEE Transactions on Power Systems (TOPS), 2005, 20(4), pp.1709-1717.

[22] Li Mingqiang, Shu Jiwu, Preventing Silent Data Corruptions from Propagating During Data Reconstruction, IEEE Transactions on Computers (TC), 59(12), pp.1611-1624, 2010 (Featured Article).

[23] Li Mingqiang, Shu Jiwu, DACO: A High Performance Disk Architecture Designed Specially for Large Scale Erasure Coded Storage Systems, IEEE Transactions on Computers (TC), 59(10), pp.1350-1362, 2010.

[24] Li Mingqiang, Shu Jiwu, Zheng Weimin, GRID Codes: Strip-based Erasure Codes with High Fault Tolerance for Storage Systems, ACM Transactions on Storage (TOS), 2009, 4(4), Article15, pp.1-22.

本研究组主页:http://storage.cs.tsinghua.edu.cn/~jiwu-shu/