Full-time Faculty Members

Guangyu Sun

Assistant Professor
Center for Energy-efficient Computing and Applications (CECA)
School of EECS, Peking University
Phone: +86-10-6275-7978
Office: 515S Science Building #5, Peking University, 5 Yiheyuan Road, Beijing, China, 100871
Email: gsun [at] pku [dot] edu [dot] cn

(主页中文版)

News

    We are recruiting undergraduate students for research in fields of "Storage System", "Low Power Deep Learning", and "Mobile Cloud Computing", please refer to this(link)
    Two papers accepted to MICRO 2015
    NSFC Proposal "Racetrack memory based ultra-high density memory design and optimization" is granted
    Paper "InterFS: an Interplanted Distributed File System to Improve Storage Utilization" is accepted to APSys 2015


Research

    Energy-efficient Memory Architectures(Link)
    Storage System Optimization for New Devices(Link)
    Acceleration Systems for Deep Learning Applications (Link)


Education



Grants

    Intel-CCF Grant, "Optimization of Distributed KV-Store on NVMe based SSDs", (PI), 2015-2016
    NSFC Grant,"Racetrack memory based ultra-high density memory design and optimization" (PI), 2016-2019
    Huaijiao Grant,"PKU-Huaijiao Mobile Cloud Platform" (co-PI), 2015-2016
    Huawei HIRP Grant,"Key-Value Framework Standardization Research" (PI), 2015-2016
    National High-tech R&D Program of China,,"Storage System Architecture and Technology Based on the New Memory Devices" (co-PI), 2013-2015
    NSFC Grant,“Energy-Efficient Cache Design using Emerging Technology in Multi-core” (PI), 2013-2015
    AMD Gift Grant,“Exploration of Energy-efficient System Design for Modern Applications in China”, 2012-2013


Students

Graduate Students
Peng Wang,Ph.D. Student(co-advised)
Chao Zhang,Ph.D. Student
Chen Zhang,Ph.D. Student(co-advised)
Xian Zhang,Ph.D. Student
Weiqi Zhang,Ph.D. Student
Yijin Guan,Ph.D. Student(co-advised)
Xiaoyang Wang,Ph.D. Student
Undergraduate Students
Zhihang Yuan,2012
Zhikang Fan,2012
Runhui Wang,2012
Bingzhe Wu,2012
Jiawu Sun,2012
Yao Fu,2013
Qingyang Zhu,2013
Yunqi He,2013
Zhen Tan,2013
Chengzhi Li,2013
Zhe Zhou,2013


Publications

2017
Wang Kang, Liang Chang, Zhaohao Wang, Weifeng Lv, Guangyu Sun, and Weisheng Zhao, "Pseudo-Differential Sensing Framework for STT-MRAM: A Cross-Layer Perspective," IEEE Transactions on Computers (IEEE TC), Vol. 66, No. 3, pp. 531-544, March 2017.
Shuo Li, Peng Wang, Nong Xiao, Guangyu Sun, and Fang Liu, "SPMS: Strand based Persistent Memory System," in Proceedings of the Design Automation and Test in Europe (DATE 2017), March 27-31, 2017, Lausanne, Switzerland, pp. 622-625.
Haiyu Mao, Xian Zhang, Guangyu Sun, and Jiwu Shu, "Protect Non-volatile Memory from Wear-out Attack based on Timing Difference of Row Buffer Hit/Miss," in Proceedings of the Design Automation and Test in Europe (DATE 2017), March 27-31, 2017, Lausanne, Switzerland, pp. 1623-1626.
Yijin Guan, Zhihang Yuan, Guangyu Sun, and Jason Cong, "FPGA-based Accelerator for Long Short-Term Memory Recurrent Neural Networks," in Proceedings of the 22nd Asia and South Pacific Design Automation Conference (ASP-DAC 2017), January 16-19, 2017, Chiba/Tokyo, Japan, pp. 629-634.
2016
Guangyu Sun, Chao Zhang, Peng Li, Tao Wang, and Yiran Chen, "Statistical Cache Bypassing for Non-Volatile Memory," IEEE Transactions on Computers (IEEE TC), Vol. 65, No. 11, pp. 3427-3440, November 2016.
Chen Zhang, Di Wu, Jiayu Sun, Guangyu Sun, Guojie Luo, and Jason Cong, "Energy-Efficient CNN Implementation on a Deeply Pipelined FPGA Cluster," in Proceedings of IEEE/ACM International Symposium on Low Power Electronics and Design (ISLPED 2016), August 8-10, 2016, San Francisco, USA, pp. 326-331.
Xiaoyang Wang, Chao Zhang, Xian Zhang, and Guangyu Sun, "np-ECC: Nonadjacent Position Error Correction Code for Racetrack Memory," in Proceedings of 2016 IEEE/ACM International Symposium on Nanoscale Architectures (NANOARCH 2016), July 18-20, 2016, Beijing, China, pp. 23-24.
Weiqi Zhang, Chao Zhang, and Guangyu Sun, "Accelerate Context Switch by Racetrack-SRAM Hybrid Cells," in Proceedings of 2016 IEEE/ACM International Symposium on Nanoscale Architectures (NANOARCH 2016), July 18-20, 2016, Beijing, China, pp. 115-116.
Wang Kang, Tingting Pang, Bi Wu, Weifeng Lv, Youguang Zhang, Guangyu Sun, and Weisheng Zhao, "PDS: pseudo-differential sensing scheme for STT-MRAM," in Proceedings of the 53rd ACM/IEEE Design Automation Conference (DAC 2016), June 5-9, 2016, Austin, TX, USA, pp. 120:1-6.
Yue Zhang, Chao Zhang, Jiang Nan, Zhizhong Zhang, Xueying Zhang, Jacques-Olivier Klein, Dafine Ravelosona, Guangyu Sun, and Weisheng Zhao, "Perspectives of Racetrack Memory for Large-Capacity On-Chip Memory: From Device to System," IEEE Transactions on Circuits and Systems, Vol. 63, No. 5, pp. 629-638, May 2016.
Chaofei Yang, Beiye Liu, Yandan Wang, Yiran Chen, Hai Li, Xian Zhang, and Guangyu Sun, "The Applications of NVM Technology in Hardware Security," in Proceedings of the 26th ACM Great Lakes Symposium on VLSI (GLSVLSI 2016), May 18-20, 2016, Boston, MA, USA, pp. 311-316.
Qingda Hu, Guangyu Sun, Jiwu Shu, and Chao Zhang, "Exploring Main Memory Design Based on Racetrack Memory Technology," in Proceedings of the 26th ACM Great Lakes Symposium on VLSI (GLSVLSI 2016), May 18-20, 2016, Boston, MA, USA, pp. 397-402.
Yuze Chi, Guohao Dai, Yu Wang, Guangyu Sun, Guoliang Li, and Huazhong Yang, "NXgraph: An efficient graph processing system on a single machine," in Proceedings of IEEE 32nd International Conference on Data Engineering (ICDE 2016), May 16-20, 2016, Helsinki, Finland, pp. 409-420.
Shuo Wang, Yun Liang, Chao Zhang, Xiaolong Xie, Guangyu Sun, Yongpan Liu, Yu Wang, and Xiuhong Li, "Performance-centric Register File Design for GPUs using Racetrack Memory," in Proceedings of the 21st Asia and South Pacific Design Automation Conference (ASP-DAC 2016), January 25-28, 2016, Macao, China, pp. 25-30.
Xian Zhang, Guangyu Sun,Yaojun Zhang, Wujie Wen, Yiran Chen, Hai Li, and Jia Di, "A Novel PUF based on Cell Error Rate Distribution of STT-RAM," in Proceedings of the 21st Asia and South Pacific Design Automation Conference (ASP-DAC 2016), January 25-28, 2016, Macao, China, pp. 342-347.
Hongbin Zhang, Chao Zhang, Xian Zhang, Guangyu Sun, and Jiwu Shu, "Pin Tumbler Lock: A Shift based Encryption Mechanism for Racetrack Memory," in Proceedings of the 21st Asia and South Pacific Design Automation Conference (ASP-DAC 2016), January 25-28, 2016, Macao, China, pp. 354-359.
2015
Xian Zhang, Guangyu Sun, Chao Zhang, Weiqi Zhang, Yun Liang, Tao Wang, Yiran Chen, and Jia Di, "Fork Path: Improving Efficiency of ORAM by Removing Redundant Memory Accesses," in Proceedings of the 48th Annual IEEE/ACM International Symposium on Microarchitecture (MICRO 2015), December 5-9, 2015, Waikiki, Hawaii, pp. 102-114.
Xiaolong Xie, Yun Liang, Xiuhong Li, Yudong Wu, Guangyu Sun, Tao Wang, and Dongrui Fan, "Enabling Coordinated Register Allocation and Thread-level Parallelism Optimization for GPUs," in Proceedings of the 48th Annual IEEE/ACM International Symposium on Microarchitecture (MICRO 2015), December 5-9, 2015, Waikiki, Hawaii, pp. 395-406.
Yun Liang, Xiaolong Xie, Guangyu Sun, and Deming Chen, "An Efficient Compiler Framework for Cache Bypassing on GPUs," IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems (IEEE TCAD), Vol. 34, No. 10, pp. 1677-1690, October 2015.
Chao Zhang, Guangyu Sun, Xueying Zhang, and Weisheng Zhao, "Thermal Modeling and Architecture Level Temperature Management for Shift Operations of Racetrack Memory," in Proceedings of the 21th National Conference of Information Storage (NCIS 2015), September 17-18, 2015, Changsha, China, .
Pengfei Zhu, Guangyu Sun, Peng Wang, and Mingyu Chen, "Improving Memory Access Performance of In-Memory Key-Value Store Using Data Prefetching Techniques," in Proceedings of International Symposium on Advanced Parallel Processing Technology (APPT 2015), August 20-21, 2015, Jinan, Shandong, China, pp. 1-17.
Haiyu Mao, Chao Zhang, Guangyu Sun, and Jiwu Shu, "Exploring Data Placement in Racetrack Memory based Scratchpad Memory," in Proceedings of the 4th IEEE Non-Volatile Memory System and Applications Symposium (NVMSA 2015), August 19-21, 2015, Hong Kong, China, pp. 1-5.
Peng Wang, Le Cao, Chunbo Lai, Leqi Zou, Guangyu Sun, and Jason Cong, "InterFS: An Interplanted Distributed File System to Improve Storage Utilization," in Proceedings of the 6th ACM SIGOPS Asia-Pacific Workshop on Systems (APSys 2015), July 27-28, 2015, Tokyo, Japan, pp. 14:1-8.
Shuangchen Li, Ang Li, Yuan Zhe, Yongpan Liu, Peng Li, Guangyu Sun, Yu Wang, Huazhong Yang, and Yuan Xie, "Leveraging Emerging Nonvolatile Memory in High-Level Synthesis with Loop Transformations," in Proceedings of IEEE/ACM International Symposium on Low Power Electronics and Design (ISLPED 2015), July 22-24, 2015, Rome, Italy, pp. 61-66.
Chunbo Lai, Song Jiang, Liqiong Yang, Shiding Lin, Guangyu Sun, Zhenyu Hou, Can Cui, and Jason Cong, "Atlas: Baidu's Key-value Storage System for Cloud Data," in Proceedings of the 31st International Conference on Massive Storage Systems and Technology (MSST 2015), May 30 - June 5, 2015, Santa Clara, CA, USA, pp. 1-14.
Chao Zhang, Guangyu Sun, Xian Zhang, Weiqi Zhang, Weisheng Zhao, Tao Wang, Yun Liang, Yongpan Liu, Yu Wang, and Jiwu Shu, "Hi-fi Playback: Tolerating Position Errors in Shift Operations of Racetrack Memory," in Proceedings of the 42nd ACM/IEEE International Symposium on Computer Architecture (ISCA 2015), June 13-17, 2015, Portland, OR, USA, pp. 694-706.
Gushu Li, Xiaoming Chen, Guangyu Sun, Henry Hoffmann, Yongpan Liu, Yu Wang, and Huazhong Yang, "An STT-RAM-based Low-Power Hybrid Register File for GPGPUs," in Proceedings of the 52nd Design Automation Conference (DAC 2015), June 7-11, 2015, San Francisco, CA, USA, pp. 103:1-6.
Yue Zhang, Chao Zhang, Jiang Nan, Xueying Zhang, Jacques-Olivier Klein, Dafine Ravelosona, Guangyu Sun, and Weisheng Zhao, "Perspectives of Racetrack Memory Based on Current-Induced Domain Wall Motion: From Device to System," in Proceedings of 2015 IEEE International Symposium on Circuits and Systems (ISCAS 2015), May 24-27, 2015, Lisbon, Portugal, pp. 381-384.
Hehe Li, Yongpan Liu, Qinghang Zhao, Guangyu Sun, Chao Zhang, Yizi Gu, Rong Luo, Huazhong Yang, Meng-Fan Chang and Xiao Sheng, "An Energy Efficient Backup Scheme with Low Inrush Current for Nonvolatile Sram in Energy Harvesting Sensor Nodes," in Proceedings of the 18th Design, Automation and Test in Europe (DATE 2015), March 9-13, 2015, Grenoble, France, pp. 7-12.
Guangyu Sun, Chao Zhang, Hehe Li, Yue Zhang, Weiqi Zhang, Yizi Gu, Yinan Sun, Jacques-Olivier Klein, Dafie Ravelosona, Yongpan Liu, Weisheng Zhao and Huazhong Yang, "From Device To System: Cross-Layer Design Exploration of Racetrack Memory," in Proceedings of the 18th Design, Automation and Test in Europe (DATE 2015), March 9-13, 2015, Grenoble, France, pp. 1018-1023.
Chen Zhang, Peng Li, Guangyu Sun, Yijin Gua, Bingjun Xiao, and Jason Cong, "Optimizing FPGA-based Accelerator Design for Deep Convolutional Neural Networks: An Analytical Approach based on Roofline Model," in Proceedings of 23rd ACM/SIGDA International Symposium on Field-Programmable Gate Arrays (FPGA 2015), February 22-24, 2015, Monterey, CA, USA, pp. 161-170.
Xiaolong Xie, Yun Liang, Yu Wan, Guangyu Sun, and Tao Wang, "Coordinated Static and Dynamic Cache Bypassing for GPUs," in Proceedings of the 21st IEEE International Symposium on High Performance Computer Architecture (HPCA 2015), February 7-11, 2015, San Francisco Bay Area, CA, USA, pp. 76-88.
Chao Zhang, Guangyu Sun, Weiqi Zhang, Fan Mi, Hai Li, and Weisheng Zhao, "Quantitative Modeling of Racetrack Memory, A Tradeoff among Area, Performance, and Power," in Proceedings of 20th Asia and South Pacific Design Automation Conference (ASP-DAC 2015), January 19-22, 2015, Chiba, Japan, pp. 100-105.
2014
孙广宇、舒继武、王鹏, "面向非易失内存的结构和系统级设计与优化综述," 华东师范大学学报(自然科学版), Vol. 2014, No. 5, pp. 72–81, 2014年9月.
Tao Wang, Guangyu Sun, Jiahua Chen, Jian Gong, Haoyang Wu, Xiaoguang Li, Songwu Lu, and Jason Cong, "GRT: a Reconfigurable SDR Platform with High Performance and Usability," ACM SIGARCH Computer Architecture News (CAN), Vol. 42, No. 4, pp. 51-56, September 2014.
Chao Zhang, Guangyu Sun, Peng Li, Tao Wang, Dimin Niu, and Yiran Chen, "SBAC: A Statistics based Cache Bypassing Method for Asymmetric-access Caches," in Proceedings of International Symposium on Low Power Electronics and Design (ISLPED 2014), August 11-13, 2014, Jolla, CA, USA, pp. 345-350.
Bi Wu, Yuanqing Cheng, Ying Wang, Aida Todri-Sanial, Guangyu Sun, Lionel Torres, and Weisheng Zhao, "An Architecture-level Cache Simulation Framework Supporting Advanced PMA STT-MRAM," in Proceedings of IEEE/ACM International Symposium on Nanoscale Architectures (NANOARCH 2015), July 8-10, 2015, Boston, MA, USA, pp. 7-12.
Tao Zhang, Ke Chen, Cong Xu, Guangyu Sun, Tao Wang, and Yuan Xie, "Half-DRAM: a High-bandwidth and Low-power DRAM Architecture from the Rethinking of Fine-grained Activation," in Proceedings of the 41st International Symposium on Computer Architecture (ISCA 2014), June 14-18, 2014, Minneapolis, USA, pp. 349-360.
Jia Zhan, Yuan Xie, and Guangyu Sun, "NoC-Sprinting: Interconnect for Fine-Grained Sprinting in the Dark Silicon Era," in Proceedings of the 51st Annual Design Automation Conference (DAC 2014), June 1-5, 2014, San Francisco, CA, USA, pp. 1-6.
Jingyu Deng, Yun Liang, Guojie Luo, and Guangyu Sun, "Rapid Design Space Exploration of Two-level Unified Caches," in Proceedings of International Symposium on Circuits and Systems (ISCAS 2014), June 1-5, 2014, Melbourne, Australia, pp. 1937-1940.
Tao Zhang, Ke Chen, Guangyu Sun, and Yuan Xie, "3D-SWIFT: A High-Performance 3D-Stacked Wide IO DRAM," in Proceedings of Great Lakes Symposium on VLSI (GLSVLSI 2014), May 21-23, 2014, Houston, TX, USA, pp. 51-56.
孙广宇、王鹏、张超, "基于新型非易失存储的存储结构," 中国计算机学会通讯, Vol. 10, No. 4, pp. 18-25, 2014年4月.
Peng Wang, Guangyu Sun, Song Jiang, Jian Ouyang, Shiding Lin, Chen Zhang, and Jason Cong, "An Efficient Design and Implementation of LSM-Tree based Key-Value Store on Open-Channel SSD," in Proceedings of ACM European Conference on Computer Systems (EuroSys 2014), April 13-16, 2014, Amsterdam, the Netherlands, pp. 16:1-16:14.
Tao Zhang, Matt Poremba, Cong Xu, Guangyu Sun, and Yuan Xie, "CREAM: a Concurrent-Refresh-Aware DRAM Memory Architecture," in Proceedings of the 20th IEEE International Symposium on High Performance Computer Architecture (HPCA 2014), February 15-19, 2014, Orlando, FL, USA, pp. 368-379.
Zhe Wang, Daniel A. Jimenez, Cong Xu, Guangyu Sun, and Yuan Xie, "Adaptive Placement and Migration Policy for an STT-RAM-Based Hybrid Cache," in Proceedings of the 20th IEEE International Symposium on High Performance Computer Architecture (HPCA 2014), February 15-19, 2014, Orlando, FL, USA, pp. 13-24.
Mengjie Mao, Guangyu Sun, Yong Li, Alex K. Jones, and Yiran Chen, "Prefetching Techniques for STT-RAM based Last-level Cache in CMP Systems," in Proceedings of the 19th Asia and South Pacific Design Automation Conference (ASP-DAC 2014), January 20-23, 2014, Singapore, pp. 67-72.
2013
Jishen Zhao, Guangyu Sun, Gabriel H. Loh, and Yuan Xie, "Optimizing GPU Energy Efficiency with 3D Die-stacking Graphics Memory and Reconfigurable Memory Interface," ACM Transactions on Architecture and Code Optimization (ACM TACO), Vol. 10, No. 4, pp. 24:1-25, December 2013.
Xiaolong Xie, Yun Liang, Guangyu Sun, and Deming Chen, "An Efficient Compiler Framework for Cache Bypassing on GPUs," in Proceedings of IEEE/ACM International Conference on Computer-Aided Design (ICCAD 2013), November 18-21, 2013, San Jose, CA, USA, pp. 516-523.
Tao Zhang, Cong Xu, Guangyu Sun and Yuan Xie, "Lazy Precharge: An Overhead-free Method to Reduce Precharge Overhead for Memory Parallelism Improvement of DRAM System," in Proceedings of 31st IEEE International Conference on Computer Design (ICCD 2013), October 6-9, 2013, Asheville, NC, USA, pp. 138-144.
Xian Zhang, Chao Zhang, Tao Zhang, Guangyu Sun, and Jia Di, "An Efficient Run-time Encryption Scheme for Non-volatile Main Memory," in Proceedings of International Conference on Compilers, Architecture, and Synthesis for Embedded Systems (CASES 2013), September 29 - October 4, 2013, Montreal, Canada, pp. 1-10.
Jian Ouyang, Shiding Lin, Zhenyu Hou, Peng Wang, Yong Wang, and Guangyu Sun, "Active SSD Design for Energy-efficiency Improvement of Web-scale Data Analysis," in Proceedings of International Symposium on Low Power Electronics and Design (ISLPED 2013), September 4-6, 2013, Beijing, China, pp. 286-291.
Peng Wang, Guangyu Sun, Tao Wang, Yuan Xie, and Jason Cong, "Designing Scratchpad Memory Architecture with Emerging STT-RAM Memory Technologies," in Proceedings of IEEE International Symposium on Circuits and Systems (ISCAS 2013), May 19-23, 2013, Beijing, China, pp. 1244-1247.
Yi Zhou, Guangyu Sun, Yu Zhang, and Kun Wang, "Asymmetric-access Aware Optimization for STT-RAM Caches with Process Variations," in Proceedings of GLSVLSI, May 2-3, 2013, Paris, France, pp. 143-148.
Yanfei Lv, Xuexuan Chen, Guangyu Sun, and Bin Cui, "A Probabilistic Data Replacement Strategy for Flash-based Hybrid Storage System," in Proceedings of the 15th Asia-Pacific Web Conference (APWeb 2013), April 4-6, 2013, Sydney, Australia, pp. 360-371.
Jian Ouyang, Lei Jia, Zhenyu Hou, Guangyu Sun, Guangjun Xie, and Yong Wang, "A Practical Implementation of GPU based Accelerator for Deep Neural Networks," in Proceedings of 4th Workshop on SoCs, Heterogeneous Architectures and Workloads (SHAW-4), February 24, 2013, Shenzhen, China, .
2012
Yaojun Zhang, Lu Zhang, Wujue Wen, Guangyu Sun, and Yiran Chen, "Multi-level Cell STT-RAM: Is It Realistic or Just a Dream?," in Proceedings of International Conference on Computer-Aided Design (ICCAD 2012), November 5-8, 2012, San Jose, CA, USA, pp. 526-532.
Jishen Zhao, Guangyu Sun, Gabriel Loh and Yuan Xie, "Energy-Efficient GPU Design with Reconfigurable In-Package Graphics Memory," in Proceedings of International Symposium on Low Power Electronics and Design (ISLPED 2012), July 30 - August 1, 2012, Redondo Beach, CA, USA, pp. 403-408.
Guangyu Sun, Yaojun Zhang, Yu Wang, and Yiran Chen, "Improving Energy Efficiency of Write-asymmetric Memories by Log Style Write," in Proceedings of International Symposium on Low Power Electronics and Design (ISLPED 2012), July 30 - August 1, 2012, Redondo Beach, CA, USA, pp. 173-178.
Guangyu Sun and Yuan Xie, "Performance/Thermal Aware Design of 3D-Stacked L2 Caches for CMPs," ACM Transactions on Design Automation of Electronic Systems (IEEE TODAES), Vol. 17, No. 2, pp. 13:1-20, April 2012.
Yibo Chen, Guangyu Sun, Qiaosha Zhou, and Yuan Xie, "3DHLS: Incorporating High-Level Synthesis in Physical Planning of Three-Dimensional (3D) ICs," in Proceedings of Design, Automation and Test in Europe (DATE 2012), March 12-16, 2012, Dresden, Germany, pp. 1185-1190.
Guangyu Sun, Cong Xu, and Yuan Xie, "Modeling and Design Exploration of FBDRAM as On-chip Memory," in Proceedings of Design, Automation and Test in Europe (DATE 2012), March 12-16, 2012, Dresden, Germany, pp. 1507-1512.
2011
Guangyu Sun, Eren Kursun, Jude Rivers, and Yuan Xie, "Exploring the Vulnerability of CMPs to Soft Errors with 3D Stacked Non-Volatile Memory," in Proceedings of IEEE International Conference on Computer Design: VLSI in Computers and Processors (ICCD 2011), October 9-12, 2011, Amherst, MA, USA, pp. 366-372.
Publication list before joining CECA Link