Graph query processing using plurality of engines S Elnikety, Y He, S Sakr US Patent 9,053,210, 2015 | 239 | 2015 |
Zero: Memory optimizations toward training trillion parameter models S Rajbhandari, J Rasley, O Ruwase, Y He SC20: International Conference for High Performance Computing, Networking …, 2020 | 206 | 2020 |
Provably-efficient job scheduling for energy and fairness in geographically distributed data centers S Ren, Y He, F Xu 2012 IEEE 32nd International Conference on Distributed Computing Systems, 22-31, 2012 | 145 | 2012 |
Adaptive work-stealing with parallelism feedback K Agrawal, CE Leiserson, Y He, WJ Hsu ACM Transactions on Computer Systems (TOCS) 26 (3), 1-32, 2008 | 134 | 2008 |
The Cilkview scalability analyzer Y He, CE Leiserson, WM Leiserson Proceedings of the twenty-second annual ACM symposium on Parallelism in …, 2010 | 131 | 2010 |
Learning intrinsic sparse structures within long short-term memory W Wen, Y He, S Rajbhandari, M Zhang, W Wang, F Liu, B Hu, Y Chen, ... arXiv preprint arXiv:1709.05027, 2017 | 126 | 2017 |
Deepspeed: System optimizations enable training deep learning models with over 100 billion parameters J Rasley, S Rajbhandari, O Ruwase, Y He Proceedings of the 26th ACM SIGKDD International Conference on Knowledge …, 2020 | 121 | 2020 |
Few-to-many: Incremental parallelism for reducing tail latency in interactive services ME Haque, YH Eom, Y He, S Elnikety, R Bianchini, KS McKinley ACM SIGPLAN Notices 50 (4), 161-175, 2015 | 119 | 2015 |
Predictive parallelization: Taming tail latencies in web search M Jeon, S Kim, S Hwang, Y He, S Elnikety, AL Cox, S Rixner Proceedings of the 37th international ACM SIGIR conference on Research …, 2014 | 109 | 2014 |
Adaptive scheduling with parallelism feedback K Agrawal, Y He, WJ Hsu, CE Leiserson Proceedings of the eleventh ACM SIGPLAN symposium on Principles and practice …, 2006 | 88 | 2006 |
Swayam: distributed autoscaling to meet slas of machine learning inference services with resource efficiency A Gujarati, S Elnikety, Y He, KS McKinley, BB Brandenburg Proceedings of the 18th ACM/IFIP/USENIX middleware conference, 109-120, 2017 | 84 | 2017 |
Zeta: Scheduling interactive services with partial execution Y He, S Elnikety, J Larus, C Yan Proceedings of the Third ACM Symposium on Cloud Computing, 1-14, 2012 | 82 | 2012 |
Performance modeling and scalability optimization of distributed deep learning systems F Yan, O Ruwase, Y He, T Chilimbi Proceedings of the 21th ACM SIGKDD International Conference on Knowledge …, 2015 | 79 | 2015 |
{DeepCPU}: Serving {RNN-based} Deep Learning Models 10x Faster M Zhang, S Rajbhandari, W Wang, Y He 2018 USENIX Annual Technical Conference (USENIX ATC 18), 951-965, 2018 | 77 | 2018 |
Mercury: A memory-constrained spatio-temporal real-time search on microblogs A Magdy, MF Mokbel, S Elnikety, S Nath, Y He 2014 IEEE 30th International Conference on Data Engineering, 172-183, 2014 | 72 | 2014 |
Adaptive parallelism for web search M Jeon, Y He, S Elnikety, AL Cox, S Rixner Proceedings of the 8th ACM European Conference on Computer Systems, 155-168, 2013 | 68 | 2013 |
G-SPARQL: a hybrid engine for querying large attributed graphs S Sakr, S Elnikety, Y He Proceedings of the 21st ACM international conference on Information and …, 2012 | 68 | 2012 |
Delayed-Dynamic-Selective (DDS) prediction for reducing extreme tail latency in web search S Kim, Y He, S Hwang, S Elnikety, S Choi Proceedings of the Eighth ACM International Conference on Web Search and …, 2015 | 60 | 2015 |
Exploiting processor heterogeneity in interactive services S Ren, Y He, S Elnikety, KS McKinley 10th International Conference on Autonomic Computing (ICAC 13), 45-58, 2013 | 58 | 2013 |
Provably efficient online nonclairvoyant adaptive scheduling Y He, WJ Hsu, CE Leiserson IEEE Transactions on Parallel and Distributed Systems 19 (9), 1263-1279, 2008 | 58 | 2008 |