Huishuai Zhang

Cited by

	All	Since 2019
Citations	2456	2263
h-index	22	20
i10-index	31	28

820

410

205

615

201520162017201820192020202120222023202410 32 64 78 130 158 285 534 808 346

Public access

View all

14 articles

2 articles

available

not available

Based on funding mandates

Co-authors

Tie-Yan LiuDistinguished Scientist, Microsoft Research AI4Science | IEEE Fellow | ACM Fellow | AAIA FellowVerified email at microsoft.com
Yingbin LiangThe Ohio State UniversityVerified email at osu.edu
Wei Chen (陈薇)Institute of Computing Technology, Chinese Academy of SciencesVerified email at ict.ac.cn
Da YuSun Yat-sen UniversityVerified email at mail2.sysu.edu.cn
Yuejie ChiCarnegie Mellon UniversityVerified email at cmu.edu
Di HePeking UniversityVerified email at pku.edu.cn
Shuxin ZhengPrincipal Researcher, Microsoft ResearchVerified email at microsoft.com
Yi ZhouUniversity of UtahVerified email at utah.edu
Janardhan KulkarniMicrosoft Research, RedmondVerified email at cs.washington.edu
Liwei WangProfessor, Peking UniversityVerified email at cis.pku.edu.cn
Lifeng LaiProfessor, University of California, DavisVerified email at ucdavis.edu
Gautam KamathAssistant Professor @ University of Waterloo, Faculty Member @ Vector InstituteVerified email at uwaterloo.ca
Qi MengPrincipal Researcher, Microsoft Research AI4ScienceVerified email at pku.edu.cn
Yin Tat LeePaul G. Allen School of Computer Science & Engineering, University of WashingtonVerified email at uw.edu
Caiming XiongSalesforce ResearchVerified email at salesforce.com
Sergey YekhaninMicrosoftVerified email at microsoft.com
Shlomo Shamai (Shitz)Distinguished Professor, Technion - Israel Institute of TechnologyVerified email at ee.technion.ac.il
Hua WangQualcommVerified email at qti.qualcomm.com

Huishuai Zhang

Peking University

Verified email at pku.edu.cn - Homepage

Deep Learning Optimization Information Theory


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
On layer normalization in the transformer architecture R Xiong, Y Yang, D He, K Zheng, S Zheng, C Xing, H Zhang, Y Lan, ... International Conference on Machine Learning, 10524-10533, 2020	765	2020
A nonconvex approach for phase retrieval: Reshaped wirtinger flow and incremental algorithms H Zhang, Y Liang, Y Chi Journal of Machine Learning Research 18 (141), 1-35, 2017	301*	2017
Differentially private fine-tuning of language models D Yu, S Naik, A Backurs, S Gopi, HA Inan, G Kamath, J Kulkarni, YT Lee, ... arXiv preprint arXiv:2110.06500, 2021	212	2021
Provable non-convex phase retrieval with outliers: Median truncatedwirtinger flow H Zhang, Y Chi, Y Liang International conference on machine learning, 1022-1031, 2016	138*	2016
Block-diagonal hessian-free optimization for recurrent and convolutional neural networks H Zhang, C Xiong US Patent 11,386,327, 2022	94*	2022
Do not let privacy overbill utility: Gradient embedding perturbation for private learning D Yu, H Zhang, W Chen, TY Liu arXiv preprint arXiv:2102.12677, 2021	90	2021
Large scale private learning via low-rank reparametrization D Yu, H Zhang, W Chen, J Yin, TY Liu International Conference on Machine Learning, 12208-12218, 2021	72	2021
Sgd converges to global minimum in deep learning via star-convex path Y Zhou, J Yang, H Zhang, Y Liang, V Tarokh arXiv preprint arXiv:1901.00451, 2019	72	2019
Availability attacks create shortcuts D Yu, H Zhang, W Chen, J Yin, TY Liu Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and …, 2022	47*	2022
How does data augmentation affect privacy in machine learning? D Yu, H Zhang, W Chen, J Yin, TY Liu Proceedings of the AAAI Conference on Artificial Intelligence 35 (12), 10746 …, 2021	44	2021
Adaptive inertia: Disentangling the effects of adaptive learning rate and momentum Z Xie, X Wang, H Zhang, I Sato, M Sugiyama International conference on machine learning, 24430-24459, 2022	43*	2022
Understanding generalization error of SGD in nonconvex optimization Y Zhou, Y Liang, H Zhang Machine Learning, 1-31, 2022	41*	2022
Gradient perturbation is underrated for differentially private convex optimization D Yu, H Zhang, W Chen, TY Liu, J Yin arXiv preprint arXiv:1911.11363, 2019	38	2019
Convergence of distributed stochastic variance reduced methods without sampling extra data S Cen, H Zhang, Y Chi, W Chen, TY Liu IEEE Transactions on Signal Processing 68, 3976-3989, 2020	32	2020
Non-convex low-rank matrix recovery with arbitrary outliers via median-truncated gradient descent Y Li, Y Chi, H Zhang, Y Liang Information and Inference: A Journal of the IMA 9 (2), 289-325, 2020	30	2020
The capacity region of the source-type model for secret key and private key generation H Zhang, L Lai, Y Liang, H Wang IEEE Transactions on Information Theory 60 (10), 6389-6398, 2014	29*	2014
Exploring the limits of differentially private deep learning with group-wise clipping J He, X Li, D Yu, H Zhang, J Kulkarni, YT Lee, A Backurs, N Yu, J Bian arXiv preprint arXiv:2212.01539, 2022	28	2022
-SGD: Optimizing ReLU Neural Networks in its Positively Scale-Invariant Space Q Meng, S Zheng, H Zhang, W Chen, ZM Ma, TY Liu arXiv preprint arXiv:1802.03713, 2018	28	2018
Stabilize deep ResNet with a sharp scaling factor H Zhang, D Yu, M Yi, W Chen, TY Liu Machine Learning 111 (9), 3359-3392, 2022	26*	2022
Geometrical properties and accelerated gradient solvers of non-convex phase retrieval Y Zhou, H Zhang, Y Liang 2016 54th Annual Allerton Conference on Communication, Control, and …, 2016	26	2016

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors