Follow
Han Cai
Han Cai
NVIDIA Research
Verified email at nvidia.com - Homepage
Title
Cited by
Cited by
Year
ProxylessNAS: Direct Neural Architecture Search on Target Task and Hardware
H Cai, L Zhu, S Han
ICLR 2019, 2018
22702018
Once for all: Train one network and specialize it for efficient deployment
H Cai, C Gan, T Wang, Z Zhang, S Han
ICLR 2020, 2019
14872019
Product-based neural networks for user response prediction
Y Qu, H Cai, K Ren, W Zhang, Y Yu, Y Wen, J Wang
International Conference on Data Mining (ICDM 2016), 1149-1154, 2016
7882016
Efficient Architecture Search by Network Transformation
H Cai, T Chen, W Zhang, Y Yu, J Wang
AAAI 2018, 2017
760*2017
Long text generation via adversarial training with leaked information
J Guo, S Lu, H Cai, W Zhang, Y Yu, J Wang
AAAI 2018, 2018
6502018
TinyTL: Reduce Activations, Not Trainable Parameters for Efficient On-Device Learning
H Cai, C Gan, L Zhu, S Han
Neural Information Processing Systems (NeurIPS 2020), 2020
363*2020
Hat: Hardware-aware transformers for efficient natural language processing
H Wang, Z Wu, Z Liu, H Cai, L Zhu, C Gan, S Han
ACL 2020, 2020
3182020
Real-time bidding by reinforcement learning in display advertising
H Cai, K Ren, W Zhang, K Malialis, J Wang, Y Yu, D Guo
International Conference on Web Search and Data Mining (WSDM 2017), 661-670, 2017
3142017
Path-Level Network Transformation for Efficient Architecture Search
H Cai, J Yang, W Zhang, S Han, Y Yu
International Conference on Machine Learning (ICML 2018), 2018
2952018
MAgent: A Many-Agent Reinforcement Learning Platform for Artificial Collective Intelligence
L Zheng, J Yang, H Cai, W Zhang, J Wang, Y Yu
NeurIPS 2017 & AAAI 2018 Demo, 2017
2482017
Apq: Joint search for network architecture, pruning and quantization policy
T Wang, K Wang, H Cai, J Lin, Z Liu, H Wang, Y Lin, S Han
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2020
2252020
EfficientViT: Multi-Scale Linear Attention for High-Resolution Dense Prediction
H Cai, J Li, M Hu, C Gan, S Han
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023
218*2023
Mcunetv2: Memory-efficient patch-based inference for tiny deep learning
J Lin, WM Chen, H Cai, C Gan, S Han
Neural Information Processing Systems (NeurIPS 2021), 2021
188*2021
Large-scale Interactive Recommendation with Tree-structured Policy Gradient
H Chen, X Dai, H Cai, W Zhang, X Wang, R Tang, Y Zhang, Y Yu
AAAI 2019, 2018
1632018
Activation Maximization Generative Adversarial Nets
Z Zhou, H Cai, S Rong, Y Song, K Ren, W Zhang, J Wang, Y Yu
ICLR 2018, 2018
1172018
Enable deep learning on mobile devices: Methods, systems, and applications
H Cai, J Lin, Y Lin, Z Liu, H Tang, H Wang, L Zhu, S Han
ACM Transactions on Design Automation of Electronic Systems (TODAES) 27 (3 …, 2022
1162022
Lite Pose: Efficient Architecture Design for 2D Human Pose Estimation
Y Wang, M Li, H Cai, W Chen, S Han
CVPR 2022, 2022
1142022
AutoML for Architecting Efficient and Specialized Neural Networks
H Cai, J Lin, Y Lin, Z Liu, K Wang, T Wang, L Zhu, S Han
IEEE Micro, 2019
62*2019
Network Augmentation for Tiny Deep Learning
H Cai, C Gan, J Lin, S Han
ICLR 2022, 2021
352021
EfficientViT-SAM: Accelerated Segment Anything Model Without Performance Loss
Z Zhang, H Cai, S Han
CVPR 2024 eLVM, 2024
332024
The system can't perform the operation now. Try again later.
Articles 1–20