Minchen Yu

Citeret af

	Alle	Siden 2019
Henvisninger	448	448
h-index	5	5
i10-indeks	5	5

140

105

20192020202120222023202412 49 64 94 132 96

Offentlig adgang

Se alle

5 artikler

0 artikler

tilgængelige

ikke tilgængelige

Baseret på krav i forbindelse med finansiering

Medforfattere

Wei WangThe Hong Kong University of Science and TechnologyVerificeret mail på cse.ust.hk
Chengliang ZhangHong Kong University of Science and TechnologyVerificeret mail på cse.ust.hk
Feng YanUniversity of HoustonVerificeret mail på central.uh.edu
Ruichuan ChenDistinguished Member of Technical Staff @ Bell LabsVerificeret mail på ruichuan.org
Huangshi TianHong Kong University of Science and TechnologyVerificeret mail på cse.ust.hk
Zhifeng JiangThe Hong Kong University of Science and TechnologyVerificeret mail på cse.ust.hk
Bo LiChair Professor at hong kong university of science and technologyVerificeret mail på cse.ust.hk
Yinghao YuEngineer, AlibabaVerificeret mail på connect.ust.hk
Qizhen WengHong Kong University of Science and TechnologyVerificeret mail på connect.ust.hk
Yizhou ShanHuawei CloudVerificeret mail på ucsd.edu
Binhang Yuan（袁彬航）Hong Kong University of Science and TechnologyVerificeret mail på ust.hk
Suyi LiHKUSTVerificeret mail på connect.ust.hk

Følg

Minchen Yu

The Chinese University of Hong Kong, Shenzhen

Verificeret mail på cuhk.edu.cn - Startside

cloud computing serverless computing big data systems machine learning systems


Titel Sortér efter henvisninger Sortér efter årstal Sortér efter titel	Citeret af Citeret af	År
{MArk}: Exploiting cloud services for {Cost-Effective},{SLO-Aware} machine learning inference serving C Zhang, M Yu, W Wang, F Yan 2019 USENIX Annual Technical Conference (USENIX ATC 19), 1049-1062, 2019	279	2019
Gillis: Serving large neural networks in serverless functions with automatic model partitioning M Yu, Z Jiang, HC Ng, W Wang, R Chen, B Li 2021 IEEE 41st International Conference on Distributed Computing Systems …, 2021	49	2021
Following the data, not the function: Rethinking function orchestration in serverless computing M Yu, T Cao, W Wang, R Chen 20th USENIX Symposium on Networked Systems Design and Implementation (NSDI …, 2023	43*	2023
Continuum: A platform for cost-aware, low-latency continual learning H Tian, M Yu, W Wang Proceedings of the ACM Symposium on Cloud Computing, 26-40, 2018	35	2018
Enabling cost-effective, slo-aware machine learning inference serving on public cloud C Zhang, M Yu, W Wang, F Yan IEEE Transactions on Cloud Computing 10 (3), 1765-1779, 2020	27	2020
FaaSwap: SLO-Aware, GPU-Efficient Serverless Inference via Model Swapping M Yu, A Wang, D Chen, H Yu, X Luo, Z Li, W Wang, R Chen, D Nie, ... arXiv preprint arXiv:2306.03622, 2023	5	2023
{CrystalPerf}: Learning to Characterize the Performance of Dataflow Computation through Code Analysis H Tian, M Yu, W Wang 2021 USENIX Annual Technical Conference (USENIX ATC 21), 253-267, 2021	4	2021
CaraServe: CPU-Assisted and Rank-Aware LoRA Serving for Generative LLM Inference S Li, H Lu, T Wu, M Yu, Q Weng, X Chen, Y Shan, B Yuan, W Wang arXiv preprint arXiv:2401.11240, 2024	3	2024
RepBun: Load-balanced, shuffle-free cluster caching for structured data M Yu, Y Yu, Y Zheng, B Yang, W Wang IEEE INFOCOM 2020-IEEE Conference on Computer Communications, 954-963, 2020	3	2020

Systemet kan ikke foretage handlingen nu. Prøv igen senere.

Artikler 1–9

Henvisninger pr. år

Dublerede henvisninger

Flettede henvisninger

Tilføj medforfattereMedforfattere

Følg

Citeret af

Medforfattere