Publications
Underlined names indicate students/researchers advised/co-advised by me.
2024
- TACOPotamoi: Accelerating Neural Rendering via a Unified Streaming ArchitectureIn ACM Transactions on Architecture and Code Optimization 2024
- ICCD Best Paper NomineeAutoVCoder: A Systematic Framework for Automated Verilog Code Generation using LLMsIn IEEE International Conference on Computer Design 2024
- ICCD Best Paper NomineeContinuous Energy Efficiency Optimization for Autonomous Embedded Systems Using Shadow CyclesIn IEEE International Conference on Computer Design 2024
- TCImproving Efficiency in Multi-modal Autonomous Embedded Systems through Adaptive GatingIn IEEE Transactions on Computers 2024
- RTSSJigsaw: Taming BEV-centric Perception on Dual-SoC for Autonomous DrivingIn EEE Real-Time Systems Symposium 2024
- TACOA2: Towards Accelerator Level Parallelism for Autonomous Micromobility SystemsIn ACM Transactions on Architecture and Code Optimization 2024
- SCBoosting Data Center Performance via Intelligently Managed Multi-backend Disaggregated MemoryIn International Conference for High Performance Computing, Networking, Storage, and Analysis 2024
- TPDSWASP: Efficient Power Management Enabling Workload-Aware, Self-Powered AIoT DevicesIn IEEE Transactions on Parallel and Distributed Systems 2024
- IWQoS Best Paper NomineeCPM: A Cross-layer Power Management Facility to Enable Highly-efficient Real-time AIoT SystemsIn IEEE/ACM International Symposium on Quality of Service 2024
- VLDBFlowWalker: A Memory-efficient and High-performance GPU-based Dynamic Graph Random Walk FrameworkIn Proceedings of the 50th International Conference on Very Large Data Bases 2024
- ISCA Best Paper NomineeA Tale of Two Domains: Exploring Efficient Architecture Design for Truly Autonomous ThingsIn Proceedings of the 51st Annual International Symposium on Computer Architecture 2024
- ICMEM2SN : Adaptive and Dynamic Multi-modal Shortcut Network Architecture for Latency-aware ApplicationsIn IEEE International Conference on Multimedia and Expo 2024
- IPDPSCoCG: Fine-grained Cloud Game Co-location on Heterogeneous PlatformIn Proceedings of the 38th Annual International Symposium on Computer Architecture 2024
- ICDEGraph Contrastive Learning for Truth InferencecIn IEEE International Conference on Data Engineering 2024
2023
- SCISPower Synchronization: Taming Massive Diversified Serverless Functions under Power ConstraintsIn Science China Information Sciences 2023
- RTSSSMG: A System-level Modality Gating Facility for Fast and Energy-Efficient Multimodal ComputingIn IEEE Real-Time Systems Symposium 2023
- JSACPractical Network Modeling Using Weak Supervision Signals for Human-Centric Networking in MetaverseIEEE Journal on Selected Areas in Communications, 2023
- SoCCNot All Resources are Visible: Exploiting Fragmented Shadow Resources in Shared-State Scheduler ArchitectureACM Symposium on Cloud Computing, 2023
- ECAILabel Aggregation with Self-Supervision Enhanced Graph TransformerEuropean Conference on Artificial Intelligence, 2023
- IISWCMMBench: Benchmarking End-to-End Multi-modal DNNs and Understanding Their Hardware-Software ImplicationsIEEE International Symposium on Workload Characterization, 2023
- Euro-Par Best Paper NomineeMMExit: Enabling Fast and Efficient Multi-modal DNN Inference with Adaptive Network ExitsIn European Conference on Parallel Processing, 2023
- ISCAArchitecting Efficient Multi-modal AIoT SystemsIn Proceedings of the 50th Annual International Symposium on Computer Architecture 2023
- TCOptimizing GPU-based Graph Sampling and Random Walk for Efficiency and ScalabilityIEEE Transactions on Computers, 2023
- PPoPPCoWalker: High-Throughput GPU Random Walk with Fine-tuned Concurrent Query Processing (Poster)In ACM SIGPLAN Annual Symposium on Principles and Practice of Parallel Programming, 2023
2022
- IEEE CALCharacterizing and Understanding End-to-End Multi-modal Neural Networks on GPUsIEEE Computer Architecture Letter, 2022
- IPDPSEnabling Efficient Request Management through Microservice Level ParallelismIn IEEE International Parallel and Distributed Processing Symposium, 2022
- FCSPerformance Optimization for Cloud Computing Systems in the Microservice Era: State-of-the-Art and Research OpportunitiesFrontiers of Computer Science 2022
2021
- TCTapping into NFV Environment for Opportunistic Serverless Edge Function DeploymentIEEE Transactions on Computers, 2021
- ArXiv
- IPDPSAlphaR: Learning-Powered Resource Management for Irregular, Dynamic Microservice GraphIn IEEE International Parallel and Distributed Processing Symposium (IPDPS) 2021
2020
- SCANT-man: Towards Agile Power Management in the Microservice EraIn International Conference for High Performance Computing, Networking, Storage and Analysis, 2020
- AAAIFine-Grained Machine Teaching with Attention ModelingIn AAAI Conference on Artificial Intelligence, 2020
- TCCIntegrated Power Anomaly Defense: Towards Oversubscription-Safe Data CentersIEEE Transactions on Cloud Computing 2020
2019
- ICPPWhen Power Oversubscription Meets Traffic Flood Attack: Re-Thinking Data Center Peak Load ManagementIn International Conference on Parallel Processing, 2019
- ICPPUnleashing the Scalability Potential of Power-Constrained Data Center in the Microservice EraIn International Conference on Parallel Processing, 2019
2018
- ICCD Best Paper AwardPower Grab in Aggressively Provisioned Data Centers: What is the Risk and What Can Be Done About ItIn International Conference on Computer Design, 2018
2016
- ISCAPower Attack Defense: Securing Battery-Backed Data CentersInternational Symposium on Computer Architecture, 2016
- JCRDGreen Hierarchical Management for Distributed Datacenter ContainersJournal of Computer Research and Development, 2016