Hi there! I’m a Research Scientist at ByteDance (San Jose), focusing on AI infrastructure and system efficiency at scale. I received my Ph.D. in Computer Science from City University of Hong Kong. My work centers on designing next-generation data compression techniques (including neural approaches) and building high-performance systems for neural network acceleration and edge computing.
Research interests
Neural & Learned Compression System Efficiency at Scale Hardware-Aware ML Acceleration
Google Scholar: Cited by …
🔥 News
- 2026.04: Joined ByteDance San Jose as Research Scientist.
- 2026.01: BAHOP and ECM accepted at ICASSP 2026.
- 2025.12: Hitcher accepted at KDD 2026, Parallel-SA at DATE 2026.
- 2025.05: WISE accepted at CVPR 2025, PMR at USENIX ATC 2025, Easz and DAWN at DAC 2025.
📔 Blog
Hello, Blog · 开张第一篇
这个博客用来记录压缩、系统效率和 AI 基础设施方向的随手笔记。支持中英双语,评论由 GitHub Discussions 驱动。
📝 Publications
Compression · LLM Efficiency
ECM: Enhancing Compressibility of Quantized Vision Encoder and LLM for Large Vision-Language Models
ICASSP 2026
Corresponding
Accelerating General-purpose Lossless Compression via Simple and Scalable Parameterization
ACM MM 2022
Trace: A Fast Transformer-based General-purpose Lossless Compressor
The Web Conference (WWW) 2022
Systems · Edge AI
DATE
Parallel-SA: Point Cloud Processing Acceleration via Parallel Set Abstraction
DATE 2026
Corresponding
STEM: Streaming-based FPGA Acceleration for Large-Scale Compactions in LSM KV
ICDE 2024
Corresponding
Whole-Slide-Image · Medical Imaging
Advances in Multiple Instance Learning for Whole Slide Image Analysis: Techniques, Challenges, and Future Directions
Preprint 2024
Corresponding
🎖 Honors and Awards
- 2025.11 DAAD Interpretable AI Postdoctoral Fellowship
- 2024.12 NeurIPS Outstanding Reviewer
- 2023.10 TinyML Contest ICCAD 2023, Second Place, San Francisco, USA
- 2023.09 Outstanding Academic Performance Award, City University of Hong Kong
- 2023.07 DAC Young Research Fellow, San Francisco, USA
- 2022.10 EDAthon 2022, Second Place, Hong Kong
🤝 Services
- ML conference referee: NeurIPS 24/25/26, ACM MM 23/24, ICLR 25/26, ICML 25/26, CVPR 25/26, AAAI 26, ARR
- System TPC: USENIX ATC 25, GLSVLSI, RTCSA
- Journal referee: ACM TECS, TMLR, IEEE TKDE
👩🏫 Supervision
- Shashwat Jaiswal, summer intern (Ph.D. student at UIUC), 2026
- Yusheng Zheng, summer intern (Ph.D. student at UCSC), 2026
- Jun Wang, Ph.D. student with Prof. Jason Xue (2024–present)
- Weilan Wang, Ph.D. student with Prof. Jason Xue (2024–2026)
- Dongdong Tang, Ph.D. student with Prof. Jason Xue (2024–2025)