Research
Research Interests
- Efficient machine learning systems (training/inference on parallel/distributed/heterogeneous hardware)
- AI algorithm-hardware Co-design (GPU)
- Effective efficiency algorithms (model compression, data efficiency, parameter-efficient tuning, etc.)
- Large-scale DL/AI applications (Large Language Model, Agent, Image/Video Generation, DLRM, etc.)