Controllable and Efficient Generative Models
- Zero-shot Video Editing
- Video Diffusion / Video DiT Models
- Efficient Diffusion / DiT Inference
- Model Compression and Quantization
- Structure-preserving Generation
Personal Homepage
Ph.D. Candidate, Electronics Science and Technology · Zhejiang University
My research sits at the intersection of generative modeling and efficient AI systems, with a focus on controllable video generation and editing for diffusion/Video DiT models.
I am a direct Ph.D. student in Electronics Science and Technology at Zhejiang University, advised by Prof. Kejie Huang.
My research lies at the intersection of generative modeling and efficient AI systems. I study controllable video generation and editing for diffusion/Video DiT models, as well as efficiency techniques such as model compression and inference acceleration.
Before focusing on AI algorithms, I was trained in Microelectronics Science and Engineering. This background allows me to approach generative models from a system perspective, bridging algorithm design with efficient hardware-aware deployment.
AAAI 2025
Submitted (NeurIPS 2025, OpenReview)
PIERS 2021 · Best Student Paper Award (Optics and Photonics Section, 5th)
Manuscript under review
Manuscript under review
Developed zero-shot video editing methods for text-to-video diffusion and Video DiT models, with focus on structure preservation, controllability, and efficiency.
Proposed dataset condensation and decoupled quantization methods to improve QAT efficiency and reduce quantization errors for Transformer detectors.
Explored hardware-software co-design with photodetector sensor arrays for low-power, low-latency near-sensor computing.
Ph.D. in Electronics Science and Technology (Direct Ph.D. Program), 2021 - Present
B.Eng. in Microelectronics Science and Technology, 2017 - 2021 (strong academic performance)
Email: cailingling22@zju.edu.cn
Links: GitHub · OpenReview