Sizhuo Ma
Research Scientist, Snap Research
I am a Senior Research Scientist at Snap Research. My current research focuses on high-fidelity image & video restoration and enhancement, diffusion models, and efficient vision-language models. Please check my Google Scholar page for a complete list of publications.
I received my PhD in Computer Sciences at UW-Madison, advised by Professor Mohit Gupta. My PhD work focused on solving geometry and motion-related computer vision problems using novel computational cameras, including light field cameras, structured light, and single-photon cameras.
Research
Towards Redundancy Reduction in Diffusion Models for Efficient Video Super-Resolution
arXiv, 2025

GuideSR: Rethinking Guidance for One-Step High-Fidelity Diffusion-Based Super-Resolution
arXiv, 2025
RobustSAM: Segment Anything Robustly on Degraded Images
CVPR 2024, Highlight
Quanta Burst Photography
SIGGRAPH 2020 / ACM Transactions on Graphics