Fangzhou Mu

fmu at wisc.edu | LinkedIn | Scholar | GitHub | CV

photo.jpg

I am currently a Senior Deep Learning Algorithms Engineer at NVIDIA. I graduated with a PhD in Computer Sciences from the University of Wisconsin-Madison (advisor: Prof. Yin Li) in December 2023. Previously, I received Master’s degrees in Computer Sciences and Pharmaceutical Sciences from UW-Madison, and a Bachelor’s degree in Biology from Zhejiang University.

I am broadly interested in computer vision and machine learning. I have worked on computational imaging, 3D vision, image generation and manipulation, video understanding, and foundations of deep learning. These days, I combine (Multimodal) Foundation Models to enable (1) visual inference by re-creating the inputs (analysis by synthesis), and (2) content generation and manipulation through visual understanding (synthesis by analysis).

News

Feb 26, 2024 Three papers accepted to CVPR 2024.
Feb 16, 2024 One paper accepted to ICLR 2024.
Jan 16, 2024 Joining NVIDIA as a Senior Deep Learning Algorithms Engineer.

Selected Publications

  1. mu2024towards.png
    Towards 3D Vision with Low-Cost Single-Photon Cameras
    Fangzhou Mu*Carter Sifferman*Sacha JungermanYiquan Li, Zhiyue Han , Michael GleicherMohit Gupta , and Yin Li
    In Computer Vision and Pattern Recognition (CVPR) , 2024
  2. mu2024snag.png
    SnAG: Scalable and Accurate Video Grounding
    Fangzhou Mu*Sicheng Mo* , and Yin Li
    In Computer Vision and Pattern Recognition (CVPR) , 2024
  3. mo2024freecontrol.png
    FreeControl: Training-Free Spatial Control of Any Text-to-Image Diffusion Model with Any Condition
    Sicheng Mo*Fangzhou Mu*Kuan Heng LinYanli Liu, Bochen Guan , Yin Li, and Bolei Zhou
    In Computer Vision and Pattern Recognition (CVPR) , 2024
  4. gutierrez2023learned.png
    Learned Compressive Representations for Single-Photon 3D Imaging
    In International Conference on Computer Vision (ICCV) , 2023
  5. mu2022physics.png
    Physics to the Rescue: Deep Non-Line-of-Sight Reconstruction for High-Speed Imaging
    Transactions on Pattern Analysis and Machine Intelligence (TPAMI) / International Conference on Computational Photography (ICCP), 2022
  6. mu20223d.png
    3D Photo Stylization: Learning to Generate Stylized Novel Views from A Single Image
    Fangzhou MuJian Wang†Yicheng Wu† , and Yin Li†
    In Computer Vision and Pattern Recognition (CVPR) Oral , 2022
  7. xu2022smartadapt.png
    SmartAdapt: Multi-Branch Object Detection Framework for Videos on Mobiles
    In Computer Vision and Pattern Recognition (CVPR) , 2022
  8. mu2020gradients.png
    Gradients as Features for Deep Representation Learning
    Fangzhou MuYingyu Liang , and Yin Li
    In International Conference on Learning Representations (ICLR) , 2020