Sayan Deb Sarkar
I'm a 1st-year PhD student at Stanford University in the Gradient Spaces Group,
advised by Prof. Iro Armeni,
affiliated with Stanford Vision Lab (SVL).
Before starting PhD, I was a CS master student at ETH Zürich supervised by Prof. Marc Pollefeys, working on
aligning real-world 3D environments from multi-modal data. I graduated with a Bachelors in
Information Technology from Manipal University, India, where I spent time working on face recognition and medical imaging problems.
In 2020-21, I spent a wonderful time working with Shreyas Hampali,
Sinisa Stekovic and Mahdi Rad at
Prof. Vincent Lepetit's
lab on hand-object pose estimation and monte carlo scene search for 3D scene understanding.
I view them as mentors entering research, and strive to learn from them.
I am always looking for research collaborations, get in touch if you have something relevant.
If you're around the Bay Area, feel free to reach out for a cup of coffee!
Email  / 
CV  / 
Google Scholar  / 
Github  / 
Twitter  / 
LinkedIn
|
|
News
- 2024-09 I joined Stanford University for PhD in Computer Vision!
- 2023-08 I will be a research intern at Qualcomm XR Labs, Amsterdam for the Fall!
- 2023-07 SGAligner accepted to ICCV 2023. First first-author submission, first accept!
- 2022-09 I started Computer Science MSc at ETH Zurich!
- 2022-07 Keypoint Transformer accepted as Oral
at CVPR 2022!
- 2021-05 I started at Mercedes-Benz R & D as a Computer Vision Research Engineer!
---- show more ----
|
Research
My research interests lie at the intersection
of Computer Vision and Machine Learning, specifically in the areas of 3D scene understanding and pose estimation.
|
|
SGAligner : 3D Scene Alignment with Scene Graphs
Sayan Deb Sarkar, Ondrej Miksik, Marc Pollefeys, Daniel Barath, Iro Armeni
arXiv /
Project Page /
Video /
Code
International Conference on Computer Vision (ICCV), 2023
We focus on the fundamental problem of aligning pairs of 3D scene graphs whose overlap can range from
zero to partial and can contain arbitrary changes. We propose SGAligner, the first method for aligning
pairs of 3D scene graphs that is robust to in-the-wild scenarios.
|
|
Keypoint Transformer: Solving Joint Identification in Challenging Hands and Object Interactions for Accurate 3D Pose Estimation
Shreyas Hampali, Sayan Deb Sarkar, Mahdi Rad, Vincent Lepetit
Computer Vision and Pattern Recognition (CVPR), 2022 Oral
arXiv /
Project Page /
Video /
Code
We propose an efficient network architecture for estimating pose of two hands and object during complex interaction. We also release the challenging H2O-3D dataset, which contains two hands interacting with YCB objects.
|
|
Monte Carlo Scene Search for 3D Scene Understanding
Shreyas Hampali*, Sinisa Stekovic*, Sayan Deb Sarkar, Chetan Srinivasa Kumar, Friedrich Fraundorfer, Vincent Lepetit
Computer Vision and Pattern Recognition (CVPR), 2021
arXiv /
Project Page /
Video /
Code
We propose a Monte-Carlo Tree Search (MCTS) based analysis-by-synthesis method to recover complete scene (3D layout+objects) from a RGB-D scan of the environment.
*Equal contribution
|
|
General 3D Room Layout from a Single View by Render-and-Compare
Sinisa Stekovic, Shreyas Hampali, Mahdi Rad, Sayan Deb Sarkar, Friedrich Fraundorfer, Vincent Lepetit
European Conference on Computer Vision (ECCV), 2020
arXiv /
Project Page /
Video /
Code
We propose an analysis-by-synthesis method to estimate a 3D layout of the room - walls, floors, ceilings - from a single perspective view. The method recovers complex non-cubiod layouts by solving a constrained discrete optimization problem.
|
Misc
Workshop Organisation: CV4AEC@CVPR
Conference Review: CVPR, ICCV, ECCV
|
|