Research
|
My current research interests include computer vision and machine learning.
Specifically, I am focusing on Human-centric 3D vision and intelligent agent that can interact with different scenes.
|
Preprints
|
* indicates equal contribution, and ✉ indicates communication author (last author as default).
|
|
InteractAnything: Zero-shot Human Object Interaction Synthesis via LLM Feedback and Object Affordance Parsing
Jinlu Zhang ,
Yixin Chen,
Zan Wang,
Jie Yang,
Yizhou Wang,
Siyuan Huang
CVPR 2025
Project page/
Paper
Abstract
In this work, we propose a novel zero-shot 3D HOI generation framework without training on specific datasets, leveraging the knowledge from large-scale pre-trained models.
|
|
Move as You Say Interact as You Can: Language-guided Human Motion Generation with Scene Affordance
Zan Wang,
Yixin Chen,
Baoxiong Jia,
Puhao Li,
Jinlu Zhang ,
Jingze Zhang,
Tengyu Liu,
Yixin Zhu,
Wei Liang,
Siyuan Huang
CVPR 2024 Highlight
Project page/
Paper/
Code
Abstract
In this work, we introduce a novel two-stage framework that employs scene affordance as an intermediate representation, effectively linking 3D scene grounding and conditional motion generation. Our framework comprises an Affordance Diffusion Model (ADM) for predicting explicit affordance map and an Affordance-to-Motion Diffusion Model (AMDM) for generating plausible human motions.
|
|
PHRIT: Parametric Hand Representation with Implicit Template
Zhisheng Huang, Yujin Chen, Di Kang, Jinlu Zhang , Zhigang Tu
ICCV 2023
Paper
Abstract
We propose PHRIT, a novel approach for parametric hand mesh modeling with an implicit template that combines the advantages of both parametric meshes and implicit representations. Our method represents deformable hand shapes using signed distance fields (SDFs) with part-based shape priors, utilizing a deformation field to execute the deformation.
|
|
MixSTE: Seq2seq Mixed Spatio-Temporal Encoder for 3D Human Pose Estimation in Video
Jinlu Zhang ,
Zhigang Tu ,
Jianyu Yang ,
Yujin Chen ,
Junsong Yuan
CVPR 2022
Paper/
Video/
Code/
Bibtex
Abstract
We propose MixSTE (Mixed Spatio-Temporal Encoder), which has a temporal transformer to separately model the temporal motion of each joint and a spatial transformer to learn inter-joint spatial correlation.
|
|
Uncertainty-Aware 3D Human Pose Estimation from Monocular Video
Jinlu Zhang ,
Yujin Chen ,
Zhigang Tu
ACM MM 2022
Paper
Abstract
We propose an uncertainty-aware method to quantify and optimize the depth and 2D detection input respectively.
|
Experience
|
Research Intern at BigAI, working with Yixin Chen and Siyuan Huang.
|
Sep, 2023 - Feb, 2025 |
Research Intern at Tencent, working with Wei Zhuo.
|
Jan, 2022 - Jan, 2023 |
Cooperation Project of WHU & Tencent in real-time super-resolution.
|
Dec, 2021 - May, 2022
|
Research Intern at Huawei (Suzhou).
|
Jul, 2019 - Oct, 2019
|
Services&Activities
|
Conference Reviewer of:
CVPR, ICCV, ECCV, ICML, NeurIPS.
|
Journal Reviewer of:
TPAMI, TIP.
|
Talk:
I share the talk of our CVPR 2022 paper on VALSE Webinar. [Link]
|
May, 2022 |
Prize
|
National Scholarship of Wuhan University
|
Oct, 2022 |
1st Runner-up in ICCV2021 MMVRAC Challenge Track 2&3. [Link]
|
Sep, 2021 |
Top 10 first-year graduate students scholarship of WHU LIESMARS.
|
Nov, 2020 |
|
Doctor of Computer Applied Technology
CFCS, School of Computer Science, Peking University, China.
Advisor: Prof. Yizhou Wang
2023 - Now
|
|
Master of Computer Applied Technology
State Key Laboratory at Wuhan University, China.
Advisor: Prof. Zhigang Tu and Prof. Junsong Yuan
2020 - 2023
|
|
Bachelor Degree of Computer Science and Technology
Computer Science, Shandong University, China.
2016 - 2020,
with Honours degree
|
|