Research
I'm broadly interested in deep learning, neural networks and its applications. In the past, I have worked on projects involving LLMs and multi-modal LLMs.
|
|
SegLLM: Interactive Multi-Round Reasoning Segmentation
Xudong Wang*,
Shaolun Zhang*,
Shufan Li*,
Konstantinos Kallidromitis,
Kehan Li,
Yusuke Kato,
Kazuki Kozuka,
Trevor Darrell
[ArXiv]
|
[Project Page]
Interactive multi-round reasoning segmentation model that enhances LLM-based segmentation by exploiting conversational memory of both visual and textual outputs.
|
|
Training Dynamics of Reversal Curse
Hanlin Zhu*,
Baihe Huang*,
Shaolun Zhang,
Michael Jordan,
Jiantao Jiao,
Yuandong Tian,
Stuart Russell
[ArXiv]
Theoretical analysis and empirical results demonstrating asymmetry in model weights leading to the Reversal Curse, where LLMs trained on "A is B" fails to learn "B is A".
|
|