Yiyang Du, Zhanqiu Guo, Xin Ye, Liu Ren, Chenyan Xiong
arXiv preprint 2026
A mid-training framework that selects VLA-aligned data from broader VLM corpora to improve downstream vision-language-action policy learning.
Yiyang Du, Zhanqiu Guo, Xin Ye, Liu Ren, Chenyan Xiong
arXiv preprint 2026
A mid-training framework that selects VLA-aligned data from broader VLM corpora to improve downstream vision-language-action policy learning.
Faria Huq, Zora Zhiruo Wang, Zhanqiu Guo, Venu Arvind Arangarajan, Tianyue Ou, Frank Xu, Shuyan Zhou, Graham Neubig, Jeffrey P. Bigham
arXiv preprint 2026
Introduces CowCorpus, a dataset of real-user web navigation trajectories with interleaved human and agent actions, and trains intervention-aware web agents.
Faria Huq, Zora Zhiruo Wang, Zhanqiu Guo, Venu Arvind Arangarajan, Tianyue Ou, Frank Xu, Shuyan Zhou, Graham Neubig, Jeffrey P. Bigham
arXiv preprint 2026
Introduces CowCorpus, a dataset of real-user web navigation trajectories with interleaved human and agent actions, and trains intervention-aware web agents.

Peiqi Liu, Zhanqiu Guo, Mohit Warke, Soumith Chintala, Chris Paxton, Nur Muhammad Mahi Shafiullah, Lerrel Pinto
IEEE International Conference on Robotics and Automation (ICRA) 2025 pp. 13346-13355
An online dynamic spatio-semantic memory system for open-world mobile manipulation, enabling robots to search for, localize, and recover objects in changing environments.
Peiqi Liu, Zhanqiu Guo, Mohit Warke, Soumith Chintala, Chris Paxton, Nur Muhammad Mahi Shafiullah, Lerrel Pinto
IEEE International Conference on Robotics and Automation (ICRA) 2025 pp. 13346-13355
An online dynamic spatio-semantic memory system for open-world mobile manipulation, enabling robots to search for, localize, and recover objects in changing environments.
arXiv preprint 2024
A context-aware mixture-of-experts extension of Neural Whittle Index Networks for restless multi-armed bandits, with convergence analysis and applications to dynamic decision making.
arXiv preprint 2024
A context-aware mixture-of-experts extension of Neural Whittle Index Networks for restless multi-armed bandits, with convergence analysis and applications to dynamic decision making.