2026

EmbodiedMidtrain: Bridging the Gap between Vision-Language Models and Vision-Language-Action Models via Mid-training

Yiyang Du, Zhanqiu Guo, Xin Ye, Liu Ren, Chenyan Xiong

arXiv preprint 2026

A mid-training framework that selects VLA-aligned data from broader VLM corpora to improve downstream vision-language-action policy learning.

EmbodiedMidtrain: Bridging the Gap between Vision-Language Models and Vision-Language-Action Models via Mid-training

Yiyang Du, Zhanqiu Guo, Xin Ye, Liu Ren, Chenyan Xiong

arXiv preprint 2026

A mid-training framework that selects VLA-aligned data from broader VLM corpora to improve downstream vision-language-action policy learning.

Modeling Distinct Human Interaction in Web Agents

Faria Huq, Zora Zhiruo Wang, Zhanqiu Guo, Venu Arvind Arangarajan, Tianyue Ou, Frank Xu, Shuyan Zhou, Graham Neubig, Jeffrey P. Bigham

arXiv preprint 2026

Introduces CowCorpus, a dataset of real-user web navigation trajectories with interleaved human and agent actions, and trains intervention-aware web agents.

Modeling Distinct Human Interaction in Web Agents

Faria Huq, Zora Zhiruo Wang, Zhanqiu Guo, Venu Arvind Arangarajan, Tianyue Ou, Frank Xu, Shuyan Zhou, Graham Neubig, Jeffrey P. Bigham

arXiv preprint 2026

Introduces CowCorpus, a dataset of real-user web navigation trajectories with interleaved human and agent actions, and trains intervention-aware web agents.

2025

DynaMem: Online Dynamic Spatio-Semantic Memory for Open World Mobile Manipulation
DynaMem: Online Dynamic Spatio-Semantic Memory for Open World Mobile Manipulation

Peiqi Liu, Zhanqiu Guo, Mohit Warke, Soumith Chintala, Chris Paxton, Nur Muhammad Mahi Shafiullah, Lerrel Pinto

IEEE International Conference on Robotics and Automation (ICRA) 2025 pp. 13346-13355

An online dynamic spatio-semantic memory system for open-world mobile manipulation, enabling robots to search for, localize, and recover objects in changing environments.

DynaMem: Online Dynamic Spatio-Semantic Memory for Open World Mobile Manipulation

Peiqi Liu, Zhanqiu Guo, Mohit Warke, Soumith Chintala, Chris Paxton, Nur Muhammad Mahi Shafiullah, Lerrel Pinto

IEEE International Conference on Robotics and Automation (ICRA) 2025 pp. 13346-13355

An online dynamic spatio-semantic memory system for open-world mobile manipulation, enabling robots to search for, localize, and recover objects in changing environments.

2024

ContextWIN: Whittle Index Based Mixture-of-Experts Neural Model For Restless Bandits Via Deep RL

Zhanqiu Guo, Wayne Wang

arXiv preprint 2024

A context-aware mixture-of-experts extension of Neural Whittle Index Networks for restless multi-armed bandits, with convergence analysis and applications to dynamic decision making.

ContextWIN: Whittle Index Based Mixture-of-Experts Neural Model For Restless Bandits Via Deep RL

Zhanqiu Guo, Wayne Wang

arXiv preprint 2024

A context-aware mixture-of-experts extension of Neural Whittle Index Networks for restless multi-armed bandits, with convergence analysis and applications to dynamic decision making.