Jialin Wu

Research Scientist

Google Deepmind

Biography

I am a research scientist at Google Deepmind. Prior to that, I received my Ph. D. from UTCS advised by Raymond J. Mooney. I received my BEng. degree from the Department of Automation supervised by Prof. Xiangyang Ji at Tsinghua University in 2017.

My most recent research interests is building large multimodal models that (1) are explainable generalists and (2) performan well on geographically (culturally) diversed tasks. I am also interested in few-shot learning, parameter efficient learning and continual learning.

Interests

Fewshot (In-Context) Learning
Language and Vision
Explainable AI

Education

PhD in Artificial Intelligence, 2017 - 2022
UT Austin
BEng in Automation, 2013 - 2017
Tsinghua University

Selected Publications

Please see my google scholar for full publications and preprints.

Yue Zhao, Long Zhao, Xingyi Zhou, Jialin Wu, Chun-Te Chu, Hui Miao, Florian Schroff, Hartwig Adam, Ting Liu, Boqing Gong, Philipp Krähenbühl, Liangzhe Yuan (2024). Distilling vision-language models on millions of videos. CVPR 2024.

PDF

Jialin Wu, Xia Hu, Yaqing Wang, Bo Pang, Radu Soricut (2024). Omni-SMoLA:Boosting Generalist Multimodal Models with Soft Mixture of Low-rank Experts. CVPR 2024 (Highlight Poster).

PDF

Nan Ding, Tomer Levinboim, Jialin Wu, Sebastian Goodman, Radu Soricut (2023). CausalLM Is Not Optimal for In-Context Learning. ICLR 2024.

PDF

Xi Chen, Josip Djolonga, Piotr Padlewski, Basil Mustafa, Soravit Changpinyo, Jialin Wu, Et. Al., (43 Authors) (2023). PaLI-X:On Scaling up a Multilingual Vision and Language Model. CVPR 2024.

PDF

Brianna Zitkovich, Tianhe Yu, Sichun Xu, Peng Xu, Ted Xiao, Fei Xia, Jialin Wu, Et. Al., (54 Authors) (2023). RT-2:Vision-language-action models transfer web knowledge to robotic controling. CoRL 2023.

PDF

See all publications