Wei Ping 平伟

Distinguished Research Scientist
NVIDIA
Santa Clara, CA

Email: wping at nvidia dot com
weiping.thu at gmail dot com

Google Scholar arXiv
LinkedIn Twitter

About me

I am a distinguished research scientist at Nvidia, working on LLMs with a focus on post-training and multimodality.

Prior to this, I worked on speech synthesis and generative models at Baidu’s Silicon Valley AI Lab, which was founded by Andrew Ng, from 2017 to 2020.

Before that, I received my PhD in Computer Science from UC Irvine in 2016, where I worked on machine learning and probabilistic graphical models.

Selected Work

Find the full list on Google Scholar.

* Equal contribution. ⁺ My student intern.

AceReason-Nemotron: Advancing math and code reasoning through reinforcement learning. [checkpoints and data] 2025.
Yang Chen*, Zhuolin Yang*, Zihan Liu, Chankyu Lee, Peng Xu, Mohammad Shoeybi, Bryan Catanzaro, Wei Ping.

Nemotron-H: A family of accurate and efficient hybrid mamba-transformer models. [blog] [checkpoints] 2025.
NVIDIA: Wei Ping (core contributor).

Cosmos-Reason1: From physical common sense to embodied reasoning. [project] [blog] 2025.
NVIDIA: Wei Ping (core contributor).

AceMath: Advancing frontier math reasoning with post-training and reward modeling. [project] [checkpoints and training data] ACL 2025.
Zihan Liu*, Yang Chen*, Mohammad Shoeybi, Bryan Catanzaro, Wei Ping.

NVLM: Open frontier-class multimodal LLMs. [project] [checkpoints] [training code] 2024.
Wenliang Dai*, Nayeon Lee*, Boxin Wang*, Zhuolin Yang*, Zihan Liu, Jon Barker, Tuomas Rintamaki, Mohammad Shoeybi, Bryan Catanzaro, Wei Ping*.

NV-Embed: Improved techniques for training LLMs as generalist embedding models. [checkpoints] ICLR 2025.
Chankyu Lee, Rajarshi Roy, Mengyao Xu, Jonathan Raiman, Mohammad Shoeybi, Bryan Catanzaro, Wei Ping.
[NV-Embed-v1 ranked No. 1 on the MTEB Benchmark as of May 24, 2024. NV-Embed-v2 reclaimed No.1 as of Aug 30, 2024.]
[The multimodal extention MM-Embed (ICLR 2025) based on NV-Embed-v1]

ChatQA: Surpassing GPT-4 on conversational QA and RAG. [project] [checkpoints] NeurIPS 2024.
Zihan Liu, Wei Ping, Rajarshi Roy, Peng Xu, Chankyu Lee, Mohammad Shoeybi, Bryan Catanzaro.

RankRAG: Unifying Context Ranking with Retrieval-Augmented Generation in LLMs. NeurIPS 2024.
Yue Yu⁺, Wei Ping, Zihan Liu, Boxin Wang, Jiaxuan You, Chao Zhang, Mohammad Shoeybi, Bryan Catanzaro.

InstructRetro: Instruction tuning post retrieval-augmented pretraining. [code] [model] ICML 2024.
Boxin Wang⁺, Wei Ping, Lawrence McAfee, Peng Xu, Bo Li, Mohammad Shoeybi, Bryan Catanzaro.

Audio Flamingo: A novel audio language model with few-shot learning and dialogue abilities. [code] ICML 2024.
Zhifeng Kong, Arushi Goel, Rohan Badlani, Wei Ping, Rafael Valle, Bryan Catanzaro.

VILA: On Pre-training for visual language models. CVPR 2024.
Ji Lin, Hongxu Yin, Wei Ping, Pavlo Molchanov, Mohammad Shoeybi, Song Han.

Retrieval meets long context large language models. ICLR 2024.
Peng Xu, Wei Ping, Xianchao Wu, Lawrence McAfee, Chen Zhu, Zihan Liu, Sandeep Subramanian, Evelina Bakhturina, Mohammad Shoeybi, Bryan Catanzaro.

BigVGAN: A universal neural vocoder with large-scale training. [demo] [code] ICLR 2023.
Sang-gil Lee⁺, Wei Ping, Boris Ginsburg, Bryan Catanzaro, Sungroh Yoon.

Factuality enhanced language models for open-ended text generation. [code] NeurIPS 2022.
Nayeon Lee⁺, Wei Ping, Peng Xu, Mostofa Patwary, Pascale Fung, Mohammad Shoeybi, Bryan Catanzaro.

DiffWave: A versatile diffusion model for audio synthesis. [project] ICLR 2021 (Oral).
Zhifeng Kong⁺, Wei Ping, Jiaji Huang, Kexin Zhao, Bryan Catanzaro.

WaveFlow: A compact flow-based model for raw audio. [project] ICML 2020.
Wei Ping, Kainan Peng, Kexin Zhao, Zhao Song.

ClariNet: Parallel wave generation in end-to-end text-to-speech. [demo] ICLR 2019.
Wei Ping, Kainan Peng, Jitong Chen.

Deep Voice 3: Scaling text-to-speech with convolutional sequence learning. [demo][more samples] ICLR 2018.
Wei Ping, Kainan Peng, Andrew Gibiansky, Sercan O. Arik, Ajay Kannan, Sharan Narang, Jonathan Raiman, John Miller.

Neural voice cloning with a few samples. [demo] NeurIPS 2018.
Sercan O. Arik*, Jitong Chen*, Kainan Peng*, Wei Ping*, Yanqi Zhou.

Deep Voice 2: Multi-speaker neural text-to-speech. [demo] NeurIPS 2017.
Sercan O. Arik, Gregory Diamos, Andrew Gibiansky, John Miller, Kainan Peng, Wei Ping, Jonathan Raiman, Yanqi Zhou.