AceReason-Nemotron: Advancing math and code reasoning through reinforcement learning. [checkpoints] 2025.
Yang Chen*, Zhuolin Yang*, Zihan Liu, Chankyu Lee, Peng Xu, Mohammad Shoeybi, Bryan Catanzaro, Wei Ping.
Nemotron-H: A family of accurate and efficient hybrid mamba-transformer models. [blog] [checkpoints] 2025.
NVIDIA: Wei Ping (core contributor).
Cosmos-Reason1: From physical common sense to embodied reasoning. [project] [blog] 2025.
NVIDIA: Wei Ping (core contributor).
NVLM: Open frontier-class multimodal LLMs. [project] [checkpoints] [training code] Technical Report. 2024.
Wenliang Dai*, Nayeon Lee*, Boxin Wang*, Zhuolin Yang*, Zihan Liu, Jon Barker, Tuomas Rintamaki, Mohammad Shoeybi, Bryan Catanzaro, Wei Ping*.
NV-Embed: Improved techniques for training LLMs as generalist embedding models. [checkpoints] ICLR 2025.
[NV-Embed-v1 ranked No. 1 on the MTEB Benchmark as of May 24, 2024. NV-Embed-v2 reclaimed No.1 as of Aug 30, 2024.]
[We built multimodal retriver MM-Embed (ICLR 2025) based on NV-Embed-v1]
Chankyu Lee, Rajarshi Roy, Mengyao Xu, Jonathan Raiman, Mohammad Shoeybi, Bryan Catanzaro, Wei Ping.
ChatQA: Surpassing GPT-4 on conversational QA and RAG. [project] [checkpoints] NeurIPS 2024.
Zihan Liu, Wei Ping, Rajarshi Roy, Peng Xu, Chankyu Lee, Mohammad Shoeybi, Bryan Catanzaro.
InstructRetro: Instruction tuning post retrieval-augmented pretraining. [code] [model] ICML 2024.
Boxin Wang+, Wei Ping, Lawrence McAfee, Peng Xu, Bo Li, Mohammad Shoeybi, Bryan Catanzaro.
Retrieval meets Long Context Large Language Models. ICLR 2024.
Peng Xu, Wei Ping, Xianchao Wu, Lawrence McAfee, Chen Zhu, Zihan Liu, Sandeep Subramanian, Evelina Bakhturina, Mohammad Shoeybi, Bryan Catanzaro.
BigVGAN: A Universal Neural Vocoder with Large-Scale Training. [demo] [code] ICLR 2023.
Sang-gil Lee+, Wei Ping, Boris Ginsburg, Bryan Catanzaro, Sungroh Yoon.
Factuality Enhanced Language Models for Open-Ended Text Generation. [code] NeurIPS 2022.
Nayeon Lee+, Wei Ping, Peng Xu, Mostofa Patwary, Pascale Fung, Mohammad Shoeybi, Bryan Catanzaro.
DiffWave: A versatile diffusion model for audio synthesis. [project] ICLR 2021 (Oral).
Zhifeng Kong+, Wei Ping, Jiaji Huang, Kexin Zhao, Bryan Catanzaro.
WaveFlow: A compact flow-based model for raw audio. [project] ICML 2020.
Wei Ping, Kainan Peng, Kexin Zhao, Zhao Song.
ClariNet: Parallel wave generation in end-to-end text-to-speech. [demo] ICLR 2019.
Wei Ping, Kainan Peng, Jitong Chen.
Deep Voice 3: Scaling text-to-speech with convolutional sequence learning. [demo][more samples] ICLR 2018.
Wei Ping, Kainan Peng, Andrew Gibiansky, Sercan O. Arik, Ajay Kannan, Sharan Narang, Jonathan Raiman, John Miller.
Neural voice cloning with a few samples. [demo] NeurIPS 2018.
Sercan O. Arik*, Jitong Chen*, Kainan Peng*, Wei Ping*, Yanqi Zhou.
Deep Voice 2: Multi-speaker neural text-to-speech. [demo] NeurIPS 2017.
(α-β) Sercan O. Arik, Gregory Diamos, Andrew Gibiansky, John Miller, Kainan Peng, Wei Ping, Jonathan Raiman, Yanqi Zhou.
Decomposition bounds for marginal MAP. NeurIPS 2015.
Wei Ping, Qiang Liu, Alexander Ihler.