Mostofa Patwary

Director of Large Foundational Language Model, Applied Deep Learning Research

Mostofa Patwary is the Director of Large Foundational Language Model, Applied Deep Learning Research at NVIDIA, where they lead a team focused on pretraining large language models and developing innovative tools for data quality and model accuracy. They previously held roles as a Principal Research Scientist and Senior Engineering Manager at NVIDIA, as well as a Senior Research Scientist at Baidu USA. Mostofa holds a PhD in Computer Science from the University of Bergen and has extensive experience in parallel algorithms, high-performance computing, and deep learning across various prestigious institutions. Their notable publications include work on the Megatron-Turing NLG 530B model and advancements in natural language processing and generative dialog modeling.

Location

South San Francisco, United States

Links


Org chart

This person is not in the org chart


Teams

This person is not in any teams


Offices

This person is not in any offices