Xiaohan Zhang is a highly skilled research engineer currently working at Databricks Mosaic Research, focusing on the development and maintenance of StreamingDataset for efficient streaming of training data and enabling large generative AI model training. Prior experience includes a machine learning engineer role at Salesforce, where significant contributions were made to content moderation and toxicity detection modules, as well as leading various machine learning product initiatives. Xiaohan also served as an artificial intelligence fellow at Insight Data Science, where advancements in image classifier training were achieved. Academic background includes a postdoctoral research position at Stanford University, centered on semiconductor defect control, and a PhD from Carnegie Mellon University, specializing in multiscale modeling of dislocation plasticity. Earlier educational credentials feature a Master of Engineering from Tsinghua University and a Bachelor of Science in Hydraulic Engineering from China Agricultural University.
Sign up to view 0 direct reports
Get started