James Fleckenstein is a Site Reliability Engineer (SRE) specializing in AI clusters at AMD, where they focus on improving the reliability of large-scale AI and HPC cluster environments. With experience dating back to 2014, James has held various roles, including Sr. DevOps Engineer and IT Systems Engineer, broadening their expertise in automation, network design, and systems reliability. They earned a Bachelor’s Degree in Economics and Computer Science from the University of Colorado Colorado Springs. James is adept in using Python for automation and has played a key role in scaling production environments and optimizing workflows for multiple development teams.
This person is not in any teams
This person is not in any offices