Principal Software Engineer
Microsoft | |
United States, Washington, Redmond | |
Nov 20, 2024 | |
OverviewMicrosoft is a company where passionate innovators come to collaborate, envision what can be and take their careers further. This is a world of more possibilities, more innovation, more openness, and the sky is the limit thinking in a cloud-enabled world. The AI Platform organization at Microsoft builds the end-to-end Azure AI stack/PaaS and is core to Azure's innovation and differentiation, as well as all of Microsoft's flagship products, from Office to Teams, to Xbox. We are looking for a Principal Software Engineer to join the team building Azure OpenAI, Azure ML, Cognitive Services, and the global Azure AI infrastructure for running the largest AI workloads on the planet. We do not just value differences or different perspectives. We seek them out and invite them in so we can tap into the collective power of everyone in the company. As a result, our customers are better served. Within AI Platform, the Azure ML team enables data scientists and developers to quickly and easily build, train, deploy, manage, and consume machine learning models. Microsoft's mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond. In alignment with our Microsoft values, we are committed to cultivating an inclusive work environment for all employees to positively impact our culture every day.
ResponsibilitiesEngage directly with key partners to understand and implement complex routing, rate limiting and load balancing capabilities for state-of-the-art LLMs and Diffusion models to optimize for utilization, throughput and latencyWork with cutting edge hardware stacks and a fast-moving software stack to deliver best of class inference and optimal costAnticipate, identify, assess, track, and mitigate project risks and issues in a fast-paced start up like environmentMotivated to build constructive and effective relationships and solve problems collaborativelySupport production inference SLAs for core AI scenarios on one of the largest GPU fleets in the worldOtherEmbody our culture and values |