New
Critical Environment Mechanical Engineer
![]() | |
![]() United States, Texas, San Antonio | |
![]() | |
OverviewMicrosoft's Cloud Operations & Innovation (CO+I) is the engine that powers our cloud services. As a CO+I Critical Environment Mechanical Engineer, you will perform a key role in delivering the core infrastructure and foundational technologies for Microsoft's online services including Bing, Office 365, Xbox, OneDrive, and the Microsoft Azure platform. Our infrastructure is comprised of a large global portfolio of more than 200 datacenters in 32 countries and millions of servers. Our foundation is built upon and managed by a team of subject matter experts working to support services for more than 1 billion customers and 20 million businesses in over 90 countries worldwide. With environmental sustainability and optimization at the forefront of our datacenter design and operations, we continue to grow and evolve as we meet the ever-changing business demands that hold Microsoft as a world-class cloud provider. Microsoft's mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.
ResponsibilitiesEnergy Efficiency Optimization - Develop and implement strategies to improve energy efficiency by optimizing airflow management, integrating high-efficiency cooling technologies, and reducing overall energy consumption without compromising performance. Incident Management - Provide 24/7 availability for incident response, serving as the primary technical lead for mechanical-related issues within assigned buildings, ensuring rapid resolution and minimal impact on operations. Root Cause Analysis - Lead the development, execution, and approval of Root Cause Analyses (RCAs) for mechanical system failures, ensuring follow-up actions are effectively managed to prevent recurrence. Availability & Capacity Planning - Serve as the Critical Environment (CE) lead for assigned building pairs, overseeing mechanical infrastructure and ensuring optimal cooling capacity to support rack deployments and operational demands. Thermal Management - Continuously monitor and regulate temperature and humidity levels to maintain optimal conditions for data center equipment, ensuring compliance with operational and safety standards. Cooling System Upgrades - Assess, recommend, and oversee the implementation of cooling system upgrades and enhancements to improve efficiency and reliability. Facility Infrastructure Management - Oversee the installation, maintenance, and replacement of major mechanical infrastructure, including generators, chillers, HVAC systems, pumps, and cooling towers, ensuring peak performance and operational resilience. Disaster Recovery & Risk Management - Develop, review, and implement contingency plans for mechanical system failures, ensuring redundancy and emergency cooling measures are in place to maintain continuous operations. Environmental Compliance - Collaborate with site operations teams to ensure adherence to environmental regulations, including water usage effectiveness (WUE), energy consumption, emissions control, and refrigerant handling. Monitoring & Automation - Manage and optimize Building Management System (BMS) operations, ensuring configurations are updated to enhance efficiency, reliability, and system performance. Collaboration with Engineering Teams - Work closely with electrical engineers and site operations teams to ensure a comprehensive and integrated approach to data center operations, focusing on reliability and efficiency. Site Transition Program (STP) - Own the transition program for new site acceptance into operations including working with the DC Field Ops Build Program Manager to ensure all the punch list items are captured and lined up for correction and closure. Mechanical Sensor & Meter Audit - Conduct regular audits of sensors and meters to identify calibration needs, ensuring accurate data collection for system monitoring and performance analysis and leverage on the support of the site CE team to achieve this. Seasonal Readiness - Lead site preparation for seasonal operational demands, ensuring mechanical systems are optimized for varying environmental conditions, such as winterization protocols and storm readiness measures. End of Life (EOL) Management - Drive the planning and execution of mechanical equipment lifecycle transitions, maintaining compliance with EOL standards and coordinating with global and regional teams for timely replacement. Preventative Maintenance Oversight - Provide on-site supervision for high-risk maintenance activities, ensuring adherence to safety and operational protocols. Change Advisory Board (CAB) Participation - Actively engage in CAB meetings to ensure compliance, procedural accuracy, and the seamless execution of mechanical system changes. Global & Regional Support - Serve as a technical liaison for external teams, providing critical site data and supporting regional and global operational initiatives. Operational Procedure Review - Perform secondary reviews of maintenance and operational procedures to uphold industry best practices and regulatory standards, and on demand author technical procedures regarding first of kind work, diagnostic work or critical activities. Utility Consumption & Billing Validation - Monitor and validate utility usage data against invoices, ensuring billing accuracy and efficiency. Site Drawings Management - Maintain the accuracy and compliance of mechanical system drawings, ensuring proper version control and accessibility. Technical Service Bulletin (TSB) Implementation - Acknowledge, prioritize, and execute TSBs based on impact and risk assessment, coordinating with relevant teams for effective resolution. Training & Development - Lead mechanical training programs, develop technical content, and support professional development initiatives to enhance team expertise and standardizationEmbody the Microsoft One culture and values. |