AMVETS Jobs

Job Information

Microsoft Corporation Software Engineer (Networking) in Shanghai, China

The Live Site Engineering team in Microsoft AI is dedicated to ensuring the production service operation. Our team collaborates closely with service owner teams worldwide to conduct live site readiness reviews, manage incidents, measure, and improve production service quality. You will have the opportunity to work on cutting-edge technology and solutions in Microsoft Copilot, Bing Search, Ads, Maps and Edge browser, etc., serving over hundreds of millions of users and delivering high availability and performance to customers.

We are seeking an engineer to work with MS AI internal teams and Azure on graceful outage handling project.

Responsibilities

  • Collaborate proactively with service owners to assess Business Continuity Planning (BCP) states and analyze potential impacts resulting from data center or deployment unit losses.

  • Engage in disaster recovery drills to evaluate service impacts, recovery processes, and identify areas for enhancement.

  • Monitor and track service metrics for Microsoft AI across global data centers, encompassing services such as Bing, F&V, Edge, Ads, Skype, and more.

  • Contribute to internal service development focused on enhancing service quality monitoring, debugging capabilities, and conducting status audits.

Qualifications

  • Bachelor's Degree in Computer Science or related technical field AND 1+ year(s) technical engineering experience in infrastructure, networking administration/ support,

  • OR Master's Degree in Computer Science or related technical field with proven experience infrastructure, networking administration/ support

  • OR equivalent experience.

  • Dedication to acquiring a deep understanding of the explexity, insights, and best practices from data center infrastructure and network administration tasks, incident resolutions to enhance system, platform, and network operations. Proactive in proposing potential solutions to address and prevent recurring challenges, ensuring they are communicated effectively to the team.

  • Foundational understanding of distributed systems design, cloud technology layers and components, and basic dependencies at scale. Ability to contribute to the codebase that defines components or features of systems or cloud technologies, enhancing the reliability and operability of supported products under the guidance of experienced engineers.

  • Proficiency in using existing tools to troubleshoot issues affecting availability, reliability, performance, and efficiency, with the support of other engineers. Skilled in collecting, classifying, and analyzing data across a range of metrics to inform engineering decisions and improve product features.

  • Commitment to developing an understanding of key learnings, insights, and best practices from code/design reviews, incident drills, debriefs, and regular meetings to improve system, platform, and product development and operations. Eagerness to suggest potential solutions to resolve and prevent recurring issues, bringing them to the attention of the team.

  • Experience on SRE or related area is highly preferred,Certifications on CCNA or above is a plus.

Microsoft is an equal opportunity employer. Consistent with applicable law, all qualified applicants will receive consideration for employment without regard to age, ancestry, citizenship, color, family or medical care leave, gender identity or expression, genetic information, immigration status, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran or military status, race, ethnicity, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable local laws, regulations and ordinances. If you need assistance and/or a reasonable accommodation due to a disability during the application process, read more about requesting accommodations (https://careers.microsoft.com/v2/global/en/accessibility.html) .

DirectEmployers