Join Microsoft as a Site Reliability Engineer II to enhance the performance and security of Microsoft Teams services. Collaborate with engineering teams to design and implement scalable solutions in a dynamic environment.
Are you interested in working for one of the most exciting teams at Microsoft? Then look no further than Microsoft Teams SRE team. You will be building solutions that leverage state-of-the-art technologies to deliver the next evolution in collaboration and teamwork. What is a Software Reliability Engineer (SRE)? SRE is what you get when you treat operations as if it is a software engineering problem. Our mission is to improve the availability, latency, performance, and security of the Microsoft Teams services. Like traditional operations, we keep important revenue-critical systems up and running, even when natural disasters, bandwidth outages and configuration problems occur. Unlike traditional operations groups, we identify and address these software problems directly through software improvements, innovative technologies, and systems automation. As a Site Reliability Engineer II in Teams, you will provide leadership, direction and accountability for networking, infrastructure design, end to end implementation and security for Teams services. Proficient collaboration skills will be required working closely with other engineering teams to ensure services/systems are highly stable and performant and meet the expectations of internal stakeholders and external customers and users. This opportunity will allow you to learn what it takes to deploy and run software as a 24x7 enterprise grade cloud service, hone your security expertise and become an expert in webservices optimization. Microsoft's mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond. Responsibilities: • Design, write and deliver software to improve the availability, scalability, latency, and efficiency of Microsoft's Identity services. • Help define the next generation of Teams services infrastructure and routing design and drive its implementation. • Troubleshoot complex infrastructure and network issues and proactively implement methods to reduce reoccurrence and impact of future incidents. • Develop code, scripts, systems, or platforms that automate complex operations processes (e.g., monitoring, alerting, routing, debugging) at scale. • Identify security issues and recommends potential mitigation strategies to address underlying causes. • Develops security guidance and models to address issues and to contribute to the definition of best practices. • Suggest and drives appropriate guidance, models, response, and remediation for issues. • Participate in regular on-call rotations and share details related to incidents and their resolution through post-mortem reports and regular review meetings. Qualifications: Required Qualifications: Master's Degree in Computer Science, Information Technology, or related field AND 1+ year(s) technical experience in software engineering, network engineering, or systems administration • OR Bachelor's Degree in Computer Science, Information Technology, or related field AND 2+ years technical experience in software engineering, network engineering, or systems administration OR equivalent experience. • Fundamental understanding of TCP/IP concepts, load balancing, CDN, ACL, routing, TLS. IP network analysis and performance and application issues using standard tools. • Fundamental understanding of security practices for native applications, web applications, distributed and database systems. • Understanding of security issues for large scale cloud services and network infrastructures. Other Requirements: Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include but are not limited to the following specialized security screenings: • Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter. Preferred Qualifications: Master's Degree in Computer Science, Information Technology, or related field AND 3+ years technical experience in software engineering, network engineering, or systems administration OR Bachelor's Degree in Computer Science, Information Technology, or related field AND 5+ years technical experience in software engineering, network engineering, or systems administration. • 2+ years technical experience running large-scale service on Linux. • 3+ years experience in scripting languages such as bash, python, and PowerShell, or compiled languages such as C#. • Demonstrated solid working knowledge on cloud computing / Azure / AAD. • Experience with with Docker and Kubernetes. Site Reliability Engineering IC3 - The typical base pay range for this role across the U.S. is USD $100,600 - $199,000 per year. There is a different range applicable to specific work locations, within the San Francisco Bay area and New York City metropolitan area, and the base pay range for this role in those locations is USD $131,400 - $215,400 per year. Certain roles may be eligible for benefits and other compensation. Find additional benefits and pay information here: https://careers.microsoft.com/us/en/us-corporate-pay Microsoft will accept applications for the role until September 5,2025 Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, color, family or medical care leave, gender identity or expression, genetic information, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran status, race, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable laws, regulations and ordinances. We also consider qualified applicants regardless of criminal histories, consistent with legal requirements. If you need assistance and/or a reasonable accommodation due to a disability during the application or the recruiting process, please send a request via the Accommodation request form . Benefits/perks listed below may vary depending on the nature of your employment with Microsoft and the country where you work. #sre #teams #microsoftteams #security #sitereliability #production #network
RNR IT Solutions, Inc. is seeking a DevOps / Site Reliability Engineer (SRE) in Dallas, Texas, to design and maintain CI/CD pipelines and manage cloud infrastructure. The ideal candidate will have extensive experience in DevOps practices and cloud technologies.
Join Microsoft as a Site Reliability Engineer II to enhance the performance and security of Microsoft Teams services. Collaborate with engineering teams to design and implement scalable solutions in a dynamic environment.
UBS Financial Services Inc. is seeking a Site Reliability Engineer (Azure) to support communication and marketing applications in Chicago, Illinois. The role involves incident management, automation, and ensuring the reliability of digital products.
UBS is seeking a Senior DevOps / Cloud Site Reliability Engineer to enhance application deployment and monitoring in cloud environments. The role involves collaborating with cross-functional teams to ensure system stability and performance.
The Boston Red Sox are seeking a DevOps and Site Reliability Engineer to enhance their Baseball Operations systems through cloud operations and automation. This full-time hybrid role focuses on Azure infrastructure and CI/CD pipeline development.
Paradyme Management is seeking a DevOps/Site Reliability Engineer (SRE) with Secret Clearance to manage and optimize Kubernetes clusters and cloud infrastructure. The role involves collaboration across teams to ensure reliability and scalability of AI solutions.
RNR IT Solutions, Inc. is seeking a DevOps / Site Reliability Engineer (SRE) in Dallas, Texas, to design and maintain CI/CD pipelines and manage cloud infrastructure. The ideal candidate will have extensive experience in DevOps practices and cloud technologies.
Join Microsoft as a Site Reliability Engineer II to enhance the performance and security of Microsoft Teams services. Collaborate with engineering teams to design and implement scalable solutions in a dynamic environment.
UBS Financial Services Inc. is seeking a Site Reliability Engineer (Azure) to support communication and marketing applications in Chicago, Illinois. The role involves incident management, automation, and ensuring the reliability of digital products.
UBS is seeking a Senior DevOps / Cloud Site Reliability Engineer to enhance application deployment and monitoring in cloud environments. The role involves collaborating with cross-functional teams to ensure system stability and performance.
The Boston Red Sox are seeking a DevOps and Site Reliability Engineer to enhance their Baseball Operations systems through cloud operations and automation. This full-time hybrid role focuses on Azure infrastructure and CI/CD pipeline development.
Paradyme Management is seeking a DevOps/Site Reliability Engineer (SRE) with Secret Clearance to manage and optimize Kubernetes clusters and cloud infrastructure. The role involves collaboration across teams to ensure reliability and scalability of AI solutions.
RNR IT Solutions, Inc. is seeking a DevOps / Site Reliability Engineer (SRE) in Dallas, Texas, to design and maintain CI/CD pipelines and manage cloud infrastructure. The ideal candidate will have extensive experience in DevOps practices and cloud technologies.
Join Microsoft as a Site Reliability Engineer II to enhance the performance and security of Microsoft Teams services. Collaborate with engineering teams to design and implement scalable solutions in a dynamic environment.
Join Microsoft as a Site Reliability Engineer II to enhance the performance and security of Microsoft Teams services. Collaborate with engineering teams to design and implement scalable solutions in a dynamic environment.