Site Reliability Engineer II Job at Microsoft Corporation, United States

R01hTHVLVmE1enBXOUJVQks2dCtMVVlGWUE9PQ==
  • Microsoft Corporation
  • United States

Job Description

Interested in a start-up like environment whilst helping extend Azure's core enterprise capabilities for mission critical workloads? Passionate about Cloud Computing technology and driving growth and maturity for very visible and ambitious programs? Then the Azure Specialized team is the right place for you. The Site Reliability Engineering Team (SRE) in Azure Specialized is directly implanted into the product engineering team and you will work closely with engineers, operations, industry vendors and workload partners to ensure mission critical systems continue to work optimally for our customers. Customers around the world depend on us to run their mission critical workload and place their trust in us to deliver the services they need, to work every day. In order to make this work for our growing customer base, we need continual effort to make Azure highly reliable. Join a growing team, owning reliability of Azure Specialized.  Our SRE team, represents a deep investment in improving the availability, reliability, operational efficiency of our systems and services. We are hiring highly motivated site reliability engineers to help drive our Azure special projects focused on enabling global scale offerings. In this role you will help Microsoft and Azure become a world leader at running and operating mission-critical workloads like AI supercomputers, Payment systems for Fintech, in memory HANA databases, all running on dedicated hardware. We're a small, agile, nimble team in Azure focused on bringing the state of the art of mission-critical software into Microsoft and providing bare-metal machines in the Azure Cloud. Come join us and be part of this platform and help us scale massively in the coming years. Microsoft's mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.  **Responsibilities** + Work closely with product engineering to ensure that the right set of service capabilities are being built to manage the service end to end. Examples include deployment systems, diagnostic capabilities and run time operational insights into key service behaviors. + Identify monitoring gaps and drive implementation. + Consume and extend telemetry using queries, dashboards, alerts to monitor reliability. + Be a part of on-call rotation and monitor all customer reported incidents (CRI), triage them, participate in root-cause analysis, track monitoring gaps, help drive work to ensure these incidents are auto-detected in the future and have reduced time to mitigation and resolution. + Coordinate large scale fleet wide maintenance and updates using safe deployment practices. Identify impact of these system changes, coordinate closely with customer facing teams and customers directly to plan maintenance windows and downtime. + Work with customer support team for updated trouble shooting guides. + Work closely with 3rd party HW vendors and appliance providers to ensure quality and reliability of systems provided to Microsoft. **Qualifications** **Required Qualifications:** + 4+ years technical experience in software engineering, network engineering, or systems administration + OR Bachelor's Degree in Computer Science, Information Technology, or related field AND 1+ year(s) technical experience in software engineering, network engineering, or systems administration + OR Master's Degree in Computer Science, Information Technology, or related field. + 2+ years of experience with managing reliability of mission critical workloads which requires coordinating with number of partners and teams + 2+ years of experience with Networking and Network Protocols **Other Qualifications:** + Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include, but are not limited to the following specialized security screenings:  + Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud Background Check upon hire/transfer and every two years thereafter. **Preferred Qualifications:** + 5+ years technical experience in software engineering, network engineering, + OR systems administration + OR Bachelor's Degree in Computer Science, Information Technology, + OR related field AND 2+ years technical experience in software engineering, network engineering, + OR systems administration + OR Master's Degree in Computer Science, Information Technology, + OR related field AND 1+ year(s) technical experience in software engineering, network engineering Site Reliability Engineering IC3 - The typical base pay range for this role across the U.S. is USD $100,600 - $199,000 per year. There is a different range applicable to specific work locations, within the San Francisco Bay area and New York City metropolitan area, and the base pay range for this role in those locations is USD $131,400 - $215,400 per year. Certain roles may be eligible for benefits and other compensation. Find additional benefits and pay information here: Microsoft will accept applications for the role until June 3, 2025. \#azurecorejobs Microsoft is an equal opportunity employer. Consistent with applicable law, all qualified applicants will receive consideration for employment without regard to age, ancestry, citizenship, color, family or medical care leave, gender identity or expression, genetic information, immigration status, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran or military status, race, ethnicity, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable local laws, regulations and ordinances. If you need assistance and/or a reasonable accommodation due to a disability during the application process, read more about requesting accommodations ( .

Job Tags

Local area,

Similar Jobs

Hyatt

Dishwasher Job at Hyatt

 ...travelers around the world in a supportive, friendly, and beautiful work environment. As a Dishwasher at HVC, you will assist in...  ...Seasonal-Part Time; 2nd; must be available to work weekends and holidays. Requirement: Valid Driver's License required. Where great... 

Advantive

Finance Manager Job at Advantive

 ...We are seeking a strategic, detail-oriented Corporate Finance Manager to drive high-impact financial analysis, systems improvements...  ...policies, and financial regulations. Partner with Accounting, Treasury, and other departments to align financial strategies... 

DW Simpson

Actuary II - Hybrid Job at DW Simpson

 ...Our client has over 60 years of being a reliable life insurer! They are looking for student actuary with 2+ years of experience to become an Actuary II. This role will primarily work on analyzing & adjusting models along with identifying different trends. The models this... 

Mack Logistics LLC

Non-CDL Package Delivery Driver - Amazon Delivery Service Partner - OVER $600 IN MONTHLY BONUSES Job at Mack Logistics LLC

 ...opportunities. Job Description Company Vehicle Provided! NO CDL REQUIRED! Approximate hours are 11:00 am-9:00 pm. Shifts are 1...  ...Compensation & Benefits ~$22.25 / Hour ~ Paid Training ~ Paid Overtime ~ Over $650 in bonuses available monthly ~... 

NYU Langone Health

Lab Technologist - Per Diem Job at NYU Langone Health

 ...be a place where our exceptionally talented faculty, staff, and students of all identities can thrive. We embrace inclusion and...  ...and proper procedures for handling hazardous substances. Those working in the Histology Department are trained in additional procedures...