Have you wondered what it is like to work on the digital production line for one of the largest, most complex software projects on the planet? It’s kind of like a physical production line, except we use a number of services, to bring a massive amount of compute and storage to together to compile, link, package and sign the windows family of products, and then push those updates and releases out to the world – to over a billion users. We are the Core OS and Intelligent Edge Servicing and Delivery (CSD) team working in the Windows Engineering System and are responsible for keeping the digital production line running efficiently. We’re on a mission to modernize these business-critical services. This includes optimizing the workflows and building telemetry pipelines to report on progress, alert and auto-mitigate issues in real time while at the same time detecting and categorizing both functional and performance regressions. Our team is a multi-discipline team whose core responsibilities include building tools, defining process and tweaking telemetry signals to manage our large-scale distributed build, packaging and release systems. This is a complex juggling act of moving legacy systems to cloud based elastic services, while keeping the daily rhythm of the business of servicing over a billion users running. As a Software Engineers you will be responsible for all aspects of service execution and life cycle (design, development, quality assurance, production deployment, service availability, reliability, and live site monitoring).
Responsibilities
Primary Responsibilities:
- Design, write and deliver software to improve network automation, availability, reliability, scalability, security, resiliency, and efficiency of highly complex distributed release services.
- Contribute in debugging/troubleshooting the complete stack of build services, drive analysis of issues and outages, write software and automation to resolve production problems with durable system fixes.
- Design and implement real-time telemetry pipelines and monitoring solutions on world-class big data technologies in Azure to automatically discover problems at their source.
- Design and implement reporting solutions to identify and visualize system trends etc. informing need for engineering investment.
- Build auto recovery/mitigation solutions for costly production problems based on reliable telemetry signals.
- Improve release line production efficiency through monitoring, reporting and recovery tools.
- Conduct periodic on-call duties and live site support.
Other Responsibilities include:
- Build high-quality metrics and monitoring logic to enable capacity planning, performance analysis, automated incident creation, and scale-out of distributed, multi-service workflows.
- Create high-quality, actionable signal out of vast amounts of constantly evolving data in real-time.
- Participate actively in code reviews, bug/issue triage with the feature teams, and support well informed decisions toward business and engineering goals.
- Review and influence ongoing design, architecture, standards and methods for operating software defined network services and systems; assess design and code changes to drive improvements.
Qualifications
Required Qualifications:
- Bachelor’s degree or higher in Computer Science, Computer Engineering, Mathematics or other technical/engineering degree OR 3+ years of software development experience
Preferred Qualifications
- Understanding of how to use concepts like abstraction and inversion of control to ship testable, reliable and maintainable code [C#, C++\C, Python or Java].
- Familiarity creating metrics and monitoring dashboards and using them to continuously deliver improvements to service performance and reliability.
- Familiarity with SQL and Data Analysis.
- Familiarity with big data technologies like Azure Data Lake, HDInsight, Apache Spark, CosmosDB or their equivalents.
- Capable of technical deep dives into service and systems design, data processing and data movement pipelines, networking, storage, monitoring, security, and capacity planning.
- Great design and problem-solving skills, with a strong emphasis on quality and engineering excellence.
- Strong sense of ownership and accountability to deliver projects and results of increasing technical complexity and ability to handle ambiguity.
- Excellent communications skills, both verbal and written and ability to work well in a diverse, team-focused collaborative environment.
- Passionate about sharing your experiences and knowledge, growing and mentoring your peers.
- Continuous learner driven by curiosity and a desire to understand the business impact of the work.
Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include, but are not limited to the following specialized security screenings:
- Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter.
Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, color, family or medical care leave, gender identity or expression, genetic information, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran status, race, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable laws, regulations and ordinances. We also consider qualified applicants regardless of criminal histories, consistent with legal requirements. If you need assistance and/or a reasonable accommodation due to a disability during the application or the recruiting process, please send a request via the Accommodation request form.
Benefits/perks listed below may vary depending on the nature of your employment with Microsoft and the country where you work.
Get email alerts for the latest"Software Engineer jobs in Redmond"
