Principal Mobile Site Reliability Engineer
OracleAustraliaUpdate time: January 19,2023
Job Description

Principal Mobile Site Reliability Engineer

Do you have a passion for high-scale services and working with Oracle's most critical customers? We are looking for a Principal Mobile Site Reliability Engineer that enjoys applying cutting-edge advances in technology to complex, mind-blowing-scale enterprise systems that help solve real world problems. You will contribute to the architecture, design, development, implementation, and operation of critical and complex systems. You are someone who enjoys learning and shaping the newest industry trends and technologies. You foster and contribute to the creative and collaborative culture to deliver results. You embrace ambiguity and enjoy exploring new technologies delivering robust, scalable solutions. 
 
Who are we?    

We are a world class team of high caliber mobile assurance services site reliability engineer's. We are an inclusive and diverse team with a full spectrum of experience distributed globally. We have the resources of a large enterprise and the energy of a start-up, working on a critical greenfield software assurance project collaboratively with our cloud and mobile engineering teams. The Software Assurance organisation has the mission to make application security and software assurance, at scale, a reality. We are a dedicated team, leveraging each other’s insights and abilities to produce cutting edge solutions to difficult problems through automation and CI/CD. Join us to grow your career and create the future of software assurance at scale together.


As a member of our global team, you will:

  • Deploy and operate large scale mobile build pipelines in a cloud native environment
  • Improve our offerings through performance and reliability analysis
  • Diagnose and resolve issues in the build pipeline
  • Participate in system design consulting, platform management, and capacity planning
  • Anticipate the future and deliver those concepts to reality
  • Participate in a global break-fix rotation

What you'll bring:

  • Experience in operating CICD pipelines that build and deliver mobile applications (Andriod and iOS)
  • Your skill in operating and analysing complex cloud deployed solutions
  • Familiarity with mobile build tools, platforms, and artifact repositories
  • A history of working with CI/CD related systems (Kubernetes, Terraform, Jenkins, Mavin, Gradle, Ant or similar)
  • A mind focused on systems reliability, automation, and improvement
  • Scripting finesse in languages like Python, Ruby, or Bash
  • Motivation to collaborate with your local and global teams
  • Experience with Linux
  • 5+ years’ experience in Systems Engineering, DevOps or SRE roles running large scale infrastructure, cloud or web services
  • Eligibility to work in Australia or New Zealand without sponsorship is essential

Nice to Have:

  • Experience in designing CICD pipelines that build and deliver mobile applications (Andriod and iOS)
  • Bonus points for development experience of Android or iOS mobile applications
  • Experience with Oracle Cloud Infrastructure (OCI) or other cloud servicesAndroid
  • Comfort with microservices architecture operating in a Kubernetes environment
  • Integration experience with database systems

What we'll give you:

  • Exposure to mind blowing large scale cutting edge systems
  • Resources of a large, global operation while still having the small, start-up feel of a smaller team day to day
  • New skills and competencies working with our vast cloud product offerings
  • Ongoing extensive training and skills development to further your career aspirations
  • Incredible benefits and company perks
  • An organisation filled with smart, enthusiastic, and motivated colleagues
  • Opportunity to impact and improve our systems and delight our customers
Solve complex problems related to infrastructure cloud services and build automation to prevent problem recurrence. Design, write, and deploy software to improve the availability, scalability, and efficiency of Oracle products and services. Design and develop designs, architectures, standards, and methods for large-scale distributed systems. Facilitate service capacity planning and demand forecasting, software performance analysis, and system tuning.

Work with Site Reliability Engineering (SRE) team on the shared full stack ownership of a collection of services and/or technology areas. Understand the end-to-end configuration, technical dependencies, and overall behavioral characteristics of production services. Responsible for the design and delivery of the mission critical stack, with focus on security, resiliency, scale, and performance. Authority for end-to-end performance and operability. Partner with development teams in defining and implementing improvements in service architecture. Articulate technical characteristics of services and technology areas and guide Development Teams to engineer and add premier capabilities to the Oracle Cloud service portfolio. Understand and communicate the scale, capacity, security, performance attributes, and requirements of the service and technology stack. Demonstrate clear understanding of automation and orchestration principles. Act as ultimate escalation point for complex or critical issues that have not yet been documented as Standard Operating Procedures (SOPs). Utilize a deep understanding of service topology and their dependencies required to troubleshoot issues and define mitigations. Understand and explain the affect of product architecture decisions on distributed systems. Professional curiosity and a desire to a develop deep understanding of services and technologies.

A BS or MS in Computer Science, or equivalent. Identifies and implements complex solutions to knowledge of server hardware and software configuration, networking, standard internet services, scripting languages, cloud computing patterns, technology security and compliance. Experience running large scale customer facing web services. Identifies and implements complex solutions to understanding of load balancing technologies and experience with development in programming languages, databases and big data stores, and container technologies. Work involves defining and documenting technical architecture of complex and highly scalable products. A minimum of 8+ years experience of running large scale customer facing web services.

Get email alerts for the latest"Principal Mobile Site Reliability Engineer jobs in Australia"