NOTE: We are unable to provide visa sponsorship for this role at this time. No candidates requiring visa sponsorship will be considered.
Job Description
Applications Developer 4 – HGBU SRE
Oracle Hospitality Cloud Site Reliability Engineering
The Hospitality Cloud SRE team is focused on maximizing service reliability for our hotel product service offerings across global Oracle data centers. Our team runs with a start-up like approach, leaving room for creative freedom. We have worked to assemble the smartest people in the industry to build and grow this revolutionary and disruptive team.
We are looking to add new members to this dynamic team and are seeking experienced software developers and engineers for continuously improving reliability for all components within our solution portfolio while we deconstruct the monolith and move to OCI.
About The Job
As part of the SRE team, you will be continually challenged and directly contribute to the success of our Oracle Hospitality cloud service offerings, every day, working closely with product and Infrastructure partners.
As an SRE, you will solve interesting technical challenges by defining, designing, deploying and troubleshooting key HGBU products, Oracle Cloud services, platforms, and infrastructure, always thinking about reliability, scalability, resilience, security, and performance.
In this role, which is a mix of software, architecture and operational readiness, you will be responsible for the following:
Service Ownership –You will be part of the SRE team, whose mission is the shared full stack ownership of a collection of services and/or technology areas, with our Core Development partners.
Ownership Scope – As an SRE, you will understand the end-to-end configuration, technical dependencies, and overall behavioural characteristics of the production services you own. In partnership with your Core Development partners, you will have the responsibility to ensure that services are designed, delivered and deployed to be mission critical with focus on security, resiliency, scale, and performance. SREs are accountable for the end-to-end performance and operability of the services they own.
Service Design – As Oracle Hospitality Cloud Services continually evolve; you will partner with development teams in defining and implementing improvements in service architecture, both current and future. As an SRE, you will be an expert at articulating technical characteristics of your services and the dependencies between services, and guide Development teams to engineer and add premier capabilities to the Oracle Cloud service portfolio.
Operations Engineering – You will understand and be able to communicate the scale, capacity, security, performance attributes and requirements of the services you own. To understand and communicate every characteristic of their service stack, such as:
- degradation and behaviour under load of the services and their dependencies
- end-to-end tuning needs, optimizing resource utilization, as load patterns fluctuate
- Instrumentation and metrics that clearly describe the service behaviours
- scaling requirements and patterns
- resiliency and recoverability, ensuring that backup / restore and disaster recovery capabilities are implemented, tested and maintained
Automation – You will have a clear understanding of automation and orchestration principles, and will be eager to automate, wherever and whenever the possibility arises, while simultaneously eliminating technical debt. Automation must be part of your DNA.
Broad Interests - SREs are a rare mix of sysadmins and Software development Engineers, and as such have the ability to understand and explain the effect of product architecture decisions on the ability to run as distributed systems. They are driven by professional curiosity and a desire to develop deep understanding of their services and the technologies they depend upon.
Ideal Qualification/ Experience
- BS or MS in Computer Science, or equivalent work experience
- Self-taught developers are also welcomed, Proven hands-on Software Development experience
- 7+ years developing software, Enterprise or Start-up background.
- Insight of Java and JEE internals (Classloading, Memory Management, Transaction management etc.)
- Proficiency in Oracle Fusion Middleware stack
- Experience in the Spring Framework
- Experience with test-driven development
- Knowledge of the JVM and be able to monitor performance, use of profilers, APM, Flight Recorders and offer guidance on improvement
- Knowledge of full stack, from front end to database and all in between.
- Understanding of Cloud Native Technologies and appreciation of Cloud Native Computing Foundation (CNCF) Charter.
- Used and implemented a full CI/CD pipeline from push to release.
- Knowledge of Containers, developing software to work in containers and Container orchestration technologies
- Git experience and Git flow knowledge, be able to work in a team with many different developers committing code.
- Experience in Unit Testing JUnit preferable, E2E testing
- Understand the concept of true Obervability and the fact its far beyond monitoring, tracing and logging.
- Ensure the best possible performance, quality, and responsiveness of the applications
- Identify bottlenecks and bugs, and devise solutions to these problems
- Will have experience in developing highly available software, handling any service interruption without customer impact.
- Knowledge of secure coding practices, secure software practices, OWASP and be able to help other developers to use practices such as static code analysers.
- Analyse software components and recommend modifications that will enhance system reliability, availability and scalability.
- Knowledge of Agile methods, and SAFE agile if possible
Solve complex problems related to infrastructure cloud services and build automation to prevent problem recurrence. Design, write, and deploy software to improve the availability, scalability, and efficiency of Oracle products and services. Design and develop designs, architectures, standards, and methods for large-scale distributed systems. Facilitate service capacity planning and demand forecasting, software performance analysis, and system tuning.
Work with Site Reliability Engineering (SRE) team on the shared full stack ownership of a collection of services and/or technology areas. Understand the end-to-end configuration, technical dependencies, and overall behavioral characteristics of production services. Responsible for the design and delivery of the mission critical stack, with focus on security, resiliency, scale, and performance. Authority for end-to-end performance and operability. Partner with development teams in defining and implementing improvements in service architecture. Articulate technical characteristics of services and technology areas and guide Development Teams to engineer and add premier capabilities to the Oracle Cloud service portfolio. Understand and communicate the scale, capacity, security, performance attributes, and requirements of the service and technology stack. Demonstrate clear understanding of automation and orchestration principles. Act as ultimate escalation point for complex or critical issues that have not yet been documented as Standard Operating Procedures (SOPs). Utilize a deep understanding of service topology and their dependencies required to troubleshoot issues and define mitigations. Understand and explain the affect of product architecture decisions on distributed systems. Professional curiosity and a desire to a develop deep understanding of services and technologies.
A BS or MS in Computer Science, or equivalent. Provides strategic and comprehensive complex business solutions to knowledge of server hardware and software configuration, networking, standard internet services, scripting languages, cloud computing patterns, technology security and compliance. Experience running large scale customer facing web services. Provides strategic and comprehensive complex business solutions to understanding of load balancing technologies and experience with development in programming languages, databases and big data stores, and container technologies. Work involves defining and documenting technical architecture of complex and highly scalable products. A minimum of 12+ years experience of running large scale customer facing web services.
Get email alerts for the latest"Site Reliability Developer - HGBU Global jobs in Australia-melbourne"
