Senior Cloud System Debug Engineer
Intel CorporationTaipeiUpdate time: March 8,2022
Job Description

DPEA Cloud Engineering is seeking an experienced system debug engineer with strong background in the cloud/datacenter domain. In this position, you will play a key role in realizing Intel's vision to "unleash the potential of data" and work closely with Intel's biggest customers: Market Makers and Next Wave CSPs to enable and scale out Intel's latest server platforms and technologies. This position will have a chance to touch cutting-edge silicon, server hardware technologies and emerging firmware and OS solutions, and apply them to the CSPs. Responsibilities include: � Mastering the latest Intel hardware and platform features. Tracking the enabling status of those features in silicon, low-level firmware/software and Operating System (e.g. Windows /Linux/VMware) � Debug customer sightings related to platform enabling and customization. Deep dive and root cause silicon enabling and system integration issues to sub system or source code level. � Closely collaborate with Intel silicon enabling, firmware development, and various component teams (e.g., memory, storage, network, power and performance, thermal/mechanical, I/O, etc.) and silicon debug teams to troubleshoot and debug cross-discipline and complex integration issues on server platforms. Drive debug taskforce cross functions to ensure timely issue resolution. � Contribute to the definition of new platforms with software architecture and development teams, support platform bring-up activities, review designs and code changes � Contribute to validation team on improving test plan/method to validate features and verify fixes � Put new technology into practice in the fastest manner, explore all possible alternatives for better solution, and pursue constant improvement on debug methodology and tool � Define and drive the system debug process implementation and ingredient owner engagement and alignment


Qualifications

The candidate should possess a Bachelor of Science in Electrical Engineering, Computer Science or relevant technology (advanced degree is preferred) with 10+ years of applicable industrial experience in the following: � Solid understanding of x86/IA with design experience or working knowledge on CPU, DIMM, Chipset and Platform � Strong low-level debugging skills that enable the root causing of issues across hardware, firmware and OS levels � Experience with ITP and Cscript development. Experience with PythonSV and/or programming with Python. In-depth knowledge of CPU flows and experience on silicon level debug (e.g. AFD - Array Freeze and Dump, and its analysis) are preferred � Solid understanding and hands-on development/validation experience of popular server/PC technologies including PCI/PCI-E, USB, SAS/SATA, i2C/SMBUS, IPMI, BIOS/EFI and DIMM, Storage, Networking, Virtualization, Manageability, Security, RAS, etc. � Understanding of Operating System, Driver, BIOS and firmware fundamentals. Programming skills (e.g. C/C++) that enable the source code level debug and issue fix is highly preferred. � Experience at model-based problem solving that enable the effective investigation and narrow-down of complex issues; � Demonstrated capability to work within a team environment facing fast-changing requirements and complicated stakeholders. Depending on candidate's domain background, following additional skills are needed: � For BIOS domain candidate, good x86 server BIOS development background and debug experience with Intel XDP is a must. Experience of ACPI, PCIe, RAS, security, NVRAM etc is a plus; � For OS domain candidate, experience on Linux kernel debug is a must; Experience of debugging/fixing Linux system power and performance issues is a plus; Experience of Linux kernel upstream development is a plus � For Hardware and I/O domain candidate, good knowledge and 3+ years enabling experience of hardware I/O or devices is a must, including but not limited to Intel RSTe, DCPMM, SSD technology, SAS and/or Networking (IB, 10Gb/40Gb Ethernet etc) ; server baseboard design experience is a plus � For Server Management domain candidate, good understanding and development experience of IPMI, redfish, NCSI, Node Manager and data center management philosophy is expected

Inside this Business Group

The Data Center & Artificial Intelligence Group (DCAI) is at the heart of Intel’s transformation from a PC company to a company that runs the cloud and billions of smart, connected computing devices. The data center is the underpinning for every data-driven service, from artificial intelligence to 5G to high-performance computing, and DCG delivers the products and technologies—spanning software, processors, storage, I/O, and networking solutions—that fuel cloud, communications, enterprise, and government data centers around the world.



Work Model for this Role

This role will be eligible for our hybrid work model which allows employees to split their time between working on-site at their assigned Intel site and off-site.

TWExperienced HireJR0209503TaipeiData Center & Artificial Intelligence Group

Get email alerts for the latest"Senior Cloud System Debug Engineer jobs in Taipei"