Data Engineer
BayerUpdate time: September 30,2020
Job Description

YOUR TASKS AND RESPONSIBILITIES

 

The primary responsibilities of this role, Data Engineer, are to: 

 

  • Work on development, deployment, and support of systems of data pipelines and computing solutions;
  • Collaborate with interdisciplinary scientists to gather requirements for data pipelines;
  • Optimize algorithms and data workers to scale horizontally and contribute to the development of new algorithms and capabilities that will enable connected pipeline analytics for all pipelines;
  • Work on all aspects of the design, development, validation, scaling and delivery of analytical or pipeline solutions;
  • Collaborate with analytics and discovery teams to design and plan data engineering solutions;
  • Develop and align roadmaps, delivery dates and integration efforts;
  • Provide reliable estimates for large scale project;
  • Implement, configure, and maintain critical third-party solutions related to engineering work, including compute environments, BI platforms, and cloud systems;
  • Design and maintain Extract, Transform, Load (ETL) workflows;
  • Integrate proactive strategies and best practices to ensure security of stored data;
  • Design, build, and maintain integrated data solutions such as "data lakes" and "data warehouses";
  • Design and maintain data storage systems and access patterns;
  • Have the ability to operate independently on work assignments with minimal guidance;
  • Be responsible for making decisions that impact business value;
  • Have the ability to work with and set priorities with both on site partner teams and teams at remote (U.S. and International) sites;
  • Help the team establish and improve processes and methodologies, like SCRUM or Kanban, and/or lead piloting new ones;
  • Facilitate and participate in code reviews, retrospectives, functional and integration testing and other team activities focused on improving quality of delivery;
  • Ensure the success of digital strategies within Plant Biotechnology by designing, creating, and operating engineered data solutions;
  • Partner with teams in data science, reporting, software engineering, and operations to accelerate strategies by ensuring the availability and quality of required data systems;
  • Have particular focus on bringing value through unique expertise in the creation and optimization of "big data" solutions and the design and execution of distributed computing workflows.

 

WHO YOU ARE

 

Your success will be driven by your demonstration of our LIFE values.  More specifically related to this position, Bayer seeks an incumbent who possesses the following:

 

Required Qualifications:

 

  • Bachelor’s degree in Computer Science, Electrical Engineering or a closely related field with at least five years of industry experience or master’s Degree in Computer Science, Electrical Engineering or a closely related field with at least two years of industry experience Doctorate in Computer Science, Electrical Engineering, or a closely related field;
  • Technical knowledge and at least two years of experience in at least two of the following, structure query language (SQL) and NoSQL databases (data warehousing, data modeling, etc.);
  • Experience with big data tools (Spark, Kafka, Flink, Hadoop, etc.);
  • Deep understanding of algorithms and data structures;
  • Experience with tools for authoring workflows & pipelines (Airflow, AWS Step Functions, KubeFlow, etc.);
  • Experience with cloudservices (EMR, S3, RedShift, EC2, etc.);
  • Experience with distributed systems;
  • Experience with python, Java, R, or Scala.

 

Preferred Qualifications:

 

  • Network and Database administration;
  • Proven systems administration and operations experience;
  • Proven ability to plan, schedule and deliver quality software, DevOps methodology;
  • Experience in running production cloud systems and diagnosing and fixing problems.

Get email alerts for the latest"Data Engineer jobs in "