Hexegic is a leading technical consultancy providing agile multi-disciplinary teams to high performing organisations. The company promises exciting, engaging and rewarding projects for those that are keen to develop and build a successful career.
The Role
The Data Pipeline Engineers ensure that our customer work with updated and correct data in our data analytics tool. You are the first to respond to data health failures on key pipelines. You will be comfortable reading code to identify fixes and making changes.
Core Responsibilities
- Maintain build schedules so that pipelines run effectively.
- Setting up and maintaining health checks on different data pipelines
- Respond, Triage, and debug the data pipeline when broken.
- Reading code and writing code changes and/or modifying the monitoring set-up
- Knowing and understanding how to navigate the data pipelines and documentation.
- Following Standard Operating Procedures to contact other teams and data providers when data is incorrect or not received on time.
- Communicating outages with the end users of a data pipeline
What We Value
- Comfortable reading and writing code in Python, Pyspark
-
Basic understanding of Spark and interested in learning the basics of tuning Spark jobs.
- Data pipeline monitoring team members should be able to use and navigate pipeline development tools.
- Experience supporting data integration technologies.