Data Engineer
Closing date for applications: Friday 3rd July 2026
Purpose of Job
To design, develop, and maintain data pipelines, architecture, and analytics platforms that enable real-time decision-making, operational efficiency, and strategic planning across railway functions.
The role supports key business areas including fleet reliability, customer insight, financial modelling, and predictive maintenance
Principal Accountabilities
Build scalable data pipelines using ETL/ELT frameworks to ingest structured and unstructured data from operational and enterprise systems
Develop and optimise data models for performance, clarity, and downstream analytics alignment
Maintain secure and compliant data warehouses, lakes, and marts supporting diverse business domains
Connect railway systems (e.g. engineering logs, depot systems, CRM, ticketing platforms) into unified data environments
Automate data flows and refresh schedules using orchestration tools to support real-time monitoring and reporting
Collaborate with IT and system owners to resolve integration issues, data gaps, or quality inconsistencies
Apply data validation, cleansing, and transformation rules to ensure accuracy and reliability
Support metadata management, lineage tracking, and audit protocols in line with governance standards (e.g. GDPR, ISO 27001)
Collaborate with compliance, safety, and finance teams to ensure integrity of regulatory and commercial reporting outputs
Work closely with data analysts, BI developers, and data scientists to deliver usable datasets, feature engineering, and model-ready structures
Assist in the development of dashboards, forecasting tools, and machine learning platforms
Contribute to rail-specific use cases including fault prediction, delay analysis, customer segmentation, and energy optimisation
Manage cloud and on-premises data environments using platforms such as Azure, AWS, GCP, and Snowflake
Develop using programming languages and tools
Support API development, containerisation and CI/CD pipelines as needed.
Experience, Knowledge, Qualifications & Training
Essential:
Proficiency in SQL and Python, along with familiarity with cloud-based data platforms
Strong understanding of ETL processes, data architecture, and performance optimisation techniques
Exposure to transport, logistics, or infrastructure sectors with IoT or telematics integration
Experience working with large-scale operational data sets (e.g. reliability logs, telemetry feeds, maintenance systems)
Diversity & Inclusion
At CrossCountry, we believe our people should reflect the diverse communities we serve. Inclusion isn’t just part of our strategy — it’s central to our success. We’re committed to creating a workplace where everyone feels valued, respected, and supported to be their best.
Click Apply Now to take the next step in your journey with CrossCountry Trains.
Did you know you can set up alerts for new job opportunities?
Simply navigate to 'My Job Alerts' in the Jobs Hub to get started.