Our client is expanding its Advanced Analytics capability to drive data-powered improvements across the Belgian railway ecosystem. You will join a mixed team of data scientists and data engineers to build scalable Azure-based data science platforms, design complex ingestion pipelines, and operationalize ML/GenAI (LLMs) for impactful use cases (punctuality, stations, HR, security, and more).
Collaborate with infrastructure to set up, improve, and maintain scalable data science platforms on Azure.
Design and implement complex data ingestion pipelines from diverse sources.
Work with data scientists to operationalize machine learning models and put LLMs into production for varied use cases.
Tackle strategic and operational analytics use cases that impact railway services.
Collaborate with data governance, security, and performance teams; contribute to a collaborative, innovative culture.
≥3 years in data engineering and MLOps.
Proven experience setting up data science platforms for 1,000+ employee organizations.
≥3 years with cloud services for advanced analytics (Azure used).
Proficiency in PySpark, Python, SQL with ≥3 successful projects.
Experience with CI/CD, Infrastructure as Code (e.g., Terraform), and DevOps (≥3 implemented projects).
Experience maintaining data pipelines for large organizations.
Fluent English (written and spoken).
Ability to set up a data platform for advanced analytics in a large organization.
Experience bringing data science and GenAI models to production and ensuring run & maintenance.
Strong coding skills in the aforementioned technologies.
Hands-on with Infrastructure as Code.
Capacity to contribute to the development of data science use cases; solid communication skills.
N/A