Senior Data Engineer

Holcim España
Holcim España
MadridPresencialCompetitivoPublicado hace 2 mesesPrácticas
🇬🇧Inglés requerido

Anuncio original

SUMMARY OF THE JOB

We are seeking a seasoned Senior Data Engineer to design, build, and optimize our next-generation data platform. You will be responsible for architecting scalable data pipelines, managing large-scale distributed systems, and ensuring our data infrastructure in AWS and Databricks is robust and efficient. The ideal candidate is a Spark expert with a deep understanding of the AWS ecosystem and a passion for automation.

MAIN ACTIVITIES / RESPONSIBILITIES

  • Pipeline Architecture: Design and implement complex batch and streaming ETL/ELT pipelines using Python, SQL, and Spark to process massive datasets.

  • Cloud Infrastructure: Leverage AWS Data Analytics services to build scalable, secure, and cost-effective data solutions.

  • Orchestration & DevOps: Manage and automate data workflows using Airflow, while utilizing Docker and ECS for containerized application deployment.

  • System Optimization: Monitor and tune the performance of distributed systems (Spark Cluster) to ensure high availability and low latency.

  • Infrastructure as Code: Utilize AWS CloudFormation or Terraform to manage data infrastructure, ensuring repeatable and version-controlled environments.

  • Cost Optimization: Monitor and optimize AWS spend by selecting appropriate instance types (Spot vs. On-Demand) and refining data storage strategies.

  • Security & Compliance: Implement IAM roles, bucket policies, and encryption (KMS) to ensure data is secure at rest and in transit.

  • Collaboration: Work within an Agile framework to deliver iterative value, collaborating closely with Data Scientists and Stakeholders to translate business needs into technical reality.

JOB DIMENSIONS

List of direct reports:

  • Up to 2 Direct Reports, and around 15 externals

Key interfaces, stakeholders and relationships:

  • Internal:

    • GDS: product manager, application manager, data & analytics & AI team

    • Country business stakeholders

  • External : 3rd party vendors

PROFILE REQUIRED

  • Experience: Minimum 4+ years of hands-on experience in active Big Data environments and 2+ years specializing in Data Analytics within AWS.

    • Compute & Processing:Amazon EMR: Architecting and managing Spark clusters for large-scale distributed processing.

      • AWS Glue: Developing serverless ETL jobs, managing the Data Catalog, and implementing Glue Crawlers.

    • Storage & Warehousing:

      • Amazon S3: Implementing "Data Lake" best practices, including partitioning, compression (Parquet/Avro), and lifecycle policies.

      • Amazon Redshift: Designing star/snowflake schemas and optimizing query performance for high-volume data warehousing.

      • Amazon Athena: Performing ad-hoc SQL analysis directly on S3 data.

      • Experience with open table formats (iceberg/delta)

    • Orchestration & Integration:

      • Amazon MWAA (Managed Workflows for Apache Airflow): Deploying and scaling Airflow environments.

      • AWS Lambda: Building event-driven data triggers and micro-services.

    • Streaming (Advantage):Amazon Kinesis or MSK (Managed Streaming for Kafka) for real-time data ingestion.

  • Core Engineering: Expert-level proficiency in Spark, Python, and SQL.

  • Infrastructure & Tooling: Proven experience with Airflow for orchestration and Docker/ECS for containerization.

  • Good knowledge in Databricks and data mesh architectures. Good understanding in how to implement and maintain Lakehouse data models (bronze / silver / gold layers) using Delta Lake for reliability, ACID transactions, time travel and schema evolution.

  • Solid software engineering practices: Git, CI/CD for data pipelines, automated testing, code quality and documentation.

  • Communication: Excellent written and oral English communication skills, with the ability to explain complex technical concepts to non-technical audiences.

  • Degree in Computer Science, Engineering, Mathematics or related field, or equivalent practical experience.

PREFERRED "PLUS" QUALIFICATION

  • Real-time Processing: Experience with streaming and distributed messaging applications like Flink and Kafka.

  • Core Tech:Java programming.

  • Industrialise ML use cases

  • Data Visualization: Experience with QlikView or QlikSense to support BI initiatives.

  • Agile: Experience working in a fast-paced Scrum or Kanban environment.

  • Certifications: AWS Certified Data Engineer - Associate/Professional or AWS Certified Solutions Architect, Databricks Data engineer (Associated/Professional) certification

  • DevOps: Experience with Openshift, Github Actions or Jenkins for CI/CD of data workflows.

Supervisor/a Mantenimiento Mecánico - Montcada i Reixac

Montcada i reixac
1m

Pasante de Impacto Social & Fundación Holcim Argentina - Corporativo

Cordoba
1m

Supervisor/a de Mantenimiento - Planta Agregados

Cordoba
1m

Pasante de Impacto Social & Fundación Holcim Argentina - Oficinas Centrales, Córdoba

Cordoba
1m

Pasante de Soporte Supply Chain - Oficinas Centrales, Córdoba

Cordoba
1m

Pasante de CDI (Asesoramiento Técnico) - Malagueño

Cordoba
2m

Consultor/a Senior Data Scientist

Madrid
4d

Senior Data & Analytics Engineer - Ecosistema Microsoft (Fabric / Azure)

Barcelona
4d

Data Engineer with German - Senior - EY GDS Spain - Hybrid

Malaga
4d

Data Architect

Madrid
4d

Senior AI Engineer with Italian- EY GDS Spain - Hybrid

Malaga
4d

Consultor/a Data Engineer

Madrid
4d

Data Scientist - Advanced Analytics Madrid

Madrid
4d

Supply Chain Finance Data Analyst

Granollers
4d

Gestor/a Data Scientist Risc Operacional (mad/Bcn)

Barcelona
4d

Gestor/a Data Scientist Riesgo Operacional (mad/Bcn)

Barcelona
4d

Data Engineer Control de Procesos de Gestion de Riesgos

Madrid
4d

Data Engineer Control Procesos Gestion Riesgos

Madrid
4d

Programa Universitario 2026 RRHH y Marketing

España, MADRID, ES
Nuevo

Reponedor/a - Cajero/a-Vicálvaro 30h/Rotativo

Madrid
1d

Profesional en formación para carnicería, charcutería y pescadería-Rivas-Vaciamadrid 20h/Fs

Rivas-vaciamadrid
1d

Cajero/a-Reponedor/a-Rivas-Vaciamadrid 30h/Rotativo

Rivas-vaciamadrid
1d

People & Culture Manager

HOXTON MADRID, Madrid
1d

Recepcionista Polivalente Ibis Madrid Norte Las Tablas

ibis Madrid Norte Las Tablas (Apertura agosto 2025), Madrid
1d

Key Account MICE & Corporate- Novotel Campo de las Naciones

Novotel Madrid Campo de las Naciones, Madrid
1d

Responsable de Proyectos | Infraestructura civil y urbanización

Madrid, (Hybrid)
1d

Underwriting Manager Construction, Spain

MADRID
1d

Supervisor/a de preparación 16:00 a 00:30 (Domingo-Viernes) ALDI Pinto

Pinto, Madrid Province
1d

Tunnelling & Geotechnical Modelling Engineer

Madrid, (Hybrid)
1d

Candidatura gestionada por Holcim España