Francisco Sánchez Noguera

Senior Data Platform Engineer

📍 Spain (Remote)

Software Engineer with 6+ years of experience in data engineering, platform engineering, and distributed systems. Expert in Python, Spark, Terraform, and Azure — from optimizing jobs that saved clients 100k+ EUR/year to provisioning 300+ data platforms at scale.

Work History

Senior Software Engineer

BASF

January 2025 Present

Internal hire leading platform engineering for a new data platform combining Azure, AKS, Databricks, and infrastructure as code with CDKTF and Terraform.

  • Leading Databricks and Unity Catalog deployment on a new CDKTF-based platform, contributing the largest share of infrastructure constructs
  • Designed zero-downtime state migration process enabling resource imports across 300+ platforms without user impact
  • Resolved a critical production incident within 1 hour by auditing Unity Catalog tables and reverting configuration changes, preventing data processing paralysis across all platforms
  • Defined role-based access control architecture through ADRs and implemented permissions across portal and infrastructure layers
  • Collaborated with architects on network topology design for the new platform
  • Implementing logs and metrics ingestion across multiple clusters using Grafana OSS, configuring Alloy collectors, receivers, and scrape pipelines
TerraformCDKTFTypeScriptPythonKubernetesAzureDatabricksUnity CatalogGitHub Actions

Senior Data Engineer

Capitole Consulting (Client: BASF)

March 2023 December 2024

Consultant leading two major engagements at BASF: delivering a cosmetics analytics data platform, then becoming lead developer for Azure data platform provisioning serving 300+ workspaces.

  • Delivered a 12-month cosmetics analytics release in under 6 months using Medallion architecture with Delta Lake processing
  • Led refactoring of the platform provisioning codebase (Terraform + Python), improving pipeline success rate from 20% to 90% and reducing execution time from 1.5 hours to 30 minutes
  • Reduced platform provisioning from 4 hours to 20 minutes by orchestrating GitHub Actions workflows with Terraform for infrastructure and Python for configuration
  • Built a REST API with FastAPI and Container App Jobs for federated Databricks Account operations
  • Orchestrated Unity Catalog rollout across 300+ Databricks workspaces with only 1% incident rate
  • Led a team of 5 engineers, increasing test coverage from 10% to 85% through code reviews, pair programming, and Python best practices
  • Conducted ~30 technical interviews for Senior Data Engineer positions, resulting in 6 hires
  • Managed the Data Community at Capitole: biweekly newsletter, meetups, and tech talks reaching 100+ data engineers
PythonTerraformAzure DevOpsAzure DatabricksDelta LakeUnity CatalogFastAPIDockerGitHub Actions

Big Data Developer

Kenmei Technologies

January 2022 March 2023

Promoted to the Big Data department, processing massive telecom datasets with Scala and PySpark on Google Dataproc and Azure Databricks.

  • Optimized a PySpark data pipeline from 20 hours to 4 hours (-80%), saving the client 100k+ EUR annually in cluster costs
  • Productionized the Uplink Interference detection algorithm using distributed PySpark, reducing execution from 60 minutes (single node) to 6 minutes (-90%)
  • Built a geospatial classification product with Scala and Sedona, categorizing tiles as indoor/outdoor across entire countries
  • Maintained and extended CallTraces — a complex Scala/Spark real-time trace processing system with a very small contributor base, contributing to the 5G implementation
ScalaPySparkGoogle DataprocAzure DatabricksDelta LakeAzure Blob StorageBigQuerySedona

Junior Innovation Engineer

Kenmei Technologies

June 2019 January 2022

R&D engineer working directly with the CTO on individual projects for telecom clients, building high-performance Python services and signal processing algorithms.

  • Built a real-time drone geolocation service achieving 100+ geolocations/second (16× the client target) using NumPy vectorization and async MQTT
  • Developed a high-throughput TCP trace server processing ~550 Mb/s with Cython and Numba, achieving throughput beyond the network interface capacity
  • Created distributed coverage maps with Dask spatial partitioning, reducing country-level processing from 4 hours to 35 minutes (-85%)
  • Designed an Uplink Interference detection algorithm using FFT and RSRQ/SINR metrics for PIM pattern identification — later productionized at scale in Big Data
  • Received recognition from a major telecom client's CTO for project delivery quality
PythonNumPyNumbaCythonDaskPySparkMQTTAsyncioPostgreSQL

Skills

Languages

Pythonexpert
Scalaadvanced
TypeScriptadvanced
Goadvanced
Bashadvanced

Data Engineering

Apache Spark / PySparkexpert
Delta Lakeexpert
Databricksexpert
Unity Catalogexpert
Daskadvanced
BigQueryadvanced

Cloud & Infrastructure

Azureexpert
Terraform / CDKTFexpert
Kubernetesadvanced
Dockeradvanced
GitHub Actionsadvanced
Azure DevOps Pipelinesadvanced

Backend & APIs

FastAPIadvanced
PostgreSQLadvanced
REST API Designadvanced
Asyncioadvanced

Observability & Quality

Prometheus / Grafanaadvanced
Pytest / Unit Testingexpert
Clean Code / Architectureadvanced
CI/CD Pipelinesadvanced

Certifications

Databricks Certified Data Engineer Professional

Databricks

Issued: 2026-01 · Expires: 2028-01

Verify credential →

Certified Kubernetes Application Developer (CKAD)

The Linux Foundation

Issued: 2025-06 · Expires: 2027-06

Databricks Certified Associate Developer for Apache Spark 3.0

Databricks

Issued: 2024-09

AWS Certified Cloud Practitioner

Amazon Web Services

Issued: 2024-06 · Expires: 2027-06

Microsoft Certified: Azure Fundamentals (AZ-900)

Microsoft

Issued: 2024-03

Education

Master's Degree (Enabling) in Telecommunications Engineering

Universitat Politècnica de València

Graduated: 2019-06-15

Bachelor's Degree in Telecommunications Engineering

Universitat Politècnica de València

Graduated: 2018-06-15

CubeSats Concurrent Engineering Workshop in Satellite Systems Design

European Space Agency (ESA)

Graduated: 2019-06-15

Side Projects

Pizza API & Terraform Provider

REST API built with Go and Gin framework with authentication, plus a custom Terraform provider using the Terraform Plugin Framework for managing resources through the API.

GoGinTerraform Plugin Framework
View Project

Contact