01/07/2026 updated


100 % available
Lead/Senior Data Engineer
Tbilisi, Georgia Applied Mathematics to Programming - UNAM
About me
Senior Data Engineer with 10+ years building and operating production data platforms across energy trading, finance, health, insurance, and regulated industries. Hands-on with Spark/PySpark, Airflow, Databricks, Snowflake, AWS & Azure, delivering scalable ETL/ELT pipelines for remote setups.
Controles de AccesoApplication Programming Interfaces (APIs)Apache AirflowAmazon Web ServicesAmazon S3Analytical ProceduresData AnalysisArchitectureAutomationMicrosoft AzureBash ShellBig DataGoogle BigQueryCloud ComputingComputer Programming
Core Data Engineering Skills
Big Data & Processing
Workflow Orchestration
Cloud & Platforms
Data Warehousing & Analytics
Programming & Querying
DevOps & Engineering Practices
Governance, Security & Compliance
Delivery & Contract Experience
Data Engineering & Data Platform Architecture
ETL / ELT Pipeline Design & Implementation
Batch & Streaming Data Processing
Data Modeling (Dimensional, Analytical, Domain-Driven)
Data Quality, Validation & Observability
Performance Optimisation & Scalability
Apache Spark / PySpark
Databricks (Jobs, Delta Lake, Unity Catalog)
Kafka / Event-Driven Pipelines (where applicable)
Large-Scale Distributed Data Processing
Apache Airflow
Cloud-native schedulers (ADF pipelines, managed workflows)
Dependency management, retries, SLAs, monitoring
AWS: S3, Glue, EMR, Athena, Lambda
Azure: Data Factory, Synapse, Data Lake, Databricks
Cloud-native data pipeline design
Snowflake
Azure Synapse
BigQuery / Redshift (conceptual + practical)
SQL performance tuning & optimisation
Python (data pipelines, automation, APIs)
SQL (advanced analytics, transformations, optimisation)
Bash / scripting
CI/CD for data pipelines
Git-based workflows
Docker (basic to intermediate)
Infrastructure-as-Code exposure
Data Governance & Lineage
GDPR / regulated-industry data handling
Access control, auditability, data protection
Remote-first & contract engagements
Stakeholder communication
Agile / Scrum environments
Production support & on-call ownership
Languages
EnglishFluentSpanishNative speaker
Project history
Built and maintained large-scale energy trading data pipelines supporting electricity, gas, and green products. Implemented Databricks-based transformation layers using PySpark. Contributed to pricing, risk(VaR, STRM), and MTM data workflows. Implemented Unity Catalog for data governance and lineage.
Designed and implemented scalable ETL pipelines for energy data platforms. Built asynchronous ingestion services using Azure Durable Functions. Migrated workloads to Azure Databricks and Azure Fabric. Implemented governance and lineage via Unity Catalog. Contributed to pricing and risk data workflows for Green Energy Trading.
Delivered dbt-based, Snowflake-based ETL batch-based pipelines and data models. Built event-driven ingestion pipelines. Implemented dbt transformations and CI/CD workflows. Supported cost-efficient, production-grade data orchestration.