12/26/2024 updated
IS
100 % available
Data Engineer with Cloud Expertise
Boulogne, France Master's Degree in Business Intelligence
Java (Programming Language)JavaScript (Programming Language)AirflowAmazon Web ServicesBig DataBigQueryCloud ComputingCloud Computing SecurityDatabasesContinuous IntegrationData IntegrationDevOpsAmazon DynamoDBElasticsearchInfrastructure ManagementPython (Programming Language)PostgreSQLMongoDBMySQLOracle ApplicationsLogstashAnsibleTalendWorkflowsData/Record LoggingData ProcessingApache SparkPysparkKibanaTerraformDockerElk StackJenkinsProgramming Languages
Cloud Computing
Expertise in GCP and AWS cloud platforms, with certifications in GCP Professional Cloud Security Engineer and AWS Certified Solutions Architect.
Data Engineering
Proficiency in designing and implementing scalable data pipelines, working with various databases including BigQuery, MySQL, Oracle, PostgreSQL, MongoDB, and DynamoDB.
Programming Languages
Strong skills in Python, Java, PySpark, JavaScript, and Scala for developing data solutions and applications.
DevOps & CI/CD
Experience with Jenkins for continuous integration and deployment, and Docker for containerization.
Big Data Technologies
Proficiency in Apache Spark for large-scale data processing and analysis.
Data Orchestration
Skilled in using Cloud Composer (Airflow) and Talend for workflow management and data integration.
Infrastructure as Code
Expertise in Terraform and Ansible for automating infrastructure provisioning and management.
Monitoring and Logging
Experience with ELK stack (Elasticsearch, Logstash, Kibana) for monitoring and log analysis.
Expertise in GCP and AWS cloud platforms, with certifications in GCP Professional Cloud Security Engineer and AWS Certified Solutions Architect.
Data Engineering
Proficiency in designing and implementing scalable data pipelines, working with various databases including BigQuery, MySQL, Oracle, PostgreSQL, MongoDB, and DynamoDB.
Programming Languages
Strong skills in Python, Java, PySpark, JavaScript, and Scala for developing data solutions and applications.
DevOps & CI/CD
Experience with Jenkins for continuous integration and deployment, and Docker for containerization.
Big Data Technologies
Proficiency in Apache Spark for large-scale data processing and analysis.
Data Orchestration
Skilled in using Cloud Composer (Airflow) and Talend for workflow management and data integration.
Infrastructure as Code
Expertise in Terraform and Ansible for automating infrastructure provisioning and management.
Monitoring and Logging
Experience with ELK stack (Elasticsearch, Logstash, Kibana) for monitoring and log analysis.
Languages
FrenchNative speaker
Project history
PROJECT : Data Warehouse Migration
Domain : Retail
Objective: Migration of an on-premise data warehouse to Google Cloud Platform (GCP)
Achievements:
Domain : Retail
Objective: Migration of an on-premise data warehouse to Google Cloud Platform (GCP)
Achievements:
- Conducted identification and analysis of storage architecture and data processing needs.
- Designed and implemented scalable data pipelines and data models to support business requirements across multiple systems, ensuring data integrity and optimal performance for enterprise data warehouses.
- Coordinated prototype reviews to validate that solutions met business requirements and adhered to service-level agreements.
- Deployed and maintained data pipelines in the cloud environment, ensuring reliability and scalability.
- Authored comprehensive application documents to facilitate knowledge transfer within and beyond the entity.
- Deliverables: code, documentation, code review
- Source code
- Documentation
- Code reviews
- Programming Languages: Python, SQL, Terraform
- Cloud Platform: Google Cloud Platform (GCP)
- Orchestration Tool: Apache Airflow
Worked on various data projects in the real estate domain, including building ETL workflows, data monitoring, and developing REST APIs for asset management.
Automated data acquisition, processing, and storage pipelines, implemented reporting tools, and set up API gateways for database management.