03/09/2026 updated


100 % available
Software Data Engineer - Cloud & Big Data Specialist
Tunis, Tunisia
Worldwide
Engineering degree in Information and communication technologiesAbout me
Palantir Foundry Data Engineer specializing in ingesting, modelling, and large-scale data pipelines using PySpark, SQL, and distributed compute. Skilled in translating business requirements into robust data contracts and reliable pipelines powering analytics, automation, and AI use cases.
Artificial IntelligenceApache AirflowAmazon Web ServicesData AnalysisBusiness IntelligenceBig DataGoogle BigQueryC++ (Programming Language)Cloud ComputingCode ReviewInformation EngineeringData VisualizationDevOpsElasticsearchInfrastructure Management
Data Engineering & Pipeline Development
Advanced expertise in designing and implementing scalable data pipelines using Apache Spark, Kafka, and Airflow for real-time data processing and orchestration
Machine Learning & AI Implementation
Comprehensive knowledge in machine learning algorithms, deep learning models, MLOps deployment, and AI-driven analytical solutions
Cloud Technologies & Big Data
Proficient in cloud platforms including AWS, Google Cloud, containerization with Docker, and big data technologies for distributed processing
Programming Languages
Strong proficiency in Python, C++, TypeScript, SQL for data analysis and software development
Database Management
Experience with PostgreSQL, MySQL, BigQuery, Redis, and various database optimization techniques
Data Visualization & Analytics
Skilled in creating interactive dashboards using Tableau, Kibana, Elasticsearch, and business intelligence tools
DevOps & Infrastructure
Knowledge of CI/CD pipelines, Terraform, Git, code reviews, and virtual machine management
Web Technologies
Understanding of web services, API development, and modern web application architectures
Advanced expertise in designing and implementing scalable data pipelines using Apache Spark, Kafka, and Airflow for real-time data processing and orchestration
Machine Learning & AI Implementation
Comprehensive knowledge in machine learning algorithms, deep learning models, MLOps deployment, and AI-driven analytical solutions
Cloud Technologies & Big Data
Proficient in cloud platforms including AWS, Google Cloud, containerization with Docker, and big data technologies for distributed processing
Programming Languages
Strong proficiency in Python, C++, TypeScript, SQL for data analysis and software development
Database Management
Experience with PostgreSQL, MySQL, BigQuery, Redis, and various database optimization techniques
Data Visualization & Analytics
Skilled in creating interactive dashboards using Tableau, Kibana, Elasticsearch, and business intelligence tools
DevOps & Infrastructure
Knowledge of CI/CD pipelines, Terraform, Git, code reviews, and virtual machine management
Web Technologies
Understanding of web services, API development, and modern web application architectures
Languages
EnglishFluentFrenchFluentItalianBasic knowledge
Project history
Designed, built, and optimized scalable Spark data pipelines using Code Workbooks and Pipeline Builders, enabling seamless data ingestion, transformation, and delivery. Provided technical support for Palantir Foundry applications by resolving incidents, monitoring pipeline performance, and ensuring the reliability and stability of data operations across global teams.
Built a Python-based StreamLit ALGOFA analytical tool enabling correlation analysis between semiconductor PCM datasets and DOE parameters. Engineered scalable data processing workflows using Dask to handle large EFF semiconductor datasets, improving compute efficiency.
Created a deep learning model to address the NILM (energy disaggregation) problem. Applied MLOps best practices, such as experiment tracking, model management using MLflow, and workflow orchestration using Prefect.