03/16/2026 updated


100 % available
Senior C++ Performance Engineer - HPC & Distributed Systems Specialist
Madrid, Spain
Worldwide
MSc & BSc in Industrial and Electronics EngineeringAbout me
C++ Performance Engineer | 10+ years building high-performance systems & data pipelines. HPC, multithreading, distributed systems, GPU acceleration, embedded Linux. Applied to 3D asset processing, real-time simulation & commercial embedded products. 4+ years fully remote with EU/US teams.
3D Computer Graphics SoftwareArchitectural DesignC++ (Programming Language)Communications ProtocolsLinuxDevOpsDistributed SystemsEmbedded SystemsLinux On Embedded SystemsHardware DesignInfrastructure ManagementPython (Programming Language)PostgreSQLLinux AdministrationMongoDB
High-Performance Computing & Distributed Systems
Advanced expertise in building scalable high-performance systems and data pipelines with deep knowledge of HPC, distributed computing, multithreading, and Linux systems engineering
C++ Performance Optimization
Specialized skills in C++17/20 performance optimization, multithreading, concurrency, and systematic performance optimization across full application stacks
3D Graphics & Real-Time Rendering
Comprehensive experience in 3D asset processing pipelines, real-time rendering, mesh optimization, texture compression, and GPU acceleration technologies
Data Pipeline Architecture
Design and implementation of mesh processing, texture compression, format conversion, and GPU acceleration pipelines
Infrastructure & DevOps
Containerized microservices, Docker, CI/CD pipelines, Linux system administration, and Google Cloud platform management
Backend Development
Python Flask development, REST APIs, PostgreSQL, MongoDB, WebSockets, and microservices architecture
Embedded Systems Programming
Complete software stack development for commercial weighing systems on embedded Linux with real-time hardware integration
Network Programming
TCP/IP communication protocols, real-time data synchronization, and distributed embedded device networking
Advanced expertise in building scalable high-performance systems and data pipelines with deep knowledge of HPC, distributed computing, multithreading, and Linux systems engineering
C++ Performance Optimization
Specialized skills in C++17/20 performance optimization, multithreading, concurrency, and systematic performance optimization across full application stacks
3D Graphics & Real-Time Rendering
Comprehensive experience in 3D asset processing pipelines, real-time rendering, mesh optimization, texture compression, and GPU acceleration technologies
Data Pipeline Architecture
Design and implementation of mesh processing, texture compression, format conversion, and GPU acceleration pipelines
Infrastructure & DevOps
Containerized microservices, Docker, CI/CD pipelines, Linux system administration, and Google Cloud platform management
Backend Development
Python Flask development, REST APIs, PostgreSQL, MongoDB, WebSockets, and microservices architecture
Embedded Systems Programming
Complete software stack development for commercial weighing systems on embedded Linux with real-time hardware integration
Network Programming
TCP/IP communication protocols, real-time data synchronization, and distributed embedded device networking
Languages
EnglishFluentSpanishNative speaker
Project history
SaaS platform for photorealistic 3D model generation and product visualization.
- Architected C++ asset processing pipelines for 3D model integration, handling mesh optimization, texture conversion, and format validation for production-scale photorealistic rendering workflows
- Led systematic performance optimization across the full application stack: C++/Qt desktop tools, backend rendering pipelines, and real-time OpenGL webviews — profiling, identifying bottlenecks, and delivering measurable throughput improvements
- Integrated AI-generated content into the C++ asset processing system, enabling automated 3D asset creation and reducing manual artist intervention in the production pipeline
- Enhanced in-house artist tooling (C++/Qt), streamlining asset ingestion and scene-building workflows
- Modernized build infrastructure (CMake, QMake) and drove codebase modularity through systematic code reviews
100% remote - Paris, France
Fashion-tech SaaS for real-time 3D fabric simulation and texture digitalization on the browser.
- Designed and implemented high-performance C++ HPC services powering real-time 3D simulation workloads, enabling browser-based fabric rendering with production-level fidelity and sub-second response times
- Identified and resolved performance bottlenecks in distributed processing pipelines, including early-stage evaluation of GPU-accelerated computation (CUDA) for parallelizable simulation tasks
- Delivered scalable backend microservices and asynchronous job processing using Python/Flask, PostgreSQL, and RabbitMQ, supporting high-concurrency simulation requests
- Built internal tooling, shared C++ libraries, and Docker-based CI/CD pipelines that standardized development workflows and reduced build/deploy friction across the engineering team
- Drove code quality through systematic refactoring, comprehensive test coverage (Google Test, Pytest), and enforcement of SOLID design principles across the C++ and Python codebases
100% remote - NY, US
Industrial weighing systems manufacturer for retail environments.
- Designed and built from scratch the complete software stack for a commercial weighing system on embedded Linux: real-time weight acquisition from proprietary hardware, thermal printer and label printer integration, networked database synchronization across multiple devices per store, and full graphical user interface (GTK)
- Implemented custom TCP/IP communication protocols enabling real-time data synchronization between distributed embedded devices on local store networks
- Developed low-level Linux system services, hardware drivers, and deployment automation for production embedded environments running on constrained hardware
- Owned full product lifecycle — requirements, prototyping, implementation, testing, and field deployment — maintaining and evolving the platform across multiple hardware revisions

