Profileimage by Anonymous profile, Senior Data Engineer, Python Software Engineer and DevOps
available

Last update: 26.11.2023

Senior Data Engineer, Python Software Engineer and DevOps

Graduation: Diplom (equiv. M. Sc.) Computer Science/Informatik
Hourly-/Daily rates: show
Languages: German (Native or Bilingual) | English (Full Professional)

Attachments

profile-robert-buchholz-20231123_261123.pdf

Skills

With over 20 years shaping software solutions, Robert excels across diverse landscapes – from startups to corporate titans, tackling challenges both onsite and remotely. He's led the charge in Machine Learning, AI, MLOps, Data Engineering, and DevOps and Python Developments. As a Senior AI Engineer, he steered Autonomous Driving initiatives; as an Engineering Lead, he powered
a Video Streaming Platform. Robert's native languages are German and Python.

Technology Focus
Machine Learning: MLOps and ML Software Engineering, in AI/Neural Networks (CV, NLP), Probabilistic Models, Predictive Analytics
Data Engineering: ETL, Data Pipelines, Big Data, Data Lake, Scalability
Backend development: REST, Python, API Design, Release Management, Security
DevOps: Multi-Cloud, AWS, Azure, Docker/Containers, Systems Engineering, Linux
Cross-Cutting Concerns: Automated Testing and Reliability (CI/CD, Infrastructure as Code, Code Pipelines, Test Driven Development), Scrum/Agile (Scrum Master, Iterative and Incremental, Lean Management), Open Source/Free Software

Project history

07/2021 - 11/2023
Senior AI Python Engineer for Python/AI Engineering / Autonomous Vehicle Big Loop Research
Automotive Software/Technology Company (DAX) (Automotive and vehicle construction, >10.000 employees)

Architecture, Software Engineering and AI/ML Engineering for production grade Machine Learning models to be deployed in vehicles. Development of model and data interfaces, Pipelines to train and evaluate models on the Azure cloud. Conduct ML experiments, generate metrics and track artifacts using Weights and Biases. Quantization and Release for Qualcomm-based embedded platform. Integration within larger group to develop an Active Learning Pipeline for the Big Data Loop using multiple models and to detect and reduce model uncertainty. Quality assurance using
automated tools and pipelines.

Keywords: Python 3, Pytorch, Pytorch lightning, torchvision, Qualcomm/QNN, Weights&Biases (wandb), albumentations, Semantic Segmentation, Object Detection, YoloX, torchmetrics , ONNX, OnnxRuntime, OpenCV, Tensorflow, scikit-learn, NumPy, pandas, pytest, Mlflow, Conda, Anaconda, Mamba/Micromamba, Nvidia, CUDA, Azure Cloud, Azure DevOps, Azure Pipelines, Docker, terraform, packer, git, LFS, pre-commit, jupyter

01/2019 - 11/2023
Lead Data Engineer for Machine Learning / B2B Weather Forecasting Product
Weather Forecasting Platform / Mass Media & Digital Company (MDAX), Munich (Media and Publishers, 5000-10.000 employees)

Architecture, development and deployment of a large-scale data analytics and forecasting product in a B2B scope (green field). Ingestion of large data sets using Serverless technologies on AWS, definition and training of machine learning models and subsequent application of models to generate insights. Customer-facing REST API, Performance Engineering for High Scalability, CI/CD setup and monitoring in a high availability setting. Data Pipelines and distributed computation on Databricks and AWS EMR (Spark/Pyspark). Production of Weather Indices as subscription service for Business Intelligence and Web Analytics.

Keywords: Python 3, REST, JSON, scikit-learn, NumPy, Scipy, pandas, pytest, DynamoDB, MongoDB, serverless, AWS Lambda, ECS/Docker, SageMaker, EMR, PySpark, Spark, Databricks, Delta Lake, Mlflow, Athena, Hive, Glue, API Gateway, EC2, S3, Atlassian (JIRA, Confluence, BitBucket), git

11/2020 - 12/2021
Senior MLOps Engineer for Digital Farming Research Organisation
Digital Farming Solutions / Agronomic Company (DAX), Cologne (Industry and mechanical engineering, >10.000 employees)

Architecture and implementation of MLOps process and framework using AWS SageMaker Suite, Step Functions, Lambda, EC2, Hosted Jupyter Notebooks and Docker. The platform is in production today and enables ML model researchers to integrate directly with product teams to reduce deployment time-to-market using automation by reducing overhead and relying on defined model interfaces while improving confidence in and inspectability of deployed models using A/B testing and audit trails (using traceable multi-account pipeline strategies). Development of ETL pipeline with Data Lake to enable Feature Store for faster model experiment iteration.

Keywords: AWS, SageMaker, Studio, Model Registry, Hosted Models, Step Functions, Lambda, S3, SQS, API Gateway, CodePipeline, CodeBuild, ECS, ECR, Fargate, CloudTrail, Terraform, Python 3, Docker, pandas, pytest, boto, moto, Atlassian (JIRA, Confluence, BitBucket), git

07/2011 - 12/2019
Senior Software Engineer for Educational Predictive Analytics Platform Development and Deployment
Educational Platform Provider (B2B market), Berlin (Internet and Information Technology, 10-50 employees)

Full-stack product development for an education platform from its inception into production (development team of 3-6 people). Real-time interaction (social) and gamification using asynchronous paradigms and rich JavaScript client, implementation of data models to measure and control learning progress (Maximum Likelihood Estimation, Clustering), service to service communication. Release management and QA using Automated Unit and Acceptance Tests (CI/CD), operation (DevOps) of the platform as a private cloud for several thousand parallel users.

Keywords: Python 2/3, Tornado, Pyramid, Flask, SQLAlchemy, scipy, pandas, jupyter, Bokeh, Plotly, Jinja2, R, nosetests, Selenium/webdriver, JS, jQuery, Backbone, Marionette, Bootstrap, SCSS/SASS, HTML5, Ansible, Puppet, nginx, haproxy, Jenkins, Docker, Red Hat/CentOS, MySQL, redis, Mercurial, git, SVN

07/2017 - 12/2018
Lead Engineer for Data Analytics and Statistical Modelling
FinTech Start-Up, Berlin (Banks and financial services, 10-50 employees)

Contracting for a start-up in the FinTech sector during its early phase. Development of machine learning models for market predictions and quantitative analysis in close cooperation with researchers, design and development of data pipelines that process external events, design and deployment of infrastructure for development and production (AWS cloud environment).

Keywords: Python 3, pandas, NumPy, scikit-learn, statsmodels, PostgreSQL, jupyter, pytest, AWS, GitLab, APIs, Scraping, git

07/2011 - 12/2017
Lead Engineer for Music Video Platform
Music Video Streaming Platform, Denmark (Media and Publishers, < 10 employees)

Contracting during early and critical phases of a start-up/spin-off, Software integration and feature development of video streaming platform (user facing) using MediaDrop, Systems Architecture and deployment (Hybrid Cloud), Continuous Deployment using Ansible, File distribution and video delivery. Development of a reliable Content Ingestion Pipeline using microservices with “Big Three” Music Labels and integration with label’s Controlling and Reporting Systems.

Keywords: Python 2, MediaDrop, SQLAlchemy, flask, Pylons, RQ, OAuth, SOLR, ElasticSearch, Ansible, nginx, haproxy, Red Hat/CentOS, XML, ffmpeg, PostgreSQL, microservices, AWS, git

07/2013 - 02/2014
Software Engineer for Doctor’s Appointment Service
International Health Start-Up, Cambridge, USA (Pharmaceuticals and medical technology, 10-50 employees)

Bringing a web platform from prototype stage into production. Major feature development, redesign and reliability improvements on existing codebase.

Keywords: Django, CalDAV, Selenium, Mailchimp, Mandrill, Twilio, AWS, git

04/2010 - 06/2011
Scrum Master, Software and Infrastructure Engineer (Full time position) for Agile Management Tool
Agile/Scrum Consulting Company, Berlin (Internet and Information Technology, 10-50 employees)

Software Engineer on a tool to manage Agile Processes (Scrum, Kanban), frontend and backend development. Scrum Master in a development team of 7. Development, deployment and Operation of Cloud service (Hosted product) and public services (Licensing server, Company CMS) as well as internal services (CI). Customer Services Engineer.
Keywords: Trac, Django, django CMS, JavaScript, jQuery, CentOS, Bitten, SVN

12/2006 - 08/2010
Security Team Lead and Developer (pro bono) for Linux Distribution
Major Linux Distribution (Gentoo) (Internet and Information Technology, 500-1000 employees)

Developer on Gentoo Linux distribution, a non-profit organization. Leading the Gentoo Security project, Security Engineer and Coordinator for Emergency Response (vulnerability management and responsible disclosure management). Engineering on multiple feature projects including virtualization (Xen/libvirt), network management and infrastructure. Package maintenance, stability and QA, upstream communication. Event/Exhibition Coordinator and Public Relations.

01/2009 - 09/2009
Google Summer of Code Mentor
Google (Internet and Information Technology, >10.000 employees)

Mentoring a Google Summer of Code project to create and release a Gentoo stats server/client. Review of project roadmap and milestones, code review and facilitation of production deployment.

Local Availability

Only available in these countries: Germany, Austria und Switzerland
Booked until 2023-12-31, available 2024-01-15 at the earliest. Located in Berlin, available for remote work and limited on-site engagements. Currently only taking on part-time remote projects.
Profileimage by Anonymous profile, Senior Data Engineer, Python Software Engineer and DevOps Senior Data Engineer, Python Software Engineer and DevOps
Register