01/27/2026 updated

**** ******** ****
100 % available

Senior AI Engineer & Data Science Consultant, Developer, Architect, and Project Lead

München, Germany
Worldwide
M.Sc. Applied Stochastics, M.Sc. Cognitive Science, B.Sc. Mathematik/Informatik, B.Sc. Cognitive Science
München, Germany
Worldwide
M.Sc. Applied Stochastics, M.Sc. Cognitive Science, B.Sc. Mathematik/Informatik, B.Sc. Cognitive Science

Profile attachments

CV Stephan Sahm 2025.11.28.pdf

10+ years experience in Data Science and Data Engineering.
5+ years experience in Data Consultancy.

Selected Programming Languages:
Julia, Python, R, SQL, Scala, Matlab, Java, C++, Haskell, ROS, JavaScript, HTML, CSS (Web Stack)

Selected Industries:
Telecommunication, Automotive, Retail, Bonus Program, Media, Manufacturer

Selected Data Science Fields:
Statistics, Machine Learning, Deep Learning, Computer Vision, NLP Natural Language Processing, Analytics, Anomaly Detection, Time Series Prediction, Recommendation, Object recognition, ETL, Data Pipelines, Data Lake, Visualization, Dashboards

Selected Big Data:
Julia, Dask, PySpark, Spark, Hadoop MapReduce, Data Lake Setup, Yarn, HDFS, Hive, HBase

Selected Database:
PostgreSQL, MongoDB, MySQL, Oracle, Microsoft, Hive, HBase

Cloud:
AWS, Azure, Infrastructure-as, code, terraform, cloudformation, sceptre

AWS:
S3, SNS, Kubernetes, AWS VPC, ETL, CRM, API, pandas, AWS SNS, AWS SQS, PostgreSQL, MongoDB, AWS DocumentDB, AWS API Gateway, AWS Cognito, AWS Lambda, infrastructure-as-code cloudformation, Lambda, AWS Transit Gateway, AWS Networking, EC2, AWS Session Manager, AWS CloudWatch

Azure:
Azure Machine Learning, Azure App Service, Azure AD, infrastructure-as-code terraform

Methodology: 
Scrum, Waterfall

Languages

DeutschNative speakerEnglischGoodNiederländischBasic knowledgeSpanischBasic knowledge

Project history

Lead Developer & Architect

Automotive

Automotive & Vehicle Manufacturing

>10.000 team member

Supporting Usecase Development on Datalake

Guidance was provided for architectural decisions, adapting access policies, and debugging routing issues. A specific GDPR treatment ingestion processes was implemented and rolled-out. In production.

Duration: 6 months
Team setting: Team Lead, Team of 2, remote
Technologies: Infrastructure-as-code, cloudformation, sceptre, python, boto3, PySpark, scala, Spark, AWS Glue, AWS Secrets,

Senior Data Science Consultant & Technical Lead

Machine Learning Reply

Internet & IT

10-50 team member

Everything around data science consultancy:
- recruiting new colleagues
- pitching new projects
- request for proposals
- requirements engineering
- team setup
- team lead
- conceptualization of data science or data engineering solution
- development of data science or data engineering solutions
- giving workshops, trainings
- auditing customers solutions
- architecting data lakes and cloud data infrastructure
- ...

Lead Developer & Architect

Automotive

Automotive & Vehicle Manufacturing

>10.000 team member

20 ETL Pipelines on AWS

Replacing an CRM required the development of about 20 ETL pipelines to replace existing systems with new data-flows. Including one REST API. In production.

Team setting: Team Lead, Team of 3, remote
Technologies: AWS Glue, PySpark, python, boto3, pandas, AWS SNS, AWS SQS, PostgreSQL, MongoDB, AWS DocumentDB, AWS API Gateway, AWS Cog

Lead Developer & Architect

Automotive

Automotive & Vehicle Manufacturing

>10.000 team member

Building Multitenant Datalake on AWS

Implementing from scratch a datalake platform on AWS which is deployed in several countries using InfrastructureAsCode as the key technology. A key focus was GDPR conformity. In production.

Team setting: Team Lead, Team of 2, remote with a few on-side workshops
Technologies: Infrastructure-as-code, cloudformation, sceptre, python, boto3, PySpark, scala, Spark, AWS Glue, AWS Secrets, AWS IAM, S3, SNS, Lambda, Kubernetes, AWS VPC, AWS Transit Gateway, AWS Networking, AWS EC2, AWS Session Manager, AWS CloudWatch, AWS Sagemaker

Core Developer

Telecommunication

Telecommunications

>10.000 team member

Unification of Existing Time Series Analytics

Several custom anomaly detection solutions on time series were refactored and unified into a generic framework which can be easily deployed to new usecases and new infrastructures (AWS tested). In production.

Team setting: Team of 15, on-site, Scrum
Technologies: Python, PySpark, (PL)SQL, Hive, HBase, Oracle, Tableau

Senior Data Science & Engineering Consultant

Data Reply

Internet & IT

50-250 team member

Everything around data science consultancy:
- recruiting new colleagues
- pitching new projects
- request for proposals
- requirements engineering
- conceptualization of data science or data engineering solution
- development of data science or data engineering solutions
- giving workshops, trainings
- auditing customers solutions
- ...

Data Science Developer

Bonus Program Company

Marketing, PR and design

500-1000 team member

Recommender System

Designed, implemented, and deployed Big Data recommendation system, now running in production for Millions of daily customers. In production.

Team setting: Team of 1, on-site, weekly reviews
Technologies: On-premise, R, Scala, SBT, Spark, Yarn, HDFS

Quality Assurance & Adviser

Manufacturer

Goods & Retail

>10.000 team member

Review: Custom Datascience Framework

Infrastructure review and code review of a framework implemented build by one of our customers.

Team setting: Team of 1, mixed remote & on-site
Technologies: R, AWS

Teacher

Goods & Retail

>10.000 team member

Workshop: Developing with Apache Spark

Four one-day workshops at customers, two introductory, the other two advanced. Contents: Performance optimization, monitoring, interfacing Scala-R-Python, best practices

Setting: Group of 15 persons, sole presenter
Technologies: R, Python, Spark

Data Science Developer

Bonus Program Company

Marketing, PR and design

5000-10.000 team member

Fraud Detection

Draft, development, implementation, evaluation and deployment of an anomaly detection system to detect previously unkown types of fraud.

Team Setting: Team of 1, on-site, review once every three months
Technologies: R, Scala, Spark, Yarn

Data Science Consultant

Data Reply

Internet & IT

10-50 team member

Everything around data science consultancy:
- recruiting new colleagues
- pitching new projects
- request for proposals
- requirements engineering
- conceptualization of data science or data engineering solution
- development of data science or data engineering solutions
- giving workshops, trainings
- auditing customers solutions
- ...

Data Science Developer

Telecommunication

Telecommunications

>10.000 team member

Callcenter and Webcontent Optimization using Speech Analytics.

A 3 dimensional content detection system was setup for written conversations. Given only plain text, it identifyed customer specific product entities, services, and problems.

Team Setting: Team of 3, on-site, reviews every week
Technologies: Python, NLP, spacy

Python Developer

Trufflebit

Internet & IT

< 10 team member

Data Parsing

Build parser to extract time series data from customer specific text data formats

Team Setting: Team of 1, remote, steady exchange with CEO
Technologies: Python, PyParsing, Cython

Web Developer

Trufflebit

Internet & IT

< 10 team member

Web Visualization

Build Django based web-dashboard with Bokeh based interactive data analysis visualization.

Team Setting: Team of 1, remote, steady exchange with CEO
Technologies: Python, Django, Bokeh

Computer vision & Object recognition

University of Osnabrück

Industry & Mechanical Engineering

10-50 team member

Building an Autonomous Robot

Programmed robot with wheels and arms to grab a muffin from the receptionist on first floor, take the elevator, and bring it to the robotics lab.

Team Setting: Team of 14, on-site, Scrum
Technologies: ROS, Gazebo, SCRUM, Python, C++, OpenCV

Contact form

Log in to get in touch

You need to be logged in to use the contact form.

Sign upLog in