Profileimage by Xian Yang Senior Cloud Data Engineer und Data Scientist from Lyon

Xian Yang

not available until 04/30/2023

Last update: 17.01.2023

Senior Cloud Data Engineer und Data Scientist

Graduation: M. Sc. Data Science
Hourly-/Daily rates: show
Languages: Chinese (Native or Bilingual) | German (Full Professional) | English (Full Professional) | French (Full Professional) | Italian (Elementary)

Attachments

YANGXianCV2022v1_191222.pdf

Skills

Cloud Engineer, ML DevOps, Graphdatenbank, Kubernetes, Java, Python, NET, DevOps, test driven development, Confluence, Jira, Azure, AWS, Lambda, Azure Pipelines, AWS EMR, AWS Glue, Azure Data Factory, Azure Machine Learning, AKS / EKS, ADLS Gen2 / AWS S3, Data(base), SQL, Neo4j, Cosmos DB, Databricks / Spark, Kafka, DVC, SSDT, Delta Lake, Cassandra, NET 5, Scala, R, Orchestration, versioning and tracking, Terraform, Kubernetes, Docker, Ansible, MLFlow, Argo, Luigi, Airflow, git, Github, Github Actions, Visualization, Power BI, Grafana, Machine learning, algorithm, Regressions, SVM, ensemble methods, XGboost, RM, clustering, KNN, naïve Bayes, graphic models, Deep learning, Neural Network, NLP, Autoencoder, Bayesian, MCMC, correspondence analysis, Spark, data ingestion, data visualization, Databricks, algorithms, DB, Azure Functions, AKS, big data, DWH, batch processing, Hadoop, backend, Excel

Project history

09/2021 - 07/2022
Data and Cloud Engineer
Halbauer GmbH
Automotive and vehicle construction
In order to migrate a classical machine learning model to Azure, which aims to predict the vehicle repair rate, I interpreted the Python project into Spark and used Databricks, MLflow and Azure Data Factory to automate the data ingestion and to parallelize the model training. Front-end data visualization uses Grafana. Tools: Azure Data Factory, Databricks, Spark, Delta Lake, Azure Pipelines, Grafana.

02/2020 - 08/2021
Data Scientist and Cloud Engineer
Island Labs GmbH (10-50 employees)
Architecture and civil engineering
Kitchen furniture blueprints sometimes contain errors which should be avoided before being sent to construction. As a
technical team working with a kitchen provider, we received thousands of kitchen plans on a daily basis. I designed a graph model to represent the furniture of a kitchen, geometric and graph algorithms to learn and look for errors in them, and a pipelined workflow on Azure to process the data. Kitchen blueprints as data are first fed into the workflow by Kafka in real time, these are then validated and sent to a Neo4j cluster by an Azure function. After the kitchen blueprints get ingested and analyzed by Neo4j, the result is sent to the Cosmos DB and returned to the client, in case an error is detected or a warning occurred. Tools: knowledge graph, Kafka, Neo4j, Azure Functions, AKS, Python and .NET.

10/2018 - 01/2020
Data scientist and engineer
Astronomical data can be immense and requires a big data solution to process it. Working together with astronomers, I
designed a Spark application for an on premise DWH to process the data and read in from and write out to its different layers. The application includes both batch processing to deal with existent data and streaming to deal with new data coming in. I also optimized the Spark application at different levels which has improved its performance remarkably. Tools: Spark 2.x, Kafka, Hadoop, Java.

04/2018 - 09/2018
Biostatistics and AI Developer
Firalis SA (50-250 employees)
Pharmaceuticals and medical technology
The combination of omic data and machine learning / deep learning is one of the cutting-edge areas of biopharma research. In this project where the company aims to get a better understanding of the high-throughput RNA sequencing data of patients undergoing different medical treatments after a myocardial infarction, my machine learning and deep learning algorithms came to help. Tools: Python and R.

07/2017 - 08/2017
Psychometry and AI Developer
Centre Hospitalier de Rouffach (50-250 employees)
Public service
In the psychomotor therapy the diagnosis of schizophrenia had been to that date carried out by the observation and the subjective judgment of medical professionals. With the hope to develop an AI assistant helping physicians make better and more accurate clinical decisions based on more quantitative indicators, I built an application using machine learning and statistical models to predict schizophrenia. Tools: R.

02/2016 - 08/2016
AI backend developer
AI-Developer
Pharmaceuticals and medical technology
As a backend developer I participated in the development of a diagnosis assistant system, the goal of which is to detect among others pulmonary sarcoidosis using AI. Tools: .NET.

11/2015 - 02/2016
Risk analyst
Bank of Communications (5000-10.000 employees)
Banks and financial services
Functions: As a risk analyst I ensured the role of developing statistical tools to monitor credit risk, market risk, liquidity risk and operational risk of the bank. Tools: R and Excel.

Local Availability

Available worldwide
Bis zu 100%
Profileimage by Xian Yang Senior Cloud Data Engineer und Data Scientist from Lyon Senior Cloud Data Engineer und Data Scientist
Register