05/29/2023 updated


100 % available
Data analyst, NLP Industry Linguist Scientist, Translator EN>RU
France
Worldwide
Application Programming Interfaces (APIs)Data AnalysisCluster AnalysisComputer ProgrammingData VisualizationText EditingPython (Programming Language)LaTeXLinear RegressionMachine LearningMongoDBNatural Language ProcessingNLTK (NLP Analysis)NumPySciPy
APIs, clustering, programming, Data analyst, Data Analytics, data analysis, data visualization, Docker, Jupyter, LaTeX, LDA, linear regression, Looker, Machine Learning, Excel, Office, word, MongoDB, NLTK, Natural Language Processing, Numpy, Pandas, Plotly, Python 3, Python, SQL, Scipy, Tableau, text analysis, text processing, tokenization, topic modeling
Languages
GermanBasic knowledgeEnglishFluentFrenchFluentRussianNative speaker
Project history
English to Russian translation for the language learningAugust 2021 - July 2022
application Duolingo.
application Duolingo.
Development of semantic analysis projects (customer verbatims, news feeds,
comments on social networks):
planning, extraction, cleaning and integration of data ;
data analysis (semantic analysis, correlations) ;
data visualization ;
development of technical and business ontologies;
programming in Python;
All four projects were successfully completed and presented to clients.
comments on social networks):
planning, extraction, cleaning and integration of data ;
data analysis (semantic analysis, correlations) ;
data visualization ;
development of technical and business ontologies;
programming in Python;
All four projects were successfully completed and presented to clients.
Data processing of Reddit and Twitter corpora within the LUE OLKi funded
LEG-COD project:
corpora building (Twitter, Reddit) ;
corpus processing (cleaning, formatting, tokenization, lemmatization, POS
tagging, topic modeling, etc.) ;
topic modeling.
LEG-COD project:
corpora building (Twitter, Reddit) ;
corpus processing (cleaning, formatting, tokenization, lemmatization, POS
tagging, topic modeling, etc.) ;
topic modeling.