05/29/2023 updated

**** ******** ****
100 % available

Data analyst, NLP Industry Linguist Scientist, Translator EN>RU

France
Worldwide
France
Worldwide

Profile attachments

CV - Nikolay Chepurnykh

Application Programming Interfaces (APIs)Data AnalysisCluster AnalysisComputer ProgrammingData VisualizationText EditingPython (Programming Language)LaTeXLinear RegressionMachine LearningMongoDBNatural Language ProcessingNLTK (NLP Analysis)NumPySciPySQL DatabasesTableau (Software)TokenizationLatent Dirichlet AllocationTopic ModelingJupyterPandasPlotlyText AnalysisLooker AnalyticsDocker
APIs, clustering, programming, Data analyst, Data Analytics, data analysis, data visualization, Docker, Jupyter, LaTeX, LDA, linear regression, Looker, Machine Learning, Excel, Office, word, MongoDB, NLTK, Natural Language Processing, Numpy, Pandas, Plotly, Python 3, Python, SQL, Scipy, Tableau, text analysis, text processing, tokenization, topic modeling

Languages

GermanBasic knowledgeEnglishFluentFrenchFluentRussianNative speaker

Project history

Translator EN>RU Duolingo

English to Russian translation for the language learningAugust 2021 - July 2022

application Duolingo.

NLP Industry Linguist Scientist Specialist

Dassault Systèmes
Development of semantic analysis projects (customer verbatims, news feeds,
comments on social networks):
planning, extraction, cleaning and integration of data ;
data analysis (semantic analysis, correlations) ;
data visualization ;
development of technical and business ontologies;
programming in Python;
All four projects were successfully completed and presented to clients.

Corpus data processing engineer

CNRS, ATILF. Nancy
Data processing of Reddit and Twitter corpora within the LUE OLKi funded
LEG-COD project:
corpora building (Twitter, Reddit) ;
corpus processing (cleaning, formatting, tokenization, lemmatization, POS
tagging, topic modeling, etc.) ;
topic modeling.

Contact form

Log in to get in touch

You need to be logged in to use the contact form.

Sign upLog in