More than 4 years of Machine Learning Software Engineering (Speech AI, NLP, LLMs). Skilled in SRE, MLOps, ML Infrastructure development.

Experience

Microsoft
October 2024 - Present
As a part of Applied Sciences Group, I contribute to the LLM platform and NLP scenarios, enabling language models across Microsoft products.

Yandex Technologies
February 2023 - October 2024
Alice voice assistant. Enhanced natural language understanding and automatic speech recognition services.

HSE University
February 2022 - January 2023
NLP research. Contributed to 2 publications, one of which is published in Q1-level journal in computational linguistics.

Rubbles (SBDA Group)
October 2021 - January 2023
Built a model serving product. Open-source stack based on Kubernetes, Terraform, Seldon Core, MLFlow, Redis, PostgreSQL.

BestPlace.AI
April 2021 - October 2021
ML engineering on Python for the predictive geo-analytical tool that provides end-to-end visibility of local shopper patterns.

Huawei Technologies
July 2020 - November 2020
Research in lossless compression with computer vision for Huawei Cloud.

Education

HSE University
September 2017 - June 2022
BSc in Applied Math and Information Science. Specialization in ML.

Yandex School of Data Analysis
2020 - 2022
MSc program in Data Science.
Selected only several courses out of the 2-year program.

Projects

Artificial Text Detection Github
My BSc thesis. I studied the ability of language models to distinguish artificial texts from human-written. As well as BSc, this project correlates with my research at CPLab.

Publications

CoAT: Corpus of artificial texts.
The paper introduces CoAT (Corpus of Artificial Texts), a large-scale dataset for Russian language that includes both human-written and AI-generated texts.
Cambridge

LUNA: A Framework for Language Understanding and Naturalness Assessment.
LUNA introduces a unified interface for 20 NLG evaluation metrics, addressing the growing need for comprehensive assessment of generated text quality.
Arxiv

Findings of the The RuATD Shared Task 2022 on Artificial Text Detection in Russian.
We present the shared task on artificial text detection in Russian, which is organized as a part of the Dialogue Evaluation initiative, held in 2022.
Arxiv