About

About

Hello, I am Marat! I am a versatile and dedicated engineer with a strong focus on ML product development for natural languages and human speech.

I work at Applied Sciences Group, Microsoft. As a part of this group, I contribute to the LLM platform and Text Intelligence scenarios, enabling language models across Microsoft products, including Windows OS.

The fastest way to reach me is either via 𝕏 (Twitter) or Telegram.

Publications

Follow me on Google Scholar.

CoAT: Corpus of artificial texts.
The paper introduces CoAT (Corpus of Artificial Texts), a large-scale dataset for Russian language that includes both human-written and AI-generated texts.
Cambridge

LUNA: A Framework for Language Understanding and Naturalness Assessment.
LUNA introduces a unified interface for 20 NLG evaluation metrics, addressing the growing need for comprehensive assessment of generated text quality.
Arxiv

Findings of the The RuATD Shared Task 2022 on Artificial Text Detection in Russian.
We present the shared task on artificial text detection in Russian, which is organized as a part of the Dialogue Evaluation initiative, held in 2022.
Arxiv

Projects

Artificial Text Detection Github
My BSc thesis. I studied the ability of language models to distinguish artificial texts from human-written. As well as BSc, this project correlates with my research at CPLab.