About
Hello, I am Marat! I am a versatile and dedicated engineer with a strong focus on ML product development for natural languages and human speech.
I work at Applied Sciences Group, Microsoft. As a part of this group, I contribute to the LLM platform and Text Intelligence scenarios, enabling language models across Microsoft products, including Windows OS.
The fastest way to reach me is either via 𝕏 (Twitter) or Telegram.
Publications
Follow me on Google Scholar.
CoAT: Corpus of artificial texts.
The paper introduces CoAT (Corpus of Artificial Texts), a large-scale dataset for Russian language that includes both human-written and AI-generated texts.
Cambridge
LUNA: A Framework for Language Understanding and Naturalness Assessment.
LUNA introduces a unified interface for 20 NLG evaluation metrics, addressing the growing need for comprehensive assessment of generated text quality.
Arxiv
Findings of the The RuATD Shared Task 2022 on Artificial Text Detection in Russian.
We present the shared task on artificial text detection in Russian, which is organized as a part of the Dialogue Evaluation initiative, held in 2022.
Arxiv
Projects
Artificial Text Detection Github
My BSc thesis. I studied the ability of language models to distinguish artificial texts from human-written. As well as BSc, this project correlates with my research at CPLab.