OpenTalks.AI 2026: Production-Ready Adapters for On-Device Language Models

Posted Mar 7, 2026

By Marat Saidov

1 min read

I spoke at OpenTalks.AI in Belgrade, Serbia. My talk covered the end-to-end journey of building production-ready adapters for on-device language models, from tuning through evaluation to real-world shipping.

The session walked through several core areas:

Adapter tuning for on-device models. How LoRA adapters are used to specialize a small base model (Phi Silica) for distinct skills — each skill/flavor gets its own lightweight adapter on top of frozen base weights.
Shipping Rewrite and Summarize. Practical lessons from taking these Text Intelligence skills from prototype to production on Copilot+ PCs — including Rewrite with multiple tone flavors (Auto, Casual, Formal, Concise) and Summarize for condensing long-form content.
Windows App SDK availability. These Text Intelligence capabilities are available through the Windows App SDK, enabling any developer building apps for Copilot+ PCs to integrate on-device language skills directly into their applications.

The Text Intelligence platform is designed so that new skills can be added by training new adapters without retraining the base model.

It was great connecting with the AI community in Belgrade and seeing how rich the advancements in Natural Language Processing in both academia and industry are in 2026.

This post is licensed under CC BY 4.0 by the author.