Exceeds - Team AI Productivity Dashboard

Dmitrii Troitskii

PROFILE

Dmitrii Troitskii

During November 2024, Mitra Fantos developed the foundational Representation Surgery feature for steering functions in language models within the davidbau/sidn-handbook repository. Leveraging machine learning and natural language processing expertise, Mitra formalized a mathematical framework to align representation statistics, specifically means and covariances, to guide model outputs. The implementation included initial experiments in HTML that demonstrated measurable reductions in gender bias and toxicity, while improving the efficiency of the steering approach. This work provided an end-to-end solution from concept to experimental validation, enabling safer and more controllable language model behavior and supporting the deployment of steerable models in user-facing features.

PROFILE

Dmitrii Troitskii

Shared Repositories

1 Commits • 1 Features

1 Commits • 1 Features

davidbau/sidn-handbook

Languages Used

Technical Skills

PROFILE

Dmitrii Troitskii

Overall Statistics

Feature vs Bugs

Repository Contributions

Your Network

Shared Repositories

Work History

1 Commits • 1 Features

1 Commits • 1 Features

Activity

Quality Metrics

Skills & Technologies

Programming Languages

Technical Skills

Repositories Contributed To

davidbau/sidn-handbook

Languages Used

Technical Skills