Exceeds - Team AI Productivity Dashboard

Petar Milojevic

PROFILE

Petar Milojevic

Worked on the tt-inference-server repository to deliver a hardware-aware optimization feature for Galaxy devices, focusing on model inference performance. Developed a new device model specification for the Galaxy6U, enabling data parallelism with DP=4 and tuning memory cache settings to improve throughput and latency. Integrated the DP4 configuration into the continuous integration pipeline, aligning tests and deployment paths for streamlined release readiness. The work was implemented using Python and machine learning techniques, with an emphasis on model development and deployment optimization. No bug fixes were recorded during this period, reflecting a targeted effort on feature delivery and system integration.

PROFILE

Petar Milojevic

Same Organization

Shared Repositories

1 Commits • 1 Features

1 Commits • 1 Features

tenstorrent/tt-inference-server

Languages Used

Technical Skills

PROFILE

Petar Milojevic

Overall Statistics

Feature vs Bugs

Repository Contributions

Your Network

Same Organization

Shared Repositories

Work History

1 Commits • 1 Features

1 Commits • 1 Features

Activity

Quality Metrics

Skills & Technologies

Programming Languages

Technical Skills

Repositories Contributed To

tenstorrent/tt-inference-server

Languages Used

Technical Skills