
Milan Stojkovic contributed to the tenstorrent/tt-forge-fe repository by engineering core compiler and backend features that advanced model support and reliability. He migrated key tensor operations and operator implementations from Python to C++ to improve performance and determinism, while integrating MLIR-based code generation for hardware-aware optimizations. Milan enhanced test infrastructure and CI/CD pipelines using Python and YAML, ensuring reproducible and robust model validation. His work included dynamic system descriptor selection, advanced attribute mapping, and embedding support, addressing both correctness and maintainability. Through careful code refactoring and containerization, Milan enabled scalable, future-proof development workflows and reduced integration risk across releases.

October 2025 monthly summary focusing on key accomplishments for TT projects. This month centered on enabling MLIR uplift in the tt-forge-fe repository by upgrading the Docker build environment to include the xxd utility, ensuring the required toolchain is available for MLIR-related builds. This change improves build reproducibility, accelerates MLIR integration, and sets the foundation for future optimizations and tooling improvements.
October 2025 monthly summary focusing on key accomplishments for TT projects. This month centered on enabling MLIR uplift in the tt-forge-fe repository by upgrading the Docker build environment to include the xxd utility, ensuring the required toolchain is available for MLIR-related builds. This change improves build reproducibility, accelerates MLIR integration, and sets the foundation for future optimizations and tooling improvements.
August 2025: Focused on performance-driven migration, correctness hardening, and test reliability across Forge repositories. Completed a substantial C++ migration of core operators to Tenstorrent Forge, improved backward pass correctness for ReduceAvg, and hardened tests for deterministic outcomes, establishing a stronger foundation for Forge-scale workloads and future feature delivery.
August 2025: Focused on performance-driven migration, correctness hardening, and test reliability across Forge repositories. Completed a substantial C++ migration of core operators to Tenstorrent Forge, improved backward pass correctness for ReduceAvg, and hardened tests for deterministic outcomes, establishing a stronger foundation for Forge-scale workloads and future feature delivery.
July 2025 monthly summary for tenstorrent/tt-forge-fe: Delivered a major shift of core tensor operations to the C++ backend, enabling more deterministic and faster execution, improved autograd integration with constant tensor creation, and increased CI reliability through cleanup efforts. The work lowers Python back-end latency, reduces framework overhead, and sets the stage for broader performance optimizations in subsequent releases.
July 2025 monthly summary for tenstorrent/tt-forge-fe: Delivered a major shift of core tensor operations to the C++ backend, enabling more deterministic and faster execution, improved autograd integration with constant tensor creation, and increased CI reliability through cleanup efforts. The work lowers Python back-end latency, reduces framework overhead, and sets the stage for broader performance optimizations in subsequent releases.
June 2025 — Focused on MLIR generation enhancements and report output correctness in the tt-forge-fe project. Delivered MLIR-format report output, hardened MLIR attribute naming by replacing a hard-coded string with a constant, and added dynamic system descriptor selection to support current and future hardware architectures (e.g., wormhole, blackhole). These changes improve reliability, enable hardware-aware optimizations, and reduce maintenance burden.
June 2025 — Focused on MLIR generation enhancements and report output correctness in the tt-forge-fe project. Delivered MLIR-format report output, hardened MLIR attribute naming by replacing a hard-coded string with a constant, and added dynamic system descriptor selection to support current and future hardware architectures (e.g., wormhole, blackhole). These changes improve reliability, enable hardware-aware optimizations, and reduce maintenance burden.
Month: 2025-05 | Tenstorrent tt-forge-fe contributed improvements concentrated on test reliability and model accuracy gating for Detr. The work enhanced CI signal fidelity and traceability for model-related changes, supporting higher confidence in production readiness.
Month: 2025-05 | Tenstorrent tt-forge-fe contributed improvements concentrated on test reliability and model accuracy gating for Detr. The work enhanced CI signal fidelity and traceability for model-related changes, supporting higher confidence in production readiness.
April 2025 monthly summary focusing on key accomplishments across tt-tvm and tt-forge-fe, delivering core features, indexing and shape handling improvements, and platform readiness for production deployment. The work centered on embedding lookup enhancements for PaddlePaddle, expanded advanced indexing capabilities, support for negative dimensions, dynamic shapes handling, and robust softmax attribute handling to ensure correctness across TVM-backed models.
April 2025 monthly summary focusing on key accomplishments across tt-tvm and tt-forge-fe, delivering core features, indexing and shape handling improvements, and platform readiness for production deployment. The work centered on embedding lookup enhancements for PaddlePaddle, expanded advanced indexing capabilities, support for negative dimensions, dynamic shapes handling, and robust softmax attribute handling to ensure correctness across TVM-backed models.
Monthly summary for 2025-03 focusing on business value and technical achievements for tenstorrent/tt-forge-fe. Highlighted work includes delivering new features, stabilizing the MLIR backend, and updating dependencies to strengthen backend compatibility and model support.
Monthly summary for 2025-03 focusing on business value and technical achievements for tenstorrent/tt-forge-fe. Highlighted work includes delivering new features, stabilizing the MLIR backend, and updating dependencies to strengthen backend compatibility and model support.
January 2025 focused on strengthening Forge-FE lowering reliability and expanding model support. Key outcomes include introducing an AttributeMapper with MLIRGenerator integration for flexible attribute renaming and type conversion during Forge-FE to MLIR lowering; extending the MLIR generator with repeat_interleave support to enable usage with models like Llama-3.2-1B; stabilizing compatibility by reverting the 'reduce_avg' attribute rename and updating op_mapping; expanding test coverage by removing an xfail in the embedding test to exercise meta-llama/Llama-3.2-1B across configured models; and documenting Pytest usage to standardize testing practices. Collectively these changes reduce integration risk, broaden model compatibility, and improve developer efficiency and test reliability.
January 2025 focused on strengthening Forge-FE lowering reliability and expanding model support. Key outcomes include introducing an AttributeMapper with MLIRGenerator integration for flexible attribute renaming and type conversion during Forge-FE to MLIR lowering; extending the MLIR generator with repeat_interleave support to enable usage with models like Llama-3.2-1B; stabilizing compatibility by reverting the 'reduce_avg' attribute rename and updating op_mapping; expanding test coverage by removing an xfail in the embedding test to exercise meta-llama/Llama-3.2-1B across configured models; and documenting Pytest usage to standardize testing practices. Collectively these changes reduce integration risk, broaden model compatibility, and improve developer efficiency and test reliability.
Monthly summary for 2024-12: Delivered key CI/CD and dependency management improvements for tt-forge-fe, and strengthened test suite reliability and code organization. These changes enhanced model compatibility and reliability of builds, reduced flaky tests across environments, and improved maintainability.
Monthly summary for 2024-12: Delivered key CI/CD and dependency management improvements for tt-forge-fe, and strengthened test suite reliability and code organization. These changes enhanced model compatibility and reliability of builds, reduced flaky tests across environments, and improved maintainability.
November 2024 performance summary for tenstorrent/tt-forge-fe: Delivered foundational MLIR generation improvements and targeted codebase cleanup that strengthen the product's reliability, performance, and maintainability. Key focus areas included expanding operation support for cosine and sine, stabilizing Llama 3b compilation, and removing deprecated graph-building primitives to streamline the compilation path and reduce maintenance overhead. The work accelerates model deployment readiness and reduces risk in future MLIR lowering changes.
November 2024 performance summary for tenstorrent/tt-forge-fe: Delivered foundational MLIR generation improvements and targeted codebase cleanup that strengthen the product's reliability, performance, and maintainability. Key focus areas included expanding operation support for cosine and sine, stabilizing Llama 3b compilation, and removing deprecated graph-building primitives to streamline the compilation path and reduce maintenance overhead. The work accelerates model deployment readiness and reduces risk in future MLIR lowering changes.
Overview of all repositories you've contributed to across your timeline