Exceeds - Team AI Productivity Dashboard

July 2026

7 Commits • 3 Features

Jul 1, 2026

July 2026 performance highlights: Delivered foundational and advanced Xtile optimizations across Intel-tensorflow/xla, Intel-tensorflow/tensorflow, and jax-ml/jax, focusing on scalable scan operations on GPUs, improved Triton integration, and safety fixes. Key work spanned (1) Xtile.scan groundwork and integration planning, (2) TTIR/Triton-backed lowering for xtile.scan with scalarization and carry handling, (3) targeted correctness fix for Triton load masks, (4) MLIR pass addition for xtile-scalarize-scan in XLA GPU codegen, and (5) re-enabling cumulated-reduction (cumred) support in JAX control flow. Commits of note include: 12d2aeb159f661ccc2d706f2f0f29a60f3a55b7a, 00ae94fdd96c925929f2a9f9dc467ab83f9bf955, 52f27ffa299fba39f0f3858d6228071394d5b357, a976051474a3ce4ec5428555364ed206c31f9f18, 5ade8cc6c102f1e0b8455963ce78ed957d3d6392, 4bda0cfa25c2e4a48fdbc57b8f4fbae643a8cc9e, 3c4a9bbb56b3f64341fee026aab3f4e8b89a77c8.

7 Commits • 3 Features

Jul 1, 2026

July 2026 performance highlights: Delivered foundational and advanced Xtile optimizations across Intel-tensorflow/xla, Intel-tensorflow/tensorflow, and jax-ml/jax, focusing on scalable scan operations on GPUs, improved Triton integration, and safety fixes. Key work spanned (1) Xtile.scan groundwork and integration planning, (2) TTIR/Triton-backed lowering for xtile.scan with scalarization and carry handling, (3) targeted correctness fix for Triton load masks, (4) MLIR pass addition for xtile-scalarize-scan in XLA GPU codegen, and (5) re-enabling cumulated-reduction (cumred) support in JAX control flow. Commits of note include: 12d2aeb159f661ccc2d706f2f0f29a60f3a55b7a, 00ae94fdd96c925929f2a9f9dc467ab83f9bf955, 52f27ffa299fba39f0f3858d6228071394d5b357, a976051474a3ce4ec5428555364ed206c31f9f18, 5ade8cc6c102f1e0b8455963ce78ed957d3d6392, 4bda0cfa25c2e4a48fdbc57b8f4fbae643a8cc9e, 3c4a9bbb56b3f64341fee026aab3f4e8b89a77c8.

July 2026

June 2026

10 Commits • 4 Features

Jun 1, 2026

June 2026 monthly summary: Delivered stability improvements and codebase simplifications across ROCm/jax, Intel-tensorflow/tensorflow, and Intel-tensorflow/xla. Key work spanned a stability fix in JAX, maintenance cleanup removing an obsolete pass to lower chlo.scan to stablehlo.reduce_window, XTile IR support for HLO Scan, extensive scan tiling and stability work in XLA with tile propagation and HLO analysis enhancements, CI stabilization via a rollback, and MLIR best-practices documentation. These efforts reduce risk, improve reliability of builds, enable more efficient tiling and codegen for scans, and tighten release readiness.

June 2026

10 Commits • 4 Features

Jun 1, 2026

June 2026 monthly summary: Delivered stability improvements and codebase simplifications across ROCm/jax, Intel-tensorflow/tensorflow, and Intel-tensorflow/xla. Key work spanned a stability fix in JAX, maintenance cleanup removing an obsolete pass to lower chlo.scan to stablehlo.reduce_window, XTile IR support for HLO Scan, extensive scan tiling and stability work in XLA with tile propagation and HLO analysis enhancements, CI stabilization via a rollback, and MLIR best-practices documentation. These efforts reduce risk, improve reliability of builds, enable more efficient tiling and codegen for scans, and tighten release readiness.

May 2026

43 Commits • 10 Features

May 1, 2026

May 2026 monthly summary focusing on delivering compatibility, performance, and maintainability improvements across the XLA ecosystem (Intel-tensorflow/xla, ROCm/jax, Intel-tensorflow/tensorflow, openxla/xla). The month saw a mix of feature work, correctness improvements, macro modernization, and CI/driver compatibility updates that collectively reduce runtime risk, accelerate performance, and simplify long-term maintenance.

43 Commits • 10 Features

May 1, 2026

May 2026 monthly summary focusing on delivering compatibility, performance, and maintainability improvements across the XLA ecosystem (Intel-tensorflow/xla, ROCm/jax, Intel-tensorflow/tensorflow, openxla/xla). The month saw a mix of feature work, correctness improvements, macro modernization, and CI/driver compatibility updates that collectively reduce runtime risk, accelerate performance, and simplify long-term maintenance.

May 2026

April 2026

8 Commits • 4 Features

Apr 1, 2026

April 2026 monthly summary focusing on delivering robust scan infrastructure, expanding dynamic-shape capabilities, and improving build-time reliability across repositories. Highlights include bug fixes that improve correctness and verifier compliance, feature work enabling dynamic scans, and performance-oriented refinements that reduce translation overhead.

April 2026

8 Commits • 4 Features

Apr 1, 2026

April 2026 monthly summary focusing on delivering robust scan infrastructure, expanding dynamic-shape capabilities, and improving build-time reliability across repositories. Highlights include bug fixes that improve correctness and verifier compliance, feature work enabling dynamic scans, and performance-oriented refinements that reduce translation overhead.

March 2026

18 Commits • 9 Features

Mar 1, 2026

March 2026 monthly work summary focusing on delivering high-impact GPU and IR evolution across ROCm/tensorflow-upstream, Intel-tensorflow/xla, openxla/xla, and Intel-tensorflow/tensorflow. Highlights include new vhlo.composite_v2 region support, CUDA/CUB-based prefix sum acceleration via XLA FFI, HLO framework performance optimizations, robustness improvements for the XLA FFI registration, adaptive scanning kernels, and Triton GPU integration. These workstreams raise interop flexibility between vhlo and stablehlo, accelerate key GPU primitives (prefix sums and scans), enhance scratch memory estimation reliability, and improve overall throughput for large-scale tensor workloads, delivering direct business value in lower-latency, higher-throughput model execution and easier maintainability.

18 Commits • 9 Features

Mar 1, 2026

March 2026 monthly work summary focusing on delivering high-impact GPU and IR evolution across ROCm/tensorflow-upstream, Intel-tensorflow/xla, openxla/xla, and Intel-tensorflow/tensorflow. Highlights include new vhlo.composite_v2 region support, CUDA/CUB-based prefix sum acceleration via XLA FFI, HLO framework performance optimizations, robustness improvements for the XLA FFI registration, adaptive scanning kernels, and Triton GPU integration. These workstreams raise interop flexibility between vhlo and stablehlo, accelerate key GPU primitives (prefix sums and scans), enhance scratch memory estimation reliability, and improve overall throughput for large-scale tensor workloads, delivering direct business value in lower-latency, higher-throughput model execution and easier maintainability.

March 2026

February 2026

3 Commits • 2 Features

Feb 1, 2026

February 2026 performance summary highlighting key features delivered and bugs fixed across Intel-tensorflow/tensorflow and Intel-tensorflow/xla, focusing on HLO Scan robustness and test stability.

February 2026

3 Commits • 2 Features

Feb 1, 2026

February 2026 performance summary highlighting key features delivered and bugs fixed across Intel-tensorflow/tensorflow and Intel-tensorflow/xla, focusing on HLO Scan robustness and test stability.

January 2026

12 Commits • 3 Features

Jan 1, 2026

Month 2026-01 achievements focused on scalable scan operations across MLIR/HLO ecosystems, enabling cross-IR portability and performance improvements for prefix-sum computations.

12 Commits • 3 Features

Jan 1, 2026

Month 2026-01 achievements focused on scalable scan operations across MLIR/HLO ecosystems, enabling cross-IR portability and performance improvements for prefix-sum computations.

January 2026

December 2025

6 Commits • 4 Features

Dec 1, 2025

Month: 2025-12. Delivered targeted testing realignment and legality improvements for GEMM fusion paths and FuncOp validation across ROCm/tensorflow-upstream and Intel-tensorflow/xla. These changes accelerate validation of Triton GEMM fusions, reduce legacy code debt, and improve maintainability and reliability of the test suites.

December 2025

6 Commits • 4 Features

Dec 1, 2025

Month: 2025-12. Delivered targeted testing realignment and legality improvements for GEMM fusion paths and FuncOp validation across ROCm/tensorflow-upstream and Intel-tensorflow/xla. These changes accelerate validation of Triton GEMM fusions, reduce legacy code debt, and improve maintainability and reliability of the test suites.

November 2025

39 Commits • 4 Features

Nov 1, 2025

Month: 2025-11 Overview: Concluded a major modernization of the GPU GEMM pathway through nested GEMM fusion, extended across Intel-tensorflow/xla and ROCm/tensorflow-upstream, with focused work on emitter updates, autotuning safety, and backend maintenance. The result is faster, more robust GPU GEMM operations, simplified maintenance, and a clearer upgrade path for future GPU backends. Key features delivered: - Triton GEMM Nested Fusion Backend Modernization: Consolidated effort to adopt nested GEMM fusion across the Triton backend, including enabling nested GEMM fusion in the emitter, removing legacy GEMM paths, updating autotuning, adding bounds checks, refactoring, and cleaning up tests and configurations to improve performance and robustness of GPU GEMM operations. - Triton Library Integration for GPU Backends: Integrated Triton library for GPU tensor operations to enhance GPU compilation capabilities and optimize performance for tensor workloads. - Autotuning Robustness for GEMM Fusion: Hardened autotuning flow to skip invalid GEMM fusion configurations when nested GEMM fusion is not achieved and added safety bounds checks, preventing misrouted configurations and out-of-bounds errors in the GEMM fusion emitter. - Backend Cleanup, MLIR Refactors, and Test Config Updates: Code cleanup and refactors to support the Triton/GPU backend, including MLIR operation creation helpers, test configuration simplifications, and removal of outdated paths. Major bugs fixed: - Autotuning robustness: skip autotuner configs if nest GEMM fusion fails; prevent routing to legacy emitter. - Bounds checks: added in Triton fusion emitter to guard against out-of-bounds access in tile/parameter calculations. - Misc: Removed legacy paths and deprecated emitter components to align with the nested GEMM fusion model. Overall impact and accomplishments: - Improved GPU GEMM performance and stability by enforcing a single, modern nested GEMM fusion path, reducing divergence between backends. Decreased risk from legacy code paths, enabling faster iteration on kernel optimizations. Improved maintainability with MLIR/C++ cleanup and streamlined test configurations. Strengthened business value by delivering faster tensor ops and more predictable autotuning for GPU workloads. Technologies/skills demonstrated: - Triton integration and nested GEMM fusion concepts - GPU backends (Intel-tensorflow/xla, ROCm/tensorflow-upstream) - MLIR-based operation creation, code cleanup, and tests refactoring - Autotuning strategies and safety checks - Cross-repo collaboration and change management for performance upgrades

39 Commits • 4 Features

Nov 1, 2025

Month: 2025-11 Overview: Concluded a major modernization of the GPU GEMM pathway through nested GEMM fusion, extended across Intel-tensorflow/xla and ROCm/tensorflow-upstream, with focused work on emitter updates, autotuning safety, and backend maintenance. The result is faster, more robust GPU GEMM operations, simplified maintenance, and a clearer upgrade path for future GPU backends. Key features delivered: - Triton GEMM Nested Fusion Backend Modernization: Consolidated effort to adopt nested GEMM fusion across the Triton backend, including enabling nested GEMM fusion in the emitter, removing legacy GEMM paths, updating autotuning, adding bounds checks, refactoring, and cleaning up tests and configurations to improve performance and robustness of GPU GEMM operations. - Triton Library Integration for GPU Backends: Integrated Triton library for GPU tensor operations to enhance GPU compilation capabilities and optimize performance for tensor workloads. - Autotuning Robustness for GEMM Fusion: Hardened autotuning flow to skip invalid GEMM fusion configurations when nested GEMM fusion is not achieved and added safety bounds checks, preventing misrouted configurations and out-of-bounds errors in the GEMM fusion emitter. - Backend Cleanup, MLIR Refactors, and Test Config Updates: Code cleanup and refactors to support the Triton/GPU backend, including MLIR operation creation helpers, test configuration simplifications, and removal of outdated paths. Major bugs fixed: - Autotuning robustness: skip autotuner configs if nest GEMM fusion fails; prevent routing to legacy emitter. - Bounds checks: added in Triton fusion emitter to guard against out-of-bounds access in tile/parameter calculations. - Misc: Removed legacy paths and deprecated emitter components to align with the nested GEMM fusion model. Overall impact and accomplishments: - Improved GPU GEMM performance and stability by enforcing a single, modern nested GEMM fusion path, reducing divergence between backends. Decreased risk from legacy code paths, enabling faster iteration on kernel optimizations. Improved maintainability with MLIR/C++ cleanup and streamlined test configurations. Strengthened business value by delivering faster tensor ops and more predictable autotuning for GPU workloads. Technologies/skills demonstrated: - Triton integration and nested GEMM fusion concepts - GPU backends (Intel-tensorflow/xla, ROCm/tensorflow-upstream) - MLIR-based operation creation, code cleanup, and tests refactoring - Autotuning strategies and safety checks - Cross-repo collaboration and change management for performance upgrades

November 2025

October 2025

17 Commits • 6 Features

Oct 1, 2025

October 2025 performance-focused delivery across TensorFlow, XLA, and JAX with emphasis on GPU GEMM performance, fusion reliability, and hermetic builds. Key outcomes include enabling the generic Triton emitter by default for all GEMMs, introducing 16-byte Split-K padding to support pipelining, relaxing nested GEMM fusion constraints, and modernizing vendored dependencies into hermetic rules with a clear tf_vendored path parameter. These changes uplift GPU compute efficiency, reduce build fragility, and improve reproducibility for production deployments.

October 2025

17 Commits • 6 Features

Oct 1, 2025

October 2025 performance-focused delivery across TensorFlow, XLA, and JAX with emphasis on GPU GEMM performance, fusion reliability, and hermetic builds. Key outcomes include enabling the generic Triton emitter by default for all GEMMs, introducing 16-byte Split-K padding to support pipelining, relaxing nested GEMM fusion constraints, and modernizing vendored dependencies into hermetic rules with a clear tf_vendored path parameter. These changes uplift GPU compute efficiency, reduce build fragility, and improve reproducibility for production deployments.

September 2025

29 Commits • 5 Features

Sep 1, 2025

September 2025 performance summary: Delivered substantive XLA and TensorFlow backend improvements across Intel-tensorflow/xla, Intel-tensorflow/tensorflow, and jax-ml/jax. The core work focused on Triton XLA backend pipeline optimizations, GPU indexing/reshape correctness fixes, and build-system/toolchain enhancements enabling raft-based distributed workloads. Reverted an unstable select_k GPU path to restore stable TopK behavior, and implemented API/build-cleanup changes to reduce surface area. A targeted JAX cleanup removed an obsolete repository rule. The month yielded higher GPU performance, more reliable releases, and a stronger foundation for distributed workloads in production.

29 Commits • 5 Features

Sep 1, 2025

September 2025 performance summary: Delivered substantive XLA and TensorFlow backend improvements across Intel-tensorflow/xla, Intel-tensorflow/tensorflow, and jax-ml/jax. The core work focused on Triton XLA backend pipeline optimizations, GPU indexing/reshape correctness fixes, and build-system/toolchain enhancements enabling raft-based distributed workloads. Reverted an unstable select_k GPU path to restore stable TopK behavior, and implemented API/build-cleanup changes to reduce surface area. A targeted JAX cleanup removed an obsolete repository rule. The month yielded higher GPU performance, more reliable releases, and a stronger foundation for distributed workloads in production.

September 2025

August 2025

27 Commits • 10 Features

Aug 1, 2025

August 2025 monthly summary focused on delivering high-impact GPU and Triton XLA back-end improvements across multiple repositories, driving performance, reliability, and maintainability. Highlights include expanded fused GEMM capabilities with broadcast support, enhanced transpose folding for codegen efficiency, and hardened memory operand handling in Triton XLA, along with upstream alignment and stability fixes.

August 2025

27 Commits • 10 Features

Aug 1, 2025

August 2025 monthly summary focused on delivering high-impact GPU and Triton XLA back-end improvements across multiple repositories, driving performance, reliability, and maintainability. Highlights include expanded fused GEMM capabilities with broadcast support, enhanced transpose folding for codegen efficiency, and hardened memory operand handling in Triton XLA, along with upstream alignment and stability fixes.

July 2025

27 Commits • 7 Features

Jul 1, 2025

July 2025 performance summary focusing on backend optimization, stability, and build reliability across multiple repos. Key work centered on Triton XLA squeeze-dims pass implementations and refinements, alongside infrastructure refinements and build-system improvements that enhance GPU codegen, developer productivity, and pipeline stability.

27 Commits • 7 Features

Jul 1, 2025

July 2025 performance summary focusing on backend optimization, stability, and build reliability across multiple repos. Key work centered on Triton XLA squeeze-dims pass implementations and refinements, alongside infrastructure refinements and build-system improvements that enhance GPU codegen, developer productivity, and pipeline stability.

July 2025

June 2025

10 Commits • 6 Features

Jun 1, 2025

June 2025 performance summary: Delivered substantial GPU fusion and Triton integration work across the XLA and ROCm stacks, improving robustness and performance for ML workloads. Key initiatives include NestGemmFusion bitcast hoisting and shape handling improvements with support for non-default data layouts; Triton integration upgrades (branch-1.8) and GPU pipeline enhancements; cross-repo alignment to support Blackwell, Hopper, and AMD GPUs; Triton integration in jaxlib; and continued optimization of nested GEMM fusion. These changes translate to higher fusion coverage, improved GPU throughput, and broader hardware compatibility, enabling faster model training and inference with fewer layout/shape edge-case issues.

June 2025

10 Commits • 6 Features

Jun 1, 2025

June 2025 performance summary: Delivered substantial GPU fusion and Triton integration work across the XLA and ROCm stacks, improving robustness and performance for ML workloads. Key initiatives include NestGemmFusion bitcast hoisting and shape handling improvements with support for non-default data layouts; Triton integration upgrades (branch-1.8) and GPU pipeline enhancements; cross-repo alignment to support Blackwell, Hopper, and AMD GPUs; Triton integration in jaxlib; and continued optimization of nested GEMM fusion. These changes translate to higher fusion coverage, improved GPU throughput, and broader hardware compatibility, enabling faster model training and inference with fewer layout/shape edge-case issues.

May 2025

36 Commits • 6 Features

May 1, 2025

May 2025 performance summary: Delivered substantial int4 support and fusion efficacy across ROCm/xla, Intel-tensorflow/xla, and ROCm/tensorflow-upstream. Key work includes stabilizing int4 data path in GPU backends, enhancing the Triton fusion emitter, and consolidating MLIR/int4 testing. These changes improve performance and correctness for low-precision workloads on GPUs, reduce regression risk, and lay groundwork for broader int4 adoption.

36 Commits • 6 Features

May 1, 2025

May 2025 performance summary: Delivered substantial int4 support and fusion efficacy across ROCm/xla, Intel-tensorflow/xla, and ROCm/tensorflow-upstream. Key work includes stabilizing int4 data path in GPU backends, enhancing the Triton fusion emitter, and consolidating MLIR/int4 testing. These changes improve performance and correctness for low-precision workloads on GPUs, reduce regression risk, and lay groundwork for broader int4 adoption.

May 2025

April 2025

20 Commits • 10 Features

Apr 1, 2025

April 2025 monthly summary for performance review. Across ROCm/xla, triton-lang/triton, jax-ml/jax, ROCm/jax, google/xls, google/heir, and ROCm/tensorflow-upstream, the team delivered significant build-system modernization, Triton/XLA integration improvements, and build configuration cleanups that reduce maintenance burden and enable faster iteration on performance-critical workloads.

April 2025

20 Commits • 10 Features

Apr 1, 2025

April 2025 monthly summary for performance review. Across ROCm/xla, triton-lang/triton, jax-ml/jax, ROCm/jax, google/xls, google/heir, and ROCm/tensorflow-upstream, the team delivered significant build-system modernization, Triton/XLA integration improvements, and build configuration cleanups that reduce maintenance burden and enable faster iteration on performance-critical workloads.

March 2025

9 Commits • 3 Features

Mar 1, 2025

March 2025 monthly summary focusing on delivering GPU/XLA features, cleaning up sparsity paths, and improving code health across Triton integrations. Business value was achieved through performance-oriented feature delivery, reduced maintenance burden, and more reliable builds and integrations across XLA GPU, Triton, and JAX backends.

9 Commits • 3 Features

Mar 1, 2025

March 2025 monthly summary focusing on delivering GPU/XLA features, cleaning up sparsity paths, and improving code health across Triton integrations. Business value was achieved through performance-oriented feature delivery, reduced maintenance burden, and more reliable builds and integrations across XLA GPU, Triton, and JAX backends.

March 2025

February 2025

6 Commits • 3 Features

Feb 1, 2025

February 2025 monthly summary for ROCm/xla and OpenXLA Triton integration. Focused on stabilizing GPU fusion handling, refactoring for maintainability, and aligning workspace and build configurations with Triton/OpenXLA updates. The work delivered stronger GPU fusion correctness, improved test coverage, and groundwork for broader OpenXLA compatibility across TritonGPU and AMDGPU backends.

February 2025

6 Commits • 3 Features

Feb 1, 2025

February 2025 monthly summary for ROCm/xla and OpenXLA Triton integration. Focused on stabilizing GPU fusion handling, refactoring for maintainability, and aligning workspace and build configurations with Triton/OpenXLA updates. The work delivered stronger GPU fusion correctness, improved test coverage, and groundwork for broader OpenXLA compatibility across TritonGPU and AMDGPU backends.

January 2025

11 Commits • 2 Features

Jan 1, 2025

January 2025 performance summary focusing on stability, correctness, and expanded Triton/XLA integration across three repos. Key outcomes include targeted bug fixes in linear algebra operations, header dependency reductions, safer memory-management improvements, and broader codegen/test support that enable more reliable production usage and faster development cycles.

11 Commits • 2 Features

Jan 1, 2025

January 2025 performance summary focusing on stability, correctness, and expanded Triton/XLA integration across three repos. Key outcomes include targeted bug fixes in linear algebra operations, header dependency reductions, safer memory-management improvements, and broader codegen/test support that enable more reliable production usage and faster development cycles.

January 2025

November 2024

2 Commits

Nov 1, 2024

Month: 2024-11 focused on stability and reliability of JAX tests on Ampere GPUs for Triton sparsity extensions in ROCm/jax. Implemented targeted test guards, adjusted assertion semantics, and re-enabled tests after addressing root issues. All changes improve CI reliability, user confidence, and hardware-specific behavior visibility.

November 2024

2 Commits

Nov 1, 2024

Month: 2024-11 focused on stability and reliability of JAX tests on Ampere GPUs for Triton sparsity extensions in ROCm/jax. Implemented targeted test guards, adjusted assertion semantics, and re-enabled tests after addressing root issues. All changes improve CI reliability, user confidence, and hardware-specific behavior visibility.

PROFILE

Christian Sigg

Overall Statistics

Feature vs Bugs

Repository Contributions

Your Network

Same Organization

Shared Repositories

Work History

7 Commits • 3 Features

7 Commits • 3 Features

10 Commits • 4 Features

10 Commits • 4 Features

43 Commits • 10 Features

43 Commits • 10 Features

8 Commits • 4 Features

8 Commits • 4 Features

18 Commits • 9 Features

18 Commits • 9 Features

3 Commits • 2 Features

3 Commits • 2 Features

12 Commits • 3 Features

12 Commits • 3 Features

6 Commits • 4 Features

6 Commits • 4 Features

39 Commits • 4 Features

39 Commits • 4 Features

17 Commits • 6 Features

17 Commits • 6 Features

29 Commits • 5 Features

29 Commits • 5 Features

27 Commits • 10 Features

27 Commits • 10 Features

27 Commits • 7 Features

27 Commits • 7 Features

10 Commits • 6 Features

10 Commits • 6 Features

36 Commits • 6 Features

36 Commits • 6 Features

20 Commits • 10 Features

20 Commits • 10 Features

9 Commits • 3 Features

9 Commits • 3 Features

6 Commits • 3 Features

6 Commits • 3 Features

11 Commits • 2 Features

11 Commits • 2 Features

2 Commits

2 Commits

Activity

Quality Metrics

Skills & Technologies

Programming Languages

Technical Skills

Repositories Contributed To

Intel-tensorflow/xla

Languages Used

Technical Skills

Intel-tensorflow/tensorflow

Languages Used

Technical Skills

ROCm/tensorflow-upstream

Languages Used

Technical Skills

ROCm/xla

Languages Used

Technical Skills

openxla/xla

Languages Used

Technical Skills

ROCm/jax

Languages Used

Technical Skills

jax-ml/jax

Languages Used

Technical Skills

triton-lang/triton

Languages Used

Technical Skills

llvm/clangir