EXCEEDS logo
Exceeds
zhihe-wang

PROFILE

Zhihe-wang

Developed ASCEND NPU training support for Qwen3-8B and Qwen3-14B models in the volcengine/verl repository, focusing on scalable large-model training workflows. Leveraged Direct Alignment Policy Optimization (DAPO) to enable efficient model training on ASCEND hardware, and created shell scripts to automate job configuration, including model paths, data inputs, and hardware-specific performance settings. Documented the setup and parameterization process to enhance reproducibility and streamline onboarding for future contributors. Utilized deep learning and machine learning expertise, along with shell scripting and RST for documentation, to deliver a robust, hardware-optimized training pipeline that prepares the team for further scalable iteration.

Overall Statistics

Feature vs Bugs

100%Features

Repository Contributions

1Total
Bugs
0
Commits
1
Features
1
Lines of code
282
Activity Months1

Work History

July 2025

1 Commits • 1 Features

Jul 1, 2025

Concise monthly summary for 2025-07 focusing on business value and technical achievements for volcengine/verl. Delivered ASCEND NPU training support for Qwen3-8B and Qwen3-14B using DAPO, along with automation scripts to streamline training workflows. This month’s efforts improved large-model training capability on ASCEND hardware and prepared the team for scalable iteration with optimized performance settings.

Activity

Loading activity data...

Quality Metrics

Correctness80.0%
Maintainability80.0%
Architecture80.0%
Performance80.0%
AI Usage20.0%

Skills & Technologies

Programming Languages

RSTShell

Technical Skills

Deep LearningMachine LearningModel TrainingNPU AccelerationShell Scripting

Repositories Contributed To

1 repo

Overview of all repositories you've contributed to across your timeline

volcengine/verl

Jul 2025 Jul 2025
1 Month active

Languages Used

RSTShell

Technical Skills

Deep LearningMachine LearningModel TrainingNPU AccelerationShell Scripting