EXCEEDS logo
Exceeds
Henrik Gramner

PROFILE

Henrik Gramner

Over a three-month period, gramner@twoorioles.com developed and optimized VP9 video codec features for the FFmpeg and ossrs/ffmpeg-webrtc repositories, focusing on low-level performance improvements. They implemented AVX-512ICL and AVX2 assembly optimizations for VP9 decoding, including sub-pixel motion compensation, intra prediction, and inverse transforms, targeting both 8-bit and 10-bit video paths. Their work introduced new decoding paths, helper functions, and initialization routines in C and assembly, enabling higher throughput and reduced CPU usage on modern x86 hardware. By removing obsolete code and refining SIMD instruction usage, they improved maintainability and decoding efficiency for real-time and streaming applications.

Overall Statistics

Feature vs Bugs

100%Features

Repository Contributions

6Total
Bugs
0
Commits
6
Features
4
Lines of code
6,297
Activity Months3

Your Network

319 people

Work History

September 2025

3 Commits • 2 Features

Sep 1, 2025

September 2025 (FFmpeg/FFmpeg) focused on performance optimization for 8-bit VP9 decoding on AVX2-capable CPUs, plus targeted maintenance to simplify the codebase. Delivered two AVX2-based optimizations for 8-bit VP9 intra prediction and inverse transforms and removed an obsolete 8-bit AVX2 VP9 inverse transform implementation to reduce code size and compilation time. These changes improve decoding throughput on supported hardware, lower maintenance burden, and illustrate robust low-level optimization and refactoring capabilities.

August 2025

1 Commits • 1 Features

Aug 1, 2025

August 2025: FFmpeg/FFmpeg delivered a high-impact performance optimization for VP9 sub-pixel motion compensation using AVX-512ICL. The change introduces AVX-512ICL assembly optimizations for 8-bit-per-pixel sub-pixel interpolation, along with new helper functions/macros and updates to initialization routines to cover multiple sub-pixel scenarios. Expected throughput improvements on AVX-512-capable CPUs for VP9 workloads; commit referenced below. This work strengthens encoding/decoding efficiency and contributes to better streaming performance on modern hardware.

May 2025

2 Commits • 1 Features

May 1, 2025

May 2025 — ossrs/ffmpeg-webrtc: Delivered VP9 AVX-512ICL optimization, targeting 16x16 and 32x32 blocks for 8-bit and 10-bit decoding. The change includes new decoding paths and inverse transforms, enabling faster VP9 decode on AVX-512ICL-capable hardware and improving real-time WebRTC throughput.

Activity

Loading activity data...

Quality Metrics

Correctness100.0%
Maintainability83.4%
Architecture80.0%
Performance93.4%
AI Usage20.0%

Skills & Technologies

Programming Languages

AssemblyC

Technical Skills

AVX-512AVX-512ICLAVX2 OptimizationAssembly LanguageAssembly Language ProgrammingC ProgrammingFFmpegFFmpeg DevelopmentLow-level OptimizationSIMD InstructionsVP9Video Codec DevelopmentVideo Codec OptimizationVideo Codecsx86 Assembly

Repositories Contributed To

2 repos

Overview of all repositories you've contributed to across your timeline

FFmpeg/FFmpeg

Aug 2025 Sep 2025
2 Months active

Languages Used

AssemblyC

Technical Skills

AVX-512Low-level OptimizationVideo Codec Optimizationx86 AssemblyAVX2 OptimizationAssembly Language

ossrs/ffmpeg-webrtc

May 2025 May 2025
1 Month active

Languages Used

AssemblyC

Technical Skills

AVX-512ICLAssembly Language ProgrammingC ProgrammingFFmpeg DevelopmentLow-level OptimizationSIMD Instructions