Hacker News new | ask | show | jobs
by mratsim 2828 days ago
Very impressive, kudos. I'd like screenshots rather than PSNR and SSIM because those don't translate into human perception of a good encode.

The only grip I have is the 13k lines of C + intrinsics.

3 comments

I've add screenshot of first intra frame and x264.264 to compare in players.
That's most DSP code. Take a look at FFTW or SOKit.
> The only grip I have is the 13k lines of C + intrinsics.

How are you supposed to get acceptable performance without intrinsics?

Especially if it's meant to be an educational and in any case non-realtime encoder, intrinsics are not really necessary and make the code less portable.

I also wonder how much the compiler can do autovectorisation on code like this --- it's pretty much exactly the type of code that autovectorisation is intended for.

Edit: I noticed in the benchmark that it compressed the 10s foreman.cif (demo video) in half a second, so it's already 20x faster than realtime on that small resolution.

It's also single core comparison. Multi-threaded encoding is supported too: with 4 threads I get 0,190s for minih264 and 0,135s for x264.