Sure, I understand the idea of GIL being faster than fine-grained locking for single-threaded apps. This is not how I read the article though (maybe that's what they meant...).
The speed can be sorted as (in most cases): single-threaded code, single-threaded with GIL, single-threaded with fine locks.