Because if it really would use something like malloc, I don't really see how it should be faster than the C version.