I don't know if I would call a math library that uses templates so liberally "high performance". High performance also includes compile time in my opinion.
I get the template hate, they take a while to wrap your head around and can create cryptic bugs. Nonetheless they can be extremely powerful and enable performance and reduced complexity by being a bit complex upfront.