| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by grayclhn 1913 days ago
	Long expressions with matrix operations is a pretty standard example. When people talk about operator overloading in data science, they usually mean “standard operations on various arrays of numbers,” which are defined in common libraries or the programming language. Not “I need to define my own ad hoc equalities.”

1 comments

marcus_holmes 1913 days ago

yeah, I get this. If there are standard definitions of operations that everyone understands, that's fine.

But I always think that maybe we should be using new operators for this, instead of overloading existing ones that have other, different, meanings in different contexts.

link

grayclhn 1913 days ago

In a data science context, the key operations are math, so overloading makes a lot of sense and is massively helpful in implementing algorithms and equations. I go back and forth on the wisdom of some of the other common uses — filtering, etc. In addition to the problems that have been mentioned, there are often hidden and infrequent but painful performance issues.

link

marcus_holmes 1913 days ago

I often think that maths could use the same slap around the chops. Less arcane operators and symbols, more explicit function names please!

link

grayclhn 1913 days ago

I consider it kind of important that the notation for expressions like “A²” doesn’t depend on whether A is an integer, real number, complex number, matrix, random variable, etc., (even if the results do) or what the specific domain is, but if you feel like it’s important to embed all of that context in the exponent operator... give it a try :)

(And whether “2” is integer, real, rational, complex, etc)

link

marcus_holmes 1913 days ago

Yeah, but operator overloading doesn't say any of that. You have no idea what the "^" operator does, depending on the operands

link

FridgeSeal 1913 days ago

Ehhh, it would seem that way, but the compactness of the syntax functions to get out of the way and help you understand the overall structure. Having longer function names ends up getting in your way more often than not in my experience.

link