Hacker News new | ask | show | jobs
Woodwork – An Open Source Python Library for Rich Semantic Data Typing (innovation.alteryx.com)
2 points by gsheni 1497 days ago
2 comments

I’ve been creating an open source library for rich semantic data typing, which helps you managing and communicating data typing information. If you’re curious what this library can do, check out our new blog post!
This is really interesting!

Have you given much thought to either type annotations or decorators. One or both of them could potentially be useful for attaching schema information to the DataFrame inputs/outputs of functions and for reducing boilerplate when you might want to validate those inputs/outputs before or after calling the functions.

I haven't given much thought to type annotations.

pandas recommends using accessors to extend DataFrames (as we do in Woodwork), which are actually just decorators: https://pandas.pydata.org/pandas-docs/stable/development/ext...