Hacker News new | ask | show | jobs
by crashabr 65 days ago
The visual cleaning idea is really interesting. Would you mind sharing more details?
1 comments

It's nothing revolutionary.

It's essentially a table layout with a plus button at the bottom. When you click it adds a new step as a row, then you pick the operation, the input columns and output column name.

If you want to add another step you click the plus again and add another row the same way. Each row can access any table field or output field defined above it in the DAG.

Then in Python a for loop runs over the steps in order and updates the data frame in place (well, in function, returning the new one). It uses a dictionary of function mappings and resolves input fields with kwargs.