|
|
|
|
|
by ethink
1952 days ago
|
|
Hello guys, As cleaning data takes most of our time in data science tasks I've created an ebook to make the command line as easy as possible to do that task. The ebook includes code snippets using the terminal dealing with lots of data from the COVID Tracking Project, Reddit users, a scientific paper discussing clickbait and non-clickbait article headlines, and more. Used some GNU, BSD commands and command-line utilities like csvkit and the fastest tool: xsv. Some benchmark results included as well. Be one of the first 10 who gets this ebook for free: How to Clean Data at the Command Line Would love to see your feedback, Thanks! |
|