Hacker News new | ask | show | jobs
by sirfz 3335 days ago
Utilizing multiprocessing for reading and processing jsons (or any type of data) then feeding the output into a shuffle_batch* op works great for me.