|
|
|
|
|
by StrangeDoctor
2654 days ago
|
|
I built tools to parse the compressed XML dumps. My computer was pretty underpowered at the time (MacBook air) so I had to very careful to make everything a streaming algorithm. Looking back I basically recreated a shitty map reduce in Python. |
|