|
|
|
|
|
by giantrobot
831 days ago
|
|
I've had the same problem with Fandom née Wikia dumps. Just gigabytes of XML with questionable adherence to schemas. Fandom also has a ton of custom-to-Fandom tags which are a further pain to handle. Pulling useful content out of the dumps has been an exercise in frustration. I'm sure I could figure something out if I had a bunch of time to dedicate to the effort. If I just had sqlite dumps they'd be trivial to work with and I'd be much happier with them. |
|