Y
Hacker News
new
|
ask
|
show
|
jobs
by
cdubzzz
3499 days ago
It is actually mbox. I should have provided more detailed numbers - in my Takeout file, for example, there are 91,360 chat messages and only 23,407 email messages .
1 comments
dmd
3499 days ago
So there's 23407 email messages but only one file containing all of them?
link
cdubzzz
3499 days ago
Yep! All messages are in a single mbox file and it's 3.2GB.
cdubz@professor-farnsworth ~/data $ du -h Mail-chris.mbox 3.2G Mail-chris.mbox
link
dmd
3499 days ago
Wow. I stand corrected and that's awful. Yet another reason to use gmvault!
link
icebraining
3499 days ago
Why it awful? As an archive, seems decent.
link
dmd
3499 days ago
(1) inconsistent escaping rules (dealing with the literal string \nFrom)
(2) easy to corrupt
link
creeble
3499 days ago
Worth noting that Google provides some Python sample code for parsing the file which works great.
link