| HN Mirror

Y	Hacker News new \| ask \| show \| jobs

by sixhobbits 29 days ago

I do some similar charting etc with telegram data dumps that you can still get from the "telegram lite" desktop app even though they removed the export functionality from the main app.

For removing noise you might want to look into TF-IDF instead of the manual method described in the post that I didn't understand. It basically looks for words common across the whole corpus as noise or ones that appear within a specific chat much higher than the whole dataset as interesting.

You can also do some fun stuff by finding phrases used asymmetrically eg more by one person in the convo than the other, or over time.

Wordclouds per person are also fun!

3 comments

valzevul 28 days ago

Hi, OP here!

TF-IDF was the first thing I tried - it works great for stopwords but it doesn't handle cross-language bleed of filler words well, and the short life-event messages ("he died", etc) use common words and get aggressively down-weighted.

I had some asymmetry analysis when looking at directional sentiment and per-person question rates - that's fun indeed!

I also went with the Jaccard convergence and the endearment categories instead of wordclouds, so that I could see how word choices are changing across time.

link

sunaookami 29 days ago

Huh, the export function in Telegram Desktop (https://github.com/telegramdesktop/tdesktop/) is still there. Click on the three dots while in a chat and then Export.

link

valzevul 28 days ago

In the same app, if you go down Settings -> Advanced -> scroll to the bottom -> Export Telegram data, there is an option to export all chats at once, including some very handy controls like getting only your own messages for large group chats.

link

sixhobbits 28 days ago

ah right, I think that might be what is "Telegram Lite" on the App Store

These are the two options I see to download https://i.ritzastatic.com/static/1e133ef5057a949b7ddd92e5668...

And the 'main' one that I usually use doesn't have export settings that I can find

https://i.ritzastatic.com/static/18db23448a373338766bf419fa0...

link

sunaookami 26 days ago

Ah yeah Telegram Lite is also an official program, but has fewer features.

link

dotancohen 29 days ago

I'm not at the desktop right now. Check if the option to do a full (all chats) export still exists.

link

dotancohen 29 days ago

Was any explanation given as to why the export functionality was removed?

link