|
|
|
|
|
by internetter
381 days ago
|
|
storage is cheap, but if you wanted to improve this: 1. find a way to dedup media 2. ensure content blockers are doing well 3. for news articles, put it through readability and store the markdown instead. if you wanted to be really fancy, instead you could attempt to programatically create a "template" of sites you've visited with multiple endpoints so the style is retained but you're not storing the content. alternatively a good compression algo could do this, if you had your directory like /home/andrew/archive/boehs.org.tar.gz and inside of the tar all the boehs.org pages you visited are saved 4. add fts and embeddings over the pages |
|