| HN Mirror

Y	Hacker News new \| ask \| show \| jobs

by hildaman 2758 days ago

I'll tell you why - try to update one column in 100GB worth of row data.

Postgress makes a copy of _every_ row and you need 100GB of extra space on the hard-drive until you commit the transaction. Now extrapolate to a 1TB table that needs updating.

Oracle has a way of doing this w/o copying the entire row.

3 comments

snuxoll 2758 days ago

> Postgress makes a copy of _every_ row and you need 100GB of extra space on the hard-drive until you commit the transaction.

This only happens if the column is indexed, heap-only-tuples will allow in-place updates otherwise. This doesn't dismiss it as a potential problem entirely, but depending on your needs you may never run into this.

link

djd20 2758 days ago

I would argue thats an issue with your architecture at that point - you may want to use table partitioning at sizes that big, or have some other mechanism in place to be able to lock access while updating such large data sets in one go.

link

snuxoll 2758 days ago

I would generally agree, though there are unfortunately limitations imposed on you the moment you start using partitioning with PostgreSQL (foreign keys remain to be a big one).

link

Tostino 2758 days ago

Not as of PG 11

link

snuxoll 2758 days ago

You still can’t make FK references TO partitioned tables, unfortunately.

link

nly 2758 days ago

So what's the workaround? Drop the index, update, then re-establish the index?

link

gopalv 2758 days ago

> Postgress makes a copy of _every_ row