Discussion:
Table maintenance: order of operations important?
(too old to reply)
Jeff Boes
2004-05-20 13:48:34 UTC
Permalink
We have a number of tables from which we expire (delete) rows nightly.
Is there any value to ordering the operations, in terms of either table
physical organization or elapsed time?

E.g.,

DELETE FROM foo WHERE date_expires < now();
VACUUM ANALYZE foo;
CLUSTER foo;
REINDEX TABLE foo;

How would you choose to order these (under 7.4.x) for fastest
turn-around? Does it matter?
--
It may not always be easy, convenient, or politically correct to stand
for truth and right, but it is the right thing to do. Always.
________ --M. Russell Ballard
Jeffery Boes <>< ***@qtm.net
Tom Lane
2004-05-20 17:52:12 UTC
Permalink
Post by Jeff Boes
DELETE FROM foo WHERE date_expires < now();
VACUUM ANALYZE foo;
CLUSTER foo;
REINDEX TABLE foo;
How would you choose to order these (under 7.4.x) for fastest
turn-around? Does it matter?
If you are going to CLUSTER then the VACUUM and the REINDEX are both
utterly redundant. The ANALYZE is still useful but should be done after
CLUSTER since its physical-order-correlation stats will be quite wrong
if done beforehand. In other words there is only one sane way to do
this and it is

DELETE FROM foo WHERE date_expires < now();
CLUSTER foo;
ANALYZE foo;

You could possibly make a case for

DELETE FROM foo WHERE date_expires < now();
CLUSTER foo;
VACUUM ANALYZE foo;

The VACUUM won't do anything useful in terms of reclaiming space (there
being none to reclaim just after a CLUSTER) but it would ensure that all
rows in the table are marked as committed-good, rather than leaving that
work to be done by the first transaction that happens to hit each row.

regards, tom lane

---------------------------(end of broadcast)---------------------------
TIP 8: explain analyze is your friend
Gaetano Mendola
2004-05-20 18:53:51 UTC
Permalink
Post by Tom Lane
Post by Jeff Boes
DELETE FROM foo WHERE date_expires < now();
VACUUM ANALYZE foo;
CLUSTER foo;
REINDEX TABLE foo;
How would you choose to order these (under 7.4.x) for fastest
turn-around? Does it matter?
If you are going to CLUSTER then the VACUUM and the REINDEX are both
utterly redundant.
Without cluster with 7.4.2 a REINDEX is redundant after a VACUUM FULL ?



Regards
Gaetano Mendola

Loading...