Pantek Library
Hosting Provided By
CybrHost
High Speed Hosting

Re: [TLM] Re: [GENERAL] How to insert on duplicate key?

From: Samantha Atkins <sjatkins(at)mac.com>
Date: Thu Dec 27 2007 - 12:23:34 EST

On Dec 24, 2007, at 11:15 PM, Greg Smith wrote:

>>
>
> This may be better because it isn't doing the query first. You may
> discover that you need to aggressively run one of the VACUUM
> processes (I'd guess regular and ANALYZE but not FULL) in order to
> keep performance steady as the number of records grows. Anytime you
> update a row, that becomes a dead row that's still taking up space,
> and if you do a lot of those they get in the way of finding the rows
> that are still live. Take a look at http://www.postgresql.org/docs/current/interactive/routine-vacuuming.html
> to get an idea of the process.

Whoa. I am going to have to dig into the implementation. What is wrong with update in place, concurrency issues? The dead row presumably is no longer indexed, right? Since it is known to be dead is it automatically removed when there are no live transaction that reference or may reference it and its data page space marked available for new rows? If not, why not? I'm dredging my mind for stuff from my RDBMS implementation grad course a very long time ago. I would imagine that vacuuming often in a huge insert update would be a pretty poor performer depending on implementation. How is this implemented? Why would it be heavy IO if a list of pointers effectively is being kept to the dead rows to simply be added to the free list? What else is vacuum doing? Lookup implemented removal from indices, something else?

  • samantha
---------------------------(end of broadcast)---------------------------
TIP 5: don't forget to increase your free space map settings Received on Thu Dec 27 12:37:42 2007

This archive was generated by hypermail 2.1.8 : Mon Jun 16 2008 - 22:58:25 EDT


Contact Us  Legal Notices  Order Services Online 
Pantek Home  Privacy Policy  IT news  Site Map  Pantek Library