|
|||||||||||
|
Re: Usage of journal in Bayesian Filtering.
From: Matt Kettler <mkettler_sa(at)verizon.net>
Date: Thu Aug 30 2007 - 09:43:27 EDT
If it read bayes_journal while scanning, it would defeat the purpose of the journal. The journal exits to be more readily writable. This is possible only because it is rarely read from. If you read from the journal during scans, the write lock wouldn't be any more available than the write lock for the main tokens database, so you might as well use that for all your writes. Data is merged from the journal into the tokens at regular intervals as a part of SA's automatic sync process (once a day), when you run sa-learn --sync, or sa-learn --force-expire.
This in general means data in the journal doesn't "go live" until a sync
kicks off. This is why bayes_learn_to_journal defaults to 0. It improves
learning performance, but also introduces a "lag" where the results
don't take effect until there's a sync.
This archive was generated by hypermail 2.1.8 : Fri Oct 26 2007 - 03:23:51 EDT |
||||||||||
|
|||||||||||