;doc: import: deduplication: edits
This commit is contained in:
parent
080184d2e5
commit
3f184a848a
@ -36,18 +36,17 @@ And after a successful import, it updates the `.latest.FILE`(s) for next time (u
|
|||||||
|
|
||||||
This is a limited kind of deduplication, let's call it "date skipping".
|
This is a limited kind of deduplication, let's call it "date skipping".
|
||||||
Within each input file, it avoids reprocessing the same dates across successive runs.
|
Within each input file, it avoids reprocessing the same dates across successive runs.
|
||||||
This is a simple system that works fairly well for transaction data.
|
This is a simple system that works for most real-world CSV files;
|
||||||
It assumes:
|
it assumes these are true, or true enough:
|
||||||
|
|
||||||
1. new items always have the newest dates
|
1. new items always have the newest dates
|
||||||
2. item dates are stable across successive downloads
|
2. item dates are stable across successive downloads
|
||||||
3. the order of same-date items is stable across downloads
|
3. the order of same-date items is stable across downloads
|
||||||
4. the name of the input file is stable across downloads
|
4. the name of the input file is stable across downloads
|
||||||
|
|
||||||
These are true of most CSV files representing transactions, or true enough.
|
If you have a bank whose CSV dates or ordering occasionally change,
|
||||||
If you have a bank whose CSV dates or ordering change occasionally,
|
you can reduce the chance of this happening in new transactions by importing more often,
|
||||||
you can reduce the chance of this happening in new transactions by importing more often
|
and in old transactions it doesn't matter.
|
||||||
(and in old transactions it doesn't matter).
|
|
||||||
And remember you can use CSV rules files as input, which is one way to ensure a stable file name.
|
And remember you can use CSV rules files as input, which is one way to ensure a stable file name.
|
||||||
|
|
||||||
`import` doesn't detect other kinds of duplication, such as duplicate transactions within a single run.
|
`import` doesn't detect other kinds of duplication, such as duplicate transactions within a single run.
|
||||||
|
|||||||
Loading…
Reference in New Issue
Block a user