hledger

Author	SHA1	Message	Date
Hans-Peter Deifel	ae73c525d8	Fix csv rules parsing (#407 ) * csv rules: Show prettier parsing errors This goes from hledger: user error ("ParseError {errorPos = SourcePos {sourceName = \"foo.csv.rules\", sourceLine = Pos 20, sourceColumn = Pos 1} :\| [], errorUnexpected = fromList [Tokens (' ' :\| \"\")], errorExpected = fromList [Label ('b' :\| \"lank or comment line\"),EndOfInput], errorCustom = fromList []}") to hledger: user error (foo.csv.rules:20:1: unexpected space expecting blank or comment line or end of input ) * csv rules: Fix parsing of empty field values A single line containing `account1 ` (note the space at the end) should parse as assignment of the empty string to account1. At least it did until commit `4141067`. The problem is that megaparsec's `space` parses multiple space characters as opposed to parsec. So in the example above it would incorrectly consume the newline. This commit also adds a new test case for this bug.	2016-09-25 12:56:28 -07:00
Simon Michael	72c39470d6	lib: non-journal formats now produce transaction ids #394 Transactions are now numbered consistently during journal finalisation, rather than just in the journal reader. Also transaction knot-tying has been moved out of journalBalanceTransactions.	2016-08-14 12:44:19 -07:00
Simon Michael	4022f5cb61	lib, web: fix some warnings after megaparsec change	2016-07-29 09:55:02 -07:00
Moritz Kiefer	4141067428	Replace Parsec with Megaparsec (see #289 ) (#366 ) * Replace Parsec with Megaparsec (see #289) This builds upon PR #289 by @rasendubi * Revert renaming of parseWithState to parseWithCtx * Fix doctests * Update for Megaparsec 5 * Specialize parser to improve performance * Pretty print errors * Swap StateT and ParsecT This is necessary to get the correct backtracking behavior, i.e. discard state changes if the parsing fails.	2016-07-29 08:57:10 -07:00
Simon Michael	f3bf98bfae	lib: parentheses trying to resolve IDE warning	2016-05-26 15:51:59 -07:00
Simon Michael	90c9735b7a	lib: textification: descriptions & codes Slightly higher (with small files) and lower (with large files) maximum residency, and slightly quicker for all. hledger -f data/100x100x10.journal stats <<ghc: 42858472 bytes, 84 GCs, 193712/269608 avg/max bytes residency (3 samples), 2M in use, 0.000 INIT (0.015 elapsed), 0.016 MUT (0.042 elapsed), 0.011 GC (0.119 elapsed) :ghc>> <<ghc: 42891776 bytes, 84 GCs, 190816/260920 avg/max bytes residency (3 samples), 2M in use, 0.000 INIT (0.004 elapsed), 0.017 MUT (0.025 elapsed), 0.010 GC (0.015 elapsed) :ghc>> hledger -f data/1000x1000x10.journal stats <<ghc: 349575240 bytes, 681 GCs, 1396425/4091680 avg/max bytes residency (7 samples), 11M in use, 0.000 INIT (0.000 elapsed), 0.137 MUT (0.146 elapsed), 0.050 GC (0.057 elapsed) :ghc>> <<ghc: 349927568 bytes, 681 GCs, 1397825/4097248 avg/max bytes residency (7 samples), 11M in use, 0.000 INIT (0.000 elapsed), 0.126 MUT (0.133 elapsed), 0.050 GC (0.057 elapsed) :ghc>> hledger -f data/10000x1000x10.journal stats <<ghc: 3424029496 bytes, 6658 GCs, 11403141/41077288 avg/max bytes residency (11 samples), 111M in use, 0.000 INIT (0.000 elapsed), 1.278 MUT (1.310 elapsed), 0.493 GC (0.546 elapsed) :ghc>> <<ghc: 3427418064 bytes, 6665 GCs, 11127869/37790168 avg/max bytes residency (11 samples), 109M in use, 0.000 INIT (0.001 elapsed), 1.212 MUT (1.229 elapsed), 0.466 GC (0.519 elapsed) :ghc>> hledger -f data/100000x1000x10.journal stats <<ghc: 34306546248 bytes, 66727 GCs, 77030638/414617944 avg/max bytes residency (14 samples), 1012M in use, 0.000 INIT (0.000 elapsed), 12.965 MUT (13.164 elapsed), 4.771 GC (5.447 elapsed) :ghc>> <<ghc: 34340246056 bytes, 66779 GCs, 76983178/416011480 avg/max bytes residency (14 samples), 1011M in use, 0.000 INIT (0.008 elapsed), 12.666 MUT (12.836 elapsed), 4.595 GC (5.175 elapsed) :ghc>>	2016-05-24 19:00:58 -07:00
Simon Michael	770dcee742	lib: textification: comments and tags No change. hledger -f data/100x100x10.journal stats <<ghc: 42859576 bytes, 84 GCs, 193781/269984 avg/max bytes residency (3 samples), 2M in use, 0.000 INIT (0.001 elapsed), 0.016 MUT (0.020 elapsed), 0.009 GC (0.011 elapsed) :ghc>> <<ghc: 42859576 bytes, 84 GCs, 193781/269984 avg/max bytes residency (3 samples), 2M in use, 0.000 INIT (0.001 elapsed), 0.015 MUT (0.018 elapsed), 0.009 GC (0.013 elapsed) :ghc>> hledger -f data/1000x1000x10.journal stats <<ghc: 349576344 bytes, 681 GCs, 1407388/4091680 avg/max bytes residency (7 samples), 11M in use, 0.000 INIT (0.000 elapsed), 0.124 MUT (0.130 elapsed), 0.047 GC (0.055 elapsed) :ghc>> <<ghc: 349576280 bytes, 681 GCs, 1407388/4091680 avg/max bytes residency (7 samples), 11M in use, 0.000 INIT (0.000 elapsed), 0.126 MUT (0.132 elapsed), 0.049 GC (0.058 elapsed) :ghc>> hledger -f data/10000x1000x10.journal stats <<ghc: 3424030664 bytes, 6658 GCs, 11403359/41071624 avg/max bytes residency (11 samples), 111M in use, 0.000 INIT (0.000 elapsed), 1.207 MUT (1.228 elapsed), 0.473 GC (0.528 elapsed) :ghc>> <<ghc: 3424030760 bytes, 6658 GCs, 11403874/41077288 avg/max bytes residency (11 samples), 111M in use, 0.000 INIT (0.002 elapsed), 1.234 MUT (1.256 elapsed), 0.470 GC (0.520 elapsed) :ghc>> hledger -f data/100000x1000x10.journal stats <<ghc: 34306547448 bytes, 66727 GCs, 76805504/414629288 avg/max bytes residency (14 samples), 1009M in use, 0.000 INIT (0.003 elapsed), 12.615 MUT (12.813 elapsed), 4.656 GC (5.291 elapsed) :ghc>> <<ghc: 34306547320 bytes, 66727 GCs, 76805504/414629288 avg/max bytes residency (14 samples), 1009M in use, 0.000 INIT (0.009 elapsed), 12.802 MUT (13.065 elapsed), 4.774 GC (5.441 elapsed) :ghc>>	2016-05-24 19:00:57 -07:00
Simon Michael	c89c33b36e	lib: textification: parse stream 10% more allocation, but 35% lower maximum residency, and slightly quicker. hledger -f data/100x100x10.journal stats <<ghc: 39327768 bytes, 77 GCs, 196834/269496 avg/max bytes residency (3 samples), 2M in use, 0.000 INIT (0.010 elapsed), 0.020 MUT (0.092 elapsed), 0.014 GC (0.119 elapsed) :ghc>> <<ghc: 42842136 bytes, 84 GCs, 194010/270912 avg/max bytes residency (3 samples), 2M in use, 0.000 INIT (0.009 elapsed), 0.016 MUT (0.029 elapsed), 0.012 GC (0.120 elapsed) :ghc>> hledger -f data/1000x1000x10.journal stats <<ghc: 314291440 bytes, 612 GCs, 2070776/6628048 avg/max bytes residency (7 samples), 16M in use, 0.000 INIT (0.000 elapsed), 0.128 MUT (0.144 elapsed), 0.059 GC (0.070 elapsed) :ghc>> <<ghc: 349558872 bytes, 681 GCs, 1397597/4106384 avg/max bytes residency (7 samples), 11M in use, 0.000 INIT (0.004 elapsed), 0.124 MUT (0.133 elapsed), 0.047 GC (0.053 elapsed) :ghc>> hledger -f data/10000x1000x10.journal stats <<ghc: 3070026824 bytes, 5973 GCs, 12698030/62951784 avg/max bytes residency (10 samples), 124M in use, 0.000 INIT (0.002 elapsed), 1.268 MUT (1.354 elapsed), 0.514 GC (0.587 elapsed) :ghc>> <<ghc: 3424013128 bytes, 6658 GCs, 11405501/41071624 avg/max bytes residency (11 samples), 111M in use, 0.000 INIT (0.001 elapsed), 1.343 MUT (1.406 elapsed), 0.511 GC (0.573 elapsed) :ghc>> hledger -f data/100000x1000x10.journal stats <<ghc: 30753387392 bytes, 59811 GCs, 117615462/666703600 avg/max bytes residency (14 samples), 1588M in use, 0.000 INIT (0.000 elapsed), 12.068 MUT (12.238 elapsed), 6.015 GC (7.190 elapsed) :ghc>> <<ghc: 34306530696 bytes, 66727 GCs, 76806196/414629312 avg/max bytes residency (14 samples), 1009M in use, 0.000 INIT (0.010 elapsed), 14.357 MUT (16.370 elapsed), 5.298 GC (6.534 elapsed) :ghc>>	2016-05-24 19:00:57 -07:00
Simon Michael	2538d14ea7	lib: textification begins! account names The first of several conversions from String to (strict) Text, hopefully reducing space and time usage. This one shows a small improvement, with GHC 7.10.3 and text-1.2.2.1: hledger -f data/100x100x10.journal stats string: <<ghc: 39471064 bytes, 77 GCs, 198421/275048 avg/max bytes residency (3 samples), 2M in use, 0.000 INIT (0.001 elapsed), 0.015 MUT (0.020 elapsed), 0.010 GC (0.014 elapsed) :ghc>> text: <<ghc: 39268024 bytes, 77 GCs, 197018/270840 avg/max bytes residency (3 samples), 2M in use, 0.000 INIT (0.002 elapsed), 0.016 MUT (0.022 elapsed), 0.009 GC (0.011 elapsed) :ghc>> hledger -f data/1000x100x10.journal stats string: <<ghc: 318555920 bytes, 617 GCs, 2178997/7134472 avg/max bytes residency (7 samples), 16M in use, 0.000 INIT (0.001 elapsed), 0.129 MUT (0.136 elapsed), 0.067 GC (0.077 elapsed) :ghc>> text: <<ghc: 314248496 bytes, 612 GCs, 2074045/6617960 avg/max bytes residency (7 samples), 16M in use, 0.000 INIT (0.003 elapsed), 0.137 MUT (0.145 elapsed), 0.067 GC (0.079 elapsed) :ghc>> hledger -f data/10000x100x10.journal stats string: <<ghc: 3114763608 bytes, 6026 GCs, 18858950/75552024 avg/max bytes residency (11 samples), 201M in use, 0.000 INIT (0.000 elapsed), 1.331 MUT (1.372 elapsed), 0.699 GC (0.812 elapsed) :ghc>> text: <<ghc: 3071468920 bytes, 5968 GCs, 14120344/62951360 avg/max bytes residency (9 samples), 124M in use, 0.000 INIT (0.003 elapsed), 1.272 MUT (1.349 elapsed), 0.513 GC (0.578 elapsed) :ghc>> hledger -f data/100000x100x10.journal stats string: <<ghc: 31186579432 bytes, 60278 GCs, 135332581/740228992 avg/max bytes residency (13 samples), 1697M in use, 0.000 INIT (0.008 elapsed), 14.677 MUT (15.508 elapsed), 7.081 GC (8.074 elapsed) :ghc>> text: <<ghc: 30753427672 bytes, 59763 GCs, 117595958/666457240 avg/max bytes residency (14 samples), 1588M in use, 0.000 INIT (0.008 elapsed), 13.713 MUT (13.966 elapsed), 6.220 GC (7.108 elapsed) :ghc>>	2016-05-24 19:00:49 -07:00
Simon Michael	0f5ee154c4	lib: simplify parsers; cleanups (#275 ) The journal/timeclock/timedot parsers, instead of constructing (opaque) journal update functions which are later applied to build the journal, now construct the journal directly (by modifying the parser state). This is easier to understand and debug. It also removes any possibility of the journal updates being a space leak. (They weren't, in fact memory usage is now slightly higher, but that will be addressed in other ways.) Also: Journal data and journal parse info have been merged into one type (for now), and field names are more consistent. The ParsedJournal type alias has been added to distinguish being-parsed and finalised journals. Journal is now a monoid. stats: fixed an issue with ordering of include files journal: fixed an issue with ordering of included same-date transactions timeclock: sessions can no longer span file boundaries (unclocked-out sessions will be auto-closed at the end of the file). expandPath now throws a proper IO error (and requires the IO monad).	2016-05-23 00:44:19 -07:00
Simon Michael	7f5e09096f	lib: rename JournalContext to JournalParseState	2016-05-18 20:57:34 -07:00
Simon Michael	84097b75c7	journal: can now include timeclock/timedot files (#320 ) journal files can now include journal, timeclock or timedot files (but not yet CSV files). Also timeclock/timedot files no longer support default year directives. The Hledger.Read.* modules have been reorganised for better reuse. Hledger.Read.Utils has been renamed Hledger.Read.Common and holds low-level parsers & utilities; high-level read utilities have moved to Hledger.Read.	2016-05-17 19:46:54 -07:00
Simon Michael	bc43036117	lib: use consistent p suffix for parsers	2015-10-17 11:51:45 -07:00
Simon Michael	42d452f99c	abstract parsec's SourcePos so as to derive NFData The NFData instance helps us time things with criterion.	2015-08-13 12:56:15 -07:00
Simon Michael	d1f63334ee	handle pending status correctly, add --pending (#250 ) A transaction/posting status of ! (pending) was effectively equivalent to * (cleared). Now it's a separate state, not matched by --cleared. The new Ledger-compatible --pending flag matches it, and so does --uncleared. The equivalent search queries are now status:*, status:! and status: (the old status:1 and status:0 spellings are deprecated). Since we interpret --uncleared and status: as "any state except cleared", it's not currently possible to match things which are neither cleared nor pending.	2015-05-16 11:51:35 -07:00
Simon Michael	70d87613f2	some cleanup of debug trace helpers	2015-05-14 13:01:49 -07:00
Simon Hengel	964a410b24	hledger-lib: Update for base-compat-0.8.0 (see #245 )	2015-04-23 15:41:59 +08:00
Simon Michael	f8a24ccead	fix parseTime warnings with time 1.5+ (#239 )	2015-03-29 16:12:54 -07:00
Simon Michael	f75849cdd6	fix ghc 7.10 Applicative import warnings (#239 ) Still needed CPP, despite using base-compat.	2015-03-29 16:09:41 -07:00
Simon Michael	8e50395b7c	ErrorT -> ExceptT, handle mtl <2.2.1 && >=2.2.1 (#239 )	2015-03-29 14:16:42 -07:00
Simon Michael	e60eb71467	adapt to GHC-7.10's time-1.5 (#239 )	2015-03-27 15:42:32 -07:00
Julien Moutinho	af56ced3b0	lib: add eof parsing checks	2015-01-11 09:45:55 -08:00
Simon Michael	9c68944c79	journal, csv: comment lines can also start with * As in Ledger. This means you can embed emacs org/outline-mode nodes in your journal file and manipulate it like an outline.	2014-12-27 14:41:28 -08:00
Simon Michael	1708f0b441	csv: try to preserve order of same-day transactions If the CSV records appear to have been in reverse date order, we'll now reverse them all before also sorting by transaction date, so that the original order of same-day transactions is preserved. We detect this using a simple heuristic: if the first converted transaction's date is later than the last's.	2014-12-02 11:16:51 -08:00
Simon Michael	733a7b12ef	csv: include path is relative to current (close #198 )	2014-12-02 10:50:31 -08:00
Julien Moutinho	cf28985cf2	lib: move from Text.ParserCombinators.Parsec to Text.Parsec NOTE: required to use liftIO in includedirective SEE: http://www.vex.net/~trebla/haskell/parsec-generally.xhtml#IO	2014-11-20 10:08:30 +01:00
Simon Michael	bfedf367c4	export Regexp types, disambiguate CsvReader's	2014-10-24 14:30:49 -07:00
Simon Michael	d0ad571321	fix manual url in default CSV rules file	2014-08-07 13:15:40 -07:00
Julien Moutinho	a6190420b2	data: add source location to transactions	2014-08-07 16:38:44 +02:00
Simon Michael	40ab1e17f6	amounts cleanups, and support zeros with commodity	2014-07-28 18:45:13 -07:00
Simon Michael	3a16e6cfc7	mostly replace slow regexpr with regex-tdfa (fixes #189 )	2014-07-06 14:03:28 -07:00
Simon Michael	0c3148ac7b	add an --ignore-assertions flag Can be helpful when reading Ledger files, where assertions may have different semantics; or for getting some answers from your journal to help you fix your assertions. Could be called --no-assertions, but this might create surprise when it has an effect contrary to --no-new-accounts. I had to add another flag throughout the parsers & journal read functions, ok for now.	2014-07-01 18:26:37 -07:00
Simon Michael	cf3d21afef	csv and general reader fixes, cleanups - The CSV reader no longer writes a "(stdin).rules" file when reading from stdin. - Selection of reader(s) is now smarter when input is coming from stdin. Previously, all readers were considered applicable for stdin. This meant that when reading a CSV file from stdin, the journal and timelog readers were always tried first, and if the CSV file was unparseable, you'd see the first (journal) reader's error instead of the CSV reader's. Now, the readers do some basic content sniffing when reading stdin, so it generally tries only the one right reader and we'll see the right errors. - The read system now has more debug output.	2014-05-09 17:55:32 -07:00
Simon Michael	4740c7082e	csv: allow an empty first name in fields list (fixes #178 )	2014-05-03 15:05:35 -07:00
Simon Michael	dedd26bbf5	csv: don't count fields in skipped lines (fixes #177 )	2014-05-03 14:54:15 -07:00
Simon Michael	3cf53661f3	new debug helpers; --debug=N sets debugLevel The debug level set by `--debug[=N]` is now available to pure and startup code as debugLevel, using unsafePerformIO. `dbg LABEL ...` is now the go-to helper for tracing values on the console; it produces output when the debug level is non-zero. `dbgExit` is similar but exits immediately, avoiding further output. The `dbgshow`, `dbgppshow` and `dbgpprint` variants allow control over the pretty-printing method and required debug level, allowing more control over what is displayed when. Other cleanups: lstrace -> ltrace, pdbgAt -> pdbg, tracewith -> traceWith.	2013-12-06 13:35:50 -08:00
Simon Michael	eff1d3f1a5	csv reader: add the `include` directive, useful for factoring out common rules used with multiple CSV files	2013-08-03 20:53:41 -07:00
Dmitry Astapov	ed58d815d6	Fix for multiple field assignments in CSV parsing	2013-06-19 08:30:33 +01:00
Simon Michael	080eb866ec	web: clean up language extensions a bit, make autoweb works again	2013-06-04 18:23:55 -07:00
Simon Michael	44545d6ec7	parsing: update a csv reader error message	2013-06-01 12:38:58 -07:00
Simon Michael	a26ab926d8	parsing: don't fail when a csv amount has trailing whitespace (fixes #113 )	2013-06-01 12:38:13 -07:00
Simon Michael	78837c66a6	parsing: fix test breakage due to new csv rules format (fixes #102 )	2013-04-12 14:59:28 -07:00
Simon Michael	616a25979a	CSV reader version 2 with new rules syntax At long last. The main change is a new rules file format that aims to be more powerful and more intuitive than v1 (hledger 0.19.x and older). Existing rules files will need to be adapted manually to the new format.	2013-03-29 22:56:55 +00:00
Simon Michael	621a91807e	rename actual/effective dates to primary/secondary The command-line flag is now --date2. Alternate spellings --effective and --aux-date are accepted for compatibility.	2012-12-06 04:43:41 +00:00
Simon Michael	4aafeb32e6	refactor: clean up Posting construction	2012-12-06 00:03:07 +00:00
Simon Michael	6eda8c4bbf	csv reader: append ".rules" to the original file name instead of replacing its extension	2012-11-26 01:56:39 +00:00
Simon Michael	afb4fb0356	csv reader: parse parenthesised amounts as negative	2012-11-26 01:56:01 +00:00
Simon Michael	8b4a99c4d5	79: convert: add a skip-lines directive (Magnus Henoch)	2012-11-18 18:21:52 +00:00
Simon Michael	64180b18ef	refactor: clarify that price amounts have only a single commodity	2012-11-19 23:17:55 +00:00
Simon Michael	4567e91409	refactor: move amount display settings out of commodity, simplify amount construction	2012-11-19 21:20:10 +00:00
Simon Michael	2a4d89bb27	expose more utilities from CsvReader	2012-05-29 21:00:49 +00:00
Simon Michael	776ad2a098	remove ensureRulesFile debug trace	2012-05-30 08:36:34 +00:00
Simon Michael	1062e2f9a4	clean up reader selection, don't write a csv rules file on journal parse error	2012-05-28 18:40:36 +00:00
Simon Michael	2fb2aea056	rename metadata fields to tags	2012-05-27 22:59:06 +00:00
Simon Michael	88212f26e8	simplify journal parser names	2012-05-09 15:34:05 +00:00
Simon Michael	8492f6cae4	fix unicode handling on GHC >= 7.2, unify utf8 IO compatibility layer tests pass again from GHC 6.12.3 to 7.4.1	2012-03-29 19:06:31 +00:00
Simon Michael	d4451ce5e3	read system cleanup, require conversion rules from a file to simplify API	2012-03-24 18:08:11 +00:00
Simon Michael	e396c0dc8d	push csv rule and format string types down	2012-03-24 01:58:34 +00:00
Simon Michael	6eb7ad28e1	refactor/beef up readJournal/readJournalFile	2012-03-23 16:21:41 +00:00
Simon Michael	4d7a809c4a	cleanups and early code for csv reader based on convert	2012-03-10 21:55:48 +00:00

1 2 3 4 5

210 Commits