244478 - Logging truncates lines at 512 chars

Assignee

Description

•

21 years ago

So NSPR logging has this hardcoded line length, beyond which it simply truncates data... we ran into this problem in bug 236772, where the log simply doesn't contain the data we require, making the log useless for debugging purposes. bug 46630 increased the limit from 200 to 512 chars, but I think we need to move to an allocated buffer here. larryh made the argument in that bug that doing so would incur malloc overhead for every PR_LOG call, but I don't see why that's a problem. Logging is generally only enabled when people want to debug things, and under that circumstance perf is not critical. Cookie code can pass very long lines to PR_LOG... there's really no limit, since cookies can be up to 4kb in length each, and we can have multiple cookies in a single header. Henrik Gemal also points out that IMAP code can have very long lines. I think there's enough motivation here to bite the bullet and fix PR_LOG to be sane with its buffer handling, which is certainly better than forcing consumers to feed nspr bite-size chunks. The code in question is at: http://lxr.mozilla.org/seamonkey/source/nsprpub/pr/src/io/prlog.c#436 (As a side note, there's a bug at http://lxr.mozilla.org/seamonkey/source/nsprpub/pr/src/io/prlog.c#256 - the sense of that |if| check needs to be reversed.)

Roland Mainz

Comment 1

•

21 years ago

What about allocating a single, static buffer at logging initalisation and only realloc it (=grow it) when the current buffer size is too small ? That would avoid the malloc overhead for each call (which is a BAD idea for cases like when something fails due memory shortage...) ...

Darin Fisher

Comment 2

•

21 years ago

Or, perhaps just keep the 512 buffer, and if a PR_LOG call needs more than that, do the malloc for that call only.

Roland Mainz

Comment 3

•

21 years ago

Darin Fisher wrote: > Or, perhaps just keep the 512 buffer, and if a PR_LOG call needs more than > that, do the malloc for that call only. If you have many of these long lines you'll always run |malloc()| for them. And running into problems on memory shorage is much more likely with that design than growing the buffer (maybe we should even think about rounding the buffer size up to the value of |MIN(16384, getpagesize())-1| (e.g. round up to system pagesize but clamp the value at 16k (since newer Solaris can have variable pagesizes ranging from 8KB-16GB) ) :)

Brendan Eich [:brendan]

Comment 4

•

21 years ago

Comment 0 says "Logging is generally only enabled when people want to debug things, and under that circumstance perf is not critical." That's false in general. Logging is useful to debug race conditions, and any serialization on a global malloc heap lock will frustrate such use-cases. Logging also may help find memory bugs, and malloc'ing will interfere with memory allocation. In law school they teach that hard cases make bad law. How many logging use-cases need malloc'd dynamic range of buffer size? Only a few, I bet. That favors the approach darin suggests in comment 2. /be

Roland Mainz

Comment 5

•

21 years ago

Brendan Eich [:brendan]

Comment 6

•

21 years ago

gisburn: http://lxr.mozilla.org/mozilla/source/nsprpub/pr/src/io/prlog.c#117. There are several options. Also, you missed my point about serialized writes. prlog.c itself supports that, of course -- otherwise several threads logging to the same file might find their log entries mixed up or overwritten in a buffer. Also, stdio does not incur a malloc on ever fwrite (or whatever) call. So you are not being accurate. The serialization I want to avoid is the one exactly implied by malloc'ing on every call, which comment 0 asserted was not a problem. /be

timeless

Comment 7

•

21 years ago

brendan: cookie and imap cases as listed already are going to happen a lot. for bonus points, imap at least in theory could happen from multiple threads (i'm not so sure about cookie, perhaps it lives on the main thread, perhaps it lives on the http thread, in the former case it can contend w/ imap, otherwise it'll have to contend w/ itself). i've hit this limit with my logging too, which I use a lot (although nowhere near as much as imap would if it has to actually log my inbox, which it does when i use mozillamail). atm my consumers pretty much have to use alternate logging systems (mostly jsconsole or console), and then my coworkers get annoyed by the debug spew/churn. as for file locking, um... nspr logging already tramples itself if two nspr apps use the same log file and it isn't WinDebug. and wrt file locking, i think i've had spidermonkey crash because it wasn't properly locking (f)printf. anyway, at a certain point, nspr logging does get a lock. It happens to grab the lock, conveniently enough, in PR_LogPrint. Note that right now nspr is not particularly good at deciding if a printf will fit (see bug 229662).

Brendan Eich [:brendan]

Comment 8

•

21 years ago

What was unclear in what I wrote in comment 6? I'm against adding a malloc per log call. If the existing #ifdef'd locking is not correct, or not enabled where it should be, file other bugs. SpiderMonkey doesn't use fprintf or printf except for debugging to stdout/err. Are those kinds of calls not threadsafe by default on Linux? /be

timeless

Comment 9

•

21 years ago

it looked like the msvcrt's functions aren't threadsafe by default (this would make sense, it's the performant way to do things, you can probably ask for threadsafe versions, but doing the locking yourself is probably better overall), but i'm not certain, it was a random hit among my many many crashes :). anyway, your argument should be against comment 2 since for imap/cookies that is a malloc for every call instead of comment 1 which is only a malloc on size misses. there's one minor problem, which is that right now, the locking is only on the actual io, if you try to share a buffer, then you're locking a formatting call too.

implement approach in comment 2 21 years ago dwitte@gmail.com (deleted), patch		Details \| Diff \| Splinter Review
slightly better patch 21 years ago dwitte@gmail.com (deleted), patch	timeless : review-	Details \| Diff \| Splinter Review
v3 21 years ago dwitte@gmail.com (deleted), patch	dwitte : review+ wtc : superreview-	Details \| Diff \| Splinter Review
v4 18 years ago dwitte@gmail.com (deleted), patch		Details \| Diff \| Splinter Review
v5 17 years ago Wan-Teh Chang (deleted), patch		Details \| Diff \| Splinter Review
v6 17 years ago dwitte@gmail.com (deleted), patch		Details \| Diff \| Splinter Review
v7 (checked in) 17 years ago Wan-Teh Chang (deleted), patch		Details \| Diff \| Splinter Review