<a class="header-button" href="https://bugzilla-dev.allizom.org/home" title="Go to home page"> Bugzilla

Updated

•

16 years ago

Severity: normal → major

Component: General → Database

Keywords: perf

Product: Thunderbird → MailNews Core

QA Contact: general → database

Summary: Long pauses (windows hourglass 100% cpu) switching folders while indexing → Long pauses (windows hourglass 100% cpu) switching folders while indexing gloda

Reporter

Comment 1

•

16 years ago

Some additional thoughts: This might be just an amplification of a problem that I'm seeing generally. That is, a delay of a few seconds when switching folders as described above. I've never seen more that 3-4 seconds delay normally, and I can't seem to reproduce or figure out the exact circumstances of the delays.

Comment 2

•

16 years ago

I attempted to replicate this on a T40 laptop but was not able to do so, as noted in Bug 465353 Attempting same o a Gateway P4 3.2GHz which I remote desktop into, but no dice. BUT, that might be because indexing there is on a steady, flat line of performance - CPU is not going above 20-25%. So indexing is not driving IO up high and not forcing appreciable CPU contention. I'm not seeing the high CPU on this machine that I saw in bug 465353.

Comment 3

•

16 years ago

Joe, what gloda preferences do you have enabled in regards to logging? Additionally, are you running thunderbird with a console window open? The fact that disk activity ceased to happen and the window turned grey suggest there is something at work more than just gloda. Gloda's asynchronous mode of operation requires that it constantly surrender time-slices, so if the window goes grey, gloda is likely not running. Likewise, if the database is doing something (which should not be able to block the main thread), we should be seeing all kinds of disk activity. If the problem were bug 465122 (failure to close msf files), I would expect disk swapping.

Reporter

Comment 4

•

16 years ago

I was not logging at the time, but: https://bugzilla.mozilla.org/show_bug.cgi?id=465122#c1 May have some relevance. I had an unusually high number of windows going and only 512m memory on this machine. I'll take a closer look at memory and processes if I can reproduce.

Reporter

Comment 5

•

16 years ago

Well it seems I can reproduce this. At least I did this time around. Thunderbird.exe hits 99% and stays there for 10 minutes or more. I have been seeing a delay in switching to/from the newsgroup mozilla.support.thunderbird in normal usage (3-4) seconds delay. What I think is relevant is the number of headers in this group 72,713 Thats seventy two thousand. Attempting to open that group while indexing seems to key the incident. Not sure if the other system usage has anything to do with it. Memory seems to churn ranging from 157k--194k with nothing much else happening. Occasional disc activity, but very little. Attaching screenshot. Anything more I can do, please ask, but it gets me a little nervous hanging too long at 100% CPU

Reporter

Comment 6

•

16 years ago

Attached image TB at 99% (deleted) — Details

Reporter

Comment 7

•

16 years ago

After giving this bug some further thought (dangerous sometimes) I think it might boil down to fact that the TB UI freezes when attempting to open a folder while gloda is currently using that msf. In the case I am seeing, that newsgroup I'm trying to open has a large number of headers, and could explain the amount of time the UI freezes. If that is the case , then copying the msf to a temp file, and using that for gloda indexing, might be an answer.

Comment 8

•

16 years ago

Gloda has no problem sharing the msf. The problem that explains a pause of a few seconds is likely that the msf is huge and anyone opening the msf is going to block the UI. The multi-minute thing is very concerning. In theory, gloda should not be doing anything with your newsgroup folder at all. In the past at least, I think it might get tricked into performing event-driven indexing in a newsgroup, and that may still be true. Is your newsgroup marked for offline storage? If so, are all the messages already downloaded? When the multi-minute thing happens, do you know if any heavy network traffic is going on?

Reporter

Comment 9

•

16 years ago

(In reply to comment #8) > Gloda has no problem sharing the msf. The problem that explains a pause of a > few seconds is likely that the msf is huge and anyone opening the msf is going > to block the UI. > > The multi-minute thing is very concerning. In theory, gloda should not be > doing anything with your newsgroup folder at all. In the past at least, I > think it might get tricked into performing event-driven indexing in a > newsgroup, and that may still be true. > > Is your newsgroup marked for offline storage? If so, are all the messages > already downloaded? When the multi-minute thing happens, do you know if any > heavy network traffic is going on? No offline storage is selected. I don't see any network activity at all during the incident, as a matter of fact, my anti-virus (automatic update) failed to execute during the incident. Just popped up as a coincidence during the incident. What can I do to elaborate. Run with console..whatever I can do to help.

Updated

•

16 years ago

Priority: -- → P1

Updated

•

16 years ago

Depends on: 466438

Updated

•

16 years ago

Whiteboard: [likely resolved by patch on bug 466438]

Comment 10

•

16 years ago

The fix for bug 466438 is now in the trunk; it stops gloda from indexing newsgroup messages. The next set of nightlies should include the fix. Please try and duplicate with such a build. This is the only problem I'm really able to conceive of that could be directly gloda's fault and might explain the observed problem. Long load times for folders with tons of messages are not a gloda-specific problem, though they are something we want to improve. If you are able to reproduce, please do run with console after having enabled the gloda "dump" preference and try and reproduce there. (Search on "global" in the advanced pref config and enable the "dump" one.) I'm advising trying to reproduce without the logging because the logging can be apt to generate a lot of CPU load itself and might confuse the issue.

Severity: major → normal

Priority: P1 → P4

Whiteboard: [likely resolved by patch on bug 466438] → [believed fixed by bug 466438]

Reporter

Comment 11

•

16 years ago

The situation seems to be much improved. With one test run (lasted 1 hr 53 mins to complete the indexing) I detected 1 period of about 3 minutes with a frozen UI The console was not too helpful, since it was flooded with hundreds of messages: 2008-11-25 18:33:55 gloda.indexer WARN Have aMimeMsg but not bodyPlain? These actually wrapped the console storage, so much info was over-written. BTW, probably 90% of my message storage is HTML hence not bodyPlain. I'll try a few more runs when I get time, (1 run=almost 2 hours)

Comment 12

•

16 years ago

improved for me also using compose, but I didn't do extensive testing. but I suspect there are other usecases that get impacted, IO intensive like search for example (iirc I tried search - I should have documented it at the time)

Comment 13

•

16 years ago

Joe, are you using a debug build by any chance? Also, any chance you have a profiler available? Wayne, do you know if there are easy instructions to run a profiler against a windows build of thunderbird?

David Ascher (:davida)

Comment 14

•

16 years ago

I'm going to deny blocking status on this one until it can be verified/duplicated. This isn't to deny the problem, just that it's hard to have it impact drivers until we can have more actionable STRs.

Flags: blocking-thunderbird3? → blocking-thunderbird3-

Comment 15

•

16 years ago

(In reply to comment #13) > Wayne, do you know if there are easy instructions to run a profiler against a > windows build of thunderbird? http://www.hacksrus.com/~ginda/venkman/profiles/ Perhaps this issue is worth putting out to the testing community.

Comment 16

•

16 years ago

(In reply to comment #15) > (In reply to comment #13) > > Wayne, do you know if there are easy instructions to run a profiler against a > > windows build of thunderbird? > > http://www.hacksrus.com/~ginda/venkman/profiles/ I don't think Venkman (JavaScript) profiling would be sufficient. We need to know what the C++ code is up to. > Perhaps this issue is worth putting out to the testing community. Sounds reasonable.

Comment 17

•

16 years ago

Joe, does thunderbird window go "(not responding)" when this happens?

Reporter

Comment 18

•

16 years ago

(In reply to comment #17) > Joe, does thunderbird window go "(not responding)" when this happens? I'm about ready to try this again. I'll test 2 ways With my existing (large-everyday test profile) And with a more limited profile that I built for for beta testing) WRT "not responding" it certainly wasn't, nor was it telling me anything. Oh yes in regard to the troubling slow responding folder, part of that was due to a mix of read and unread messages in threaded view (newsgroups) Marking all-read speeded it up quite a bit.

Reporter

Comment 19

•

16 years ago

Further testing with: Mozilla/5.0 (Windows; U; Windows NT 5.1; en-US; rv:1.9.1b3pre) Gecko/20090223 Lightning/1.0pre Shredder/3.0b3pre ID:20090223031225 I deleted the database and started the indexing process. During Gloda indexing I could not reproduce the symptoms originally noted in this bug report. Switching folder, as well as normal TB usage seemed to work fine, but I have the following observations to note: Indexing seemed to take longer, but with less impact on UX The one issue that I did note was that scrolling large messages was jerky. Also scrolling down the folder pane. To try to quantify that I used the URL ref in bug 372039 to test. Here are the results (average of 4 runs via the start page) Gloda running 7240 No Gloda 5850 I could close this bug as worksforme, but I think I'll leave it open for periodic evaluation in case a profile specific condition was to blame.