78148 - clean up directory listing stream converters

Reporter

Description

•

24 years ago

Currently the stream converters take one ascii representation, parse it, convert it into another, and pass it off to something else which parses that, and then spits out the result. I got sick of hacking arround this while getting gopher to output html directory listings. So I'm fixing it. Doug, do we need to support nsFTPDirListingConverter::Convert? Nothing actually uses it AFAICS, and the indexedToHTML stuff doesn't support it either (ditto for gopher - I don't think that the gopher code has even been tested....). Currently, I have an nsIDirIndex interface which contains attributes for the various things we want to support, and an nsIDirIndexListener, which inherits off nsIRequestObserver, and provides the addition method: void onIndexAvailable(in nsIRequest aRequest, in nsISupports aCtxt, in nsIDirIndex aIndex); I haven't finished hooking this all up yet (I still have to connect the new FTPDirListingConverter to nsIndexedToHTML), but: [bbaetz@banana netwerk]$ cvs diff streamconv/converters/nsFTPDirListingConv.cpp | diffstat nsFTPDirListingConv.cpp | 307 +++++++------------------- 1 file changed, 93 insertions, 214 deletions [bbaetz@banana netwerk]$ cvs diff streamconv/converters/nsIndexedToHTML.cpp | diffstat nsIndexedToHTML.cpp | 242 ++++++++---------------------- 1 files change, 72 insertions, 170 deletions (I also NS_LITERAL_STRING'd nsIndexedToHTML while I was at it) Of course, I've added the new interfaces and implementation files, (but those just consists of getters and setters, + the license blurb). So I come out adding 23 more lines than I remove. I haven't pulled all the parsing stuff out of nsDirectoryViewer yet, though, or touched gopher. Or done anything more than verify that it all compiles - I know it won't work as it currently stands. Even if we do decide to scrap the XUL directory viewer, this still: a) Cleans stuff up b) Provides a common interface between ftp and gopher. c) Avoids uneeded parsing/escaping/unescaping (although I'm still escaping - I decided to try and check that it works, and change as little as possible, first) d) May give us a way to easily produce html listings for file:///, although the nsString vs nsCString stuff would have to be cleaned up - currently I've just done what the directory viewer did. The disadvantage of doing this is that it is theoretically now possible (since my directoryViewer changes from a few weeks ago) for a web server to send us application/http-index-format data, and have mozilla display the data as the XUL tree. I haven't tested that though yet. These changes would remove the parser, and so remove that feature - I don't know if anyone cares though. I still have a couple of streamconverter/nsIInputStream questions though. I'll code a bit more to try and get something actually displaying, and see what I discover, or I'll find some necko people on IRC and nag them :)

Doug Turner (:dougt)

Comment 1

•

24 years ago

supporting Convert() allows for SYNC stream converting. Is anyone doing this: no. Will anyone want to? Don't know. Attach some diffs so we can see some of this great sounding work. :-)

Bradley Baetz

Reporter

Comment 2

•

24 years ago

Is there some sort of wrapper class which will make an async converter synchronous? I can imagine one, but I don't know if it already exists. I'll attach diffs once I get something displaying - probably later today, after I finish some other stuff.

Status: NEW → ASSIGNED

Target Milestone: --- → mozilla0.9.1

Judson Valeski

Comment 3

•

24 years ago

I believe one of the test stream converters I provide wraps a sync converter in async callbacks. In real-world situations though this defeats the whole purpose of asynchronous function calls because they just wind up blocking. It's best to provide async parsing. bradley writes: "Currently the stream converters take one ascii representation, parse it, convert it into another, and pass it off to something else which parses that, and then spits out the result. I got sick of hacking arround this while getting gopher to output html directory listings. So I'm fixing it." what's wrong w/ the stream converter model? and what/how were you hacking around whatever you found to be wrong w/ it?

Bradley Baetz

Reporter

Comment 4

•

24 years ago

> I believe one of the test stream converters I provide wraps a sync converter > in async callbacks. I meant the other way arround - given an async stream converter, wrap it so that it becomes a sync converter. I know how to do it, I was just wondering if there was something in the tree already. Nothings wrong with the stream converter model. However, it only passes text streams arround, and so we end up parsing and unparsing the same data multiple times. To expand on my summary: What currently happens (for ftp) is that nsFtpDirListingConv takes a text/ftp-dir-<servertype> stream, and parses it, putting it into an indexEntry (a local class defined in nsFtpDirListingConv.cpp). At the end of each line, it takes that structure, and puts out a application/http-index-format entry. Depending on the settings in the prefs, the ftp protocol may have arranged to get a chain of converters, so this data will be passed to nsIndexedToHTML, and html produced, or it will be left as application/http-index-format, where nsDirectoryViewer.cpp will parse it. That parser is seveal hundred lines long, and makes several assumptions about gopher and ftp. It also puts everything into a structure which looks remarkably like the indexEntry structure we started out with. The nsIndexedToHTML parser isn't really a parser for the entire format - it makes assumptions about the layout of the data lines. I could make it understand gopher as well, using a few ifs, and hardcode more stuff in. Thats what I meant by "hacking arround" - not the stream converter model, but the duplication of code, and the needless parsing/unparsing/etc. The onIndexAvailable is there because, AFAIK, there isn't an nsIInputStream which I can attach nsISupport objects to, rather than char*'s. I'm not sure that something like that would really fit the model of the input stream stuff. I'll attach some code later tonight.

Bradley Baetz

Reporter

Comment 5

•

24 years ago

Attached patch work-in-progress patch (obsolete) (deleted) — Details — Splinter Review

Bradley Baetz

Reporter

Comment 6

•

24 years ago

I've attached a work in progress patch. With that patch gopher and ftp will use the html directory index. I haven't updated the XUL viewer yet - I'll do that tomorrow. The API changes are straight forward, and mainly involve removing lots of code. Any design/API comments?

Judson Valeski

Comment 7

•

24 years ago

hmmm, I'm a bit concerned here because http dir listing format is sort of standard internally (HTTP can use it), and I see some benefit to cononicalizing various listing formats into it as it gives us some common ground. have we isolated this extra parsing step for ftp as a performance hit?

Bradley Baetz

Reporter

•

24 years ago

Does that feature work with ns6? If thats the case, then I'll leave it in - I'll have the parser convert to nsIDirIndex, instead (and move it into netwerk). This will still have the advantage that the 4 directory input methods (application/http-index-format, file, ftp, and gopher) will be able to use either output format without problems.

Judson Valeski

Comment 12

•

24 years ago

after thinking this over some more, and talking this over w/ rpotts, I'm a bit concerned over this goal. again, the whole idea of http_index is that it is a cononical dir listing format that *any* protocol can generate and subsequently we'll be able to convert this connonicial format into another. this adds a nice layer between protocol writers and content delivery. it also allows for new converters, say text/xml, to come in and take *all* dir listings and turn them into some new format. I'd really prefer we maintain this std format.

Bradley Baetz

Reporter

•

24 years ago

OK, so I've though about this a bit more, and now agree with valeski. Does anyone have any objections to just killing the XUL/rdf directory viewer, and moving everything to the HTML output? This would lose the ability to do the rdf ftp bookmarks, but that doesn't really bother me. dougt? Any objections? The bookmarks URI is really sucky, and the directory viewer is slow. According to waterson: <waterson> it was a research project gone bad I can then hook the application/http-index-format datatype directly up to that convertor, and remove the pref-testing code from the ftp directory listing convertor. If there's only one output format, then theres no need for all this abstraction. valeski and/or dougt - would a patch to do that meet your approval?

Judson Valeski

Comment 17

•

24 years ago

lemmee ask some embedding customer's what they think about losing the XUL listing. the last time I checked, there were some that actually liked it :-).

Bradley Baetz

Reporter

Comment 18

•

24 years ago

----------- PLEASE IGNORE ALL COMMENTS ABOVE THIS LINE ------------ OK, so I've redone this, and the only change is the abstracting of the parser. If you have the network.dir.generate_html pref set to true (as it is by default now), then ftp, gopher, and file will all appear as html. The dates are also reported using the current locale, as well, and with the current time (as opposed to GMT). file:// has the wrong dates - nsDirectoryIndexStream is generating dates in 1970. I'm also not sure how well it behaves with non-ASCII file names. Because the intermediate application/http-index-format is still generated, the code isn't any smaller - if you ignore the copyright lines in the new files, its about the same size. All protocols generate application/http-index-format, and then the viewer factory decides whether to generate html or XUL based on the pref. I want to fix the file:/// stuff first, but are there any obbjections to the new patch (which I'll attach now)?

Bradley Baetz

Reporter

•

24 years ago

Attached patch new patch (obsolete) (deleted) — Details — Splinter Review

Judson Valeski

Comment 23

•

24 years ago

the usage of the http-index only converter in FTP seems fine to me, as do the makefile changes (who's going to do the mac build changes?), and factory registration mods. however, I'm going to have to defer on the index/parser changes; dougt maybe?

Bradley Baetz

Reporter

Comment 24

•

24 years ago

Attached patch patch against current tree (obsolete) (deleted) — Details — Splinter Review

Bradley Baetz

Reporter

Comment 25

•

24 years ago

pushing off - this is too late for 0.9.1

Target Milestone: mozilla0.9.1 → mozilla0.9.2

benc

Comment 26

•

23 years ago

mass move, v2. qa to me.

QA Contact: tever → benc

Bradley Baetz

Reporter

Updated

•

23 years ago

Target Milestone: mozilla0.9.2 → mozilla1.0

work-in-progress patch 24 years ago Bradley Baetz (deleted), patch		Details \| Diff \| Splinter Review
new patch 24 years ago Bradley Baetz (deleted), patch		Details \| Diff \| Splinter Review
new patch 24 years ago Bradley Baetz (deleted), patch		Details \| Diff \| Splinter Review
patch against current tree 24 years ago Bradley Baetz (deleted), patch		Details \| Diff \| Splinter Review
patch v0.9 23 years ago Bradley Baetz (:bbaetz) (deleted), patch		Details \| Diff \| Splinter Review
new patch 23 years ago Bradley Baetz (:bbaetz) (deleted), patch		Details \| Diff \| Splinter Review
updated for cvs conflicts/build system changes 23 years ago Bradley Baetz (:bbaetz) (deleted), patch	darin.moz : review+	Details \| Diff \| Splinter Review
...and update for license changes, too! 23 years ago Bradley Baetz (:bbaetz) (deleted), patch	bbaetz : review+ dougt : superreview+	Details \| Diff \| Splinter Review
screenshot - test ftp directory in XUL view 23 years ago jbetak@netscape.com (away - not reading bugmail) (deleted), image/jpeg		Details
zipped screenshots from Japanese W2K, they confirm Bradley's hunch that HTML view might be broken for non-Latin-1 file and directory names 23 years ago jbetak@netscape.com (away - not reading bugmail) (deleted), application/x-zip-compressed		Details