115107 - CSS not fixed up by webbrowserpersist ("save page as, complete" omits background images)

23 years ago

*** Bug 115532 has been marked as a duplicate of this bug. ***

[not reading bugmail]

Comment 4

•

23 years ago

see bug 115532 for more in the straight html for background tags.

Adam Lock

Comment 5

•

23 years ago

The webbrowserpersist object doesn't save anything from the CSS whether inline or not. I don't know if it is possible to walk through, fixup and generate externally linked and inline CSS (with the minimum of effort), but it's something I will take for the time being.

Assignee: ben → adamlock

Summary: table/cell backgrounds not saved with save complete → CSS not fixed up by webbrowserpersist

Dimitrios

Comment 6

•

23 years ago

Adding "(background images not saved)" to summary.

Summary: CSS not fixed up by webbrowserpersist → CSS not fixed up by webbrowserpersist (background images not saved)

David Hallowell

Comment 7

•

23 years ago

*** Bug 116660 has been marked as a duplicate of this bug. ***

David Hallowell

Comment 8

•

23 years ago

Seen this on Linux 2001-12-20 (Slackware 8) changing OS to All

OS: Windows 2000 → All

Hardware: PC → All

[not reading bugmail]

Comment 9

•

23 years ago

This is regarding the <td background=''> attribute. I have a page I saved, from www.stomped.com as of today. using 12-27 build.. the source has a <td background="images/trans.gif"> and Mozilla renders that correctly. Now doing 'save page as', doesn't create a subdirectory of images for example 'thedefaultsavedir'/www.stomped.com_files/images, and it doesn't parse this tag attribute to retrieve the image: 'trans.gif' and save it to the: '/www.stomped.com_files/images' subdirectory. 'Save page as' page source, still outputs the default source tag as above, and when trying to access the image upon viewing using open file > context menu > view background image, the alert box shows it trying to access just 'thedefaultsavedir'/images/trans.gif. There is also no doc type declared on this page.

sairuh (rarely reading bugmail)

Updated

•

23 years ago

Keywords: nsbeta1

Adam Lock

Updated

•

23 years ago

Target Milestone: --- → Future

Judson Valeski

Comment 10

•

•

•

22 years ago

Attached patch Work in progress (obsolete) (deleted) — Details — Splinter Review

Patch is work in progress but it attempts to fix up style properties that typically contain an url(), e.g. background-image. I have most of the rule searching and url extraction / fixup down but I have to clean up the node cloning function. I also have a horrible feeling that just asking a node for it's inline syle drags a bunch of -moz styles into existence even if they weren't there to start with. This means you get a mess of extra styles in the output document. I'll have to examine this issue a bit more.

Boris Zbarsky [:bzbarsky]

Comment 28

•

22 years ago

So this is based on a totally drive-by skim of the patch: 1) NS_ARRAY_LENGTH is a nice macro. ;) 2) "content" and "cursor" can have URI values 3) The parser you wrote will fail to parse something like: content: "url(foo)" url(foo); correctly. You're right that GetPropertyCSSValue would be nice here... 4) "foo: bar" in CSS is a declaration, not a rule (the "// Test if the inline style contains rules that" comment) 5) The code for setting the URL value will not work for "content" and "cursor" because they can include things other than the URL. 6) I'm not sure how putting @import in the same list as property names will work -- the two don't even live in the same places...

Adam Lock

Comment 29

•

22 years ago

Comment 50

•

21 years ago

> No one is working on this right now or will in the immediate future. Hey Boris... Where did you get that... This is one of the bugs I want to see fixed for Nvu. Please don't reassign yet, I have plenty of stuff on my plate right now.

Boris Zbarsky [:bzbarsky]

Comment 51

•

21 years ago

glazou, I got it from earlier comments from Adam (the assignee), from the target milestone, and from general knowledge of what I know people are working on. Since the details what you're doing has been shrouded in secrecy and since I can't read minds (well, not ones that far away, at least), I hardly had a way of knowing you were thinking of working on this... ;)

Bill Mason

Comment 52

•

21 years ago

*** Bug 224801 has been marked as a duplicate of this bug. ***

Asa Dotzler [:asa]

Comment 53

•

21 years ago

I'm nominating this for 1.6 because I think saving web pages is a fairly common task and as CSS becomes more prominent, we're seeing more and more pages that don't save completely. dbaron thinks this might be more 1.7alpha material but it does cause one of our smoketests, B.27, (and I suspect an increasingly representative smoketest) to partially fail.

Flags: blocking1.6b?

Keywords: smoketest

Hixie (not reading bugmail)

Comment 54

•

21 years ago

The way to fix this is to implement saving web pages the way that MacIE does it -- saving the files unchanged, annotated with their original URI, so that links between dependent resources still work, even if they are done via obscure ways like built from JavaScript (which we could never solve otherwise).

Asa Dotzler [:asa]

Comment 55

•

21 years ago

Jump ball. Who wants to try to tackle this? Glazman, can you take this?

José Jeria

Comment 56

•

21 years ago

*** Bug 226925 has been marked as a duplicate of this bug. ***

Asa Dotzler [:asa]

Comment 57

•

•

18 years ago

Flags: blocking1.9a1? → blocking1.9-

Whiteboard: [wanted-1.9]

Boris Zbarsky [:bzbarsky]

Comment 87

•

18 years ago

•

17 years ago

Attached patch preliminary, incomplete implementation (obsolete) (deleted) — Details — Splinter Review

So I've been working on dbaron's suggestion for a week or two now. I'm leaving for the rest of the weekend in half an hour to run my first half marathon (eep!) so I figured I'd post what I have so far for preliminary review. I've tried to avoid gratuitous refactoring of nsWebBrowserPersist, opting for small changes/hacks around an architecture not really aligned with the sequence of operations CSS serialization requires. Right now, the biggest thing left to implement feature-wise is to make CSS and other content-serialized, not-downloaded files get the right file extension. Currently, wbp fixes up persisted files with nsIChannel-derived mime types; since we're not downloading the CSS files we persist from a remote server, that's not going to work. I haven't spent more than a few minutes looking at how much I have to work with, so I'm not sure of the best way to go about doing that. For some things it's easy (nsICSSStyleSheet should be saved as .css) but for things like background-image: url(blah), I'm not sure whether CSS knows what the mime type of blah is. Can't just take the extension from the URI, since that fails for dynamically-generated pages. For images I guess we do content-sniffing anyways, so we could save it as whatever we wanted, but still. Anyways, I'll be back Monday and can hopefully post a more polished/complete patch later in the week. This patch deserves at least two r-'s in its current state. At least there's enough code there to make comments on.

Attachment #279129 - Flags: review?(bzbarsky)

Ben Karel [eschew]

Comment 98

•

17 years ago

Oh, one other thing: we technically don't need to persist @import-ed stylesheets linked to from style elements as separate files, their child rules can be recursively serialized along with the main page stylesheet. Right now the code does both, I think, because it's in flux. Since that's the first thing in the diff I just wanted to comment on it.

Boris Zbarsky [:bzbarsky]

Comment 99

•

17 years ago

I doubt I'll be able to get to this until after I get back in mid-to-late Sept. > their child rules can be recursively serialized along with the main page > stylesheet Not if they contain @charset, @namespace, etc rules.

Boris Zbarsky [:bzbarsky]

Comment 102

•

17 years ago

Comment on attachment 279129 [details] [diff] [review] preliminary, incomplete implementation >Index: layout/style/nsCSSStyleSheet.cpp >+ aContent.Append(NS_LITERAL_STRING("/* Rules from linked stylesheet, original URL: */\n")); As I said, this is no good if the linked stylesheets have @namespace rules (or @charset or @import, but skipping those won't necessarily break things, while skipping @namespace most definitely will). Please make sure to write tests for this case. What you probably want to do instead is to serialize the rules, and have @import serialization start the serialization of the imported sheet. I believe that's what dbaron suggested too. >+ for (i = 0; i < styleRules; ++i) { >+ rv = GetStyleRuleAt(i, *getter_AddRefs(rule)); Declare |rule| here, not up at the beginning somewhere? >Index: layout/style/nsHTMLCSSStyleSheet.cpp >+ NS_IMETHOD Serialize(nsAString& aContent, nsIDocumentEncoderFixup *aFixup); Why not just put this method on nsICSSStyleSheet, since those are all you serialize? >Index: layout/style/nsICSSRule.h And probably put this on nsICSSStyleRule. I don't see any rule implementations of Serialize() in this patch. Some of the rules will require fixup too, of course (@document rules come to mind). Of course such rules in user sheets won't work right once saving has happened, but getting that to work is food for another bug, I think. >Index: embedding/components/webbrowserpersist/src/nsWebBrowserPersist.cpp >+#include "nsIFormControl.h" >+#include "nsIDOM3Node.h" This looks like part of another patch, right? >+ // Leave mCurrentDataPath alone so that CSS fixup knows where to save >+ //mCurrentDataPath = oldDataPath; This is pretty questionable. I'd have to dig to make sure, but I would be it's wrong. >@@ -2504,17 +2522,37 @@ nsWebBrowserPersist::EnumPersistURIs(nsH >+ rv = cos->Init(outputStream, nsnull, 0, 0); >+ NS_ENSURE_SUCCESS(rv, rv); This is wrong if the sheet has an @charset rule. Unless you plan to strip those, in which case it's wrong if the main document is not UTF-8. Again, please add tests. >+ PRBool wroteFullString = PR_FALSE; >+ rv = cos->WriteString(data->mContents, &wroteFullString); Why is wroteFullString being ignored? Perhaps this should be called something else? >+ rv = cos->Flush(); >+ rv = outputStream->Flush(); >+ rv = cos->Close(); Why bother assigning to rv if you plan to ignore it? >+ nodeAsLink->GetType(type); >+ if (type.EqualsLiteral("text/css")) { This is wrong. |type| could be an empty string for a CSS style sheet. Further, if type is "text/css" there ould be no DOMStyleSheet attached to the node. You probably want to just GetSheet() and then null-check it. Again, add tests. >+ nsAutoString content; >+ ss->Serialize(content, mFixup); ... >+ data->mContents.Assign(content); Why not get the |data| first and just pass data->mContents to Serialize()? I haven't reviewed the URI-munging details. CSS fixup should probably also be applied to "style" attributes. Might be a separate bug. To answer your quesions in the bug: > I've tried to avoid gratuitous refactoring of nsWebBrowserPersist If doing said refactoring (in a separate patch prior to implementing this) would make things better, please go for it! Unless you think you want to get this into 1.9 and that would make it harder; in that case please file a followup on the refactoring. > for things like background-image: url(blah), I'm not sure whether CSS knows > what the mime type of blah is. For images in particular, it does if they were used and we've gotten far enough in downloading them to know the type. More precisely, given an nsCSSValue::Image you can get its imgIRequest and get the type from that. But really, if we're persisting those images (which we should be, right?), we'll know the type because those we _do_ get via an nsIChannel.

Attachment #279129 - Flags: review?(bzbarsky) → review-

Ben Karel [eschew]

Updated

•

17 years ago

Depends on: 293834

Ben Karel [eschew]

Comment 103

•

17 years ago

Attached patch mostly complete patch (actually not nearly complete) (deleted) — Details — Splinter Review

Comment 111

•

17 years ago

just test the wikipedia.org, the master css file that include another css file. @import url('./anothercss.css'); @import url('./anothercss-two.css'); their's also must be save with the images and another css included.

David Baron :dbaron:

Comment 112

•

17 years ago

Comment on attachment 282460 [details] [diff] [review] mostly complete patch (actually not nearly complete) On the principle that it's better off in somebody's review queue than nobody's, adding this to my review queue, although I also probably won't get to it for a bit.

Attachment #282460 - Flags: review?(dbaron)

Reed Loden [:reed]

Updated

•

17 years ago

Assignee: file-handling → web+moz

Flags: wanted1.9+

QA Contact: ian → file-handling

Whiteboard: [wanted-1.9]

Target Milestone: Future → ---

Stephen Donner [:stephend] Not actively reading bugmail

Comment 114

•

•

15 years ago

I won't have time to work on this in the near future, so reverting to default owner.

Assignee: eschew → nobody

David Baron :dbaron:

Comment 132

•

15 years ago

Attached patch mostly complete patch, mostly merged to trunk (obsolete) (deleted) — Details — Splinter Review

This is mostly merged to trunk (I haven't tried compiling yet, though). The aSerializedCloneKids change appears to have landed already. I merged the URI serialization code in nsCSSDeclaration. But I still need to add a replacement for the code that was patching TryBackgroundShorthand in the old patch.

David Baron :dbaron:

Comment 133

•

15 years ago

So now that I'm attempting to compile this, I'm having trouble figuring out how *any* of the patches in this bug ever compiled (well, I can see how the first patch could compile... but it wouldn't link). They both have code in nsCSSStyleSheet.cpp that calls a Serialize method on either nsICSSRule or nsICSSStyleRule. In the first patch it was declared pure virtual on nsICSSRule but never implemented; in the second patch it's neither declared nor implemented. And it makes far more sense to me for that method to be on nsICSSRule. The current serialization code simply omits things like @media rules, @namespace rules, etc. So I'd thought this patch was really ready for review, but it seems like it has some pretty major gaps in it. Or were there changes you had in your tree that you just didn't include in your diff?

•

11 years ago

Removing "qawanted" since the need for an automated test is marked by "in-testsuite?".

Keywords: qawanted

Nobody; OK to take it and work on it

Assignee

Updated

•

10 years ago

Mentor: dbaron

Whiteboard: [Halloween2011Bug][mentor=dbaron][lang=c++] → [Halloween2011Bug][lang=c++]

Witek

•

10 years ago

I read the whole thread, and since I'm new to the project i try implement solution proposed by the dotnetCarpenter. What do you think David?

David Baron :dbaron:

Comment 175

•

10 years ago

Do you mean comment 167? That sounds like a proposal to rewrite the entire "Save As, Complete" feature, of which this bug (fixing up URIs in the CSS) would be just a small part. But it doesn't tell me anything about what you plan to do for the part covered by this bug: fixing up URIs in the CSS.

WBT

Comment 176

•

10 years ago

Attached image Bugzilla main page test case- original (deleted) — Details

WBT

Comment 177

•

10 years ago

Attached image Bugzilla main page test case- after save (deleted) — Details

This 13-year-old bug in basic functionality is surprising - why is this still broken? I've attached a very local test demonstration from the Bugzilla main page, to help illustrate its status today.

Zéfling

Comment 178

•

10 years ago

UnMHT is on GPL licence ( https://addons.mozilla.org/fr/firefox/addon/unmht/license/7.3.0.5 ), is it not possible to use a part of code ? UnMHT is for me a great tool for save a complete page : HTML / CSS / JS / media

Updated

•

2 years ago

Duplicate of this bug: 1832220

Test cases possibly showing independence of CSS bug and table bug 23 years ago Andrew Lin (deleted), application/x-zip-compressed		Details
Work in progress 22 years ago Adam Lock (deleted), patch		Details \| Diff \| Splinter Review
Work in progress 2 22 years ago Adam Lock (deleted), patch		Details \| Diff \| Splinter Review
preliminary, incomplete implementation 17 years ago Ben Karel [eschew] (deleted), patch	bzbarsky : review-	Details \| Diff \| Splinter Review
mostly complete patch (actually not nearly complete) 17 years ago Ben Karel [eschew] (deleted), patch	dbaron : review-	Details \| Diff \| Splinter Review
Firefox start page before the Save Page As.... 15 years ago Tim Riley [:timr] (deleted), image/tiff		Details
Saved firefox page after "Save Page As..." - missing elements 15 years ago Tim Riley [:timr] (deleted), image/tiff		Details
mostly complete patch, mostly merged to trunk 15 years ago David Baron :dbaron: (deleted), patch		Details \| Diff \| Splinter Review
more merged to trunk 15 years ago David Baron :dbaron: (deleted), patch		Details \| Diff \| Splinter Review
Bugzilla main page test case- original 10 years ago WBT (deleted), image/png		Details
Bugzilla main page test case- after save 10 years ago WBT (deleted), image/png		Details