<a class="header-button" href="https://bugzilla-dev.allizom.org/home" title="Go to home page"> Bugzilla

Comment 1

•

18 years ago

Hmm. :-( If we are going to rejig this interface, can I reopen the idea of changing the name? Publically and politically, we can't call it the "effective TLD" - the registrars will throw a fit and not cooperate with us. And we need their help to maintain the list. My current idea is the "public prefix service" - does that sound OK? Gerv

Comment 2

•

18 years ago

Surely that's "suffix" rather than prefix? I'd suggest "public domain" service perhaps. It's not even a very good interface from native code, since it can take an IDN domain name and return a length based on the ACE encoding which doesn't match the input. I'd like to get two string-returning scriptable methods, one returning the "public domain" (e.g. ".co.uk") and the other the best-guess effective domain (e.g. "amazon.co.uk" returned from "beta.www.amazon.co.uk"). consumers could always calculate the latter from the former, but I suspect it's going to be a common enough desire we might as well build it into the service and debug the code only once.

Boris Zbarsky [:bzbarsky]

Comment 3

•

18 years ago

The current API is as usable from javascript as from anywhere else, you just have to start with a nsIURI and use the hostASCII version to make sure it's converted the same way the effective service will convert it.

Comment 4

•

18 years ago

At the very least that should be documented...

Reporter

Comment 5

•

18 years ago

(In reply to comment #3) Yes, taking the detour through asciiHost will probably work. It doesn't make this a good interface however.

Comment 6

•

18 years ago

Dan: I guess it is a suffix, the way domains are normally written. So "Public Suffix Service". So your methods would be something like: getPublicSuffix() getBaseDomain() "Public Domain Service" has an unfortunate ambiguity because of the other meanings of "Public Domain" :-( Gerv

Comment 7

•

18 years ago

something like that. The length is useful only if we're really concerned about the perf/bloat impact of the string copies (which we may well be in places), and it's a little confusing that it may be a length relative to a string different from the one you passed into it. Then we have to figure out if the returned string is an IDN domain or punycoded, and is it always one or the other or does it follow the IDN whitelist?

Comment 8

•

18 years ago

Blocking at least for correct documentation, if not a better API.

Flags: blocking1.9? → blocking1.9+

Reporter

Comment 9

•

18 years ago

There is bug 368700 and bug 368702 depending on how this bug is dealt with - I didn't want to check in a consumer for nsIEffectiveTLDService if the interface is going to be refactored. But since I don't see any movement here - how should I proceed? I could check in anyway or change the patches to use the one-dot rule instead of nsIEffectiveTLDService as it is currently common in the source base.

Comment 10

•

18 years ago

Wladimir, can you take this bug?

Assignee: nobody → trev.moz

Reporter

Comment 11

•

18 years ago

So I guess what should be done here is adding getEffectiveTLD and getEffectiveDomain methods, as well as marking getEffectiveTLDLength as [noscript] because of its ambiguousness when called from JavaScript (I think it should be kept for performance reasons however). Unfortunately, I doubt that I will have time for this.

Reporter

Comment 12

•

18 years ago

Attached patch Proposed patch (obsolete) (deleted) — Details — Splinter Review

This does what I described in the previous comment. It also changes the testcase from bug 368702 to use getEffectiveTLD and getEffectiveDomain since getEffectiveTLDLength is no longer available.

Attachment #264676 - Flags: superreview?(dveditz)

Attachment #264676 - Flags: review?(darin.moz)

Reporter

Updated

•

18 years ago

Status: NEW → ASSIGNED

Steffen Wilberg

Comment 13

•

18 years ago

A similar patch is in bug 367446.

Comment 14

•

18 years ago

Wladimir: can you and Dave (bug 367446) cooperate to work out what we need to do? Gerv

Updated

•

18 years ago

Whiteboard: [has patch] needs r?darin,dveditz

Target Milestone: --- → mozilla1.9alpha5

Updated

•

18 years ago

Attachment #264676 - Flags: review?(darin.moz) → review?(cbiesinger)

Reporter

Comment 15

•

18 years ago

Dave agreed with me that development should continue in this bug. I will add truncation of output parameters that I forgot in my patch, otherwise the two patches are pretty much identical with exception of comments and variable naming. Bug 367446 comment 10 needs to be addressed as well.

Comment 17

•

18 years ago

Comment on attachment 264676 [details] [diff] [review] Proposed patch Clearing review requests awaiting an updated patch. 1) need to address bug 367446 comment 10 (getBaseDomain() needs a numeric "depth" argument. Current code assumes "1"). 2) renaming is now or never. I don't feel strongly about the service name--especially given the PITA of moving files--but the new methods need better names. I'm happy enough with Gerv's names from comment 6 3) It would make more sense for these methods to take a nsIURI instead of a string domain name. I suspect the vast majority of the callers will already have a nsIURI and have to call getHost, and then we have to worry that each callsite will correctly drill into the innermost URI in cases like jar: uri's (offline apps, etc) before reading the host. Centralizing would be safer. Plus the URIs should already be normalized and we could skip that step. If we think enough places might just have a string host then maybe we need two sets of interfaces, one normalizing a string, one taking the innermost host out of a URI, and both feeding into shared guts. 4) should we get rid of the GetEffectiveTLDLength method? I think it's just asking for trouble having people use it. 5) we might not actually need both getPublicSuffix() and getBaseDomain() -- if we add the "depth" argument they come out to the same thing. getBaseDomain(nsIURI, 0) is public, getBaseDomain(nsIURI, 1) is the basic private one. On second thought getPublicSuffix(nsIURI) would add readability to code that uses it, doesn't hurt to keep it.

Attachment #264676 - Attachment is obsolete: true

Attachment #264676 - Flags: superreview?(dveditz)

Attachment #264676 - Flags: review?(cbiesinger)

Dão Gottwald [:dao]

Comment 18

•

18 years ago

getEffectiveTLDLength should remain as a scriptable method. Otherwise this change will make it even harder to get the effective sub-domain (i.e. host minus base domain.) (In reply to comment #5) > (In reply to comment #3) > Yes, taking the detour through asciiHost will probably work. It doesn't make > this a good interface however. It does. See bug 367446 comment 7.

Comment 19

•

18 years ago

I would argue for a single method, as Dan outlines in comment #5, taking an nsIURI and an integer. It's rather hard to figure out what best to call it, though. I'd be happy with getPublicSuffix(), but we might consider e.g. getPublicSuffixPlus(1, nsIURI), which returns the public suffix + 1. If people want the effective TLD length, they can call length() on the string returned by getPublicSuffixPlus(0, nsIURI). The function which returns a number for the length is broken. Gerv

Reporter

Comment 20

•

18 years ago

Daniel Veditz: 3) I am not so sure about making parameter an nsIURI. The one existing consumer is document.domain setter - and it doesn't have an nsIURI, neither do I see a meaningful way to construct one. Same goes for cookie handlers once they start using this service. Adblock Plus will probably have the same problem when using this interface. This means having two methods with the only difference being the type of the input parameter. Is this really a good solution? 4) It should be fine if we have getBaseDomainLength marked as [noscript]. There is little point using this method from JavaScript anyway, but it is safe to use from C++, and it can improve performance there (saves you string copy/compare, see document.domain setter). 5) I don't think that "getPublicSuffixPlus" actually makes it more clear what the following parameter means. I am leaning towards getBaseDomain as the only method, this is the most consistent solution.

Reporter

Comment 21

•

18 years ago

Attached patch Patch v2 (obsolete) (deleted) — Details — Splinter Review

I decided not to change hostname parameter type here, I rather prefer not to make this patch more complicated than necessary. Please file a follow-up bug if you want this fixed. Other than that - methods have been renamed and an integer parameter for the number of additional domain parts has been added.

Attachment #266257 - Flags: superreview?(dveditz)

Attachment #266257 - Flags: review?(cbiesinger)