-
Notifications
You must be signed in to change notification settings - Fork 517
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Change Function.prototype.toString() to be ES6 compatible #1139
Conversation
Here's the current behavior in this branch, and a comparison to V8. For a native function:
So output for non-bound native functions is now the same. For Ecmascript functions:
Here there's a natural difference as V8 provides the source. Duktape output won't eval() parse anymore, as required by ES6. Bound functions behave differently:
The main difference is that V8 looks up the final non-bound function and shows its type (native or Ecmascript). For some reason V8 also drops the function name, showing neither "cos" (the target function's .name) nor "bound cos" (the bound function's .name). The current Duktape output re: function names is "broken" in the sense that invalid function names (like a space in the bound function case) are allowed. But that's actually OK because it just causes a SyntaxError earlier than the function body. I think I'll change the bound function behavior so that the native/ecmascript type would be shown and the bound status of the function would be lost. However, with that behavior it'd make most sense to show the target function's (non-bound function, that is) .name in the result. In other words, |
V8 behavior for bound Ecmascript function is quite interesting and probably leaks a bit of the internals:
|
1fb307b
to
842264d
Compare
There are some suggestions here (linked by @fatcerberus earlier): http://www.2ality.com/2016/08/function-prototype-tostring.html. Following that would mean that a function's name would be omitted unless it was an intrinsic, so that:
There's no internal distinction between "intrinsic" names and other names now, so this approach isn't possible right now. |
The way I read that proposal it seemed like it was just bound functions and native functions for which the name would be omitted. |
Sure - but only because the proposal calls for actual source code to be returned for Ecmascript functions? If that's not implemented, then something else must happen. |
So if you take these two:
Since that's not possible, at least right now:
So I would read this to mean Ecmascript functions would also use The proposal of course has no bearing re: compliance, and I'd like to see directly from the function .toString() whether it's a native or a script function, even if source code wasn't available. I'm not sure about bound functions though. If I had to choose, the current Duktape behavior is worse compared to looking up the target function and showing its type (hiding the bound status of the initial argument). Ideally both would be visible, but that's extra code and maybe not worth it. A hybrid option would be to show the initial function's name but look up the native/Ecmascript type from the non-bound target. That way e.g.
which simultaneously indicates it's a bound function (via the required ES6 name), but also shows that the non-bound target is a native function. The downside is that the name looks funny (with the space) but that has no practical compliance impact because the whole thing is only required to eval() to SyntaxError. It doesn't matter that the SyntaxError comes from the function name rather than the function body. |
Specifically what I don't like about the V8 behavior is this:
The name is hidden, and "native code" sounds wrong from an Ecmascript target function. I would Duktape to show at the very least:
or more preferably include the name of the target:
or maybe the name of the bound function (hybrid approach described above):
|
The intent of the proposal, at least the way I read it, is that Ecmascript functions MUST carry source code. Hence the bit that "in ES2016, whether to do so was left up to engines". Like you said though, the proposal has no bearing on compliance, and I agree that it's weird to call an Ecmascript function "native code". Anyway for ES6/7 compliance, the |
Yes, I understood that :) But if you don't do that, the proposal still tries to say that "In contrast, the proposal standardizes a placeholder: a function whose body is { [native code] }.". This is used for bound functions, even if they are Ecmascript functions. This is actually exactly what V8 does. So, all I was saying is that this same approach could be used for bound functions in Duktape (or perhaps even Ecmascript functions). But that doesn't make much sense to me. |
Re: that proposal document, note that it discusses two separate things. The first is a concrete proposal on what to return in each case. I'm not talking about that. The second is about how the ES6 requirement of returning something that's a SyntaxError is actually rather ill-defined because there's no way to know what syntax is added down the line. So they propose a standard placeholder. And that placeholder is used whenever you don't produce an actual source. This placeholder still has relevance even if you don't follow the other parts of the proposal because of the forward-compatibility issue. But in practical terms |
842264d
to
a860ba9
Compare
Ah, I see what you mean now, sorry :) Regarding V8's behavior for bound functions, I assume it doesn't return source code because that wouldn't meet the requirement of parsing to "a functionally equivalent object", so it just falls back on returning a SyntaxError-inducing stub. I also remember reading somewhere in the specification that you're not supposed to reveal the target of a bound function, but I don't recall where I saw that now. |
I don't recall offhand either - that might make sense for the actual function reference. But since the bound function .name already reveals the target's name it'd be quite odd if the target type (native or not) would be off limits. |
String output must either (a) eval() back as an equivalent object, OR (b) eval() to a SyntaxError. Previous Duktape output, conformant to ES5/ES5.1, matched FunctionDeclaration syntax but didn't parse back as an equivalent function.
a860ba9
to
4e3ee29
Compare
4e3ee29
to
ac2b79d
Compare
I'll merge this with the behavior above, specifically for bound functions:
That will fulfill the ES6 requirement and is analogous to what exists in master currently. This can be improved in follow-ups if there's a good, clear idea for e.g. bound functions. |
Emscripten compatibility isn't affected by the change. |
ac2b79d
to
04f62d4
Compare
In regards to real-world usage of |
There are indeed real world code bases relying on function .toString() - I just find that quite awkward and prone to break when new language features are introduced. But that's a topic for another pull: eventually Duktape should provide function source code too, but doing so is more than just tacking the compiler input as a property because of the variety of different syntaxes where functions may come from (object literal getters/setters, Function constructor, etc). So to make the source actually work, some processing is needed. That's the main reason I haven't yet implemented it. Also for low memory targets it'd be nice if the source code could be compressed using some simple text-aware algorithm. Maybe bit packed like the built-in strings. It could then be expanded on request. |
Interestingly, as mentioned in the comments of that article, SpiderMonkey used to reconstruct functions by decompiling then bytecode, but around FF 17 that was changed to just store the original source verbatim. I think if anything were done in Duktape towards supporting this, that latter method would make most sense. On the topic of this pull: It is apparently common to stringify bound functions as though they were native (the article mentions that too). I prefer |
I never considered anything else than using the original source - doing otherwise would be a huge footprint impact because bytecode-to-source translation would be very expensive. But the thing is, you can't just use the source because of the syntactic variants, so some processing is needed. It's not a huge undertaking though, just manufacturing a function expression around a plain body (which is already done to some extent by the Function constructor). |
Opened #1147 for discussing the source code issue; it's out of scope for this pull. |
ES5.1 requirement is that Function.prototype.toString() result must be parseable as a FunctionDeclaration with no further requirements (it doesn't have to compile into an equivalent function, for instance).
ES6 changes this so that the source code must either parse to an equivalent function, or must else cause a SyntaxError in parsing. See comments starting from: #1135 (comment).