WIP: revamping code generation #1024

evancz · 2015-08-24T23:10:08Z

There are a number of issues that it makes sense to tackle all at once:

The potential for name collision as described in Module name collision in generated code #826
Decent amount of code spent in the intro and outro of modules.
We add all modules to Elm such that they are public to the world.
Make it work with Closure Compiler better
We want to have nice support for node
The bad error messages when folks say something like Elm.fullscreen(Erm.Main) in JS
We rebuild a decent amount of stuff for each "instance" of Elm as described in Memoize module make functions in the embedded scenario #888

With @JoeyEremondi working on dead code elimination, it seems like a reasonable time to start addressing these things in one pass. The point of this issue is to describe the plan for this in a coherent way.

Format for Generated Elm Code

When elm-make spits out a hunk of JS, the general format I have in mind is like this:

var Elm = function() {
    var elm-lang$core$Basics$fst = ...
    var elm-lang$core$Basics$snd = ...
    var elm-lang$core$Basics$degrees = ...
    var evancz$elm-html$Html$div = ...
    var evancz$elm-html$Html$text = ...
    ...

    return {
        fullscreen: ...
        embed: ...
        ...
    };
}();

All top level declarations in the whole project are turned into fully qualified names. There are no more module boundaries in generated code. So when I want to refer to Html.div I refer to evancz$elm-html$Html$div in the generated JS. And when we do dead code elimination, it will just mean leaving out some subset of these definitions.

This will address points 1, 2, and 3 entirely (name collision, intro/outro, exposing too much). I suspect it will also go a very long way with point 4 about Closure Compiler. One of the big problems we had there was that if you expose an object and use one field from it, you gotta keep all the fields. This is no longer an issue with module exports, so I am hoping things will work a lot nicer. We could at least pass things through there to do variable renaming and get nicer names.

Initializing an Elm instance

Right now initializing Elm code has certain issues (points 5 and 6) that it'd be nice to improve. We can conditionally generate the headers needed to get things working with node, that's not too crazy. The big change here would be in how we initialize Elm widgets.

// generated JS from the Elm code

var app = Elm.fullscreen('Main');

Notice that we give a string version of the module name we want. This means if there is some misspelling, we can give an error like "Cannot find module Main, did you mean one of these? ..." and then point to some other resources on what might be going wrong.

We also only have a very small API coming off of the Elm object now.

Reducing overhead of many "instances"

In the vast majority of cases, it is safe to share values across every instance that is out there. This is true of all non-signal code. When it comes to Mouse.position we need to be more careful because it matters where you are embedded.

Right now I am purposely leaving the "native format" unspecified, but it would make sense to me to have certain kinds of native code. The categories might just need to be pure/impure and we need to duplicate the impure parts for each instance but nothing else. If things depend on impure things, they also need to get duplicated. It should not be too hard to have an impure thing infect its dependencies.

Plan

To get this chunk of stuff done, it makes sense to break it into chunks in this order:

Add package information to the AST.Module.Interface record. We need to know the user and package name for every module to make canonicalization truly canonical and solve Module name collision in generated code #826. This should be a PR of its own.
Make some tweaks to the canonicalization code to make uses of top-level values point to the fully qualified version, so Canonical Local "x" becomes Canonical (Module "Here") "x" and we generate the correct code.
Modify elm-compiler and elm-make to spit out code in the new format described above, with no module boundaries. The existing core and native stuff will not work with this. That is fine, we will rebuild them.
See how this interacts with Closure Compiler. Are there any bad problems?
Plan out how native code needs to fit into this.
Get core using the new format
Start doing dead code elimination on the whole thing

The text was updated successfully, but these errors were encountered:

mgold · 2015-08-24T23:57:28Z

Nitpick: You'll need to turn dashes into underscores, e.g. elm-lang into elm_lang.

Question: How does this handle unexported values in modules? Are they hidden somehow? Or is the hiding at name-resolution-time?

vilterp · 2015-08-25T15:40:23Z

What happens if there are multiple modules running in a page and they are using different versions of some library? Not sure how this interacts with the discussion of multiple instances, or if version numbers need to be added to the fully-qualified identifiers of values.

JoeyEremondi · 2015-08-25T16:17:33Z

I think that would be an error right now, since elm-make needs a consistent
set of dependencies? But it's definitely good to keep in mind.

On Tue, Aug 25, 2015 at 8:40 AM, Pete Vilter [email protected]
wrote:

What happens if there are multiple modules running in a page and they are
using different versions of some library? Not sure how this interacts with
the discussion of multiple instances, or if version numbers need to be
added to the fully-qualified identifiers of values.

—
Reply to this email directly or view it on GitHub
#1024 (comment)
.

laszlopandy · 2015-08-25T16:18:43Z

@vilterp That is not a problem because there is a closure around the whole app. This only removes the closure between modules, not between Elm and the outside world.

evancz · 2015-08-25T16:23:20Z

@vilterp, @laszlopandy is correct that it's all in a closure, so they won't interfere with anything else in the page. @JoeyEremondi is also correct that it'd be impossible to build a single closure that had multiple versions of the same thing.

@mgold, since everything will have a canonical name, it actually does not matter if "unexported values" are available. We know from the checks in the compiler that the module boundaries are respected. Those boundaries do not need to be replicated in the generated code because no one can write a new function that refers to a "hidden" value.

evancz · 2015-08-25T16:44:14Z

I opened #1025 about what to do with hyphens. I am not sure how to do that yet.

kmarekspartz · 2015-08-26T14:12:36Z

It looks like this does nothing about #873 (and may make it worse!). If codegen is changing this much, it may be possible to fix that bug at the same time.

rtfeldman · 2015-08-26T16:33:20Z

I opened #1029 about JS module compatibility.

JoeyEremondi · 2015-08-26T16:35:26Z

It doesn't solve #873, but I don't think it will make it worse. I've been
working on reference analysis for DeadCodeElimination, so extending that to
check for self-reference not across lambdas should be not to hard.

On Wed, Aug 26, 2015 at 9:33 AM, Richard Feldman [email protected]
wrote:

I opened #1029 #1029
about JS module compatibility.

—
Reply to this email directly or view it on GitHub
#1024 (comment)
.

rtfeldman · 2015-08-26T16:38:34Z

I believe @laszlopandy's suggestion of "specify main in elm-package.json" would be a better fix for "The bad error messages when folks say something like Elm.fullscreen(Erm.Main) in JS" because it eliminates that class of errors.

In that world you'd just call Elm.fullscreen() and it would work, because the notion of what main to run would have been already incorporated into the compiled output, and verified at compile time.

laszlopandy · 2015-08-26T18:25:37Z

Richard, not if you have multiple mains. Which apparently circuit hub needs.
That's why I promised elm-package.json has a list of main modules.

On Wednesday, August 26, 2015, Richard Feldman [email protected]
wrote:

I believe @laszlopandy https://github.com/laszlopandy's suggestion of
"specify main in elm-package.json" would be a better fix for "The bad
error messages when folks say something like Elm.fullscreen(Erm.Main) in
JS" because it eliminates that class of errors.

In that world you'd just call Elm.fullscreen() and it would work, because
the notion of what main to run would have been already incorporated into
the compiled output, and verified at compile time.

—
Reply to this email directly or view it on GitHub
#1024 (comment)
.

rtfeldman · 2015-08-26T19:01:12Z

I'm curious if multiple mains are a strict "must have" there, or just the easiest way to implement given current tools?

evancz · 2015-08-26T19:06:35Z

@rehno-lindeque, I know you were building multiple mains in one call to elm-make. Folks are discussing some changes to elm-package.json that might make this easier to specify. Everyone, please continue this discussion on elm-lang/elm-make#49.

@rtfeldman and others, please try to self police the curation stuff described here when possible.

rtfeldman · 2015-08-26T20:33:43Z

Argh, sorry about that. Should have made a separate issue.

evancz · 2015-08-26T21:54:33Z

No worries, it's not an immediately intuitive way to do things (took me 3 years to think of it) so I expect I'll be adding reminders like that for a while ;)

evancz · 2016-05-12T23:22:27Z

Done enough with 0.17. Makes sense to manage DCE as a separate thing as it is more complicated than "just doing it" as discussed with @justinmanley yesterday!

evancz mentioned this issue Aug 25, 2015

finding unambiguous names for generated code #1025

Closed

kmarekspartz mentioned this issue Aug 27, 2015

Unexpected behavior in recursive data structure declarations #999

Closed

evancz closed this as completed May 12, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

WIP: revamping code generation #1024

WIP: revamping code generation #1024

evancz commented Aug 24, 2015 •

edited

Loading

mgold commented Aug 24, 2015

vilterp commented Aug 25, 2015

JoeyEremondi commented Aug 25, 2015

laszlopandy commented Aug 25, 2015

evancz commented Aug 25, 2015

evancz commented Aug 25, 2015

kmarekspartz commented Aug 26, 2015

rtfeldman commented Aug 26, 2015

JoeyEremondi commented Aug 26, 2015

rtfeldman commented Aug 26, 2015

laszlopandy commented Aug 26, 2015

rtfeldman commented Aug 26, 2015

evancz commented Aug 26, 2015

rtfeldman commented Aug 26, 2015

evancz commented Aug 26, 2015

evancz commented May 12, 2016

WIP: revamping code generation #1024

WIP: revamping code generation #1024

Comments

evancz commented Aug 24, 2015 • edited Loading

Format for Generated Elm Code

Initializing an Elm instance

Reducing overhead of many "instances"

Plan

mgold commented Aug 24, 2015

vilterp commented Aug 25, 2015

JoeyEremondi commented Aug 25, 2015

laszlopandy commented Aug 25, 2015

evancz commented Aug 25, 2015

evancz commented Aug 25, 2015

kmarekspartz commented Aug 26, 2015

rtfeldman commented Aug 26, 2015

JoeyEremondi commented Aug 26, 2015

rtfeldman commented Aug 26, 2015

laszlopandy commented Aug 26, 2015

rtfeldman commented Aug 26, 2015

evancz commented Aug 26, 2015

rtfeldman commented Aug 26, 2015

evancz commented Aug 26, 2015

evancz commented May 12, 2016

evancz commented Aug 24, 2015 •

edited

Loading