Shortcut (type inferrence) for naming enum values #683

skyfex · 2018-01-11T13:02:26Z

Problem
In Zig we currently have to type out the full "path" to an enum value. I.e., for the following enum:

const Type = enum {
    Ok,
    NotOk,
};

We have to provide the namespace (if relevant), the enum and the value name: someNamespace.Type.Ok or Type.Ok

This can get very tedious. This is en example from my testing:

nrfZig.PinCnf {  .dir = nrfZig.PinCnfDir.Output,
                 .input = nrfZig.PinCnfInput.Disconnect,
                 .pull = nrfZig.PinCnfPull.Disabled,
                 .drive = nrfZig.PinCnfDrive.H0H1,
                 .sense = nrfZig.PinCnfSense.Disabled,
            };

The code can feel overly verbose and repetitive. It also discourages use of enums. People might use integers or booleans instead of a descriptive enum.

Proposal
Whenever Zig can infer the enum type from the context of the code, it should. Instead of writing Type.OK , you can just type OK or .OK (one or the other, which one is up for debate)

Examples

When declaring a const or variable, you still need the full name:
var foobar = myModule.Type.Ok
When assigning to an already declared varible, you can use the short form:
foobar = .Ok
When assigning to a field in a struct, or instantiting a struct , you can use the short form:

object.foobar = .Ok
object = ObjectType { .foobar = .Ok }

When calling functions, you can use the short form:

fn baz(t: Type) { ... }
baz(.Ok)

Is switch statements you can use the short form:

switch(foobar) {
  .Ok  => ...,
  .NotOk => ...
}

When returning from a function:

fn baz(t) -> Type { return .Ok }

It should also be possible to use with these proposals: #661 and #649

Discussion

Pros:

Encourages use of enum over boolean and magic integers
Makes it more feasible to use enums instead of special syntax/keywords (see use a builtin enum for calling conventions instead of keywords #661 and Add endianness as one of the pointer properties #649)
Not having to repeat yourself when writing code

Cons:

Can sometimes be more vague when reading code (example: baz(Active, Enabled, On) is not much more helpful than baz(true,true,true))
More than one way to do something

A related idea would be to infer namespace names for other things too (like function calls). This should probably be a separate proposal.

Edit: Changed the examples from Ok to .Ok syntax

The text was updated successfully, but these errors were encountered:

skyfex · 2018-01-11T13:15:55Z

I'll add my own opinions to the cons I could think of:

Can sometimes be more vague when reading code (baz(Active, Enabled, On) is not much more helpful than baz(true,true,true))

This is just part of the general tradeoff with function calls in most programming languages. Functions are our vocabulary, and you're expected to remember or intuit roughly what they do and what parameters they take. If we wanted it be clear when reading code what the parameters are, Zig should have used a Smalltalk/Objective-C style for functions. See #479 for a related discussion.

More than one way to do something

This is the case with variable/const declarations as well: var x: Type = value or just var x = value. I feel like this proposal is the exact same thing. Which way you should go is a stylistic choice. The programmer should make a judgment about when he wants to clarify for the reader what the types are.

Just as var frame: Frame = getFrame() is very reduntant, so is Gpio { .direction = GpioDirection.Input }

For something like var x = gargleBlast(), the programmer should probably add the type name. The same might go for something like baz(Enabled)

You could say something similar about baz(3) vs baz(u8(3)).

Hejsil · 2018-01-11T13:44:01Z

I think the only real use case is for long sequences of enum usage, such as switches, where writing the same Enum.* does not increase readability.

I think this problem also applies to structs with const members and functions, like the common init pattern. Should we infer these too?

fn takeList(a: ArrayList(u32)) { }

// Currently
takeList(ArrayList(u32).init(allocator));
// If we infer Enums from parameter types, should we then also infer const members of structs. Enums are just structs with const members (Kinda)?
takeList(init(allocator));

When I put it this way, the features sounds super scary, and I personally vote against. Besides, we already have facilities to mostly eliminate all these long names. Use a local alias:

const ArrU32 = ArrayList(u32)
var a = ArrU32.init(allocator);
var b = ArrU32.init(allocator);

const E = SomeLongEnumName;
switch (e) {
    E.I, E.II, E.III => {},
    E.IV, E.V => {},
}

I claim, that this keeps all the readability of long names as long as the alias is close to the usage.

However, if we really wanna eliminate these names, I propose extending the use keyword to be able to export any "namespace" (struct/union const members, enums, namespaces).

use ArrayList(u32)
var a = init(allocator);
var b = init(allocator);

use SomeLongEnumName;
switch (e) {
    I, II, III => {},
    IV, V => {},
}

Right now, we can only use use in global scope and on namespaces. I like this less than using the local alias.

skyfex · 2018-01-11T16:25:41Z

I tried looking into what other languages are doing. Both C++ and Nim seems to have some idea of "scoped" (marked with .pure. in Nim, "enum class" in C++) and "unscoped" enums. This seems like a bad compromise to me. How do you decide if an enum should be scoped or not?

C# and D seems to have only scoped enums, but infering enum type has been a requested feature in C# for a while.

I had a thought: these languages can have multiple functions with the same name, where the actual function is inferred from the type of the parameters. This can make inferring enum type complicated.

But Zig seems to go the way of C: one name for one function (in a given namespace). I think this is the right way for a language that aims to be as explicit, simple and close-to-hardware as Zig. But I think it would be wise to leverage this to make the language "nicer" in other areas, such as enum type inferrence, as long as it doesn't lead to bugs or too much confusion.

Hejsil: Can you elaborate what you mean by "scary"? To me, scary means that it can lead to bugs. I don't see how this is possible though. I can't see that you could infer the wrong type.

To me, the scariest thing is that programmers and library writers don't use enums. That is what will lead to bugs. Having a small fraction of code readers be confused for a few seconds while they look up a function or struct definition is not scary to me, just a bit annoying.

The question is how you divide the "annoying to uninformed reader"/"annying to informed reader and writer" ratio. I think the fact that Zig has type inferrence in declarations var x = foobar() sets the bar for Zig. The question is which side of the ratio this proposal falls on.

I don't agree that aliases or "use" helps much. Look at my example from the problem description for instance. I think aliases makes the code harder to read. Instead of knowing that it's doing inferrence, and knowing that you have to look at the function/struct definition for the answer, you now have to look for some random line in the code.

Good point about inferring struct types. I wouldn't say that inferring enum types implies inferring struct types, but they are definitely related. I would say there exists arguments for referring struct types if possible, but they're not nearly as strong. It'd be interesting to create a proposal though, just to see what the implications would be.

andrewrk · 2018-01-11T17:12:56Z

Here's an argument for limiting the scope of this proposal to enums with . syntax, like this:

const Foo = enum {A, B, C};
switch (foo) {
    .A => 1,
    .B => 1,
    .C => 1,
}

consider this use case:

const Endian = enum {Little, Big};
const NativeEndian = if (builtin.is_big_endian) Endian.Big else Endian.Little;
const ForeignEndian = if (builtin.is_big_endian) Endian.Little else Endian.Big;
fn foo(e: Endian) {
    switch (e) {
        NativeEndian => {
            // do something
        },
        ForeignEndian => {
            // do something
        },
    }
}

this use case works today. now consider if we were also wanting to not have the . but still look for enum values.

const Endian = enum {Little, Big};
fn foo(e: Endian) {
    const Little = Endian.Big;
    const Big = Endian.Little;
    switch (e) {
        Little => {
            // do something
        },
        Big => {
            // do something
        },
    }
}

Obviously you would not write this code, but the fact that you can is problematic. Let's dodge this complexity by designing it out of the language. If we rely on . it unambiguously says that we will be using the context of an enum value. It would only work in a context where an enum type is expected.

PavelVozenilek · 2018-01-11T17:24:18Z

About the . syntax: I suggested something similar long long ago, in #120, to avoid writing structure name again and again.

thejoshwolfe · 2018-01-11T17:29:02Z

I tried looking into what other languages are doing.

I'll also add that Java 5 allows (actually requires) you to omit the qualifiers on enum values in case statements, but everywhere else enum values must be referred to by qualifying them as usual.

Another con

(colliding with @andrewrk's comment above; we were typing at the same time.)

Introduces some weird namespace and shadowing cases:

const Endian = enum { Big, Little, };
const Big = Endian.Little;  // doesn't look like it should be an error.
const foo: Endian = Big;    // ambiguous.
const Little = "unrelated"; // definitely shouldn't be an error.
const bar: Endian = Little; // ambgiuous.

You'd expect that the enum member namespace should shadow the other declarations (meaning that foo would be Endian.Big and bar would be Endian.Little), but Zig is trying to avoid shadowing. Declaring a name that shadows another name is a compile error. This is because shadowing is always avoidable and too confusing (however see #678 which might change this.). So If we want to make something in the above example an error, what should the error be? This kind of question can certainly be answered, but it makes me uneasy.

See #678 for a possible solution to this problem. Consider adding this to the list of allowed cases for shadowing:

Enum value shorthand names shadowing any other name.

Then you would clear up the ambiguity like this:

const foo: Endian = Endian.Big;
const bar: Endian = Endian.Little;

If you really wanted to refer to the aliases Big and Little from the example, you'd need a way to qualify your reference to them, or else you simply can't refer to them.

On the pro side

Your list of examples is in harmony with #287, which is a major proposal that will change lots of subtle semantics in zig. You can rephrase this proposal in the language of #287 like this: "if an expression's result location has a type that is an enum, and the expression is a single identifier, then the enum's value namespace is pushed onto the namespace search stack." This is actually pretty elegant, understandable, and covers all your examples, and more.

The interaction with #661 and #649 is very compelling.

skyfex · 2018-01-11T18:04:06Z

Personally I don't have a big preference on .Ok or Ok. I was slightly biased towards not doing having . which is why I didn't use it in the examples. But @andrewrk raised some good points. Now I'm more in favor of .Ok

I changed the examples to get a better feel for what that would look like.

skyfex · 2018-01-11T18:32:24Z

Btw, this is what the example from the problem statement would look like.

nrf.gpio.pin_cnf[7] = 
   nrf.PinCnf {  .dir = .Output,
                 .input = .Disconnect,
                 .pull = .Disabled,
                 .drive = .High0High1,
                 .sense = .Disabled };

Look at how much more that reads exactly like you'd want it too. (I find the second "nrf." reduntant, but that's nitpicking)

To elaborate on why this is important: I think microcontroller firmware code is one of the most attractive use-cases for Zig. In these applications a lot of your code will be accessing memory mapped registers. This is generally a pain in the ass in C. Usually the microcontroller vendor will provide a thin C library to access these, but the documentation for these are of varying quality. Usually the datasheet documenting the registers is the best documentations, and you'll end up almost reverse engineering the C library to figure out how to generate the register writes you want.

If Zig could make doing direct register writes about as easy as calling functions, this would easily attract a lot of people interested in writing firmware code.

If it's as hard or harder than in C, you'll only attract those who have are interested in safety over anything else, which I'm sad to say isn't as many as there should be.

Hejsil · 2018-01-11T19:41:33Z

@skyfex Ye, scary is the wrong word to use, and now that I read more on how this discussion pans out, I'm starting to like the proposal to. My mindset was mostly that, if you can have Ok's enum type be inferred, it's hard for the reader to know if Ok is a variable or constant defined in this scope (or parent scope) or and inferred enum. .Ok fixes all of this, because .Ok states that it is inferred from the context, so no confusion.

And yea, let's have the "Infer everything" in some other issue.

andrewrk · 2018-01-11T19:45:56Z

@skyfex would you mind making an issue for the direct writes to register thing? I don't want to lose track of it.

see #683

See #683

andrewrk · 2019-03-24T05:01:00Z

In the above 2 commits I introduced the new type, updated zig fmt, and implemented implicit casting to enum types. Here's what's left before this issue can be closed:

grammar update langref
grammar update spec
update documentation to demonstrate the enum literal type
peer type resolution of enum and enum literal
test compile error "enum '%s' has no field named '%s'". add error note "enum declared here".
make switch statements allow enum literal types

See #683

Hejsil · 2019-05-11T18:27:15Z

Grammar updated in spec
Grammar updated in language ref

andrewrk added this to the 0.3.0 milestone Jan 11, 2018

andrewrk added the proposal This issue suggests modifications. If it also has the "accepted" label then it is planned. label Jan 11, 2018

thejoshwolfe mentioned this issue Jan 11, 2018

allow declaration shadowing but disallow ambiguous identifier references #678

Closed

skyfex changed the title ~~Shortcut for naming enum values~~ Shortcut (type inferrence) for naming enum values Jan 12, 2018

skyfex mentioned this issue Jan 12, 2018

anonymous struct literals #685

Closed

andrewrk modified the milestones: 0.3.0, 0.4.0 Feb 28, 2018

raulgrell mentioned this issue Mar 30, 2018

syntax flaw: return type #760

Closed

andrewrk mentioned this issue May 4, 2018

Proposal: Optional argument names in function calls #982

Closed

andrewrk added the accepted This proposal is planned. label May 7, 2018

This was referenced Jun 7, 2018

remove the concept of explicit casting as it currently exists #1061

Closed

when multiple union fields share the same type, allow them to share a body in a switch prong #1107

Closed

andrewrk mentioned this issue Aug 1, 2018

builtin function @reify to create a type from a TypeInfo instance #383

Closed

9 tasks

andrewrk mentioned this issue Sep 13, 2018

add documentation for atomics #1516

Open

Hejsil mentioned this issue Oct 16, 2018

remove var args and add anon list initialization syntax #208

Closed

andrewrk added a commit that referenced this issue Mar 24, 2019

introduce the enum literal type

d0551db

see #683

andrewrk added a commit that referenced this issue Mar 24, 2019

implement implicit cast from enum literal to enum

a736dfe

See #683

andrewrk added a commit that referenced this issue Mar 24, 2019

make switch expressions allow enum literal types

aff7b38

See #683

andrewrk mentioned this issue Mar 24, 2019

style guide: make enum and union fields snake_case the same as struct fields #2101

Open

andrewrk added a commit that referenced this issue Mar 24, 2019

implement peer type resolution for enum literals

da9d8a6

See #683

andrewrk added a commit that referenced this issue Mar 24, 2019

add compile error test for invalid enum literal implicit cast

3306e43

See #683

andrewrk added contributor friendly This issue is limited in scope and/or knowledge of Zig internals. docs labels Mar 24, 2019

andrewrk modified the milestones: 0.4.0, 0.5.0 Mar 24, 2019

andrewrk mentioned this issue Apr 10, 2019

solve the grammar ambiguity with enum literals inside array literals and struct literals #2235

Closed

andrewrk closed this as completed in c2cf040 Jul 4, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Shortcut (type inferrence) for naming enum values #683

Shortcut (type inferrence) for naming enum values #683

skyfex commented Jan 11, 2018 •

edited by andrewrk

Loading

skyfex commented Jan 11, 2018 •

edited

Loading

Hejsil commented Jan 11, 2018

skyfex commented Jan 11, 2018 •

edited

Loading

andrewrk commented Jan 11, 2018

PavelVozenilek commented Jan 11, 2018

thejoshwolfe commented Jan 11, 2018

skyfex commented Jan 11, 2018 •

edited

Loading

skyfex commented Jan 11, 2018 •

edited

Loading

Hejsil commented Jan 11, 2018

andrewrk commented Jan 11, 2018

andrewrk commented Mar 24, 2019 •

edited

Loading

Hejsil commented May 11, 2019

Shortcut (type inferrence) for naming enum values #683

Shortcut (type inferrence) for naming enum values #683

Comments

skyfex commented Jan 11, 2018 • edited by andrewrk Loading

skyfex commented Jan 11, 2018 • edited Loading

Hejsil commented Jan 11, 2018

skyfex commented Jan 11, 2018 • edited Loading

andrewrk commented Jan 11, 2018

PavelVozenilek commented Jan 11, 2018

thejoshwolfe commented Jan 11, 2018

Another con

On the pro side

skyfex commented Jan 11, 2018 • edited Loading

skyfex commented Jan 11, 2018 • edited Loading

Hejsil commented Jan 11, 2018

andrewrk commented Jan 11, 2018

andrewrk commented Mar 24, 2019 • edited Loading

Hejsil commented May 11, 2019

skyfex commented Jan 11, 2018 •

edited by andrewrk

Loading

skyfex commented Jan 11, 2018 •

edited

Loading

skyfex commented Jan 11, 2018 •

edited

Loading

skyfex commented Jan 11, 2018 •

edited

Loading

skyfex commented Jan 11, 2018 •

edited

Loading

andrewrk commented Mar 24, 2019 •

edited

Loading