lvalue: Inference rules for field expressions

Question

lvalue: Inference rules for field expressions

madmann91 opened this issue 8 years ago · 16 comments

In the lvalue branch, the type inference rules are broken for field expressions:

const Type* FieldExpr::check(InferSema& sema) const {
    auto ltype = sema.check(lhs());
    if (is_ptr(ltype)) {
        PrefixExpr::create_deref(lhs_.get());
        ltype = sema.check(lhs());
    }

    auto ref = split_ref_type(ltype);

    if (auto struct_type = ltype->isa<StructType>()) {
        if (auto field_decl = struct_type->struct_decl()->field_decl(symbol())) {
            if (ref)
                Ref2ValueExpr::create(lhs())->type();
            return sema.wrap_ref(ref, struct_type->op(field_decl->index()));
        }
    }

    return sema.wrap_ref(ref, ltype->is_known() ? sema.type_error() : sema.find_type(this));
}

Assuming that the structure type is known, but the field type is not, we will return a type such as reference to ?23 in the first type inference iteration. In the next iteration, we will have inserted a Ref2Value node, which means that lhs will now have a non-reference type. Hence, this iteration will return a non-reference type as well, which cannot be unified with the type coming from the previous iteration.

This is an example that triggers the bug:

fn iterate_rays(rays: &Ray) -> () {
    rays.org;
}

struct Ray {
    org: Vec4
}

struct Vec4 {
    w: f32
}

My suggestion to fix this issue: Do not create Ref2Value nodes here, as it makes little sense from a type checking perspective (you need a reference to a structure to get a reference to a structure field). This Ref2Value should be located at the usage site, when necessary.

Answer 1 · 2017-02-07T11:15:52.000Z

Actually, couldn't we type this through subtyping (i.e &T > T), and add a intermediate pass between type inference and type checking that adds these explicit casts between references and rvalues ?

Answer 2 · 2017-02-07T13:24:33.000Z

Does this fix the problem?

Answer 3 · 2017-02-07T14:18:59.000Z

No, the issue still prevails with more complex examples, like this one:

fn iterate_rays(rays: &Ray) -> () {
    rays.org.x;
}
struct Ray {
    org: Vec4
}
struct Vec4 {
    x: f32
}

Answer 4 · 2017-02-07T14:24:27.000Z

yes, you're right

Answer 5 · 2017-02-07T14:41:06.000Z

One option is to allow FieldExprs to work with references. That simplifies a bit the code, and we just need to fix TypeSema to split the reference before checking:

void FieldExpr::check(TypeSema& sema) const {
    auto type = sema.check(lhs());

    // split reference
    split_ref_type(type);

    // ...
}

Answer 6 · 2017-02-07T14:48:39.000Z

Shall I apply that patch?

Answer 7 · 2017-02-07T14:52:26.000Z

See pull request #50

Answer 8 · 2017-02-07T14:58:23.000Z

yes, this is also what I thought: having these Ref2Value nodes in-between doesn't really make sense. I'll keep this bug report opened. I want to double-check a few things. I think we have the same problem with MapExpr.

Answer 9 · 2017-02-07T14:59:08.000Z

We do indeed. I was about to create a new bug report for that...

Answer 10 · 2017-02-07T15:09:24.000Z

For MapExpr, the issue is slightly different. The semantics we have are incorrect. We should not automatically insert the * operator. The map operator, when applied on an array of references, should offset the pointer (some sort of GEP). Right now, when we have:

let a : &[i32] = /* ... */ ;
a(5)

this translates to:

let a : &[i32] = /* ... */ ;
(*a)(5)

Which is not correct. You cannot create a reference to an element of an array if you have already dereferenced the base pointer.

Answer 11 · 2017-02-07T15:13:46.000Z

This is actually correct, because the type of *a in this example is not [i32] but reference of [i32] which still has this GEP/LEA kind of thing going.

Answer 12 · 2017-02-07T15:19:09.000Z

I think this should do the trick, or do you have an example where this goes wrong?

Answer 13 · 2017-02-07T15:24:50.000Z

The following example still does not work on my machine:

fn test(array: &[i32]) -> () {
    array(0);
}

Answer 14 · 2017-02-07T15:26:59.000Z

You are correct though, I did not see that * actually creates a reference to the value. A simple fix for the bug is then simply to split the reference before a call to MapExpr::remit.

const Def* MapExpr::remit(CodeGen& cg, State state, Location eval_loc) const {
    auto ltype = lhs()->type();
    split_ref_type(ltype);

    // ...
}

Answer 15 · 2017-02-07T15:28:15.000Z

yes, just a slight issue in the codegen. I'm currently inspecting that.

Answer 16 · 2017-02-07T15:32:43.000Z

yes, that's the fix. Let me add some test case and then we can close this issue.