bug in ODEFnc() #11

jiangxinke · 2022-01-18T08:38:31Z

There is a bug in ODEFnc(), for the Parameter can not be augmented assignment directly, or else it would have an error "cannot assign ‘torch.cuda.FloatTensor’ as parameter ‘self.w’ (torch.nn.Parameter or None expected)"

it should be :

        self.w.data = (1 + self.beta) * self.w - self.beta * torch.mm(torch.mm(self.w, torch.t(self.w)), self.w)
        xw = torch.einsum('ijkl, lm->ijkm', x, w)

        d2 = torch.clamp(self.d2, min=0, max=1)
        w2 = torch.mm(self.w2 * d2, torch.t(self.w2))
        self.w2.data = (1 + self.beta) * self.w2 - self.beta * torch.mm(torch.mm(self.w2, torch.t(self.w2)), self.w2)

The text was updated successfully, but these errors were encountered:

square-coder · 2022-01-18T09:48:17Z

You are right, but we found such operation led to suboptimal convergence. And a simple alternative is to just delete this line, which means we don’t constrain $self.w$ to be orthogonal, and it works in practice. We have fixed the bug in this way.
Thanks for your attention.

jiangxinke · 2022-01-18T10:39:55Z

tks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

bug in ODEFnc() #11

bug in ODEFnc() #11

jiangxinke commented Jan 18, 2022

square-coder commented Jan 18, 2022

jiangxinke commented Jan 18, 2022

bug in ODEFnc() #11

bug in ODEFnc() #11

Comments

jiangxinke commented Jan 18, 2022

square-coder commented Jan 18, 2022

jiangxinke commented Jan 18, 2022