Simplifying dynamic rnn #7509

emailweixu · 2018-01-14T06:20:50Z

The design should be such that the user does not need to worry about batch at all. The code in test_dyn_rnn.py is too complex. User should not need to write code such as:

Paddle/python/paddle/v2/fluid/tests/test_dyn_rnn.py

Lines 27 to 30 in df9c13a

    
           rank_table = fluid.layers.lod_rank_table(x=sent_emb) 
        
           sent_emb_array = fluid.layers.lod_tensor_to_array( 
        
               x=sent_emb, table=rank_table)

Paddle/python/paddle/v2/fluid/tests/test_dyn_rnn.py

Line 55 in df9c13a

mem = fluid.layers.shrink_memory(x=mem, i=i, table=rank_table)

Paddle/python/paddle/v2/fluid/tests/test_dyn_rnn.py

Line 60 in df9c13a

fluid.layers.increment(x=i, in_place=True)

Paddle/python/paddle/v2/fluid/tests/test_dyn_rnn.py

Line 62 in df9c13a

fluid.layers.less_than(x=i, y=seq_len, cond=cond)

Why can't we make it as easy as v2 recurrent_group or fluid StaticRNN?

emailweixu · 2018-01-14T06:34:30Z

For while_op, we should make it behave like if_else_op, meaning that the condition is a vector with size=batch_size. Each dimension of cond is responsible for one instance in the batch.

emailweixu · 2018-01-14T06:41:05Z

We should have a DynamicRNN with same usage as StaticRNN.

reyoung · 2018-01-15T03:48:48Z

@emailweixu
We have the Python syntax sugar, DynamicRNN, to wrap them all. It just like recurrent_group before and uses the same API from https://github.com/PaddlePaddle/talks/blob/develop/paddle-gtc-china.pdf

The usage is shown in the below unit test.

Paddle/python/paddle/v2/fluid/tests/test_dyn_rnn.py

Lines 94 to 101 in 9deb175

    
           rnn = fluid.layers.DynamicRNN() 
        
           with rnn.block(): 
        
               in_ = rnn.step_input(sent_emb) 
        
               mem = rnn.memory(shape=[100], dtype='float32') 
        
               out_ = fluid.layers.fc(input=[in_, mem], size=100, act='tanh') 
        
               rnn.update_memory(mem, out_) 
        
               rnn.output(out_)

The complex unit test is just a low-level test for the syntax sugar DynamicRNN. It because we develop the low-level APIs first, and make the low-level APIs correctly. And we develop the syntax sugar and write the complete test by using this syntax sugar.

emailweixu · 2018-01-31T05:51:06Z

I see. But I think we still need a simpler while_op which can handle each sample in a batch independently.

jacquesqiao · 2018-01-31T23:21:18Z

@emailweixu do you mean the syntax like below, each candition can have an independent branch to handle?

cond = less_then(...)
ie = pd.if_else(cond)
with ie.true_block():
    d = pd.layer.add(x, y)
    ie.output(d, pd.layer.softmax(d))
with ie.false_block():
    d = pd.layer.fc(z)
    ie.output(d, d+1)
o1, o2 = ie(cond)

like:

cond = less_then(sequence, condition)
rnn = pd.DynamicRNN(cond)
with rnn.true_block():
     in_ = rnn.step_input(sent_emb) 
     mem = rnn.memory(shape=[100], dtype='float32') 
     out_ = fluid.layers.fc(input=[in_, mem], size=100, act='tanh') 
     rnn.update_memory(mem, out_) 
     rnn.output(out_) 
with rnn.false_block():
     in_ = rnn.step_input(sent_emb) 
     mem = rnn.memory(shape=[100], dtype='float32') 
     out_ = fluid.layers.fc(input=[in_, mem], size=200, act='relu') 
     rnn.update_memory(mem, out_) 
     rnn.output(out_) 
o = rnn()

or

rnn = pd.DynamicRNN()
with rnn.block():
     in_ = rnn.step_input(sent_emb)
     cond = less_then(in_, condition) 
     ie = pd.ifelse(cond)
     with case.true_block()
	     mem = rnn.memory(shape=[100], dtype='float32') 
	     out_ = fluid.layers.fc(input=[in_, mem], size=100, act='tanh') 
	     rnn.update_memory(mem, out_) 
	     rnn.output(out_)
     with case.false_block()
	     mem = rnn.memory(shape=[100], dtype='float32') 
	     out_ = fluid.layers.fc(input=[in_, mem], size=200, act='relu') 
	     rnn.update_memory(mem, out_) 
	     rnn.output(out_) 
o = rnn()

jacquesqiao · 2018-02-01T05:55:15Z

I have a design of scalar switch case op, maybe the interface for this can be like:
#8031

rnn = pd.DynamicRNN()
with rnn.block():
     in_ = rnn.step_input(sent_emb)
     cond1 = logic_op(in_, condition1)
     cond2 = logic_op(in_, condition2)
     switch = SwitchOp()
     with switch.case(cond1)
	     mem = rnn.memory(shape=[100], dtype='float32') 
	     out_ = fluid.layers.fc(input=[in_, mem], size=100, act='tanh') 
	     rnn.update_memory(mem, out_) 
	     rnn.output(out_)
     with switch.case(cond2)
	     mem = rnn.memory(shape=[100], dtype='float32') 
	     out_ = fluid.layers.fc(input=[in_, mem], size=200, act='relu') 
	     rnn.update_memory(mem, out_) 
	     rnn.output(out_) 
o = rnn()

emailweixu · 2018-02-01T21:02:49Z

What I am thinking something like the following:

cond = calc_initial_condition()
loop = fluid.layers.While(cond=cond)
with loop.block():
      mem = memory(boot=x)
      out = layers.fc(input=mem, size=100)
      p = layers.fc(input=out, size=1, act='sigmoid')
      layers.less_than(p, 0.5, cond=cond)
      loop.update_memory(mem, out)
      loop.output(out)
o = loop()

jacquesqiao · 2018-02-02T03:27:59Z

Sorry I am a little confused about the above code, I write my questions in the comment below.

cond = calc_initial_condition()
loop = fluid.layers.While(cond=cond)
with loop.block():
      mem = memory(boot=x)
      out = layers.fc(input=mem, size=100)
      p = layers.fc(input=out, size=1, act='sigmoid')
      # 1. p is not used in the following code, is y ==> p ?
      # 2. what is the less_than used for?
      layers.less_than(y, 0.5, cond=cond)
      loop.update_memory(mem, out)
      loop.output(out)
o = loop()

emailweixu · 2018-02-02T21:21:21Z

Sorry, y should be p. less_than is for calculating termination condition

emailweixu assigned reyoung, wangkuiyi and jacquesqiao Jan 14, 2018

emailweixu changed the title ~~Making dynamic rnn more transparent to batch~~ Simplifying dynamic rnn Jan 14, 2018

shanyi15 closed this as completed Aug 15, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Simplifying dynamic rnn #7509

Simplifying dynamic rnn #7509

emailweixu commented Jan 14, 2018 •

edited

Loading

emailweixu commented Jan 14, 2018

emailweixu commented Jan 14, 2018

reyoung commented Jan 15, 2018

emailweixu commented Jan 31, 2018

jacquesqiao commented Jan 31, 2018 •

edited

Loading

jacquesqiao commented Feb 1, 2018

emailweixu commented Feb 1, 2018 •

edited

Loading

jacquesqiao commented Feb 2, 2018

emailweixu commented Feb 2, 2018

Simplifying dynamic rnn #7509

Simplifying dynamic rnn #7509

Comments

emailweixu commented Jan 14, 2018 • edited Loading

emailweixu commented Jan 14, 2018

emailweixu commented Jan 14, 2018

reyoung commented Jan 15, 2018

emailweixu commented Jan 31, 2018

jacquesqiao commented Jan 31, 2018 • edited Loading

jacquesqiao commented Feb 1, 2018

emailweixu commented Feb 1, 2018 • edited Loading

jacquesqiao commented Feb 2, 2018

emailweixu commented Feb 2, 2018

emailweixu commented Jan 14, 2018 •

edited

Loading

jacquesqiao commented Jan 31, 2018 •

edited

Loading

emailweixu commented Feb 1, 2018 •

edited

Loading