V2 API save and load param with V1 API #3574

tensor-tang · 2017-08-18T13:24:20Z

tensor-tang · 2017-08-18T14:21:34Z

ERROR: test_init_from_tar (main.TestParameters)
[13:53:24] : [Step 1/1] ----------------------------------------------------------------------
[13:53:24] : [Step 1/1] Traceback (most recent call last):
[13:53:24] : [Step 1/1] File "test_parameters.py", line 112, in test_init_from_tar
[13:53:24] : [Step 1/1] p2.init_from_tar(file1)
[13:53:24] : [Step 1/1] File "/paddle/build/python/build/lib-python/paddle/v2/parameters.py", line 377, in init_from_tar
[13:53:24] : [Step 1/1] tar_param = Parameters.from_tar(f)
[13:53:24] : [Step 1/1] File "/paddle/build/python/build/lib-python/paddle/v2/parameters.py", line 364, in from_tar
[13:53:24] : [Step 1/1] params.deserialize(param_name, f)
[13:53:24] : [Step 1/1] File "/paddle/build/python/build/lib-python/paddle/v2/parameters.py", line 306, in deserialize
[13:53:24] : [Step 1/1] self.set(name, arr.reshape(self.get_shape(name)))
[13:53:24] : [Step 1/1] ValueError: cannot reshape array of size 0 into shape (1,256)

Test_parameters failed since no gradient machine have initialed, so to_tar saved non tar.

Will update.

And maybe we need add more test case with gradient machine initialed

luotao1 · 2017-08-21T04:04:06Z

@reyoung 要在v2的parameter里面用header，而不是直接写死

f.write(struct.pack("IIQ", 0, 4, size))

请问你觉得什么方式比较好？

目前PR的方式，即调用c++端parameter
其他？比如在python端写个获取c++端header的接口？

qingqing01 · 2017-08-22T02:09:39Z

python/paddle/v2/parameters.py

-        while buf:  # f.write crashes with big data blog.
-            f.write(buf)
-            wrote_size += 65535
+        if len(self.__gradient_machines__) == 0:


len(self.__gradient_machines__) == 0 和 len(self.__gradient_machines__) != 0 分别是什么情况下？为啥要用这个判断？

一般来说，以DeepSpeech2为例，在真正保存参数之前，实际是已经有了self.__gradient_machines__，所以走进len(self.__gradient_machines__) != 0的分支。

在现有的单测里面，是会进入len(self.__gradient_machines__) == 0的分支。

qingqing01 · 2017-08-22T02:11:37Z

python/paddle/v2/parameters.py

+                param = __get_parameter_in_gradient_machine__(
+                    each_gradient_machine, name)
+                assert isinstance(param, api.Parameter)
+                filename = 'tmp_param_file'


filename是写死的，每次保存都会覆盖？

是写死的，但是每次用完后会删掉，所以只是一个临时的。
之所以要这么用，是因为已有的V1 API接口目前只提供了从文件名的save和load。

luotao1 · 2017-08-22T02:58:36Z

由于目前这样的写法和c++端更加紧密了，v2的目的是尽可能和c++端解耦。所以有几个问题：

这个header的format什么时候会填充呢？是paddle.init(use_mkldnn=True)的时候进行填充？默认填充都是0,4,size?
不同param间的format应该也可以不一样，这部分是在trainer/config_parser.py端可以确定，还是必须c++端确定？

tensor-tang · 2017-08-22T03:15:54Z

这个header的format什么时候会填充呢？是paddle.init(use_mkldnn=True)的时候进行填充？默认填充都是0,4,size?

MKLDNN的param里的header format会在MKLDNN layer里面到forward之前才能被确认。默认是0。

不同param间的format应该也可以不一样，这部分是在trainer/config_parser.py端可以确定，还是必须c++端确定？

不同param间的format是可以不一样的，但是format的确认首先还是在C++端，不过要从Python端取出来是需要另外加接口才可以。

luotao1 · 2017-08-22T03:26:02Z

如果只是header问题，那么可以写一个接口，这里从c++端获取header即可。

tensor-tang · 2017-09-12T03:18:01Z

Can refine later if necessary.

tensor-tang added 3 commits August 17, 2017 19:05

use v1 api save param

cda377e

use v1 api load param if have gradient machine

53c25d2

Merge remote-tracking branch 'upstream/develop' into v2saveheader

5d08456

tensor-tang requested a review from luotao1 August 18, 2017 13:34

tensor-tang changed the title ~~save and load param with V1 API~~ V2 API save and load param with V1 API Aug 18, 2017

keep v2 save param if have no gradient machine

fe415d9

qingqing01 reviewed Aug 22, 2017

View reviewed changes

tensor-tang closed this Sep 12, 2017

tensor-tang deleted the v2saveheader branch September 12, 2017 03:18

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

V2 API save and load param with V1 API #3574

V2 API save and load param with V1 API #3574

tensor-tang commented Aug 18, 2017

tensor-tang commented Aug 18, 2017

luotao1 commented Aug 21, 2017

qingqing01 Aug 22, 2017

tensor-tang Aug 22, 2017

qingqing01 Aug 22, 2017

tensor-tang Aug 22, 2017

luotao1 commented Aug 22, 2017

tensor-tang commented Aug 22, 2017

luotao1 commented Aug 22, 2017

tensor-tang commented Sep 12, 2017

V2 API save and load param with V1 API #3574

V2 API save and load param with V1 API #3574

Conversation

tensor-tang commented Aug 18, 2017

tensor-tang commented Aug 18, 2017

luotao1 commented Aug 21, 2017

qingqing01 Aug 22, 2017

Choose a reason for hiding this comment

tensor-tang Aug 22, 2017

Choose a reason for hiding this comment

qingqing01 Aug 22, 2017

Choose a reason for hiding this comment

tensor-tang Aug 22, 2017

Choose a reason for hiding this comment

luotao1 commented Aug 22, 2017

tensor-tang commented Aug 22, 2017

luotao1 commented Aug 22, 2017

tensor-tang commented Sep 12, 2017