You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
2018/04/02 01:23:39 raft.go:664: INFO: 2 received MsgVoteResp from 2 at term 301
2018/04/02 01:23:39 raft.go:651: INFO: 2 [logterm: 59, index: 1578263] sent MsgVote request to 1 at term 301
2018/04/02 01:23:39 raft.go:651: INFO: 2 [logterm: 59, index: 1578263] sent MsgVote request to 3 at term 301
2018/04/02 01:23:41 draft.go:653: Couldn't take snapshot, txn watermark: [1529797], applied watermark: [1578263]
2018/04/02 01:23:42 raft.go:749: INFO: 2 is starting a new election at term 301
2018/04/02 01:23:42 raft.go:580: INFO: 2 became candidate at term 302
2018/04/02 01:23:42 raft.go:664: INFO: 2 received MsgVoteResp from 2 at term 302
2018/04/02 01:23:42 raft.go:651: INFO: 2 [logterm: 59, index: 1578263] sent MsgVote request to 1 at term 302
2018/04/02 01:23:42 raft.go:651: INFO: 2 [logterm: 59, index: 1578263] sent MsgVote request to 3 at term 302
2018/04/02 01:23:45 raft.go:749: INFO: 2 is starting a new election at term 302
2018/04/02 01:23:45 raft.go:580: INFO: 2 became candidate at term 303
One of the three server is constantly emitting following logs as I see now:
2018/04/03 11:45:49 raft.go:749: INFO: 2 is starting a new election at term 41644
2018/04/03 11:45:49 raft.go:580: INFO: 2 became candidate at term 41645
2018/04/03 11:45:49 raft.go:664: INFO: 2 received MsgVoteResp from 2 at term 41645
2018/04/03 11:45:49 raft.go:651: INFO: 2 [logterm: 59, index: 1578263] sent MsgVote request to 1 at term 41645
2018/04/03 11:45:49 raft.go:651: INFO: 2 [logterm: 59, index: 1578263] sent MsgVote request to 3 at term 41645
2018/04/03 11:45:52 raft.go:749: INFO: 2 is starting a new election at term 41645
2018/04/03 11:45:52 raft.go:580: INFO: 2 became candidate at term 41646
2018/04/03 11:45:52 raft.go:664: INFO: 2 received MsgVoteResp from 2 at term 41646
2018/04/03 11:45:52 raft.go:651: INFO: 2 [logterm: 59, index: 1578263] sent MsgVote request to 1 at term 41646
2018/04/03 11:45:52 raft.go:651: INFO: 2 [logterm: 59, index: 1578263] sent MsgVote request to 3 at term 41646
2018/04/03 11:45:55 raft.go:749: INFO: 2 is starting a new election at term 41646
2018/04/03 11:45:55 raft.go:580: INFO: 2 became candidate at term 41647
2018/04/03 11:45:55 raft.go:664: INFO: 2 received MsgVoteResp from 2 at term 41647
2018/04/03 11:45:55 raft.go:651: INFO: 2 [logterm: 59, index: 1578263] sent MsgVote request to 3 at term 41647
2018/04/03 11:45:55 raft.go:651: INFO: 2 [logterm: 59, index: 1578263] sent MsgVote request to 1 at term 41647
Other servers' logs at this time are:
2018/04/03 11:47:55 raft.go:692: INFO: 1 [logterm: 59, index: 1578309, vote: 1] ignored MsgVote from 2 [logterm: 59, index: 1578263] at term 59: lease is not expired (remaining ticks: 52)
2018/04/03 11:47:57 raft.go:692: INFO: 1 [logterm: 59, index: 1578309, vote: 1] ignored MsgVote from 2 [logterm: 59, index: 1578263] at term 59: lease is not expired (remaining ticks: 46)
2018/04/03 11:47:58 draft.go:653: Couldn't take snapshot, txn watermark: [1529797], applied watermark: [1578309]
2018/04/03 11:48:01 raft.go:692: INFO: 1 [logterm: 59, index: 1578309, vote: 1] ignored MsgVote from 2 [logterm: 59, index: 1578263] at term 59: lease is not expired (remaining ticks: 50)
2018/04/03 11:48:04 raft.go:692: INFO: 1 [logterm: 59, index: 1578309, vote: 1] ignored MsgVote from 2 [logterm: 59, index: 1578263] at term 59: lease is not expired (remaining ticks: 38)
2018/04/03 11:48:06 raft.go:692: INFO: 1 [logterm: 59, index: 1578309, vote: 1] ignored MsgVote from 2 [logterm: 59, index: 1578263] at term 59: lease is not expired (remaining ticks: 9)
2018/04/03 11:48:08 raft.go:692: INFO: 1 [logterm: 59, index: 1578309, vote: 1] ignored MsgVote from 2 [logterm: 59, index: 1578263] at term 59: lease is not expired (remaining ticks: 8)
2018/04/03 11:48:10 raft.go:692: INFO: 1 [logterm: 59, index: 1578309, vote: 1] ignored MsgVote from 2 [logterm: 59, index: 1578263] at term 59: lease is not expired (remaining ticks: 96)
2018/04/03 11:45:22 raft.go:692: INFO: 3 [logterm: 59, index: 1578309, vote: 0] ignored MsgVote from 2 [logterm: 59, index: 1578263] at term 59: lease is not expired (remaining ticks: 100)
2018/04/03 11:45:25 raft.go:692: INFO: 3 [logterm: 59, index: 1578309, vote: 0] ignored MsgVote from 2 [logterm: 59, index: 1578263] at term 59: lease is not expired (remaining ticks: 100)
2018/04/03 11:45:27 raft.go:692: INFO: 3 [logterm: 59, index: 1578309, vote: 0] ignored MsgVote from 2 [logterm: 59, index: 1578263] at term 59: lease is not expired (remaining ticks: 100)
2018/04/03 11:45:31 raft.go:692: INFO: 3 [logterm: 59, index: 1578309, vote: 0] ignored MsgVote from 2 [logterm: 59, index: 1578263] at term 59: lease is not expired (remaining ticks: 100)
2018/04/03 11:45:33 raft.go:692: INFO: 3 [logterm: 59, index: 1578309, vote: 0] ignored MsgVote from 2 [logterm: 59, index: 1578263] at term 59: lease is not expired (remaining ticks: 100)
2018/04/03 11:45:37 raft.go:692: INFO: 3 [logterm: 59, index: 1578309, vote: 0] ignored MsgVote from 2 [logterm: 59, index: 1578263] at term 59: lease is not expired (remaining ticks: 100)
2018/04/03 11:45:40 raft.go:692: INFO: 3 [logterm: 59, index: 1578309, vote: 0] ignored MsgVote from 2 [logterm: 59, index: 1578263] at term 59: lease is not expired (remaining ticks: 100)
The text was updated successfully, but these errors were encountered:
Are the three dgraph servers replicas, i.e. Zero was started with replicas as 3?
Firstly, I'd recommend upgrading to the nightly release as it contains a fix for the issue where snapshot wasn't happening in the case where a group had replicas. See #2266. That could be one reason for OOM. If you share some logs before the OOM crash, I can confirm that.
What version of Dgraph are you using?
1.0.4
Have you tried reproducing the issue with latest release?
Yes
What is the hardware spec (RAM, OS)?
ubuntu 14.04 / 8 core 32GB
Steps to reproduce the issue (command/config used to run Dgraph).
Out of memory condition invoked and Dgraph process got killed. Details below.
Expected behaviour and actual result.
No out of memory.
__
We have a 3 node setup for dgraphzero and a 3 node setup for dgraph-server.
We got an OOM in one of the dgraph-servers on Apr 1, 21:58 WIB:
At this time as you see,
anon-rss
is32 GB
.We got another OOM on another dgraph-server on Apr 2, 01:23 WIB:
Here too,
anon-rss
is32GB
.When I check the
p/
folder on each node data is ~3.5 G
. During normal hours RSS usage from top is limited to under6 G
:From logs during that time, I see following:
One of the three server is constantly emitting following logs as I see now:
Other servers' logs at this time are:
The text was updated successfully, but these errors were encountered: