ETCD DB size metrics is not correct sometimes #8080

armstrongli · 2017-06-12T02:43:09Z

We encountered this issue while the DB size is larger than 5GB when re-adding members back to the cluster.

The follow I have is:

Start one ETCD cluster(3 members)
Bump >5GB to the ETCD cluster
Remove one member from the cluster
Stop all members from the cluster
Restart all members

The DB size reported by ETCD servers is 0.

After taking one snapshot, all the DB size is back to normal.

Here are the metrics I curl from one of the member

Before the snapshot

# HELP etcd_debugging_mvcc_db_total_size_in_bytes Total size of the underlying database in bytes.
# TYPE etcd_debugging_mvcc_db_total_size_in_bytes gauge
etcd_debugging_mvcc_db_total_size_in_bytes 0

metrics-before-snapshot.txt.zip

After the snapshot

# HELP etcd_debugging_mvcc_db_total_size_in_bytes Total size of the underlying database in bytes.
# TYPE etcd_debugging_mvcc_db_total_size_in_bytes gauge
etcd_debugging_mvcc_db_total_size_in_bytes 7.014617088e+09

metrics-after-snapshot.txt.zip

Cluster endpoint status

+-------------------------------------------------+------------------+---------+---------+-----------+-----------+------------+
|                    ENDPOINT                     |        ID        | VERSION | DB SIZE | IS LEADER | RAFT TERM | RAFT INDEX |
+-------------------------------------------------+------------------+---------+---------+-----------+-----------+------------+
| https://tess-node-p4hnm-1189140.33.tess.io:4001 | 416b952ae811a0d7 | 3.0.15  | 7.0 GB  | false     |       378 |    1670896 |
| https://tess-node-nf3nq-1189139.33.tess.io:4001 | c0fc36cf224bac7c | 3.0.15  | 7.0 GB  | false     |       378 |    1670896 |
| https://tess-node-phkwk-1189138.33.tess.io:4001 | f9d31f1cd85ee669 | 3.0.15  | 7.0 GB  | true      |       378 |    1670897 |
+-------------------------------------------------+------------------+---------+---------+-----------+-----------+------------+

The text was updated successfully, but these errors were encountered:

armstrongli · 2017-06-12T02:44:39Z

The endpoint status is right before & after taking snapshot.

xiang90 · 2017-06-12T02:50:58Z

The database size is a debugging metrics and is calculated lazily. If there is no new data committed, it wont be updated.

armstrongli · 2017-06-12T05:53:45Z

It didn't get updated for days.

xiang90 · 2017-06-12T05:56:34Z

can you share me a script that i can run locally to reproduce it?

armstrongli · 2017-06-16T06:48:03Z

Just need to create one cluster with v3.0.15 & v3.1.8 together, and run benchmark on it. And that's all.

Fixes etcd-io#8080

Fixes #8080

Fixes etcd-io#8080

armstrongli changed the title ~~ETCD metrics is not correct sometimes~~ ETCD DB size metrics is not correct sometimes Jun 12, 2017

heyitsanthony pushed a commit to heyitsanthony/etcd that referenced this issue Jun 16, 2017

mvcc: set db size metric on restore

7f149d8

Fixes etcd-io#8080

heyitsanthony mentioned this issue Jun 16, 2017

mvcc: set db size metric on restore #8120

Merged

heyitsanthony closed this as completed in #8120 Jun 16, 2017

gyuho pushed a commit that referenced this issue Jun 20, 2017

mvcc: set db size metric on restore

ed7ef5b

Fixes #8080

yudai pushed a commit to yudai/etcd that referenced this issue Oct 5, 2017

mvcc: set db size metric on restore

118d50e

Fixes etcd-io#8080

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ETCD DB size metrics is not correct sometimes #8080

ETCD DB size metrics is not correct sometimes #8080

armstrongli commented Jun 12, 2017 •

edited

Loading

armstrongli commented Jun 12, 2017

xiang90 commented Jun 12, 2017

armstrongli commented Jun 12, 2017 •

edited

Loading

xiang90 commented Jun 12, 2017

armstrongli commented Jun 16, 2017

ETCD DB size metrics is not correct sometimes #8080

ETCD DB size metrics is not correct sometimes #8080

Comments

armstrongli commented Jun 12, 2017 • edited Loading

armstrongli commented Jun 12, 2017

xiang90 commented Jun 12, 2017

armstrongli commented Jun 12, 2017 • edited Loading

xiang90 commented Jun 12, 2017

armstrongli commented Jun 16, 2017

armstrongli commented Jun 12, 2017 •

edited

Loading

armstrongli commented Jun 12, 2017 •

edited

Loading