Add a m3msg server for ingestion #1028

cw9 · 2018-10-06T15:55:42Z

Part 1 for the ingestion change in M3coordinator.
This diff adds a m3msg server which decodes traffic from m3msg consumer into metrics and takes in a WriteFn to write those metrics.

Part 2 will be implementing a storage based ingester to fulfill the WriteFn.

codecov · 2018-10-06T17:28:57Z

Codecov Report

Merging #1028 into master will increase coverage by 0.08%.
The diff coverage is n/a.

@@            Coverage Diff             @@
##           master    #1028      +/-   ##
==========================================
+ Coverage   76.99%   77.07%   +0.08%     
==========================================
  Files         439      439              
  Lines       37191    37191              
==========================================
+ Hits        28636    28666      +30     
+ Misses       6502     6481      -21     
+ Partials     2053     2044       -9

Flag	Coverage Δ
#dbnode	`81.4% <ø> (+0.11%)`	⬆️
#m3em	`73.12% <ø> (ø)`	⬆️
#m3ninx	`75.25% <ø> (ø)`	⬆️
#m3nsch	`51.19% <ø> (ø)`	⬆️
#query	`63.98% <ø> (ø)`	⬆️
#x	`84.72% <ø> (ø)`	⬆️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 6e61b39...e8e9a9a. Read the comment docs.

richardartoul · 2018-10-06T17:37:16Z

src/cmd/services/m3coordinator/server/m3msg/config.go

+	writeFn WriteFn,
+	iOpts instrument.Options,
+) (server.Server, error) {
+	scope := iOpts.MetricsScope().Tagged(map[string]string{"server": "m3msg"})


Is what you're doing with tagged different than calling subscope?

yep it's different, subscope will add the string to the metric name, tagged will add a tag to the metric and does not mess with the name.

richardartoul · 2018-10-06T17:39:55Z

src/cmd/services/m3coordinator/server/m3msg/handler.go

+
+		h.processMessage(msg)
+	}
+	if msgErr != nil && msgErr != io.EOF {


how will the loop get restarted

the consumer is long lived with a tcp connection, c.Message() is a blocking call and the loop continues to call it to keep decoding messages from the connection

richardartoul · 2018-10-06T17:46:12Z

src/cmd/services/m3coordinator/server/m3msg/types.go

+	metricTimeNanos int64,
+	value float64,
+	sp policy.StoragePolicy,
+	callback *RefCountedCallback,


Supernit: Might be nice to make this have CallbackSuccess() and CallbackFailure()

I'm open to it, I use the current interface so it's cheaper for us to add more callback types in the future

richardartoul · 2018-10-06T17:47:11Z

src/cmd/services/m3coordinator/server/m3msg/types.go

+
+// Callback performs the callback.
+func (r *RefCountedCallback) Callback(t CallbackType) {
+	if t == OnSuccess {


So if they callback with failure, refCount will never reach 0 and m3msg will retry? Is there any concept of an explicit nack?

we actually do nack in statsdex, like when we get any non-retriable errors, we still callback success and will ack the message, I could use another callback type for that case so it's easier to understand

richardartoul · 2018-10-06T17:47:40Z

src/cmd/services/m3coordinator/server/m3msg/types.go

+}
+
+// NewRefCountedCallback creates a RefCountedCallback.
+func NewRefCountedCallback(msg consumer.Message) *RefCountedCallback {


If you're not doing any pooling, is all this ref counting necessary? Seems like you could just either Ack or Nack at the end

yeah the thing is, each message can contain more than one metric, so we only ack the message when all the metrics were ingested successfully. when one of them failed, we won't ack the message and it will be retried.

Could you "fail faster" if Callback(OnRetriableError) immediately cause a nack?

Unfortunately m3msg does not support that right now, retry only happens if a message is not acked within X amount of time. Could consider adding it in the future

richardartoul

LGTM

Chao Wang added 2 commits October 6, 2018 11:49

Add a m3msg server for ingestion

225f062

fix metalint

0b45998

cw9 requested review from robskillington, richardartoul and prateek October 6, 2018 15:59

add license

eef4cec

richardartoul reviewed Oct 6, 2018

View reviewed changes

add callback type for non retriable error

e8e9a9a

richardartoul approved these changes Oct 7, 2018

View reviewed changes

cw9 merged commit afc0a1f into master Oct 7, 2018

cw9 deleted the chao/ingest-1 branch October 7, 2018 00:57

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add a m3msg server for ingestion #1028

Add a m3msg server for ingestion #1028

cw9 commented Oct 6, 2018 •

edited

Loading

codecov bot commented Oct 6, 2018 •

edited

Loading

richardartoul Oct 6, 2018

cw9 Oct 6, 2018

richardartoul Oct 6, 2018

cw9 Oct 6, 2018

richardartoul Oct 6, 2018

cw9 Oct 6, 2018

richardartoul Oct 6, 2018

cw9 Oct 6, 2018

richardartoul Oct 6, 2018

cw9 Oct 6, 2018

richardartoul Oct 7, 2018

cw9 Oct 7, 2018

richardartoul left a comment

Add a m3msg server for ingestion #1028

Add a m3msg server for ingestion #1028

Conversation

cw9 commented Oct 6, 2018 • edited Loading

codecov bot commented Oct 6, 2018 • edited Loading

Codecov Report

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

richardartoul left a comment

Choose a reason for hiding this comment

cw9 commented Oct 6, 2018 •

edited

Loading

codecov bot commented Oct 6, 2018 •

edited

Loading