Fix imports of multiple databases in one export file #9352

wwilfinger · 2018-01-22T02:24:49Z

I'm not happy with some of the names here. I also want to add a test that hits the 5000 batchSize to verify everything works there. Will probably look at cmd/influx_inspect/export/export_test.go to see how to generate larger test dbs.

batchWrite was using the last database and retention policy read from
the input file. Because batchWrite was called only every batchSize lines
or at EOF, databases with fewer than batchWrite points could be imported
into the incorrect database.

This change forces a flush with batchWrite whenever processDML reads a
change in database or retention policy.

Required for all non-trivial PRs

Rebased/mergable
Tests pass
CHANGELOG.md updated
Sign CLA (if not already signed)

jsternberg · 2018-01-30T15:31:44Z

importer/v8/importer.go

@@ -178,10 +181,18 @@ func (i *Importer) processDML(scanner *bufio.Reader) error {
 			return nil
 		}
 		if strings.HasPrefix(line, "# CONTEXT-DATABASE:") {
-			i.database = strings.TrimSpace(strings.Split(line, ":")[1])
+			i.dmlDatabase = strings.TrimSpace(strings.Split(line, ":")[1])


Is there a reason why you don't just call i.batchWrite() here before changing the database? It seems like there would be less of a code change if the batch were just flushed whenever the context database or retention policy is changed. It looks like batchWrite() is pretty safe to call multiple times. The only caveat is that, for some reason, this section of code:

i.batch = i.batch[:0]

This is in batchAccumulator rather than inside of batchWrite. If you move that piece of code to the end of batchWrite and remove it from batchAccumulator, you can just call batchWrite whenever you want to flush the writes.

You're right, thanks. That was much easier. I rebased and pushed.

Not sure how to write a test with more than 5000 points without pasting in a gigantic string of json. Open to suggestions.

aanthony1243

straightforward change, tests look good.

batchWrite was using the last database and retention policy read from the input file. Because batchWrite was called only every batchSize lines or at EOF, databases with fewer than batchWrite points could be imported into the incorrect database. This change forces a flush with batchWrite whenever processDML reads a change in database or retention policy.

jsternberg · 2018-02-05T18:19:46Z

I resolved the changelog conflict and pushed an update. As soon as it passes tests, I'm going to merge this.

Thanks for the help! We should have this change included in the 1.5 release.

jsternberg · 2018-02-06T05:01:23Z

Hi @wwilfinger, can you sign the link to the CLA that's here? Thanks.

jsternberg · 2018-02-08T17:09:06Z

@wwilfinger ping. Can you sign the CLA otherwise we're going to have to revert this change and redo it.

Thanks.

wwilfinger · 2018-02-09T15:22:20Z

@jsternberg My employer's legal team is taking a while to get back to me if I can sign the CLA. Go ahead and revert. I apologize for the waste of time.

…t-export-import" This reverts commit 9aeae7c, reversing changes made to 35b44cc. The contributor was unable to sign the contributor license agreement so we have to revert this commit.

ghost added the proposed label Jan 22, 2018

jsternberg suggested changes Jan 30, 2018

View reviewed changes

rbetts requested a review from aanthony1243 February 5, 2018 16:55

aanthony1243 approved these changes Feb 5, 2018

View reviewed changes

jsternberg approved these changes Feb 5, 2018

View reviewed changes

ghost assigned jsternberg Feb 5, 2018

ghost added review and removed proposed labels Feb 5, 2018

jsternberg merged commit 9aeae7c into influxdata:master Feb 5, 2018

ghost removed the review label Feb 5, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix imports of multiple databases in one export file #9352

Fix imports of multiple databases in one export file #9352

wwilfinger commented Jan 22, 2018

jsternberg Jan 30, 2018

wwilfinger Feb 3, 2018

aanthony1243 left a comment

jsternberg commented Feb 5, 2018

jsternberg commented Feb 6, 2018

jsternberg commented Feb 8, 2018

wwilfinger commented Feb 9, 2018 •

edited

Loading

Fix imports of multiple databases in one export file #9352

Fix imports of multiple databases in one export file #9352

Conversation

wwilfinger commented Jan 22, 2018

Required for all non-trivial PRs

jsternberg Jan 30, 2018

Choose a reason for hiding this comment

wwilfinger Feb 3, 2018

Choose a reason for hiding this comment

aanthony1243 left a comment

Choose a reason for hiding this comment

jsternberg commented Feb 5, 2018

jsternberg commented Feb 6, 2018

jsternberg commented Feb 8, 2018

wwilfinger commented Feb 9, 2018 • edited Loading

wwilfinger commented Feb 9, 2018 •

edited

Loading