CDC: 19.1 updates #4403

lnhsingh · 2019-02-20T22:46:13Z

Changes addressing #3992:

Added Responses section on CREATE CHANGEFEED to explain what messages get emitted to a Kafka topic for DML statements.
Added more description for updated and resolved timestamps, and cursor.
Removed results_buffer_size from docs.
Add info about schema changes with backfill
Add info about cloud storage sinks
Add Avro data types
Add how to debug

Misc changes:

Added / edited Avro core changefeed instructions

Closes #3992.

cockroach-teamcity · 2019-02-20T22:46:18Z

This change is

cockroach-teamcity · 2019-02-20T22:49:54Z

http://cockroach-docs-review.s3-website-us-east-1.amazonaws.com/5525dcfc2ea420f3d8d9d7924d17dca6d04c2ec9/

cockroach-teamcity · 2019-03-18T17:25:50Z

http://cockroach-docs-review.s3-website-us-east-1.amazonaws.com/f7712f9eeac2587b49edffb6578169c421bb8485/

cockroach-teamcity · 2019-03-19T00:19:33Z

http://cockroach-docs-review.s3-website-us-east-1.amazonaws.com/6ef6d4be2bfdc04c1a2b72e200df820728fb6c11/

cockroach-teamcity · 2019-03-19T23:46:19Z

http://cockroach-docs-review.s3-website-us-east-1.amazonaws.com/31cda71fa4f12fc7bddcc1a9ddf7f241cc966749/

cockroach-teamcity · 2019-03-27T16:43:11Z

http://cockroach-docs-review.s3-website-us-east-1.amazonaws.com/270bf5041e08e3a0bfa8cf47fd89b37f7f1e51d9/

cockroach-teamcity · 2019-03-28T17:37:31Z

http://cockroach-docs-review.s3-website-us-east-1.amazonaws.com/1e134315c7651c0a0e5e291e1645b50a7dcceea2/

cockroach-teamcity · 2019-03-28T17:37:48Z

http://cockroach-docs-review.s3-website-us-east-1.amazonaws.com/77cdb318834c2480b0319b0deb35e02676e2d914/

cockroach-teamcity · 2019-03-28T18:25:12Z

http://cockroach-docs-review.s3-website-us-east-1.amazonaws.com/8b084f6ca1a86e8f736909bd2b9d7a151db61a10/

Changes include: - Added / edited Avro core changefeed instructions Minor edit Add expected responses for enterprise changefeeds CDC updates - Fix broken links - Add info about cursor - Add info about updated timestamps - Add info about schema changes with backfill

Minor edits / links

cockroach-teamcity · 2019-03-28T18:39:34Z

http://cockroach-docs-review.s3-website-us-east-1.amazonaws.com/64afc48213f32c984903f68cedd8e9950d7e001c/

cockroach-teamcity · 2019-03-28T18:41:29Z

http://cockroach-docs-review.s3-website-us-east-1.amazonaws.com/dde2a95524799f6ac3cab6382969b548ed3e5153/

cockroach-teamcity · 2019-03-28T22:31:41Z

http://cockroach-docs-review.s3-website-us-east-1.amazonaws.com/6c54c1156f7d7757a0629e33f951fc8371ee75d7/

cockroach-teamcity · 2019-03-28T23:17:25Z

http://cockroach-docs-review.s3-website-us-east-1.amazonaws.com/2d1340955deb458d2927ad58363f08d0916f4302/

rolandcrosby

looking good so far!

Reviewed 4 of 8 files at r2, 1 of 2 files at r3.
Reviewable status: complete! 0 of 0 LGTMs obtained (waiting on @Amruta-Ranade, @danhhz, @lhirata, and @rolandcrosby)

v19.1/change-data-capture.md, line 82 at r3 (raw file):

Rows that have been backfilled by a schema change are always re-emitted because Avro's default schema change functionality is not powerful enough to represent the schema changes that CockroachDB supports (e.g., CockroachDB columns can have default values that are arbitrary SQL expressions, but Avro only supports static default values).

To ensure that the Avro schemas that CockroachDB publishes will work with the (undocumented and inconsistent) schema compatibility rules used by the Confluent schema registry, CockroachDB emits all fields in Avro as nullable unions. This ensures that Avro and Confluent consider the schemas to be both backward- and forward-compatible. Note that the original CockroachDB column definition is also included in the schema as a doc field, so it's still possible to distinguish between a `NOT NULL` CockroachDB column and a `NULL` CockroachDB column.

on second thought, that parenthetical I added about the schema compatibility rules is a bit gratuitous

v19.1/change-data-capture.md, line 146 at r3 (raw file):

{% include copy-clipboard.html %}
~~~ sql
> CREATE CHANGEFEED FOR TABLE name INTO 'schema://host:port';

nit: scheme

v19.1/change-data-capture.md, line 213 at r3 (raw file):

{{site.data.alerts.callout_info}}
Debugging is only available for enterprise changefeeds.

"debugging is only available" sounds a little strange. Maybe "This section only applies to enterprise changefeeds using Kafka"?

v19.1/change-data-capture.md, line 216 at r3 (raw file):

{{site.data.alerts.end}}

For changefeeds connected to Kafka, use log information to debug connection issues (i.e., `kafka: client has run out of available brokers to talk to (Is your cluster reachable?)`). Debug by looking for lines in the logs with `[kafka-producer]` in them:

Link 'log information' to a page explaining CockroachDB's log files (assuming we have one)

v19.1/change-data-capture.md, line 243 at r3 (raw file):

    {% include copy-clipboard.html %}
    ~~~ shell
    oach sql --url="postgresql://[email protected]:26257?sslmode=disable" --format=csv

what happened to the beginning of this line?

v19.1/change-data-capture.md, line 306 at r3 (raw file):

    ~~~

### Create a core changefeed in Avro

"in Avro" sounds odd to me; maybe "using the Avro output format" or something?

v19.1/change-data-capture.md, line 308 at r3 (raw file):

### Create a core changefeed in Avro

<span class="version-tag">New in v19.1:</span> In this example, you'll set up a core changefeed for a single-node cluster that emits [Avro](https://docs.confluent.io/current/schema-registry/docs/serializer-formatter.html#wire-format) records.

Add a quick explanation of what the Confluent stuff is for - like "The binary Avro encoding convention used by CockroachDB uses the Confluent Schema Registry to store Avro schemas"

v19.1/change-data-capture.md, line 752 at r3 (raw file):

{% include {{ page.version.version }}/misc/experimental-warning.md %}

<span class="version-tag">New in v19.1:</span> In this example, you'll set up a changefeed for a single-node cluster that is connected to an AWS sink. Note that you can set up changefeeds for any of [these cloud storages](create-changefeed.html#cloud-storage-sink).

nit: cloud storage providers (also maybe say AWS S3 instead of just AWS)

v19.1/change-data-capture.md, line 828 at r3 (raw file):

    {% include copy-clipboard.html %}
    ~~~ sql
    > CREATE CHANGEFEED FOR TABLE office_dogs INTO 'experimental-s3://test-s3encryption/test?AWS_ACCESS_KEY_ID=enter_key-here&AWS_SECRET_ACCESS_KEY=enter_key_here' with updated, resolved='10s';

'test-s3encryption' is a slightly confusing name, maybe just 'example-bucket-name'?

v19.1/create-changefeed.md, line 61 at r3 (raw file):

----------+-------+---------------
`topic_prefix` | [`STRING`](string.html) | Adds a prefix to all of the topic names.<br><br>For example, `CREATE CHANGEFEED FOR TABLE foo INTO 'kafka://...?topic_prefix=bar_'` would emit rows under the topic `bar_foo` instead of `foo`.
`tls_enabled=true` | [`BOOL`](bool.html) | If `true`, use a Transport Layer Security (TLS) connection. This can be used with a `ca_cert` (see below).

"If true, enable Transport Layer Security on the connection to Kafka"

v19.1/create-changefeed.md, line 63 at r3 (raw file):

`tls_enabled=true` | [`BOOL`](bool.html) | If `true`, use a Transport Layer Security (TLS) connection. This can be used with a `ca_cert` (see below).
`ca_cert` | [`STRING`](string.html) | The base64-encoded `ca_cert` file.<br><br>Note: To encode your `ca.cert`, run `base64 -w 0 ca.cert`.
`sasl_enabled` | [`BOOL`](bool.html) | If `true`, use Simple Authentication and Security Layer (SASL) to authenticate. This requires a `sasl_user` and `sasl_password` (see below).

specifically SASL/PLAIN (link to https://docs.confluent.io/current/kafka/authentication_sasl/authentication_sasl_plain.html)

v19.1/create-changefeed.md, line 69 at r3 (raw file):

#### Cloud storage sink

Example of a cloud storage sink (i.e., AWS) URI:

AWS S3

v19.1/create-changefeed.md, line 87 at r3 (raw file):

Option | Value | Description
-------|-------|------------
`updated` | N/A | Include updated timestamps with each row.<br><br>If a `cursor` is provided, the "updated" timestamps will match the [MVCC](../v19.1/architecture/storage-layer.html#mvcc) timestamps of the emitted rows, and there is no initial scan.. If a `cursor` is not provided, the changefeed will perform an initial scan (as of the time the changefeed was created), and the "updated" timestamp for each change record emitted in the initial scan will be the timestamp of the initial scan. Similarly, when a [backfill is performed for a schema change](change-data-capture.html#schema-changes-with-column-backfill), the "updated" timestamp is set to the first timestamp for when the new schema is valid.

nit: .. -> .

v19.1/create-changefeed.md, line 128 at r3 (raw file):

## Responses

The messages (i.e., keys and values) emitted to a Kafka topic are composed of the following:

this is specific to the envelope format specified by the user (the default format is 'wrapped' I believe, which produces this output)

v19.1/create-changefeed.md, line 130 at r3 (raw file):

The messages (i.e., keys and values) emitted to a Kafka topic are composed of the following:

- **Key**: Always composed of the table's `PRIMARY KEY` field (e.g., `[1]` or `{"id":1}`).

specifically, the key is an array of the primary key fields of the row

v19.1/create-changefeed.md, line 131 at r3 (raw file):

- **Key**: Always composed of the table's `PRIMARY KEY` field (e.g., `[1]` or `{"id":1}`).
- **Value**:

should specify that there are three possible level fields in the value of a record emitted to CDC:

after, which contains the state of the row after the update (or 'null' for deletes)
updated, which contains the updated timestamp
resolved, which is emitted for records representing resolved timestamps (these records won't include an after field since they only function as checkpoints)

cockroach-teamcity · 2019-03-29T19:38:38Z

http://cockroach-docs-review.s3-website-us-east-1.amazonaws.com/584dcee1f422fb94eeb54cd922b0de8a0016be39/

lnhsingh

Reviewable status: complete! 0 of 0 LGTMs obtained (waiting on @Amruta-Ranade, @danhhz, and @rolandcrosby)

v19.1/change-data-capture.md, line 82 at r3 (raw file):

Previously, rolandcrosby (Roland Crosby) wrote…

on second thought, that parenthetical I added about the schema compatibility rules is a bit gratuitous

Removed

v19.1/change-data-capture.md, line 146 at r3 (raw file):

Previously, rolandcrosby (Roland Crosby) wrote…

nit: scheme

Done.

v19.1/change-data-capture.md, line 213 at r3 (raw file):

Previously, rolandcrosby (Roland Crosby) wrote…

"debugging is only available" sounds a little strange. Maybe "This section only applies to enterprise changefeeds using Kafka"?

On second thought, the callout seems redundant with the first sentence. Removing.

v19.1/change-data-capture.md, line 216 at r3 (raw file):

Previously, rolandcrosby (Roland Crosby) wrote…

Link 'log information' to a page explaining CockroachDB's log files (assuming we have one)

Done.

v19.1/change-data-capture.md, line 243 at r3 (raw file):

Previously, rolandcrosby (Roland Crosby) wrote…

what happened to the beginning of this line?

Weird. Fixed.

v19.1/change-data-capture.md, line 306 at r3 (raw file):

Previously, rolandcrosby (Roland Crosby) wrote…

"in Avro" sounds odd to me; maybe "using the Avro output format" or something?

Does "using Avro" make sense?

v19.1/change-data-capture.md, line 308 at r3 (raw file):

Previously, rolandcrosby (Roland Crosby) wrote…

Add a quick explanation of what the Confluent stuff is for - like "The binary Avro encoding convention used by CockroachDB uses the Confluent Schema Registry to store Avro schemas"

Edited

v19.1/change-data-capture.md, line 752 at r3 (raw file):

Previously, rolandcrosby (Roland Crosby) wrote…

nit: cloud storage providers (also maybe say AWS S3 instead of just AWS)

Done.

v19.1/change-data-capture.md, line 828 at r3 (raw file):

Previously, rolandcrosby (Roland Crosby) wrote…

'test-s3encryption' is a slightly confusing name, maybe just 'example-bucket-name'?

Done.

v19.1/create-changefeed.md, line 61 at r3 (raw file):

Previously, rolandcrosby (Roland Crosby) wrote…

"If true, enable Transport Layer Security on the connection to Kafka"

Done.

v19.1/create-changefeed.md, line 63 at r3 (raw file):

Previously, rolandcrosby (Roland Crosby) wrote…

specifically SASL/PLAIN (link to https://docs.confluent.io/current/kafka/authentication_sasl/authentication_sasl_plain.html)

Done.

v19.1/create-changefeed.md, line 69 at r3 (raw file):

Previously, rolandcrosby (Roland Crosby) wrote…

AWS S3

Done.

v19.1/create-changefeed.md, line 87 at r3 (raw file):

Previously, rolandcrosby (Roland Crosby) wrote…

nit: .. -> .

Done.

v19.1/create-changefeed.md, line 128 at r3 (raw file):

Previously, rolandcrosby (Roland Crosby) wrote…

this is specific to the envelope format specified by the user (the default format is 'wrapped' I believe, which produces this output)

Done.

v19.1/create-changefeed.md, line 130 at r3 (raw file):

Previously, rolandcrosby (Roland Crosby) wrote…

specifically, the key is an array of the primary key fields of the row

Done.

v19.1/create-changefeed.md, line 131 at r3 (raw file):

Previously, rolandcrosby (Roland Crosby) wrote…

should specify that there are three possible level fields in the value of a record emitted to CDC:

after, which contains the state of the row after the update (or 'null' for deletes)

updated, which contains the updated timestamp

resolved, which is emitted for records representing resolved timestamps (these records won't include an after field since they only function as checkpoints)

Done.

cockroach-teamcity · 2019-03-29T21:12:01Z

http://cockroach-docs-review.s3-website-us-east-1.amazonaws.com/3b6edeab696c5b2f88cccabb2a8dfa202d1fa6a0/

cockroach-teamcity · 2019-04-01T15:47:25Z

http://cockroach-docs-review.s3-website-us-east-1.amazonaws.com/6d2759f9c019572c1e62affb81907f3e7a2e1810/

cockroach-teamcity · 2019-04-01T15:58:31Z

http://cockroach-docs-review.s3-website-us-east-1.amazonaws.com/5e6fb21dce3f84a6182d46b5ad3632ec011df023/

danhhz

once roland is happy

Reviewable status: complete! 1 of 0 LGTMs obtained (waiting on @Amruta-Ranade, @danhhz, @lhirata, and @rolandcrosby)

_includes/v19.1/cdc/core-url.md, line 2 at r8 (raw file):

{{site.data.alerts.callout_info}}
Because core changefeeds return results differently than other SQL statements, they require a dedicated database connection with specific settings around result buffering. In normal operation, CockroachDB improves performance by buffering results server-side before returning them to a client. Core changefeeds also have different cancellation behavior than other queries: they can only be canceled by closing the underlying connection or issuing a  [`CANCEL QUERY`](cancel-query.html) statement on a separate connection. Combined, these attributes of changefeeds mean that applications should explicitly create dedicated connections to consume changefeed data, instead of using a connection pool as most client drivers do by default.

If we're going to mention the results buffer, then in place of the sentence you deleted, we should mention that we automatically disable it for core changefeeds. I'm also okay just removing any mention of results buffering. Up to you

_includes/v19.1/sql/settings/settings.md, line 83 at r8 (raw file):

<tr><td><code>sql.defaults.experimental_vectorize</code></td><td>enumeration</td><td><code>0</code></td><td>default experimental_vectorize mode [off = 0, on = 1, always = 2]</td></tr>
<tr><td><code>sql.defaults.optimizer</code></td><td>enumeration</td><td><code>1</code></td><td>default cost-based optimizer mode [off = 0, on = 1, local = 2]</td></tr>
<tr><td><code>sql.defaults.results_buffer.size</code></td><td>byte size</td><td><code>16 KiB</code></td><td>default size of the buffer that accumulates results for a statement or a batch of statements before they are sent to the client. This can be overridden on an individual connection with the 'results_buffer_size' parameter. Note that auto-retries generally only happen while no results have been delivered to the client, so reducing this size can increase the number of retriable errors a client receives. On the other hand, increasing the buffer size can increase the delay until the client receives the first result row. Updating the setting only affects new connections. Setting to 0 disables any buffering.</td></tr>

This one is still true. May want to leave it.

v19.1/change-data-capture.md, line 82 at r3 (raw file):

Previously, lhirata wrote…

Removed

lol. I do think it's worth calling out (without the shade) that confluent schema registry has a different set of rules for backward and forward schema compatibility than avro does. this was surprising to me

v19.1/change-data-capture.md, line 15 at r8 (raw file):

The core feature of CDC is the [changefeed](create-changefeed.html). Changefeeds target a whitelist of tables, called the "watched rows". Every change to a watched row is emitted as a record in a configurable format (JSON or Avro) to a configurable sink ([Kafka](https://kafka.apache.org/)).

## Ordering guarantees

I gave an overview to roland once about how all these rules build up to some useful (and much easier to reason about) top-level invariants. We should document them at the top here and use that as context for all this stuff below. @rolandcrosby, do you have time to sync with lauren and go over that?

I'm happy letting this be a followup, just happened to think of it while reading the changes in this PR

v19.1/change-data-capture.md, line 80 at r8 (raw file):

When schema changes with column backfill (e.g., adding a column with a default, adding a computed column, adding a `NOT NULL` column, dropping a column) are made to watched rows, the changefeed will emit some duplicates during the backfill. When it finishes, CockroachDB outputs all watched rows using the new schema.

Rows that have been backfilled by a schema change are always re-emitted because Avro's default schema change functionality is not powerful enough to represent the schema changes that CockroachDB supports (e.g., CockroachDB columns can have default values that are arbitrary SQL expressions, but Avro only supports static default values).

"not powerful enough" feels too shade-y for my taste. can we rephrase?

v19.1/change-data-capture.md, line 80 at r8 (raw file):

When schema changes with column backfill (e.g., adding a column with a default, adding a computed column, adding a `NOT NULL` column, dropping a column) are made to watched rows, the changefeed will emit some duplicates during the backfill. When it finishes, CockroachDB outputs all watched rows using the new schema.

Rows that have been backfilled by a schema change are always re-emitted because Avro's default schema change functionality is not powerful enough to represent the schema changes that CockroachDB supports (e.g., CockroachDB columns can have default values that are arbitrary SQL expressions, but Avro only supports static default values).

the transition here to talking about avro feels abrupt

v19.1/create-changefeed.md, line 130 at r3 (raw file):

Previously, lhirata wrote…

Done.

Maybe [1] for json or {"id":1} for avro? I was confused when I first read this and only figured it out after I read the below.

Also nit: it's not an array in avro

v19.1/create-changefeed.md, line 67 at r8 (raw file):

`sasl_password` | [`STRING`](string.html) | Your SASL password.

#### Cloud storage sink

we should mention somewhere that cloud storage sink currently only works with format=json

v19.1/create-changefeed.md, line 90 at r8 (raw file):

`resolved` | [`INTERVAL`](interval.html) | Periodically emit resolved timestamps to the changefeed. Optionally, set a minimum duration between emitting resolved timestamps. If unspecified, all resolved timestamps are emitted.<br><br>Example: `resolved='10s'`
`envelope` | `key_only` / `wrapped` | Use `key_only` to emit only the key and no value, which is faster if you only want to know when the key changes.<br><br>Default: `envelope=wrapped`
`cursor` | [Timestamp](as-of-system-time.html#parameters)  | Emits any changes after the given timestamp, but does not output the current state of the table first. If `cursor` is not specified, the changefeed starts by doing an initial scan of all the watched rows and emits the current value, then moves to emitting any changes that happen after the scan.<br><br>When starting a changefeed at a specific `cursor`, the `cursor` cannot be before the configured garbage collection window (see [`gc.ttlseconds`](configure-replication-zones.html#replication-zone-variables)) for the table you're trying to follow; otherwise, the changefeed will error. By default, you cannot create a changefeed that starts more than 25 hours in the past.<br><br>`cursor` can be used to [start a new changefeed where a previous changefeed ended.](#start-a-new-changefeed-where-another-ended)<br><br>Example: `CURSOR=1536242855577149065.0000000000`

nit: "With default garbage collection settings, this means you cannot"

v19.1/create-changefeed.md, line 132 at r8 (raw file):

- **Key**: An array always composed of the row's `PRIMARY KEY` field(s) (e.g., `[1]` or `{"id":1}`).
- **Value**:
    - One of three possible level fields:

I think roland meant top-level : - )

rolandcrosby

Reviewed 1 of 3 files at r6, 1 of 2 files at r8.
Reviewable status: complete! 1 of 0 LGTMs obtained (waiting on @Amruta-Ranade, @danhhz, @lhirata, and @rolandcrosby)

v19.1/change-data-capture.md, line 306 at r3 (raw file):

Previously, lhirata wrote…

Does "using Avro" make sense?

yeah, I like that

v19.1/change-data-capture.md, line 752 at r3 (raw file):

Previously, lhirata wrote…

Done.

can you change the link text to "these cloud storage providers" too?

v19.1/change-data-capture.md, line 15 at r8 (raw file):

Previously, danhhz (Daniel Harrison) wrote…

I gave an overview to roland once about how all these rules build up to some useful (and much easier to reason about) top-level invariants. We should document them at the top here and use that as context for all this stuff below. @rolandcrosby, do you have time to sync with lauren and go over that?

I'm happy letting this be a followup, just happened to think of it while reading the changes in this PR

I was just talking to Lauren offline about providing some pseudocode for "how to correctly consume a topic and interpret changefeed messages" - that might also be a good place to talk about the invariants?

v19.1/create-changefeed.md, line 130 at r3 (raw file):

Previously, danhhz (Daniel Harrison) wrote…

Maybe [1] for json or {"id":1} for avro? I was confused when I first read this and only figured it out after I read the below.

Also nit: it's not an array in avro

d'oh, I forgot it was a record in Avro, I second Dan's suggestion

v19.1/create-changefeed.md, line 67 at r8 (raw file):

Previously, danhhz (Daniel Harrison) wrote…

we should mention somewhere that cloud storage sink currently only works with format=json

good catch, yes, should specifically say it only works with JSON and always emits newline-delimited json files

v19.1/create-changefeed.md, line 132 at r8 (raw file):

Previously, danhhz (Daniel Harrison) wrote…

I think roland meant top-level : - )

yup!

lnhsingh

Reviewable status: complete! 1 of 0 LGTMs obtained (waiting on @Amruta-Ranade, @danhhz, and @rolandcrosby)

_includes/v19.1/cdc/core-url.md, line 2 at r8 (raw file):

Previously, danhhz (Daniel Harrison) wrote…

If we're going to mention the results buffer, then in place of the sentence you deleted, we should mention that we automatically disable it for core changefeeds. I'm also okay just removing any mention of results buffering. Up to you

Added.

_includes/v19.1/sql/settings/settings.md, line 83 at r8 (raw file):

Previously, danhhz (Daniel Harrison) wrote…

This one is still true. May want to leave it.

Added back.

v19.1/change-data-capture.md, line 82 at r3 (raw file):

Previously, danhhz (Daniel Harrison) wrote…

lol. I do think it's worth calling out (without the shade) that confluent schema registry has a different set of rules for backward and forward schema compatibility than avro does. this was surprising to me

Done.

v19.1/change-data-capture.md, line 306 at r3 (raw file):

Previously, rolandcrosby (Roland Crosby) wrote…

yeah, I like that

👍

v19.1/change-data-capture.md, line 752 at r3 (raw file):

Previously, rolandcrosby (Roland Crosby) wrote…

can you change the link text to "these cloud storage providers" too?

Done.

v19.1/change-data-capture.md, line 80 at r8 (raw file):

Previously, danhhz (Daniel Harrison) wrote…

"not powerful enough" feels too shade-y for my taste. can we rephrase?

I think I can just remove and combine with the above paragraph

v19.1/change-data-capture.md, line 80 at r8 (raw file):

Previously, danhhz (Daniel Harrison) wrote…

the transition here to talking about avro feels abrupt

I feel like I was trying to shoehorn this into the section. Created a new section for it.

v19.1/create-changefeed.md, line 130 at r3 (raw file):

Previously, rolandcrosby (Roland Crosby) wrote…

d'oh, I forgot it was a record in Avro, I second Dan's suggestion

Done.

v19.1/create-changefeed.md, line 67 at r8 (raw file):

Previously, rolandcrosby (Roland Crosby) wrote…

good catch, yes, should specifically say it only works with JSON and always emits newline-delimited json files

Done.

v19.1/create-changefeed.md, line 90 at r8 (raw file):

Previously, danhhz (Daniel Harrison) wrote…

nit: "With default garbage collection settings, this means you cannot"

Done.

v19.1/create-changefeed.md, line 132 at r8 (raw file):

Previously, rolandcrosby (Roland Crosby) wrote…

yup!

My bad! Done.

cockroach-teamcity · 2019-04-01T23:10:10Z

http://cockroach-docs-review.s3-website-us-east-1.amazonaws.com/221c815b734a3207d2c75712438150219e1e2db2/

Amruta-Ranade

@lhirata Awesome work! 🎉

Amruta-Ranade · 2019-04-02T18:55:30Z

v19.1/create-changefeed.md

+
+Parameter | Value | Description
+----------+-------+---------------
+`topic_prefix` | [`STRING`](string.html) | Adds a prefix to all of the topic names.<br><br>For example, `CREATE CHANGEFEED FOR TABLE foo INTO 'kafka://...?topic_prefix=bar_'` would emit rows under the topic `bar_foo` instead of `foo`.


nit: "to all of the" > "to all"

Amruta-Ranade · 2019-04-02T19:12:19Z

v19.1/create-changefeed.md

+- **Key**: An array always composed of the row's `PRIMARY KEY` field(s) (e.g., `[1]` for `JSON` or `{"id":1}` for Avro).
+- **Value**:
+    - One of three possible top-level fields:
+        - `after`, which contains the state of the row after the update (or 'null' for `DELETE`s).


nit: 'null' > null?

lnhsingh

Reviewable status: complete! 0 of 0 LGTMs obtained (and 1 stale) (waiting on @Amruta-Ranade, @danhhz, and @rolandcrosby)

v19.1/change-data-capture.md, line 15 at r8 (raw file):

Previously, rolandcrosby (Roland Crosby) wrote…

I was just talking to Lauren offline about providing some pseudocode for "how to correctly consume a topic and interpret changefeed messages" - that might also be a good place to talk about the invariants?

FYI, moved this into a separate issue: #4590

v19.1/create-changefeed.md, line 60 at r9 (raw file):

Previously, Amruta-Ranade (Amruta Ranade) wrote…

nit: "to all of the" > "to all"

Done.

v19.1/create-changefeed.md, line 139 at r9 (raw file):

Previously, Amruta-Ranade (Amruta Ranade) wrote…

nit: 'null' > null?

Good catch. Done.

cockroach-teamcity · 2019-04-02T20:31:53Z

http://cockroach-docs-review.s3-website-us-east-1.amazonaws.com/b48eefbcd882165d7ec8bb6ebfca74759c931220/

rolandcrosby

Reviewed 3 of 4 files at r9, 1 of 1 files at r10.
Reviewable status: complete! 1 of 0 LGTMs obtained (and 1 stale)

jseldess

Excellent work, @lhirata. as long as one of the reviewers actually tested the steps.

Reviewable status: complete! 2 of 0 LGTMs obtained (and 1 stale) (waiting on @jseldess)

Closes #4055.

cockroach-teamcity · 2019-04-03T16:29:13Z

http://cockroach-docs-review.s3-website-us-east-1.amazonaws.com/0959dfbef596e372dd9e64d87bdfa26559c61d39/

lnhsingh added the in progress label Feb 20, 2019

lnhsingh force-pushed the cdc branch from 5525dcf to f7712f9 Compare March 18, 2019 17:04

lnhsingh force-pushed the cdc branch from 1e13431 to 77cdb31 Compare March 28, 2019 17:34

Lauren added 5 commits March 28, 2019 14:37

CDC Updates

9d2aaec

Changes include: - Added / edited Avro core changefeed instructions Minor edit Add expected responses for enterprise changefeeds CDC updates - Fix broken links - Add info about cursor - Add info about updated timestamps - Add info about schema changes with backfill

Add table that maps CRDB types to Avro types

6fc28e8

Add debug info. Add Kafka query parameters

8627b7d

Add cloud storage sink

f7c4ec0

Add new S3 parameter (#4196)

dde2a95

Minor edits / links

lnhsingh force-pushed the cdc branch from 64afc48 to dde2a95 Compare March 28, 2019 18:38

lnhsingh changed the title ~~(WIP) CDC: 19.1 updates~~ CDC: 19.1 updates Mar 28, 2019

lnhsingh marked this pull request as ready for review March 28, 2019 18:39

lnhsingh requested review from danhhz and rolandcrosby March 28, 2019 18:39

lnhsingh requested a review from Amruta-Ranade March 28, 2019 18:39

Minor edits / remove AWS_SESSION_TOKEN

6c54c11

Add links to Changefeed Dashboard

2d13409

Amruta-Ranade mentioned this pull request Mar 28, 2019

Assorted UI Updates #4597

Merged

rolandcrosby suggested changes Mar 29, 2019

View reviewed changes

Merge branch 'master' into cdc

584dcee

lnhsingh commented Mar 29, 2019

View reviewed changes

Edits based on Roland's feedback

3b6edea

Fix broken page

6d2759f

Fix broken links

5e6fb21

danhhz reviewed Apr 1, 2019

View reviewed changes

rolandcrosby suggested changes Apr 1, 2019

View reviewed changes

lnhsingh commented Apr 1, 2019

View reviewed changes

Edits based on Dan's review

221c815

Amruta-Ranade approved these changes Apr 2, 2019

View reviewed changes

lnhsingh commented Apr 2, 2019

View reviewed changes

Edits based on feedback

b48eefb

rolandcrosby approved these changes Apr 2, 2019

View reviewed changes

lnhsingh requested a review from jseldess April 3, 2019 14:50

jseldess approved these changes Apr 3, 2019

View reviewed changes

Add note to MySQL / Postgres import about skip_foreign_keys

0959dfb

Closes #4055.

lnhsingh merged commit 6bba1a9 into master Apr 3, 2019

lnhsingh deleted the cdc branch April 3, 2019 16:56

lnhsingh removed the in progress label Apr 3, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CDC: 19.1 updates #4403

CDC: 19.1 updates #4403

lnhsingh commented Feb 20, 2019 •

edited

Loading

cockroach-teamcity commented Feb 20, 2019

cockroach-teamcity commented Feb 20, 2019

cockroach-teamcity commented Mar 18, 2019

cockroach-teamcity commented Mar 19, 2019

cockroach-teamcity commented Mar 19, 2019

cockroach-teamcity commented Mar 27, 2019

cockroach-teamcity commented Mar 28, 2019

cockroach-teamcity commented Mar 28, 2019

cockroach-teamcity commented Mar 28, 2019

cockroach-teamcity commented Mar 28, 2019

cockroach-teamcity commented Mar 28, 2019

cockroach-teamcity commented Mar 28, 2019

cockroach-teamcity commented Mar 28, 2019

rolandcrosby left a comment

cockroach-teamcity commented Mar 29, 2019

lnhsingh left a comment

cockroach-teamcity commented Mar 29, 2019

cockroach-teamcity commented Apr 1, 2019

cockroach-teamcity commented Apr 1, 2019

danhhz left a comment

rolandcrosby left a comment

lnhsingh left a comment

cockroach-teamcity commented Apr 1, 2019

Amruta-Ranade left a comment

Amruta-Ranade Apr 2, 2019

Amruta-Ranade Apr 2, 2019

lnhsingh left a comment

cockroach-teamcity commented Apr 2, 2019

rolandcrosby left a comment

jseldess left a comment

cockroach-teamcity commented Apr 3, 2019

CDC: 19.1 updates #4403

CDC: 19.1 updates #4403

Conversation

lnhsingh commented Feb 20, 2019 • edited Loading

cockroach-teamcity commented Feb 20, 2019

cockroach-teamcity commented Feb 20, 2019

cockroach-teamcity commented Mar 18, 2019

cockroach-teamcity commented Mar 19, 2019

cockroach-teamcity commented Mar 19, 2019

cockroach-teamcity commented Mar 27, 2019

cockroach-teamcity commented Mar 28, 2019

cockroach-teamcity commented Mar 28, 2019

cockroach-teamcity commented Mar 28, 2019

cockroach-teamcity commented Mar 28, 2019

cockroach-teamcity commented Mar 28, 2019

cockroach-teamcity commented Mar 28, 2019

cockroach-teamcity commented Mar 28, 2019

rolandcrosby left a comment

Choose a reason for hiding this comment

cockroach-teamcity commented Mar 29, 2019

lnhsingh left a comment

Choose a reason for hiding this comment

cockroach-teamcity commented Mar 29, 2019

cockroach-teamcity commented Apr 1, 2019

cockroach-teamcity commented Apr 1, 2019

danhhz left a comment

Choose a reason for hiding this comment

rolandcrosby left a comment

Choose a reason for hiding this comment

lnhsingh left a comment

Choose a reason for hiding this comment

cockroach-teamcity commented Apr 1, 2019

Amruta-Ranade left a comment

Choose a reason for hiding this comment

Amruta-Ranade Apr 2, 2019

Choose a reason for hiding this comment

Amruta-Ranade Apr 2, 2019

Choose a reason for hiding this comment

lnhsingh left a comment

Choose a reason for hiding this comment

cockroach-teamcity commented Apr 2, 2019

rolandcrosby left a comment

Choose a reason for hiding this comment

jseldess left a comment

Choose a reason for hiding this comment

cockroach-teamcity commented Apr 3, 2019

lnhsingh commented Feb 20, 2019 •

edited

Loading