target-bigquery

target-bigquery is a Singer target for BigQuery.

Capabilities

about
stream-maps
schema-flattening

Settings

Setting	Required	Default	Description
project_id	True	None	Google project id
dataset	True	None	Dataset to load data into
location	False	None	Dataset location
table_prefix	False	None	Optional prefix to add to table names
batch_size	False	100000	Maximum size of batches when records are streamed in. BATCH messages are not affected by this property.
max_batch_age	False	5.0	Maximum time in minutes between state messages when records are streamed in. BATCH messages are not affected by this property.
add_record_metadata	False	1	Add Singer Data Capture (SDC) metadata to records
default_partition_column	False	None	Default partition column for all streams
truncate_before_load	False	0	If tables should be truncated before new data is loaded
append_only	False	0	Only append data; don't overwrite existing data
table_configs	False	None	Stream specific configs. Like partition keys.
stream_maps	False	None	Config object for stream maps capability. For more information check out Stream Maps.
stream_map_config	False	None	User-defined config values to be used within map expressions.
flattening_enabled	False	None	'True' to enable schema flattening and automatically expand nested properties.
flattening_max_depth	False	None	The max depth to flatten schemas.

A full list of supported settings and capabilities is available by running: target-bigquery --about

Utilizing partitioned tables in BigQuery

Enabling partitioned data on your destination table can, depending on your data, reduce the amount of data (and thus cost) needed to be queried on each batch. The partition column must be a timestamp and it must remain constant over time. A creation-timestamp is usually a good choice.

To take advantage of this feature use either default_partition_column or table_configs setting.

A full example meltano config is as follows:

loaders:
- name: target-bigquery
  config:
    project_id: <PROJ-ID>
    location: <LOCATION>
    dataset: <DATASET>
    table_prefix: ${MELTANO_EXTRACT__LOAD_SCHEMA}_
    batch_size: 10000
    ## No partioning by default
    # default_partition_column:
    table_configs:
    ## All hubspot streams have a createdAt property we can use.
    ## We are using the fact that all table names will be prefixed with the tap name
    ## thanks to the table_prefix setting above
    - table_prefix: tap_hubspot_
      partition_column: createdAt
    ## Here we specify a specific stream and what partition column is correct
    - table_name: tap_postgres_public-e_users
      partition_column: date_created

If neither of table_prefix and table_name matches then default_partition_column is used as fallback.

If the resulting partition_column is false, either by being None (the default) or an empty string, then no partioning will be performed.

Also note that the target will not perform any ALTER TABLE operations. Only tables being created for the first time will be partitioned.

Configure using environment variables

This Singer target will automatically import any environment variables within the working directory's .env if the --config=ENV is provided, such that config values will be considered if a matching environment variable is set either in the terminal context or in the .env file.

Google authentication

Google authentication is done by providing a path to a service account JSON file in the environment variable GOOGLE_APPLICATION_CREDENTIALS

Usage

You can easily run target-bigquery by itself or in a pipeline using Meltano.

Executing the Target Directly

target-bigquery --version
target-bigquery --help
# Test using the "Carbon Intensity" sample:
tap-carbon-intensity | target-bigquery --config /path/to/target-bigquery-config.json

Developer Resources

Follow these instructions to contribute to this project.

Initialize your Development Environment

pipx install poetry
poetry install

Create and Run Tests

Create tests within the target_bigquery/tests subfolder and then run:

poetry run pytest

You can also test the target-bigquery CLI interface directly using poetry run:

poetry run target-bigquery --help

Testing with Meltano

Note: This target will work in any Singer environment and does not require Meltano. Examples here are for convenience and to streamline end-to-end orchestration scenarios.

Next, install Meltano (if you haven't already) and any needed plugins:

# Install meltano
pipx install meltano
# Initialize meltano within this directory
cd target-bigquery
meltano install

Now you can test and orchestrate using Meltano:

# Test invocation:
meltano invoke target-bigquery --version
# OR run a test `elt` pipeline with the Carbon Intensity sample tap:
meltano elt tap-carbon-intensity target-bigquery

SDK Dev Guide

See the dev guide for more instructions on how to use the Meltano SDK to develop your own Singer taps and targets.

Name		Name	Last commit message	Last commit date
Latest commit History 66 Commits
.github/workflows		.github/workflows
.secrets		.secrets
target_bigquery		target_bigquery
.gitignore		.gitignore
README.md		README.md
meltano.yml		meltano.yml
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml
tox.ini		tox.ini

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

target-bigquery

Capabilities

Settings

Utilizing partitioned tables in BigQuery

Configure using environment variables

Google authentication

Usage

Executing the Target Directly

Developer Resources

Initialize your Development Environment

Create and Run Tests

Testing with Meltano

SDK Dev Guide

About

Releases

Packages

Contributors 3

Languages

YouCruit/youcruit-target-bigquery

Folders and files

Latest commit

History

Repository files navigation

target-bigquery

Capabilities

Settings

Utilizing partitioned tables in BigQuery

Configure using environment variables

Google authentication

Usage

Executing the Target Directly

Developer Resources

Initialize your Development Environment

Create and Run Tests

Testing with Meltano

SDK Dev Guide

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages