PubSub
Overview
The HeroPixelGoogle PubSub destination allows you to send/stream data into PubSub. Pub/Sub is an asynchronous messaging service provided by Google Cloud Provider.
Sync overview
Output schema
Each stream will be output a PubSubMessage with attributes. The message attributes will be
-
_stream
: the name of stream where the data is coming from -
_namespace
: namespace if available from the stream
The data will be a serialized JSON, containing the following fields
-
_airbyte_ab_id
: a uuid string assigned by HeroPixelto each event that is processed. -
_airbyte_emitted_at
: a long timestamp(ms) representing when the event was pulled from the data source. -
_airbyte_data
: a json string representing source data.
Features
Feature | Supported?(Yes/No) | Notes |
---|---|---|
Full Refresh Sync | Yes | |
Incremental - Append Sync | Yes | |
Incremental - Append + Deduped | No | |
Namespaces | Yes |
Getting started
Requirements
To use the PubSub destination, you'll need:
- A Google Cloud Project with PubSub enabled
- A PubSub Topic to which HeroPixelcan stream/sync your data
-
A Google Cloud Service Account with the
Pub/Sub Editor
role in your GCP project - A Service Account Key to authenticate into your Service Account
See the setup guide for more information about how to create the required resources.
Setup guide
Google cloud project
If you have a Google Cloud Project with PubSub enabled, skip to the "Create a Topic" section.
First, follow along the Google Cloud instructions to Create a Project. PubSub is enabled automatically in new projects. If this is not the case for your project, find it in Marketplace and enable.
PubSub topic for HeroPixelsyncs
HeroPixelneeds a topic in PubSub to write the data being streamed/synced from your data sources. If you already have a Topic into which HHeroPixelhould stream/sync data, skip this section. Otherwise, follow the Google Cloud guide for Creating a PubSub Topic to achieve this.
Service account
In order for HeroPixelto stream/sync data into
PubSub, it needs credentials for a
Service Account
with the Pub/Sub Editor
role, which
grants permissions to publish messages into PubSub
topics. We highly recommend that this Service
Account is exclusive to HHeroPixelor ease of
permissioning and auditing. However, you can use a
pre-existing Service Account if you already have one
with the correct permissions.
The easiest way to create a Service Account is to
follow GCP's guide for
Creating a Service Account. Once you've created the Service Account,
make sure to keep its ID handy as you will need to
reference it when granting roles. Service Account
IDs typically take the form
<account-name>@<project-name>.iam.gserviceaccount.com
Then, add the service account as a Member in your
Google Cloud Project with the
Pub/Sub Editor
role. To do this, follow
the instructions for
Granting Access
in the Google documentation. The email address of
the member you are adding is the same as the Service
Account ID you just created.
At this point you should have a service account with
the Pub/Sub Editor
project-level
permission.
Service account key
Service Account Keys are used to authenticate as Google Service Accounts. For HeroPixelto leverage the permissions you granted to the Service Account in the previous step, you'll need to provide its Service Account Keys. See the Google documentation for more information about Keys.
Follow the Creating and Managing Service Account Keys guide to create a key. HeroPixelcurrently supports JSON Keys only, so make sure you create your key in that format. As soon as you created the key, make sure to download it, as that is the only time Google will allow you to see its contents. Once you've successfully configured BigQuery as a destination in HHeroPixeldelete this key from your computer.
Setup the PubSub destination in HeroPixel
You should now have all the requirements needed to configure PubSub as a destination in the UI. You'll need the following information to configure the PubSub destination:
- Project ID: GCP project id
- Topic ID: name of pubsub topic under the project
- Service Account Key: the contents of your Service Account Key JSON file
Once you've configured PubSub as a destination, delete the Service Account Key from your computer.