Database CI/CD and Schema Migration with Snowflake and GitHub

Estimated: 30 mins
Database CI/CD and Schema Migration with Snowflake and GitHub

A series of articles about Database CI/CD and Schema Migration with Snowflake


Overview

In the last article Database CI/CD and Schema Migration with Snowflake, you have tried UI workflow in Bytebase.

This tutorial will bring your Snowflake schema change to the next level by introducing the GitOps workflow, where you commit the schema change script to the GitHub repository, which will in turn trigger the schema deployment pipeline in Bytebase.

You can use Bytebase's Community Plan to finish the tutorial.

Prerequisites

Before you start this tutorial, make sure:

Step 1 - Run Bytebase in Docker and set the External URL generated by ngrok

ngrok is a reverse proxy tunnel, and in our case, we need it for a public network address in order to receive webhooks from VCS. ngrok we used here is for demonstration purposes. For production use, we recommend using Caddy.

ngrok-reverse-proxy

  1. Run Bytebase in Docker with the following command:

    docker run --rm --init \
      --name bytebase \
      --publish 8080:8080 --pull always \
      --volume ~/.bytebase/data:/var/opt/bytebase \
      bytebase/bytebase:3.3.0
  2. Bytebase is running successfully in Docker, and you can visit it via localhost:8080. Register an admin account and it will be granted the workspace admin role automatically.

  3. Login to ngrok Dashboard and complete the Getting Started steps to install and configure. If you want to use the same domain each time you launch ngrok, go to Cloud Edge > Domains, where you'll find the domain <<YOURS>>.ngrok-free.app linked to your account.

  4. Run the ngrok command ngrok http --domain=<<YOURS>>.ngrok-free.app 8080 to start ngrok with your specific domain, and you will see the output displayed below:

    terminal-ngrok

  5. Log in Bytebase and click the gear icon (Settings) on the top right. Click General under Workspace. Paste <<YOURS>>.ngrok-free.app as External URL under Network section and click Update.

    external-url

  6. Now you can access Bytebase via <<YOURS>>.ngrok-free.app.

Step 2 - Find your Snowflake account in Bytebase

  1. Visit Bytebase Console through the browser via your ngrok URL. Log in using your account created from the previous tutorial.

  2. Create one or two new databases on your Snowflake instances for different environments, refer to previous tutorial if you need help. home

Step 3 - Connect Bytebase with GitHub.com

  1. Go to Bytebase homepage, and click Integration > GitOps on the left sidebar. Choose GitHub.com as Git provider. What we need is a github personal access token. bb-gitops-no-access-token

  2. Go to your GitHub account. Click your avatar and then click Settings on the menu. Click Developer settings on the left sidebar, and then click Personal access tokens > Fine-grained tokens. gh-fine-grained-tokens

  3. Click Generate new token, fill in the fields and check the scopes according to the description on Bytebase. Click Generate token.

  4. Copy the token and paste it back into Bytebase Integration > GitOps. Click Confirm and add. bb-gitops-access-token

Step 4 - Enable GitOps workflow with Snowflake

  1. Go to the project Sample Project, click Integration > GitOps. Click Add Enable GitOps connector. bb-project-gitops-add

  2. Choose GitHub.com - the provider you just added. It will display all the repositories you can manipulate. Choose test-bb-gitops. bb-project-select-repo

  3. Keep the default setting, and click Finish. Pay attention to Database Group, which is the database group that the schema change will be applied to. With Community Plan, the changes will automatically affect all databases in the project. With Enterprise Plan, you'll have the option to specify the target database group. bb-project-gitops-configure

Step 5 - Change schema for Snowflake by pushing SQL schema change files to GitHub

  1. In your GitHub repository test-bb-gitops, create a folder bytebase, then create an sql file 202405111600_create_t1.sql.

    Paste the sql script in it.

    CREATE SCHEMA DEMO;
    CREATE TABLE T1
    (
       "id" INTEGER NOT NULL
    );
  2. Create a new branch for this commit and start a pull request. Click Merge pull request to merge the new branch into the main branch. gh-branch-t1

  3. There will be a comment saying there's a rollout in Bytebase. Click the link to the issue page, you’ll see

    1. The issue is created via GitHub.com, there's a link to the GitHub commit.

    2. The SQL is exactly the one we have committed to the GitHub repository.

    3. The SQL has passed the automatic task checks and rollout automatically.

    4. Since there're two databases in the project, Bytebase creates a 2-staged pipeline to roll out the change sequentially.

      bb-issue-done

Summary and Next

Now you have tried out GitOps workflow, which will store your database schema in GitHub and trigger the change upon committing the change to the repository via Pull Request, to bring your database change workflow to the next level of Database DevOps - Database as Code.

If the built-in workflow is not suitable, you can opt to Bytebase API to fully customize the workflow to integrate with your CI pipeline. Automating Database Schema Change workflow Using GitHub Actions is an example.

Edit this page on GitHub

Subscribe to Newsletter

By subscribing, you agree with Bytebase's Terms of Service and Privacy Policy.