This is a series of articles about Database Change Management with ClickHouse.
This is a series of articles about Database Change Management with ClickHouse.
ClickHouse is a fast open-source column-oriented database management system that allows generating analytical data reports in real-time using SQL queries.
This tutorial will guide you step-by-step to set up database change management for ClickHouse (Cloud and self-managed) in Bytebase. With Bytebase, a team can have a formalized review and rollout process to make ClickHouse schema change and data change.
You’ll have a GUI and the full change history. You can use Bytebase free version to finish the tutorial.
There is also a bonus section about drift detection for those advanced users if needed.
Before you start this tutorial, make sure:
docker run --init \
--name bytebase \
--platform linux/amd64 \
--restart always \
--publish 5678:8080 \
--health-cmd "curl --fail http://localhost:5678/healthz || exit 1" \
--health-interval 5m \
--health-timeout 60s \
--volume ~/.bytebase/data:/var/opt/bytebase \
bytebase/bytebase:1.13.0 \
--data /var/opt/bytebase \
--port 8080
Bytebase is running successfully in Docker, and you can visit it vialocalhost:5678
.
Visit localhost:5678 in your browser. Register the first admin account which will be granted Workspace Owner
.
In Bytebase, an Instance could be your on-premises MySQL instance, an AWS RDS instance etc, in this tutorial, a ClickHouse Cloud account or a self-managed instance.
Visitlocalhost:5678
and log in asWorkspace Owner
.
ClickAdd Instance.
Add aClickHouse instanceand clickCreate.
Pay attention to some fields:
Environment: chooseTest
, if you chooseProd
, issues will wait for approval by default. In this tutorial, we try to keep it simple. However, it’s all configurable.
If you use Cloud version. Go to your ClickHouse Cloud account, and clickView connection string.
Host or SocketandPortare in the grey box as parameters.
UsernameandPasswordare generated and stored inclickhouse_credentials.txt
while you registered the ClickHouse Cloud account.
Be sure to allow access to this service fromAnywhere
.
ChooseCA Certificate
for SSL Connection, if you use macOS, open the file/etc/ssl/cert.pem
, copy the content as a whole then paste it into theCA Certificatefield box. ClickTest Connectionto verify it’s working.
In Bytebase,Projectis the container to group logically relatedDatabases,IssuesandUserstogether, which is similar to the project concept in other dev tools such as Jira, GitLab. So before you deal with the database, a project must be created.
After the instance is created, clickProjectson the top bar.
ClickNew Projectto create a new projectTestClickHouse
, key isTCH
, mode isstandard
. ClickCreate.
In Bytebase, aDatabaseis the one created by 'CREATE DATABASE xxx'. A database always belongs to a singleProject.Issuerepresents a specific collaboration activity between Developer and DBA such as creating a database, altering a schema. It's similar to the issue concept in other issue management tools.
After the project is created. ClickNew DBon the project top bar.
Fill the form withName-db_demo
,Environment-Test
, andInstance-ClickHouse instance
. ClickCreate.
It will create an issue “CREATE DATABASE ….” automatically. Because it’s forTest
environment, the issue will run without waiting for your approval by default. ClickResolve, and the issue isDone
. The database is created.
Go back to the home page by clickingHomeon the left sidebar. On the home page, you can see the project, the database, and the issue you just resolved.
In Step 4, you created an issue to create a database via UI workflow and then executed it. Let’s try to create another issue to alter that database.
Go to projectTestClickHouse
, and clickAlter Schema.
Choosedb_demo
and clickNext. It could generate a pipeline if you have different databases for different environments.
Input SQL as follows, and clickCreate.
CREATE TABLE
t1 (id UInt64, name String) ENGINE = MergeTree
ORDER BY id;
Bytebase will do some basic checks and then execute the SQL. Since it’s forTest
environment, the issue is automatically approved by default. ClickResolve issue, and the issue status will becomeDone
.
On the issue page, clickView change. You will see diff for the change.
You can also go toChange Historyunder the project to view the full history. Or go into a specific database to view its history.
This section requires you to haveEnterprise Plan(you can start 14 days trial directly in the product without credit card).
Now you can see the full change history ofdb_demo
. However, what isEstablish new baseline? When should it be used?
By adopting Bytebase, we expect teams to use Bytebase exclusively for all schema changes. Meanwhile, if someone has made ClickHouse schema change out side of Bytebase, obviously Bytebase won’t know it. And because Bytebase has recorded its own copy of schema, when Bytebase compares that with the live schema having that out-of-band schema change, it will notice a discrepancy and surface a schema drift anomaly. If that change is intended, then you should baseline the schema state again to reconcile.
In this section, you’ll be guided through this process.
age
there. Make sure the new column is added.ALTER TABLE t1 ADD COLUMN age UInt8;
database db_demo
Anomaly Center
ClickView diff, you will see the exact drift.
Go todb_demo
>Change Historyand clickEstablish new baselineto reconcile the schema.
It will create a baseline issue. ClickResolveto mark it done.
Go back todb_demo
or Anomaly Center, and you will find the drift is gone.
Now you have connected ClickHouse with Bytebase, and tried out the UI workflow to do schema change. Bytebase will record the full change history for you. WithEnterprise Plan, you can even have drift detection.
In the next article, you’ll try out GitOps workflow, which will store your ClickHouse schema in GitHub and trigger the change upon committing the change to the repository, to bring your ClickHouse change workflow to the next level, akaDatabase DevOps- Database as Code .