Supabase-Kubernetes Roadmap: Feature Prioritization and Enhancements #53

arpagon · 2024-04-12T16:02:10Z

This project is in its early stages. Let's use this issue to brainstorm and outline the essential components of our Supabase-Kubernetes Helm chart. Here's what we can discuss:

Must-have components: Which Supabase services are non-negotiable for the next release?
Configuration flexibility: What key parameters should users be able to customize during deployment?
Basic architecture: Outline a preliminary structure for the chart's resources and dependencies. (biname, other postgres charts...)

Additionally, we'd love your input on:

Propose new features: What functionalities would you like to see included in the chart?
Testing: How can we collaboratively test the chart during development?
Chart release methods: What are the preferred methods for releasing new versions of the chart?

arpagon · 2024-04-12T16:11:46Z

@AntonOfTheWoods, @koryonik, @heresandyboy, and @drpsyko101 - We'd love your participation in shaping the roadmap for the Supabase-Kubernetes Helm chart! Your knowledge and experience would be invaluable as we outline its future.

@drpsyko101, a special thank you for your work on pull request #48. Your work on the Helm chart's is a solid foundation to continue building upon. This is an essential step in making the Supabase-Kubernetes Helm chart a valuable tool for the community.

drpsyko101 · 2024-04-13T11:58:37Z

AntonOfTheWoods · 2024-04-14T01:44:12Z

@drpsyko101 I think that is a great checklist, however I would challenge the multi-node placement. That should be prioritised first and should be done before any other work is really even planned because otherwise the chart is only able to deploy toy deployments.

Kubernetes introduces a massive amount of complexity over single-node systems like plain compose but it brings phenomenal benefits - mainly because you can scale from a single node on a dev or CI/CD box to 1000-node clusters with relative ease. That is precisely what helm charts help you do in an elegant and maintainable way. That also brings in significant complexity over compose though...

I think it is very important to make it clear whether this chart is meant to be for (hobbyist/dev) single-node (minikube, microk8s) clusters or whether it will be a chart usable for large-scale, high-performance clusters. The current postgresql defaults to a deployment that doesn't have any sort of persistence. That means if you stop-restart the cluster the data disappears. This is exactly the sort of thing that defaults to something sane with charts like bitnami (or the operators you mention). Writing charts that scale and do what reasonably sized deployments need them to takes a lot of time, effort, and expertise and reinventing the wheel needs to be done only if you know you have the time and expertise to do better! :-)

drpsyko101 · 2024-04-14T10:35:41Z

@AntonOfTheWoods I understand your concerns about putting the multi-node support in the short term goals. But scaling up services other than vector also mandates high-availability support. This is especially difficult to set postgres and storage/minio to StatefulSet due to their complex replication setup, hence placing them in the long term goals.

However, it is a different story if the user uses external HA postgres and minio. Then it is viable to set vector to use DaemonSet setup as you've mentioned above. From what I can see, there shouldn't be many changes to other resources to do so.

bartekus · 2024-04-15T03:20:12Z

As a consumer of this setup (specifically the revision that @drpsyko101 recently contributed) I would reckon that the setup that Supabase provide (docker-compose based) and which has been replicated for cloud providers like Digital Ocean fill the hobby/small segment requirements quite well. As to Kubernetes setup, I would venture to assume that nobody is going to use anything in production that has Postgres as fundamental requirements and does not provide HA variant as the general target. Even out DevOps team was ok with single-node for initial trial, but flagged lack of HA Postgres as essentially compliance killer that is going to prevent the setup from ever being production grade approved and used beyond the scope of POC/MVP development. Helm charts is what our dev team uses so no complains there. Just my 2 cents, and thank you for all your continuous contributions and amazing work! You all rock!

24601 · 2024-04-17T01:06:39Z

You are definitely right. We use Supabase in production, self-hosted, on k8s. I think (and I am guessing) that the intent of the chart writers was not that people would use that PG instance provided, but that users (like us) would drop in our appropriate PG solution as necessary.

Now, that was not very easy, and I think you have a point that could be made much, much easier. It was a lot of chart surgery for us to drop in StackGres in lieu.

We are going to refactor our changes with some lessons learned and will PR them back at some point.

As a consumer of this setup (specifically the revision that @drpsyko101 recently contributed) I would reckon that the setup that Supabase provide (docker-compose based) and which has been replicated for cloud providers like Digital Ocean fill the hobby/small segment requirements quite well. As to Kubernetes setup, I would venture to assume that nobody is going to use anything in production that has Postgres as fundamental requirements and does not provide HA variant as the general target. Even out DevOps team was ok with single-node for initial trial, but flagged lack of HA Postgres as essentially compliance killer that is going to prevent the setup from ever being production grade approved and used beyond the scope of POC/MVP development. Helm charts is what our dev team uses so no complains there. Just my 2 cents, and thank you for all your continuous contributions and amazing work! You all rock!

AntonOfTheWoods · 2024-04-17T01:21:03Z

@AntonOfTheWoods I understand your concerns about putting the multi-node support in the short term goals. But scaling up services other than vector also mandates high-availability support. This is especially difficult to set postgres and storage/minio to StatefulSet due to their complex replication setup, hence placing them in the long term goals.

However, it is a different story if the user uses external HA postgres and minio. Then it is viable to set vector to use DaemonSet setup as you've mentioned above. From what I can see, there shouldn't be many changes to other resources to do so.

I think you are over-complicating things. HA doesn't have a specific definition that means anything useful. Something that calls a support engineer who then clicks a button that reloads from a backup on a new server is "HA" in many organisations. "Highly" can mean any number of 9s and you can take any number of factors into account - or not. How many separate network providers do you need before you consider your setup "HA"?

Now, that was not very easy, and I think you have a point that could be made much, much easier. It was a lot of chart surgery for us to drop in StackGres in lieu.

This is why I think it's just silly not to use bitnami where it makes sense. They have an excellent and very widely used system for making this very easy, including with proper secrets management. Their system has templates and formalisms for doing all this that could simply be copy/pasted. While they do have an "HA" version of a postgres chart, I didn't have much success with it but once you use their abstractions, it is very clear to everyone (who has very likely seen/used a bitnami chart before) how to swap out without having to spend a few hours making sure you are changing the right things in the right places.

drpsyko101 · 2024-04-17T10:41:54Z

If you're familiar with bitnami images, I'd reckon you've looked at their postgresql-ha. PostgreSQL, like many other databases, needs some sort of replication system at scale. Either by using master-standby, master-read, sharding nodes, etc. to ensure that the data is consistent across nodes in the event of node/container failure. In the case of bitnami, it uses pgpool to allocate master-read containers.

While they do have an "HA" version of a Postgres chart, I didn't have much success with it...

This is exactly why it takes time to implement multi-node support.

As for the overall chart syntax, it can be slowly improved over time. Making several breaking changes without any care for the existing users is bad for the community, no?

AntonOfTheWoods · 2024-04-19T05:44:20Z

As for the overall chart syntax, it can be slowly improved over time. Making several breaking changes without any care for the existing users is bad for the community, no?

I personally think that delivering a db module that is based on a deployment and has autoscaling clearly available in the template and values - even though what would happen if autoscaling did kick in is anyone's guess - is very much "without any care for the existing users".

In any case, I have been working on extending the bitnami supabase chart (recently updated with recent images for the supported modules) with the missing modules and it looks like it is going to be successful. That's the beauty of open source!

This is exactly why it takes time to implement multi-node support.

And exactly why I am going to put my trust in bitnami. Honestly, there are so many horrible bugs in this chart it is a danger to the community. Unless you there is a massive "THIS IS NOT MULTI-NODE COMPATIBLE, AND YOU CAN'T USE THE PROVIDED postgresql OR minio" at the top of the readme it's going to waste many people's time. Anyone who wants a single-node option would be a fool not to simply use the upstream-provided compose!

LinuxSuRen · 2024-04-19T08:32:38Z

Please consider having an OCI helm chart. And create an early version of the early adoption users.

Thank @drpsyko101 for letting me know this thread. See also #56 (comment)

drpsyko101 · 2024-04-19T08:39:47Z

@AntonOfTheWoods I think we aren't on the same page here. Both of us want the chart to be multi-node compliant. But as I've said above, I'm okay starting with external db & S3 for multi-node deployment. As the chart progresses, we can then implement our own HA solutions to fit the needs of some users who require self-contained charts.

Honestly, there are so many horrible bugs in this chart it is a danger to the community.

Not gonna deny this, it is no different from vanilla helm create. I sincerely hope more PR will be added to address that without many breaking changes.

We're still in version 0.1. There is a lot of room for improvement to make this chart better. Your contribution is very much appreciated!

bartekus · 2024-04-19T18:00:59Z

The good news is that with Supabase going GA bitnami updated their supabase helm charts (this was done just 4 days ago).

AntonOfTheWoods · 2024-04-29T13:50:32Z

If anyone is interested in a bitnami-oriented take on all this, have a look at https://github.com/supafull/helm-charts/ . It still has a lot of rough edges (and not everything has been tested, like the imgproxy, etc.) but it has mostly the same level of feature support as this chart now does, but it uses bitnami charts where possible, and the remaining stuff (analytics, functions and imgproxy) was heavily inspired by the bitnami way. That means it:

uses the bitnami supabase chart as a dependency for the main features
uses the bitnami minio chart as a dependency for the in-cluster s3-compatible storage
uses the bitnami postgresql chart to install a completely separate postgres instance for the analyticsdb (because spraying a firehose of logs at your main DB is just silly...)
uses the upstream vector chart. This chart is not very well written (though written by the datadog guys by the looks of it...) but it at least uses a daemonset for the agent. It doesn't currently allow for their more advanced architectures but the agent-only setup should be a good start.

There isn't a lot of documentation yet but I'm going to be putting it through it's paces for a demo/example project using react-admin + electric-sql + supabase (before trying to migrate another real project to this), and (after my upcoming 5-day May-day break) will be making sure it all works and giving it a damned good documenting (though I might not bother with testing AWS S3 or Google BigQuery unless I get some stars!).

Please let the feedback flow freely!

heresandyboy · 2024-11-08T11:50:22Z

You are definitely right. We use Supabase in production, self-hosted, on k8s. I think (and I am guessing) that the intent of the chart writers was not that people would use that PG instance provided, but that users (like us) would drop in our appropriate PG solution as necessary.

Now, that was not very easy, and I think you have a point that could be made much, much easier. It was a lot of chart surgery for us to drop in StackGres in lieu.

Hi, it has been a long while since I gave up on running supabase in k8s in production.
I want to revisit it for a project I am working on, in particular I am very interested in how to setup StackGres as the postgres database for supabase. @24601 you had mentioned above that you had this working are were thinking of sharing. It would really help if you have anything to share.

I'd be happy to run through it all and pull out whatever we can contribute here, to give us a decent option for a compatible external database for this chart with full instructions.

arpagon added the enhancement New feature or request label Apr 12, 2024

drpsyko101 mentioned this issue Apr 13, 2024

Add support for existing secret references #54

Merged

drpsyko101 mentioned this issue Apr 16, 2024

Add GitHub Action for chart release #55

Draft

4 tasks

drpsyko101 mentioned this issue Apr 19, 2024

Request to add OCI helm registry support #56

Open

teocns mentioned this issue Nov 16, 2024

Multi node roadmap features #85

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Supabase-Kubernetes Roadmap: Feature Prioritization and Enhancements #53

Supabase-Kubernetes Roadmap: Feature Prioritization and Enhancements #53

arpagon commented Apr 12, 2024

arpagon commented Apr 12, 2024

drpsyko101 commented Apr 13, 2024 •

edited

Loading

AntonOfTheWoods commented Apr 14, 2024

drpsyko101 commented Apr 14, 2024

bartekus commented Apr 15, 2024

24601 commented Apr 17, 2024

AntonOfTheWoods commented Apr 17, 2024

drpsyko101 commented Apr 17, 2024

AntonOfTheWoods commented Apr 19, 2024 •

edited

Loading

LinuxSuRen commented Apr 19, 2024 •

edited

Loading

drpsyko101 commented Apr 19, 2024

bartekus commented Apr 19, 2024

AntonOfTheWoods commented Apr 29, 2024 •

edited

Loading

heresandyboy commented Nov 8, 2024

Supabase-Kubernetes Roadmap: Feature Prioritization and Enhancements #53

Supabase-Kubernetes Roadmap: Feature Prioritization and Enhancements #53

Comments

arpagon commented Apr 12, 2024

arpagon commented Apr 12, 2024

drpsyko101 commented Apr 13, 2024 • edited Loading

AntonOfTheWoods commented Apr 14, 2024

drpsyko101 commented Apr 14, 2024

bartekus commented Apr 15, 2024

24601 commented Apr 17, 2024

AntonOfTheWoods commented Apr 17, 2024

drpsyko101 commented Apr 17, 2024

AntonOfTheWoods commented Apr 19, 2024 • edited Loading

LinuxSuRen commented Apr 19, 2024 • edited Loading

drpsyko101 commented Apr 19, 2024

bartekus commented Apr 19, 2024

AntonOfTheWoods commented Apr 29, 2024 • edited Loading

heresandyboy commented Nov 8, 2024

drpsyko101 commented Apr 13, 2024 •

edited

Loading

AntonOfTheWoods commented Apr 19, 2024 •

edited

Loading

LinuxSuRen commented Apr 19, 2024 •

edited

Loading

AntonOfTheWoods commented Apr 29, 2024 •

edited

Loading