Mini Cloud Platform β Bare-Metal Infrastructure
Private datacenter-equivalent infrastructure β built from scratch with MAAS, k3s, ArgoCD, and GitOps.
What This Project Isβ
This documentation covers a complete bare-metal infrastructure built locally using MAAS (Metal as a Service), provisioning a 3-node cluster ready for Kubernetes and production workloads.
Equivalent to: AWS EC2 + VPC + Auto Provisioning β but local.
Infrastructure at a Glanceβ
| Node | IP | Role | Hardware |
|---|---|---|---|
| set-hog | 10.0.0.2 | Control Plane | ThinkPad T15 Gen 1 |
| fast-skunk | 10.0.0.4 | Worker | ThinkPad T490 |
| fast-heron | 10.0.0.7 | Worker | ThinkPad T490 |
MAAS Controller: Ubuntu + dual NIC (WiFi β internet, Ethernet β 10.0.0.1)
Complete Roadmapβ
Each phase builds directly on the previous one β nothing requires something that hasn't been set up yet.
| Phase | Topic | Key Technology | Status |
|---|---|---|---|
| 0 | MAAS + 3-node provisioning | MAAS, PXE, cloud-init | β Done |
| 1 | Kubernetes cluster | k3s | π Next |
| 2 | kubectl local access | kubeconfig | π |
| 3 | Remote access from anywhere | Tailscale, Cloudflare Tunnel, Homer | π |
| 4 | Load balancer IPs on bare-metal | MetalLB | π |
| 5 | Persistent storage | Longhorn, NFS | π |
| 6 | Expose apps to the network | NGINX Ingress | π |
| 7 | Private container registry | Harbor + Trivy | π |
| 8 | Cluster monitoring | Prometheus, Grafana | π |
| 9 | First real workload | kubectl, containerd | π |
| 10 | Infrastructure automation | Ansible | π |
| 11 | Infrastructure as Code | Terraform / OpenTofu, Crossplane | π |
| 12 | GitOps deployment | ArgoCD | π |
| 13 | CI/CD pipelines | GitLab / Gitea | π |
| 14 | Backup & disaster recovery | Velero, etcd snapshots | π |
| 15 | Security hardening | Vault, cert-manager, RBAC | π |
| 16 | Automation & workflows | n8n, Temporal, Apache Airflow | π |
| 17 | Event-driven & autoscaling | KEDA, NATS | π |
| 18 | Developer platform | Backstage | π |
| 19 | AI / ML platform | Ollama, MLflow, Kubeflow | π |
| 20 | Reliability & chaos testing | Chaos Mesh | π |
| 21 | Advanced observability | Loki, Jaeger, Alertmanager | π |
| 22 | eBPF networking (advanced) | Cilium, Hubble | π |
| β | Data Layer | Kafka/Redpanda, ClickHouse, dbt, Superset, OpenMetadata | π |
| β | Security Layer | Keycloak, OPA/Gatekeeper, Falco, Cosign+SBOM, kube-bench | π |
Final Stack (When Complete)β
ββ INFRASTRUCTURE ββββββββββββββββββββββββββββββββββββββββββββββββββ
MAAS β bare-metal provisioning
k3s β Kubernetes cluster
MetalLB β load balancer IPs
Longhorn β distributed storage
Harbor β private container registry
ββ AUTOMATION & DELIVERY βββββββββββββββββββββββββββββββββββββββββββ
Ansible β infrastructure automation
Terraform β infrastructure as code
Crossplane β Kubernetes-native IaC
ArgoCD β GitOps
GitLab β CI/CD
ββ PLATFORM SERVICES βββββββββββββββββββββββββββββββββββββββββββββββ
Velero β backup & disaster recovery
Vault β secrets management
n8n β visual workflow automation
Temporal β code-based workflow orchestration
Airflow β data pipeline scheduling
KEDA β event-driven autoscaling
NATS β message broker
Backstage β developer portal
ββ OBSERVABILITY βββββββββββββββββββββββββββββββββββββββββββββββββββ
Prometheus β metrics
Grafana β dashboards
Loki β logs
Jaeger β traces
Chaos Mesh β reliability testing
Cilium β eBPF networking + Hubble
Tailscale β remote access VPN
ββ DATA LAYER ββββββββββββββββββββββββββββββββββββββββββββββββββββββ
Redpanda β event streaming (Kafka-compatible)
Debezium β change data capture (CDC)
ClickHouse β columnar analytics warehouse
dbt β SQL transformation layer
Superset β self-hosted BI dashboards
OpenMetadata β data catalog, lineage, governance
ββ SECURITY LAYER ββββββββββββββββββββββββββββββββββββββββββββββββββ
Keycloak β SSO / OIDC identity provider
OPA/Gatekeeperβ admission control (policy as code)
Falco β runtime threat detection (eBPF)
Cosign β image signing + SBOM supply chain
kube-bench β CIS compliance scoring
ββ AI / ML βββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
Ollama β local LLMs (Mistral, LLaMA 3)
MLflow β ML experiment tracking
Kubeflow β ML pipelines + distributed training
CV / LinkedIn Summaryβ
- Designed and deployed a 3-node bare-metal infrastructure using MAAS
- Implemented PXE-based automated OS provisioning via network boot (PXE)
- Built isolated cluster network (10.0.0.0/24) with DHCP/DNS management
- Resolved complex networking issues (IPv6 conflicts, DHCP overlap, alias interfaces)
- Deployed full Kubernetes platform: k3s, ArgoCD, Prometheus, Harbor, Vault
- Built private AI platform with local LLM serving (Ollama) and ML pipelines (Kubeflow)
- Implemented remote access via Tailscale VPN and Cloudflare Tunnel
- Applied chaos engineering with Chaos Mesh to validate cluster resilience
- Built end-to-end data platform: Kafka/Redpanda β ClickHouse β dbt β Superset with OpenMetadata governance
- Implemented enterprise security: Keycloak SSO, OPA/Gatekeeper admission control, Falco runtime detection, Cosign supply chain signing