DevOps – ACE IT SKILLS

Hybrid cloud và multicloud

Hybrid cloud và multicloud là hai cụm từ xuất hiện rất nhiều trong các cuộc nói chuyện về chiến lược hạ tầng, nhưng chúng cũng là hai khái niệm dễ bị dùng mơ hồ nhất. Video của TechWorld with Nana làm rõ một điều quan trọng: doanh nghiệp không chọn các mô hình này chỉ vì nghe có vẻ hiện đại, mà vì có những ràng buộc thực tế về kỹ thuật, chi phí, tuân thủ và tổ chức.

Bài viết này viết lại nội dung đó bằng tiếng Việt theo hướng thực dụng hơn, để bạn hiểu khi nào hybrid cloud hoặc multicloud thật sự có ý nghĩa với team DevOps và kiến trúc hệ thống.

Bạn sẽ học được gì

hybrid cloud và multicloud khác nhau thế nào
vì sao doanh nghiệp áp dụng các mô hình này
lợi ích thật sự và cái giá phải trả
những hiểu lầm phổ biến khi bàn về chiến lược cloud
các bài nên đọc tiếp để nối sang Kubernetes và AWS

Hybrid cloud là gì

Hybrid cloud thường nói tới mô hình kết hợp giữa on-premises hoặc private infrastructure với public cloud. Một phần hệ thống có thể vẫn chạy trong data center nội bộ, trong khi phần khác được đưa lên AWS, Azure hoặc GCP.

Lý do không phải lúc nào cũng là kỹ thuật thuần túy. Có khi doanh nghiệp cần giữ một số hệ thống cũ ở chỗ cũ, có khi do yêu cầu compliance, có khi do dữ liệu nhạy cảm chưa thể chuyển hết lên public cloud trong một bước.

Multicloud là gì

Multicloud thường nghĩa là dùng từ hai public cloud trở lên trong cùng chiến lược hạ tầng. Ví dụ một phần workload chạy trên AWS, phần khác chạy trên GCP hoặc Azure. Mục tiêu có thể là tận dụng dịch vụ tốt nhất của từng nền tảng, giảm phụ thuộc vào một nhà cung cấp, hoặc bám theo yêu cầu của từng bộ phận kinh doanh.

Dù nghe hấp dẫn, multicloud không tự động tốt hơn single cloud. Nó chỉ đáng giá khi lợi ích lớn hơn độ phức tạp bổ sung.

Vì sao doanh nghiệp chọn hybrid cloud

không thể di chuyển toàn bộ hệ thống legacy trong một lần
muốn giữ dữ liệu hoặc hệ thống nhạy cảm ở môi trường kiểm soát chặt hơn
cần kết nối dần dần giữa hạ tầng cũ và nền tảng cloud mới
có ràng buộc pháp lý hoặc địa lý dữ liệu

Điểm quan trọng là hybrid cloud thường sinh ra từ nhu cầu chuyển đổi thực tế chứ không phải từ một bản vẽ kiến trúc lý tưởng ngay từ đầu.

Vì sao doanh nghiệp chọn multicloud

muốn tránh phụ thuộc tuyệt đối vào một cloud provider
muốn chọn dịch vụ tốt nhất ở từng nền tảng
có nhiều team hoặc nhiều công ty thành viên với lịch sử hạ tầng khác nhau
cần tối ưu theo khu vực địa lý, năng lực hoặc pricing riêng

Tuy nhiên, multicloud không chỉ là thêm một tài khoản cloud nữa. Nó kéo theo thêm sự phức tạp ở IAM, networking, quan sát hệ thống, chính sách bảo mật, đào tạo đội ngũ và chi phí vận hành.

Lợi ích có thật, nhưng không miễn phí

Cả hybrid cloud và multicloud đều mang lại độ linh hoạt. Nhưng cái giá của sự linh hoạt là kiến trúc khó hơn, nhiều công cụ hơn và yêu cầu đội ngũ trưởng thành hơn. Nếu team còn chưa làm tốt trên một cloud, việc nhảy sang hai hoặc ba cloud thường không làm mọi thứ tốt lên. Nó chỉ khiến rối nhanh hơn.

Đó là một điểm rất đáng giữ lại từ video: nhiều công ty nói về multicloud như một mục tiêu trưởng thành, trong khi trên thực tế nó chỉ nên là hệ quả của nhu cầu rõ ràng.

Khi nào không nên theo đuổi mô hình này

Nếu sản phẩm còn nhỏ, đội ngũ còn mỏng, quy trình release chưa ổn định, logging và monitoring còn yếu, thì ưu tiên hợp lý hơn thường là làm thật tốt trên một nền tảng trước. Hybrid cloud hay multicloud lúc đó dễ trở thành gánh nặng hơn là lợi thế.

Tóm tắt nhanh

Hybrid cloud là kết hợp giữa hạ tầng nội bộ và public cloud.
Multicloud là dùng nhiều public cloud trong cùng chiến lược.
Cả hai mô hình đều có giá trị khi có lý do kinh doanh và kỹ thuật rõ ràng.
Nếu không kiểm soát tốt độ phức tạp, chúng có thể làm chi phí vận hành tăng mạnh.

Đọc tiếp

Cloud technology basics để nhìn lại nền tảng trước khi đi vào chiến lược cloud.
Google cloud platform basics nếu bạn muốn so thêm góc nhìn cloud provider.
Các lựa chọn container trên AWS để nối chiến lược cloud với bài toán chạy workload.
Managed Kubernetes cluster là gì nếu bạn muốn hiểu lớp nền tảng ứng dụng trên cloud.

Nguồn tham khảo

Video gốc: Hybrid Cloud and MultiCloud | Why are companies adopting it?
AWS hybrid cloud overview: https://aws.amazon.com/hybrid/
Google Cloud hybrid and multicloud overview: https://cloud.google.com/learn/what-is-hybrid-cloud

Các lựa chọn container trên AWS

Khi bước vào hệ sinh thái container trên AWS, người mới rất dễ bị ngợp vì quá nhiều tên dịch vụ xuất hiện cùng lúc: ECS, EKS, Fargate, ECR. Video của TechWorld with Nana làm rất tốt một việc quan trọng, đó là đặt các mảnh ghép này cạnh nhau để thấy chúng không phải các công cụ hoàn toàn tách biệt, mà là các lựa chọn giải quyết những phần khác nhau của bài toán container.

Bài viết này viết lại các ý chính theo cách ngắn gọn, rõ ràng và thực dụng hơn cho người học DevOps bằng tiếng Việt.

Bạn sẽ học được gì

ECR, ECS, EKS và Fargate khác nhau ở điểm nào
dịch vụ nào quản lý image, dịch vụ nào quản lý workload
khi nào nên dùng ECS thay vì EKS
vai trò của Fargate trong mô hình serverless cho container
cách ghép các dịch vụ này vào một flow triển khai thực tế

Bức tranh lớn trước khi đi vào từng dịch vụ

Trước tiên cần tách bài toán container trên AWS thành hai lớp. Lớp thứ nhất là nơi bạn lưu image. Lớp thứ hai là nơi bạn chạy workload. Nhiều người mới học lẫn hai lớp này với nhau nên cảm thấy mọi thứ rối hơn mức cần thiết.

ECR nằm ở lớp lưu trữ image. ECS và EKS nằm ở lớp điều phối workload. Fargate là cách chạy workload mà bạn không phải trực tiếp quản lý server cho phần compute đó.

ECR: nơi lưu image

ECR là Elastic Container Registry. Hãy nghĩ đơn giản đây là kho chứa image Docker hoặc OCI image trong hệ sinh thái AWS. Khi bạn build một image cho ứng dụng, ECR là nơi rất tự nhiên để đẩy image lên trước khi cho ECS hoặc EKS kéo về chạy.

Nếu team đã dùng AWS nhiều, ECR thường là lựa chọn thuận tay vì quyền truy cập, tích hợp và quy trình đẩy image có thể gắn chặt với pipeline sẵn có.

ECS: điều phối container theo cách đơn giản hơn

ECS là Elastic Container Service. So với EKS, ECS thường dễ tiếp cận hơn cho team chỉ muốn chạy container trên AWS mà chưa cần sức mạnh và độ linh hoạt của Kubernetes. Bạn không phải học toàn bộ mô hình object của Kubernetes, nhưng vẫn có một hệ thống điều phối để chạy, scale và cập nhật container.

Điểm mạnh của ECS là gần với hệ sinh thái AWS, ít khái niệm hơn, và thường phù hợp với team muốn đi nhanh. Điểm đổi lại là ECS ít portable hơn Kubernetes nếu sau này bạn muốn đa cloud hoặc chuyển nền tảng.

EKS: khi bạn cần Kubernetes

EKS là lựa chọn khi bạn muốn chuẩn Kubernetes, cần một mô hình orchestration phổ biến hơn trong thị trường, hoặc muốn nền tảng dễ gắn với tooling Kubernetes rộng lớn hơn. EKS mạnh hơn về mặt tiêu chuẩn và hệ sinh thái, nhưng cũng đòi hỏi nhiều kiến thức hơn.

Nếu bài toán của bạn là “chạy container ổn, đơn giản, nhanh trên AWS”, ECS có thể đủ. Nếu bài toán là “xây nền tảng Kubernetes nghiêm túc”, EKS mới là câu trả lời hợp lý hơn.

Fargate: lớp compute ít phải quản server

Fargate không phải registry và cũng không phải một orchestrator độc lập như ECS hay EKS. Nó là mô hình compute để chạy container mà bạn không phải quản EC2 instance ở mức thông thường. Điều này giúp giảm phần việc quản lý máy chủ, vá hệ điều hành và một số tác vụ vận hành hạ tầng.

Fargate rất hấp dẫn cho những workload vừa và nhỏ, workload theo sự kiện hoặc team muốn giảm mạnh phần vận hành node. Tuy nhiên nó không phải lúc nào cũng rẻ hoặc phù hợp cho mọi kiểu tải lớn.

Một cách ghép dễ nhớ

ECR để lưu image
ECS hoặc EKS để điều phối workload
Fargate để cung cấp compute mà bạn ít phải lo server hơn

Ví dụ, một team có thể build image, đẩy lên ECR, sau đó chạy ứng dụng qua ECS trên Fargate. Một team khác có thể đẩy image lên ECR rồi deploy qua EKS vì họ cần Kubernetes.

Những sai lầm thường gặp

nghĩ rằng ECS, EKS, Fargate và ECR là bốn dịch vụ cạnh tranh trực tiếp với nhau
chọn EKS chỉ vì thấy Kubernetes đang phổ biến mà chưa rõ nhu cầu thật
chọn Fargate nhưng không theo dõi chi phí khi workload tăng đều
không tách rõ nơi lưu image với nơi chạy container

Tóm tắt nhanh

ECR là nơi lưu image container.
ECS và EKS là hai cách điều phối workload container trên AWS.
Fargate là mô hình compute ít phải quản server hơn.
Lựa chọn đúng phụ thuộc vào độ phức tạp, nhu cầu tiêu chuẩn hóa và khả năng vận hành của team.

Đọc tiếp

Tạo cụm EKS theo cách đơn giản nếu bạn đang nghiêng về Kubernetes trên AWS.
Hybrid cloud và multicloud nếu bạn muốn hiểu khi nào hạ tầng vượt khỏi một cloud duy nhất.
Why docker containers matter để củng cố phần nền tảng container.
Aws basics for engineers để nắm lại bức tranh AWS căn bản.

Nguồn tham khảo

Video gốc: Containers on AWS Overview: ECS | EKS | Fargate | ECR
AWS ECS docs: https://docs.aws.amazon.com/AmazonECS/latest/developerguide/Welcome.html
AWS Fargate docs: https://docs.aws.amazon.com/AmazonECS/latest/developerguide/AWS_Fargate.html
AWS ECR docs: https://docs.aws.amazon.com/AmazonECR/latest/userguide/what-is-ecr.html

Tạo cụm EKS theo cách đơn giản

EKS thường là nơi nhiều người gặp Kubernetes trên AWS lần đầu tiên. Video của TechWorld with Nana đi theo hướng rất thực tế: không sa đà vào triết lý, mà tập trung trả lời một câu hỏi đơn giản hơn nhiều, đó là làm sao tạo một Kubernetes cluster trên Amazon EKS theo cách dễ tiếp cận nhất.

Bài viết này tóm lược lại ý chính của video bằng tiếng Việt, đồng thời bổ sung thêm góc nhìn vận hành để người mới không lầm tưởng rằng tạo được cluster là đã hoàn thành phần khó nhất.

Bạn sẽ học được gì

EKS là gì và phù hợp với tình huống nào
một flow tạo cluster EKS đơn giản thường gồm những bước nào
phần nào là AWS giúp bạn, phần nào bạn vẫn phải tự lo
những lỗi tư duy thường gặp khi mới dùng EKS
các bài nên đọc tiếp để nối sang kiến trúc container trên AWS

EKS thực chất là gì

Amazon EKS là dịch vụ managed Kubernetes của AWS. Cũng giống các managed service khác, EKS giúp bạn bỏ bớt gánh nặng dựng control plane và tích hợp Kubernetes với hệ sinh thái AWS tốt hơn. Điều hấp dẫn nhất của EKS không chỉ là tạo cluster nhanh, mà là kết nối được với IAM, VPC, load balancer, autoscaling và các dịch vụ khác trong cùng nền tảng.

Với team đã dùng AWS sẵn, điều này giúp hành trình vào Kubernetes mượt hơn nhiều so với việc tự dựng cụm trên EC2 từ đầu.

Một flow tạo EKS dễ hiểu

Nếu bỏ qua chi tiết nhỏ của từng câu lệnh, flow đơn giản để tạo EKS thường gồm:

chuẩn bị tài khoản, region và quyền truy cập phù hợp
tạo cluster EKS
gắn worker node hoặc node group
cấu hình `kubectl` để kết nối tới cluster
triển khai ứng dụng thử nghiệm để xác minh môi trường hoạt động

Điểm quan trọng ở đây là cluster chỉ thật sự hữu ích khi có node để chạy workload. Nhiều người mới nhìn thấy cluster tạo thành công trong console và nghĩ mọi thứ đã sẵn sàng, nhưng nếu chưa có node group hợp lệ thì bạn vẫn chưa chạy được ứng dụng thực tế.

EKS giúp gì và không giúp gì

EKS giúp bạn đỡ phần control plane, tích hợp bảo mật theo cách quen thuộc của AWS, và tận dụng được nhiều dịch vụ xung quanh. Nhưng nó không tự động giải quyết toàn bộ bài toán nền tảng ứng dụng.

Bạn vẫn cần lo:

chiến lược node và autoscaling
thiết kế network và IAM cho workload
quy trình CI/CD để đưa image vào cluster
giám sát log, metric và alert
kiểm soát chi phí khi môi trường mở rộng

Đây là lý do EKS phù hợp với tư duy platform engineering hoặc DevOps trưởng thành hơn là tư duy “bấm nút xong là có production”.

Vì sao EKS hấp dẫn với team AWS

EKS đặc biệt hấp dẫn khi hệ thống của bạn đã dùng VPC, IAM, CloudWatch, ECR hoặc các dịch vụ hạ tầng khác trên AWS. Khi đó, Kubernetes không phải một đảo tách biệt mà trở thành một phần của hệ sinh thái đang dùng.

Với người mới, đây là lợi thế lớn vì bạn không phải học lại mọi thứ từ đầu. Nhưng mặt trái là bạn cũng phải hiểu thêm nhiều khái niệm AWS hơn một cluster Kubernetes chung chung.

Những sai lầm thường gặp

chỉ tập trung tạo cluster mà quên thiết kế node group hợp lý
không kiểm soát IAM nên quyền quá rộng hoặc quá rối
không dự trù chi phí cho worker node, load balancer và storage
đưa workload lên EKS trước khi có logging và monitoring cơ bản

Nếu mới học, cách tốt nhất là bắt đầu từ một cụm nhỏ, một ứng dụng demo đơn giản, sau đó mới mở rộng dần các lớp như ingress, autoscaling và observability.

Tóm tắt nhanh

EKS là managed Kubernetes service của AWS.
Nó giúp bạn tăng tốc phần dựng cluster và tích hợp tốt hơn với hệ sinh thái AWS.
Tạo được cluster chưa phải là xong, vì node, bảo mật, deploy và giám sát vẫn là phần việc lớn.
EKS đặc biệt phù hợp khi team đã có nền tảng AWS từ trước.

Đọc tiếp

Các lựa chọn container trên AWS để hiểu EKS đứng ở đâu trong bức tranh AWS.
Managed Kubernetes cluster là gì để nắm tư duy managed cluster trước khi đi sâu vào EKS.
Aws basics for engineers để nhìn lại nền tảng AWS căn bản.
Kubernetes in practice nếu bạn muốn quay về bức tranh Kubernetes chung.

Nguồn tham khảo

Video gốc: AWS EKS – Create Kubernetes cluster on Amazon EKS | the easy way
Amazon EKS docs: https://docs.aws.amazon.com/eks/
AWS ECR docs: https://docs.aws.amazon.com/AmazonECR/latest/userguide/what-is-ecr.html

Triển khai ứng dụng lên LKE với Helm

Sau khi đã hiểu managed Kubernetes cluster là gì, câu hỏi tự nhiên tiếp theo là: vậy deploy một ứng dụng thực tế lên đó như thế nào? Video thứ hai trong playlist đi theo hướng rất thực dụng: không nói quá nhiều lý thuyết, mà cho thấy quy trình đẩy ứng dụng lên LKE và dùng Helm để đóng gói cấu hình.

Bài này viết lại nội dung theo cách dễ áp dụng hơn cho người đang học DevOps. Mục tiêu không phải chép lại từng lệnh, mà giúp bạn hiểu vì sao Helm xuất hiện trong quy trình Kubernetes và nó giúp team triển khai ổn định hơn ra sao.

Bạn sẽ học được gì

quy trình triển khai ứng dụng lên managed Kubernetes cluster
vai trò của image registry, Helm chart và release
vì sao Helm giúp deployment đỡ rối hơn
những điểm cần kiểm tra khi deploy lần đầu
nên đọc tiếp bài nào để nối sang AWS và cloud platform

LKE và bối cảnh của bài toán

LKE là Linode Kubernetes Engine, tức một managed Kubernetes service. Dù video dùng LKE, ý chính của nó áp dụng rộng hơn cho cả EKS, GKE hay AKS: bạn có sẵn cluster, và nhiệm vụ của bạn là đưa ứng dụng cùng cấu hình chạy được trong môi trường đó.

Quy trình này thường có ba phần chính: build image, lưu image ở registry, sau đó deploy workload vào cluster. Nếu làm thủ công hoàn toàn bằng nhiều file YAML rời rạc, bạn sẽ rất nhanh gặp vấn đề về trùng lặp và khó kiểm soát phiên bản cấu hình. Đó là lý do Helm trở nên quan trọng.

Helm giúp gì cho deployment

Helm thường được gọi là package manager cho Kubernetes. Cách hiểu dễ nhất là Helm cho phép bạn gom các manifest liên quan vào một chart, sau đó truyền giá trị đầu vào để tạo ra cấu hình phù hợp cho từng môi trường.

Thay vì phải sửa nhiều file YAML mỗi lần đổi image tag, replica count, service type hoặc domain, bạn chỉ cần quản lý các biến trong file giá trị. Điều này đặc biệt hữu ích khi team có nhiều môi trường như dev, staging và production.

Một flow triển khai điển hình

Nội dung video cho thấy một flow rất gần với thực tế production nhỏ hoặc vừa:

chuẩn bị image của ứng dụng
đẩy image lên registry
kết nối `kubectl` tới cluster LKE
tạo hoặc chỉnh Helm chart
chạy lệnh cài đặt hoặc nâng cấp release
kiểm tra pod, service và trạng thái rollout

Khi nhìn flow này, điều quan trọng không phải là thuộc từng câu lệnh, mà là hiểu mối liên hệ: registry giữ artifact, Helm chart mô tả deployment, Kubernetes chịu trách nhiệm chạy workload theo trạng thái mong muốn.

Vì sao release management quan trọng

Một điểm rất hay trong cách tiếp cận bằng Helm là tư duy release. Mỗi lần bạn deploy không chỉ là “ném file YAML lên cluster”, mà là tạo hoặc cập nhật một release có version, có giá trị cấu hình, có lịch sử thay đổi tương đối rõ ràng.

Điều này giúp team dễ rollback hơn, dễ so sánh cấu hình giữa các lần deploy hơn, và nhất quán hơn khi nhiều người cùng thao tác trên cluster.

Những gì nên kiểm tra sau khi deploy

Deploy xong chưa có nghĩa là xong việc. Một số bước kiểm tra cơ bản nhưng cực kỳ quan trọng là:

pod có thật sự chạy ổn định hay bị crash loop
service đã expose đúng cổng cần thiết chưa
image tag có đúng version mong muốn không
biến môi trường và secret đã vào pod chưa
ingress hoặc load balancer đã route đúng chưa

Đây là chỗ nhiều người mới học Kubernetes hay bỏ qua. Họ tập trung vào chuyện “deploy được lệnh” nhưng chưa kiểm tra “ứng dụng thật sự dùng được chưa”.

Khi nào Helm có thể trở nên rối

Helm rất hữu ích, nhưng cũng có thể gây rối nếu chart bị nhồi quá nhiều logic hoặc biến cấu hình không có quy ước rõ ràng. Một chart tốt nên đủ linh hoạt cho các môi trường khác nhau, nhưng không nên biến thành một hệ thống template quá phức tạp mà không ai ngoài người viết đầu tiên hiểu nổi.

Vì vậy, nếu team mới bắt đầu, hãy giữ chart gọn, đặt tên biến rõ ràng và tách rành mạch giữa cấu hình chung với cấu hình riêng cho từng môi trường.

Tóm tắt nhanh

Deploy ứng dụng lên managed Kubernetes thường đi theo flow image registry, chart và release.
Helm giúp gom cấu hình và quản lý deploy nhất quán hơn.
Điều quan trọng không chỉ là chạy được lệnh, mà là kiểm chứng workload chạy ổn định.
Helm hiệu quả nhất khi chart đủ gọn và có quy ước rõ ràng.

Đọc tiếp

Managed Kubernetes cluster là gì để hiểu tầng nền của bài toán.
Tạo cụm EKS theo cách đơn giản nếu bạn muốn đi tiếp sang AWS.
Docker and kubernetes helm repositories để xem thêm góc nhìn về chart repository.
Why docker containers matter nếu bạn muốn nhìn lại tầng container bên dưới.

Nguồn tham khảo

Video gốc: Step by Step Application Deployment on LKE using Helm | Kubernetes on Cloud (2/2)
Helm documentation: https://helm.sh/docs/
Linode Kubernetes Engine docs: https://www.linode.com/docs/products/containers/kubernetes/

Managed Kubernetes cluster là gì

Khi mới học Kubernetes trên cloud, nhiều người thường nghe tới cụm từ managed Kubernetes cluster nhưng chưa thật sự hiểu nhà cung cấp cloud đã làm giúp mình đến đâu và phần nào vẫn là việc của team kỹ thuật. Đây cũng là ý chính của video mở đầu trong playlist của TechWorld with Nana về Kubernetes on Cloud.

Bài viết này viết lại nội dung theo góc nhìn thực hành DevOps: không chép lại video, mà tóm lược những điểm quan trọng nhất để bạn hiểu managed cluster giúp gì, không giúp gì, và khi nào nên chọn nó.

Bạn sẽ học được gì

managed Kubernetes cluster là gì
cloud provider thường quản lý những phần nào
những việc team của bạn vẫn phải tự chịu trách nhiệm
khi nào managed cluster hợp lý hơn self-managed cluster
nên đọc tiếp bài nào trong cluster DevOps này

Managed Kubernetes cluster là gì

Managed Kubernetes cluster là mô hình mà nhà cung cấp cloud đứng ra vận hành phần control plane của Kubernetes cho bạn. Điều đó thường bao gồm API server, scheduler, controller manager, lưu trữ trạng thái cluster và một phần cơ chế nâng cấp nền tảng.

Nói ngắn gọn, bạn vẫn dùng Kubernetes, vẫn deploy pod, service, ingress, config map hay secret như bình thường, nhưng bạn không phải tự dựng mọi thành phần từ đầu như khi cài cluster thủ công trên máy ảo.

Cloud provider làm giúp những gì

Điểm dễ thấy nhất của managed cluster là giảm gánh nặng ở lớp hạ tầng nền. Thay vì phải tự cài đặt control plane, cấu hình high availability, quản lý chứng chỉ, vá lỗi bảo mật và theo dõi tính ổn định của các thành phần lõi, bạn nhận được một cụm Kubernetes đã được chuẩn bị sẵn.

Trong thực tế, cloud provider thường chịu trách nhiệm cho các phần sau:

triển khai và duy trì control plane
backup hoặc bảo vệ trạng thái cluster ở mức nền tảng
một phần quy trình nâng cấp phiên bản Kubernetes
tích hợp với networking, load balancer, storage và IAM của cloud
giao diện hoặc API để tạo cluster nhanh hơn

Đó là lý do managed cluster đặc biệt phù hợp với team nhỏ, team đang tăng tốc hoặc team muốn tập trung nhiều hơn vào ứng dụng thay vì vận hành control plane.

Những gì bạn vẫn phải tự làm

Một nhầm lẫn rất phổ biến là nghĩ rằng dùng managed Kubernetes thì gần như không còn việc vận hành nào nữa. Thực tế không phải vậy. Cloud provider giúp bạn bớt phần khó ở nền tảng, nhưng lớp workload, bảo mật ứng dụng và quy trình release vẫn là việc của đội ngũ nội bộ.

Thông thường bạn vẫn phải tự xử lý:

thiết kế node group hoặc worker node phù hợp
cấu hình namespace, RBAC và network policy
quản lý image, secret, config và quy trình deploy
theo dõi chi phí, autoscaling và hiệu năng ứng dụng
logging, monitoring, alerting và phản ứng khi sự cố xảy ra

Nói cách khác, managed cluster không loại bỏ DevOps. Nó chỉ dịch chuyển phần việc DevOps từ dựng hạ tầng Kubernetes thủ công sang quản trị workload và platform hiệu quả hơn.

Khi nào nên chọn managed cluster

Managed Kubernetes thường là lựa chọn hợp lý khi bạn cần triển khai nhanh, muốn có môi trường production ổn định hơn, hoặc không muốn dành quá nhiều thời gian cho việc chăm sóc control plane. Đây là mô hình đặc biệt thực dụng với startup, sản phẩm SaaS đang tăng trưởng, hoặc doanh nghiệp muốn chuẩn hóa nền tảng cho nhiều team.

Ngược lại, nếu bạn cần kiểm soát rất sâu vào control plane, có ràng buộc đặc biệt về môi trường on-prem, hoặc đang xây một platform nội bộ quá đặc thù, self-managed cluster vẫn có chỗ đứng. Nhưng trong đa số trường hợp hiện nay, managed cluster là điểm khởi đầu thực tế hơn.

Một cách nghĩ đơn giản

Nếu self-managed cluster giống như tự xây và tự bảo trì cả ngôi nhà, thì managed cluster giống như thuê một tòa nhà có phần móng, điện nước và thang máy đã được quản lý chuyên nghiệp. Bạn vẫn phải sắp xếp nội thất, vận hành công việc hằng ngày, và xử lý những vấn đề thuộc về hoạt động của riêng mình.

Những sai lầm thường gặp

nghĩ rằng managed cluster đồng nghĩa với zero-ops
không hiểu ranh giới trách nhiệm giữa cloud provider và team nội bộ
tạo cluster nhanh nhưng không có chuẩn về logging, security và backup workload
đánh giá thấp chi phí khi số lượng node, load balancer và storage tăng lên

Tóm tắt nhanh

Managed Kubernetes cluster giúp bạn bỏ bớt phần vận hành control plane.
Cloud provider hỗ trợ nền tảng, nhưng bạn vẫn phải quản lý workload và vận hành ứng dụng.
Mô hình này rất phù hợp cho team muốn đi nhanh và ổn định hơn trên cloud.
Muốn dùng hiệu quả, bạn vẫn cần tư duy DevOps rõ ràng về CI/CD, quan sát hệ thống và bảo mật.

Đọc tiếp

Triển khai ứng dụng lên LKE với Helm để thấy cách một ứng dụng thực sự đi vào cluster.
Tạo cụm EKS theo cách đơn giản nếu bạn muốn nhìn bài toán này trên AWS.
Kubernetes in practice để củng cố bức tranh tổng thể.
Seven skills for strong devops engineers để thấy Kubernetes chỉ là một phần trong năng lực DevOps.

Nguồn tham khảo

Video gốc: Managed Kubernetes Cluster explained | Kubernetes on Cloud (1/2)
Kubernetes documentation: https://kubernetes.io/docs/concepts/overview/
Amazon EKS documentation: https://docs.aws.amazon.com/eks/

Seven skills for strong devops engineers

DevOps is not one tool and not one job title with a fixed meaning everywhere. In practice, a strong DevOps engineer is someone who helps teams deliver software faster and more reliably by improving systems, automation, visibility, and collaboration.

1. Linux and System Fundamentals

Most modern infrastructure still depends on strong operating system knowledge. Understanding processes, services, files, networking, permissions, and system diagnostics gives you the base to troubleshoot real production problems.

2. Scripting and Automation

Manual work does not scale well. Shell scripting, Python, or similar tools help automate deployment, validation, backups, reporting, and environment setup.

3. CI/CD

Continuous integration and continuous delivery pipelines are central to modern delivery. A DevOps engineer should understand testing flow, artifact creation, deployment stages, rollback strategies, and release reliability.

4. Containers and Orchestration

Docker and Kubernetes matter because they standardize packaging and deployment. You do not need to use every advanced feature, but you should understand the operational model behind them.

5. Cloud Infrastructure

AWS, GCP, and Azure all provide different services, but the underlying engineering ideas remain similar: scalable compute, storage, networking, IAM, and monitoring.

6. Observability

Logs, metrics, traces, and alerts are what turn a running system into an understandable system. You cannot improve reliability if you cannot see what is happening.

7. Communication and Ownership

This is often underestimated. DevOps engineers work across teams, so they must explain tradeoffs, reduce friction, and take responsibility for delivery quality, not just infrastructure syntax.

Final Thoughts

A strong DevOps engineer is not simply a person who knows many tools. It is a person who can use the right tools to reduce operational risk and improve the speed and confidence of delivery.

Kubernetes ide tools that help

Kubernetes can be powerful, but the raw command-line experience is not always comfortable, especially for engineers who are still learning how cluster resources relate to each other. That is why Kubernetes IDE and desktop tools became popular: they reduce friction when exploring workloads, logs, configurations, and cluster state.

What These Tools Usually Help With

browsing namespaces, pods, deployments, and services,
viewing logs and events,
editing YAML manifests,
inspecting port forwarding and shell access,
visualizing relationships between resources.

Common Examples

Different teams use different tools, including Lens, OpenLens, editor extensions, terminal dashboards, and cloud-native consoles. The value is not only the UI itself. The value is faster understanding when something in the cluster is wrong.

A Practical Scenario

Suppose a deployment is healthy according to the desired replica count, but users still report failures. A good Kubernetes desktop tool can help you quickly inspect:

whether pods are restarting,
whether liveness or readiness probes are failing,
whether the service points to the correct labels,
and whether recent events show image pull or permission issues.

Why the CLI Still Matters

These tools are helpful, but they should not replace basic command-line understanding. Engineers still need to know how to use kubectl get, describe, logs, and exec. GUI tools are best seen as accelerators, not substitutes for conceptual understanding.

Final Thoughts

The best Kubernetes IDE is the one that helps your team debug faster without hiding too much of the underlying model. Good tooling reduces cognitive load, but strong operators still need to understand the platform behind the screen.

When vagrant still makes sense

Vagrant is a tool for creating reproducible development environments, usually by automating virtual machine setup through providers such as VirtualBox or VMware. It became popular because teams needed a simpler way to share environments without manually documenting every installation step.

What Problem Vagrant Solves

When every developer sets up a machine by hand, small differences in operating system packages, runtime versions, and service configuration create avoidable bugs. Vagrant solves this by defining an environment as code.

How It Works

A Vagrantfile describes the base box, networking, synced folders, and provisioning steps. Once that file exists, anyone on the team can run a small set of commands and get a very similar environment.

Example Vagrantfile

Vagrant.configure("2") do |config|
  config.vm.box = "ubuntu/jammy64"
  config.vm.network "private_network", ip: "192.168.56.10"
  config.vm.provision "shell", inline: <<-SHELL
    apt-get update
    apt-get install -y nginx
  SHELL
end

Where Vagrant Still Makes Sense

legacy applications that depend on VM-style environments,
training and demos where full machine isolation is useful,
infrastructure experiments that should feel closer to real servers than containers.

When Containers May Be Better

For many modern application teams, Docker is lighter and faster. If you only need process-level isolation and a reproducible runtime, containers may be enough. Vagrant becomes more useful when you specifically need operating system behavior that looks like a full machine.

Final Thoughts

Vagrant is less fashionable than it once was, but it is still a useful tool when a project benefits from reproducible VM-based environments. The key is not choosing the newest tool. The key is choosing the right level of isolation for the problem.

Understanding metal as a service

Hello again! Today’s class is metal as a service MAAS so we’ve been doing a number of classes lately on these different services, we’ve talked about software as a service we’ve talked about infrastructure as a service, and we have talked about platform as a service. Now when you’re talking about these services by far the most popular solution out there is software as a service that is where you basically you go out and you lease software from companies and the software is all run on their server so we’re talking about software as a service think about things like Google Docs think about things like Salesforce the software is not installed on your local computer.

Its installed on their servers and you access that somehow these are either through a web browser or through some type of a terminal connection something like that tied by the infrastructure as a service basically when your your infrastructure all those things that you would buy and install in your premises you no longer actually own anymore.

So think about your telephone systems companies you should spend $50 thousand on their telephone system and that telephone system was installed in their premises and they owned it now you can get hosted voice over IP solutions such as on sip you get hosted firewall solutions you can have hosted server rooms why have your own servers when you can go to Amazon Web Services and simply spin up a number of virtual services on their platform so that is infrastructure as a service.

We then talked about platform as a service platform as a service is where you create your web apps and then you are looking for some place to host them so the basic idea is think about a shared hosting web plan you create your web site you create your web application and then you simply upload that to GoDaddy or one in one servers their servers have PHP installed their servers have my sequel installed their servers have Ruby or Perl or any of these other scripting languages that you need all that you have to worry about is your application of course that gets much more complicated once you go over to Google App Engine and some of the more and more advanced things but that’s the the basic concept so now we’re getting to basically probably the last service that I will be talking about is metal as a service now this is one of those really really really cool ideas that actually it is pretty cutting-edge I’m not sure if it’s bleeding edge but it’s pretty cutting edge so some of you guys looking to create businesses out there really should listen to what I’m talking about right now because it really is a good business opportunity right now because not very many people actually offer this service so when we’re talking about metal as a service what we are literally talking about is providing the server hardware as a service to clients so this is not the same thing as what you would normally think about with cloud computing or virtualization where you install a hypervisor onto a server and then they spin up some kind of a instance of a server and this isn’t the same thing as a dedicated server so for thanhguyensite.net and a couple of other things that I do we rent a dedicated server from a company called one and one comm with that I get a specific server 12 gigs of ram quad core processor blah blah blah.

With a certain version of Linux on it and then from that point I can configure up but with that when you are purchasing something like a dedicated server you have to use whatever operating systems are provided by the provider so with one and one com if you go with them you can use like Server 2008 or Server 2012 Fedora sent OS if you wanted to use something else tough luck if you wanted to use freebsd on one of these services that you couldn’t do it if you wanted to install a hypervisor on Twenties dedicated services servers you couldn’t do it the reason is is because although you’re renting the dedicated server it has to have a bare minimum installation on it before you are actually able to get access to it the cool thing with metal as a service it is the concept is that you are literally renting the physical box with nothing else on it so this is where you would go to a provider and you would literally rent it would be a quad-core xeon processor with so much RAM so much hard drive and that’s it there would not necessarily be sent OS on it there were not necessarily be windows on it there wouldn’t necessarily be anything on it.

You are literally renting the metal as a service so basically now instead of having to have your own server rooms with your own equipment your own server racks your own HVAC you’re all redundant to power supply and your ups and all of that kind of stuff you can have the same thing sitting in somebody else’s server room so they are renting to you the metal as a service now that you may be wondering why why would you bother with that if you can get virtual servers if you can use infrastructure as a service if you can even get dedicated servers why would you run want to rent or lease the bare metal as a service well as you go through with your companies.

If you have a startup company or if you have a technology company and you start growing what, you are going to find is no matter what operating system distribution you use. It is probably not going to be optimized for whatever it is you’re doing so you know we all know with Windows we all know with Windows. Windows hogs up a lot of extra resources to do things most of us really truly don’t care about it we’ll be happy if it did it but a lot of people don’t realize is even with Linux.

Even with Linux there are resources that are used there are security vulnerabilities that are opened up simply because when you install the default installation of whatever Linux you’re going to be using it installs a base level of applications and services and a lot of times you don’t need that so imagine if you were a company where you want to spin out a lot of database servers and you want those database servers to run at the absolute optimum the fastest they can possibly run well you may want to go in to a distribution of an operating system and literally rip out all the crap that you don’t want you don’t need notepad you don’t need tar you don’t need a lot of these these things you just need that server to run as fast as possible to do a specific task possibly do something like a database server because this becomes very important especially when you start dealing with larger companies that are dealing with a larger load on their servers because when you rip out all of the crap you don’t need on it on a servers operating system.

You can gain efficiencies now this is not you’re not going to probably speed up the server by 200% or 500% or a thousand percent right that’s not what what the target you’re going to hit you may be able to actually speed up the server though by something like five percent now for you especially if you haven’t dealt with real server rooms if you haven’t dealt with real loads on servers optimizing an operating system we get five percent improvement probably doesn’t sound like a big deal but with companies if you if you have 20 servers up and running or 40 servers up and running a five percent increase in speed can be very very very very very very very significant so with these companies they may be looking to optimize the things like I say the operating systems that will be installed how all this will be configured how all this will be managed and so all they want is the bare metal they want the server they want the hard drives they want the RAM.

They want the CPU but they don’t want anybody else to mess with the rest of it they want to be able to build this thing from the ground up and again there can be a lot of reasons for this nowadays things like again efficiency making sure that the resources on the server are optimized but also issues such as compliance so compliance is becoming a bigger and bigger deal within the the corporate world what compliance means is that you are running your IT systems to certain specifications for security and reliability so as more and more companies start using cloud computers and servers and all that to run the infrastructure of their business they have to make sure that that infrastructure is reliable enough for their industry now one of the problems if you go out and you use a standard instance of an operating system or you use a standard load of an operating system from one of these providers is you don’t necessarily know all the security flaws you haven’t necessarily been able to sit down and do penetration test and do hardening testing and do all of those things so when you when you are leasing let’s say from 1 + 1 , dedicated server you can’t guarantee that this is that the the server operating system that has been installed is as hardened as it should be.

Now again for somebody like me I don’t care again do good backups and you should be fine for and this is one of the things you have to think about for 98% of the business population this type of concept doesn’t matter but for that 2% it is very very very important it is very important that they know that whatever operating system and software that’s going to be installed in that server it lives up to certain specifications so that’s why they would want to be able to rent that that bare metal as a service so metal as a service now one of the questions that you’re going to be coming up in with thanh’s you’re gonna be saying work and got me saying.

Well Thanh uh I don’t understand how you would interact with metal as a service then because you know when we think about dealing with these virtual computers when we think about dealing with it with cloud computers and all that we have a basic interface to deal with so basically again if you get a dedicated server you get a virtual private server they spin up the operating system and then they give you the login credentials so basically the company that you’re buying the service from they have already installed the instance of the operating system they’ve already created the first user account they then give you that information for that first user account and then you can figure it out.

However it is you want so the question you may be asking say well I don’t I don’t understand that because if you’re literally renting the metal and the metal let’s say five states away well wait a minute but there’s no operating system to interact with and the metals five states away so you don’t want to drive there so so I don’t understand how you would configure it or work with it well one of the cool things and not really new but but they’re they’re coming too more into vogue is something called IPP KVM switches so kate.

KVM switches been around for forever long far longer than i’ve been in the computer industry keyboard video mouse switches so what these are generally when you’re dealing with a server rack is you plug all the servers in the rack into one KVM switch and then you can press a button then when you press that button that gives you access to the server from one keyboard video and mouse combination so you have a monitor you have a keyboard and mouse and you say oh I need to deal with the wit server – and you just hit the server – button and server – pops up oh I need to deal with server 10 you click the server 10 button and the server 10 pops up well with KVM they now have IP k via what this means is that you can deal with that server from the basic input/output so the basic video keyboard and mouse and you can do that over an IP connection so you can either open that up through a web browser and be able to log in or you can open it up through some kind of terminal session or or some kind of application so basically you can be sitting in your in your office five states away from this bare metal the company that you’re dealing with will plug in the KVM switch and whatever else and then basically you can hit the on there.

They can hit the on button or you may have some kind of remote wait at the on button and from there it will literally load into a BIOS screen then from there depending on what the the service provider has for you you can go and you can install your your your your operating system and do all of your configurations but literally you have remote access to the lowest level of that server so you could literally reboot that server and go into the BIOS and change BIOS configuration settings you literally have that ability whether you’re five states away or you know on an entirely different continent now especially with PDUs so the the power distribution units basically what most people will call surge protectors even those have remote access so that you can do things like power cycle the server because again the question where you’re like well Thanh I don’t understand if if you have metal as a service.

If you have that metal and you do something wrong and it freezes up how do you force it to restart because again you know you’re installing operating systems you’re doing all kind of wacky stuff sometimes it’ll freeze and if you have access to the metal what are you going to do well with these surge protectors these power distribution units you can actually power cycle them again remotely so this is the cool stuff with metal as a service I think this is going to become a much more prominent thing right now this is one of those things that it is offered by companies you don’t see it around a lot but it is something that you should be looking at and you should be thinking about because again this way you can have you you can have your own custom servers that have been hardened up to your specifications but they are sitting in somebody else’s data center you don’t have to worry about it just like with all these commoditized items it is less expensive for you to be able to rent this service from somebody else that can have a thousand or five thousand or twenty thousand of these servers up and running basically they can have five or ten technicians running around making sure all the metal is doing with metal supposed to do versus if you had servers in each one of your individual offices and having people run around and deal with that kind of stuff so that’s the basic concept of metal as a service again all it is at the end of the day is you are literally leasing or renting that bare metal so you’ve got a server with absolutely no operating system on it.

That is what you’re leasing that allows you to do a lot of really cool sexy amazing cool stuff um and with the modern technology like I say it’s actually very very very doable today it’s one of those things when you got to start thinking we think about the cloud I mean that’s a whole one of the problems with us old technicians right is we’re used to touching stuff we like you know when we work on computers we’re used to keyboards we’re used to like plugging away and we’ve got the server in front of us and we got the router in front of us we got all this stuff in front of us so like mentally we think about all this equipment like being in our server room being in our office being in our facility and so what you have to realize is in this modern world that we’re in you can have the exact same functionality that you would have if the equipment was in your building but it can be somewhere else it can be provisioned given to you very quickly it can be given to you very inexpensively and you can be provided as securely or more securely than what you could do yourself again a lot of people you know I’m starting to talk you know talking about things like metal as a service and everybody gets worried about security everybody’s like oh how do I how do I know Thanh.

How do I know my servers are going to be secure how do I know that data center is going to be secure well one you do something called due diligence you you actually make sure that the company that you’re dealing with is a legitimate company you probably if you’re going to be running your business office stuff you should fly out to their data center at least once to actually take a look at it make sure they’ve got all the security stuff and all that but beyond that what a lot of people don’t realize is how in secure their facilities actually are they always worry about how insecure the cloud provider is and they somehow completely ignore just the crappy crappy crappy crappy crappy security that they have on their facility again I’m here in Baltimore Maryland in the Baltimore City.

We have an incredibly high crime rate and so one of the real problems that you have is you can have antivirus on your servers you can have your firewalls on your servers you can of your intrusion detection on the servers you can have your ups on the servers and some crackhead could break into your building literally rip the server out of server rack and walk away with it and try to sell for 25 bucks to the local pawn shop and when they can’t sell it at the local pawn shop then they’ll get pissed and they’ll just throw it in a ditch and keep walking again that’s the nice thing with these data centers at least with that kind of physical security you would be surprised many times they have much much much much better physical security in the rest of this then then you have take a real honest hard look at the security in your facility and if you’re honest about it you probably know that it’s it’s probably pretty bad it’s probably probably probably your servers would be better off in some kind of hosted solution so that’s all there is for from metal as a service.

I enjoy taking this class and look forward to see you the next one you.

Linux tips for beginners

In this post I will share experience using linux command, and was we can do and play with a operating system for Developer, or shorter is Linux. You don’t need to know more about this, I will show you how you can create funny program with some Linux command, what you can’t do in Windows.

1. Linux updates and upgrades do not require you to reboot

Re-booting after every software installation or update is very annoying in Windows. I keep wondering why this is not necessary in Linux, but on Windows is rule of tumb – every installation is asking you to re-boot, or after you download the Upgrades annoying window pop up and say – will reboot in 10 minutes, save your work. This is really annoying.

With Linux all you need to do is run : sudo apt update

Then have a cup of coffee until it update your computer.

2. No need to install drivers every time you plug in your computer USB device

OK, I understand that there are custom devices which use uncommon drivers like printers, cameras, etc, but why on earth every time you plug in Windows simple mass storage disk drive, or USB-Serial converter, or Mouse, or Keyboard – devices which are standard and are embedded in every Windows OS after W98 it ALWAYS ask for drivers which are already there? Why on Linux I plug the external HHD or Flash drive and it automatically mounts on my computer and I can work with it, while Windows is asking me for drivers and several minutes scans and show me different windows with warnings and Continue buttons like I’m doing something scaring which may ruin my OS?

3. You can move image/boot-able drive between machines without need for reinstall

Yes! this is something windows users can’t imagine is possible! I do remember back in the dark ages Windows asked me to re-install after I have upgraded the RAM memory size! Now imagine you get your Windows boot-able HDD and plug it on other computer, will it boot? no way!

4. system config in files not registers mess

Now this is one of the most annoying WIndows features – after several months of installing and removing software your registers and windows/system directory becomes so bloated with shared DLLs and mess that some people start making money by writing registry cleaning software!

5. You can’t boot windows from USB Stick

Probably they didn’t find a way to ask you for registration and to collect your money every time they sense this USB is plug to other computer??

6. You cant see this on Windows:

# uptime
15:54PM up 122 days, 11:22, 5 users, load average: 0.12, 0.30, 0.13

every few days if you do not reboot windows machine it starts to act slowly due to the severe memory fragmentation