惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

F
Full Disclosure
WordPress大学
WordPress大学
小众软件
小众软件
Cloudbric
Cloudbric
AWS News Blog
AWS News Blog
腾讯CDC
量子位
人人都是产品经理
人人都是产品经理
大猫的无限游戏
大猫的无限游戏
freeCodeCamp Programming Tutorials: Python, JavaScript, Git & More
V
Vulnerabilities – Threatpost
Scott Helme
Scott Helme
Hugging Face - Blog
Hugging Face - Blog
博客园_首页
C
CXSECURITY Database RSS Feed - CXSecurity.com
The Hacker News
The Hacker News
奇客Solidot–传递最新科技情报
奇客Solidot–传递最新科技情报
IT之家
IT之家
Jina AI
Jina AI
Attack and Defense Labs
Attack and Defense Labs
S
SegmentFault 最新的问题
Simon Willison's Weblog
Simon Willison's Weblog
The Cloudflare Blog
阮一峰的网络日志
阮一峰的网络日志
T
Tailwind CSS Blog
Last Week in AI
Last Week in AI
博客园 - 【当耐特】
Google Online Security Blog
Google Online Security Blog
美团技术团队
OSCHINA 社区最新新闻
OSCHINA 社区最新新闻
V
Visual Studio Blog
罗磊的独立博客
L
LINUX DO - 最新话题
博客园 - Franky
博客园 - 叶小钗
Apple Machine Learning Research
Apple Machine Learning Research
The Last Watchdog
The Last Watchdog
J
Java Code Geeks
AI
AI
C
Cisco Blogs
酷 壳 – CoolShell
酷 壳 – CoolShell
C
Cyber Attacks, Cyber Crime and Cyber Security
Cisco Talos Blog
Cisco Talos Blog
博客园 - 三生石上(FineUI控件)
雷峰网
雷峰网
Help Net Security
Help Net Security
钛媒体:引领未来商业与生活新知
钛媒体:引领未来商业与生活新知
云风的 BLOG
云风的 BLOG
I
Intezer
S
Securelist

Ogenki

Self-hosted LLM stack: a solid foundation for an open-weight platform built to evolve A few months with `Claude Code`: tips and workflows that helped me `Agentic Coding`: concepts and hands-on Platform Engineering use cases `PostgreSQL`: From Metrics to Query Plan Analysis `VictoriaLogs`: What if logs management became simple and performant? `VictoriaMetrics` : Effective alerts, from theory to practice 🛠️ Harness the Power of `VictoriaMetrics` and `Grafana` Operators for Metrics Management `Dagger`: The missing piece of the developer experience? `TLS` with Gateway API: Efficient and Secure Management of Public and Private Certificates Going Further with `Crossplane`: Compositions and Functions Beyond Traditional VPNs: Simplifying Cloud Access with `Tailscale` `Gateway API`: Can I replace my Ingress Controller with `Cilium`? Applying GitOps Principles to Infrastructure: An overview of `tf-controller` `CloudNativePG`: An easy way to run PostgreSQL on Kubernetes 100% `GitOps` using Flux My Kubernetes cluster (GKE) with `Crossplane` Manage tools versions with `asdf` Helm workshop: Templating exercises Helm workshop: Build your first chart Helm workshop: Lifecycle operations Helm workshop: Ecosystem Helm workshop: Third party charts Helm workshop Kubernetes workshop: Manage permissions in Kubernetes Kubernetes workshop: Troubleshooting Kubernetes workshop: Complete application stack Kubernetes workshop: Local environment Run an application on Kubernetes Kubernetes workshop
Kubernetes workshop: Resources allocation and autoscaling
2021-05-06 · via Ogenki

Resources allocation in Kubernetes

Resources allocation in Kubernetes is made using requests and limits in the container's definition.

  • requests: What the container is guaranteed to get. These values are used when the scheduler takes a decision on where (what node) to place a given pod.
  • limits: Are values that cannot be exceeded

ℹ️ You can use explain to have a look to the documentation of resources.

1kubectl explain --recursive pod.spec.containers.resources.limits
2KIND:     Pod
3VERSION:  v1
4
5FIELD:    limits <map[string]string>
6
7DESCRIPTION:
8     Limits describes the maximum amount of compute resources allowed. More
9...

The wordpress we've created in the previous lab doesn't have resources definition. There are different ways to edit its current state (kubectl edit, apply, patch ...)

1kubectl edit deploy wordpress

replace resources: {} with this block

1...
2        resources:
3          requests:
4            cpu: 100m
5            memory: 100Mi
6          limits:
7            cpu: 1000m
8            memory: 200Mi
9...

The pods resources usage can be displayed using (this might take a few seconds)

1kubectl top pods
2NAME                               CPU(cores)   MEMORY(bytes)
3wordpress-694866c6b7-mqxdd         1m           171Mi
4wordpress-mysql-6c597b98bd-4mbbd   1m           531Mi

Configure the autoscaling base on cpu usage. When a pod reaches 50% of its allocated cpu a new pod is created.

1kubectl autoscale deployment wordpress --cpu-percent=50 --min=1 --max=5
2horizontalpodautoscaler.autoscaling/wordpress autoscaled

It takes up to 15 seconds (default configuration) to get the first values

1kubectl get hpa
2NAME        REFERENCE              TARGETS         MINPODS   MAXPODS   REPLICAS   AGE
3wordpress   Deployment/wordpress   <unknown>/50%   1         5         0          10s
4
5kubectl get hpa
6NAME        REFERENCE              TARGETS   MINPODS   MAXPODS   REPLICAS   AGE
7wordpress   Deployment/wordpress   1%/50%    1         5         1          20s

Now we'll run an HTTP bench using wrk. Open a new shell and run

1kubectl run -ti --rm bench --image=jess/wrk -- /bin/sh -c 'wrk -t12 -c100 -d180s http://wordpress'

During the benchmark above (3 minutes duration) let's have a look to the hpa

1watch kubectl get hpa
2Every 2.0s: kubectl get hpa
3hostname: Tue Jun 22 11:13:08 2021
4
5NAME        REFERENCE              TARGETS   MINPODS   MAXPODS   REPLICAS   AGE
6wordpress   Deployment/wordpress   1%/50%    1         5         1          8m28s

After a few seconds we'll see that the upscaling will be done automatically. Here the number of replicas will reach the maximum we defined (5 pods).

1Every 2.0s: kubectl get hpa
2hostname: Tue Jun 22 11:14:13 2021
3
4NAME        REFERENCE              TARGETS    MINPODS   MAXPODS   REPLICAS   AGE
5wordpress   Deployment/wordpress   998%/50%   1         5         5          9m33s

That was a pretty simple configuration, basing the autoscaling on CPU usage for a webserver makes sense. You can also base the autoscaling on any other metrics that are reported by your application.

➡️ Next: Troubleshooting