惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

D
Docker
爱范儿
爱范儿
T
The Exploit Database - CXSecurity.com
量子位
T
Tailwind CSS Blog
T
Threatpost
The GitHub Blog
The GitHub Blog
AWS News Blog
AWS News Blog
云风的 BLOG
云风的 BLOG
K
Kaspersky official blog
P
Proofpoint News Feed
博客园 - 司徒正美
L
LangChain Blog
T
Threat Research - Cisco Blogs
C
CERT Recently Published Vulnerability Notes
罗磊的独立博客
酷 壳 – CoolShell
酷 壳 – CoolShell
博客园 - 叶小钗
S
Secure Thoughts
The Last Watchdog
The Last Watchdog
Spread Privacy
Spread Privacy
H
Hacker News: Front Page
T
Troy Hunt's Blog
Cyber Security Advisories - MS-ISAC
Cyber Security Advisories - MS-ISAC
Google DeepMind News
Google DeepMind News
W
WeLiveSecurity
A
Arctic Wolf
Apple Machine Learning Research
Apple Machine Learning Research
cs.AI updates on arXiv.org
cs.AI updates on arXiv.org
P
Proofpoint News Feed
T
Tor Project blog
T
The Blog of Author Tim Ferriss
I
Intezer
P
Privacy & Cybersecurity Law Blog
美团技术团队
N
Netflix TechBlog - Medium
博客园_首页
OSCHINA 社区最新新闻
OSCHINA 社区最新新闻
V
Vulnerabilities – Threatpost
Application and Cybersecurity Blog
Application and Cybersecurity Blog
G
Google Developers Blog
Attack and Defense Labs
Attack and Defense Labs
T
Tenable Blog
月光博客
月光博客
Stack Overflow Blog
Stack Overflow Blog
J
Java Code Geeks
腾讯CDC
Microsoft Security Blog
Microsoft Security Blog
A
About on SuperTechFans
Last Week in AI
Last Week in AI

IT Notes - proxmox

IT Notes IT Notes IT Notes IT Notes IT Notes IT Notes IT Notes IT Notes IT Notes IT Notes IT Notes IT Notes IT Notes
IT Notes
Stefano Marinelli · 2018-09-16 · via IT Notes - proxmox

Sometimes, servers can become unresponsive, both physical and virtual, and you may be unable to connect to them, particularly when they are overloaded. In such cases, a watchdog can be a solution.

A watchdog device, assisted by a watchdog application, monitors the server to ensure it is active and healthy. Every 30 seconds (though this interval can be adjusted), the daemon checks if everything is functioning correctly. If it is, that's fine; if not, the watchdog device can perform certain actions. In my case, I usually request the device to execute a hard reboot of the server to restore its reliability.

Proxmox allows the installation and configuration of a watchdog device, enabling you to specify what actions to take when problems arise.

The easiest way to enable it is as follows: on the Proxmox server, navigate to /etc/pve/qemu-server/ (if no cluster has been configured) and edit the VM config file.

Add a watchdog device by appending this line to the VM definition:

watchdog: model=i6300esb,action=reset

This instructs Proxmox to perform a hard reset of the VM if it becomes unresponsive. Shut down and restart the VM.

This step is necessary, as the watchdog will be created at the next "start" of the VM, and a simple reboot will not suffice.

The next step is to install and configure the watchdog daemon inside the VM. Be cautious, as some GNU/Linux distributions (e.g., Ubuntu) may blacklist the watchdog kernel module. If this is the case, check /etc/modprobe.d/blacklist-watchdog.conf (if it exists). In my situation, I removed the i6300esb from the blacklist and added it to /etc/modules so that it would load at boot.

After installing the daemon, configure it as desired.

To test the entire setup, you can intentionally hang the kernel by executing the following command:

echo c > /proc/sysrq-trigger

After waiting for a few seconds, the VM should automatically restart.