1Panel 无故停止/无法自启

故障描述

经常性发生面板服务停止(频繁时一日内多次),服务停止时日志也没有相关报错。

重启服务器时,经常性无法自启面版服务(偶尔正常),需要手动执行 1panel 指令才能启动。

自行查阅资料尝试解决无果。

系统环境

Linux Bookworm 6.8.12-2-pve #1 SMP PREEMPT_DYNAMIC PMX 6.8.12-2 (2024-09-05T10:03Z) x86_64 GNU/Linux

系统日志

11/10 似乎发生了两次停止,第一次自己重启成功了,10点发生的第二次重启失败

[2024-11-10 00:00:00] [INFO] Website scheduled task has completed[2024-11-10 00:00:00] [INFO] The scheduled certificate update task is currently in progress ...[2024-11-10 00:00:00] [INFO] The scheduled certificate update task has completed[2024-11-10 01:24:00] [INFO] AppStore scheduled task in progress ...[2024-11-10 01:24:00] [INFO] Starting synchronization with App Store...[2024-11-10 01:24:00] [INFO] [AppStore] download file from https://apps-assets.fit2cloud.com/stable/1panel.json.zip[2024-11-10 01:24:00] [INFO] The App Store is at the latest version[2024-11-10 01:24:00] [INFO] AppStore scheduled task has completed[2024-11-10 03:30:00] [DEBUG] tar --warning=no-file-changed --ignore-failed-read --exclude-from=<(find /mnt/nfs/backup/1panel -type s -printf '%P' | sed 's|^|./|') -zcf /mnt/nfs/backup/1panel/system/snapshot_1panel_v1.10.20-lts_amd64_2024111003300024u13/1panel/1panel_backup.tar.gz --exclude ./system --exclude ./system_snapshot -C /mnt/nfs/backup/1panel .[2024-11-10 03:30:00] [DEBUG] docker save linuxserver/qbittorrent:4.6.7 adguard/adguardhome:v0.107.53 homeassistant/home-assistant:2024.10.2 emqx/emqx:5.8.1 whyour/qinglong:2.17.11 snowdreamtech/frpc:0.60.0 jellyfin/jellyfin:10.9.11 mysql:8.4.2 yidadaa/chatgpt-next-web:2.15.5 ghcr.io/screego/server:1.11.1 1panel/openresty:1.21.4.3-3-3-focal ghcr.io/gethomepage/homepage:v0.9.10 linuxserver/code-server:4.93.1 redis:7.4.1 | gzip -c > /mnt/nfs/backup/1panel/system/snapshot_1panel_v1.10.20-lts_amd64_2024111003300024u13/docker/docker_image.tar[2024-11-10 03:41:36] [DEBUG] tar --warning=no-file-changed --ignore-failed-read --exclude-from=<(find /opt/1panel -type s -printf '%P' | sed 's|^|./|') -zcf /mnt/nfs/backup/1panel/system/snapshot_1panel_v1.10.20-lts_amd64_2024111003300024u13/1panel/1panel_data.tar.gz --exclude ./tmp --exclude ./log --exclude ./cache --exclude ./db/1Panel.db-* --exclude .*.sock -C /opt/1panel .[2024-11-10 03:47:40] [DEBUG] tar --warning=no-file-changed --ignore-failed-read --exclude-from=<(find /mnt/nfs/backup/1panel/system/snapshot_1panel_v1.10.20-lts_amd64_2024111003300024u13 -type s -printf '%P' | sed 's|^|./|') -zcf /opt/1panel/tmp/system/snapshot_1panel_v1.10.20-lts_amd64_2024111003300024u13.tar.gz -C /mnt/nfs/backup/1panel/system/snapshot_1panel_v1.10.20-lts_amd64_2024111003300024u13 .[2024-11-10 03:54:59] [ERROR] do handle tar failed, stdout: stderr:gzip: stdout: No space left on devicetar: /opt/1panel/tmp/system/snapshot_1panel_v1.10.20-lts_amd64_2024111003300024u13.tar.gz:只写入了 10240 个中的 2048 个字节tar: Child returned status 1tar: Error is not recoverable: exiting now, err: exit status 2[2024-11-10 03:54:59] [ERROR] update record status failed, err: disk I/O error (4874)[2024-11-10 03:55:35] [ERROR] create record status failed, err: disk I/O error (4874)[2024-11-10 03:55:35] [ERROR] update record status failed, err: disk I/O error (4874)[2024-11-10 03:56:35] [ERROR] create record status failed, err: disk I/O error (4874)[2024-11-10 03:56:35] [ERROR] update record status failed, err: disk I/O error (4874)[2024-11-10 03:57:35] [ERROR] create record status failed, err: disk I/O error (4874)[2024-11-10 03:57:35] [ERROR] update record status failed, err: disk I/O error (4874)[2024-11-10 03:58:35] [ERROR] create record status failed, err: disk I/O error (4874)[2024-11-10 03:58:35] [ERROR] update record status failed, err: disk I/O error (4874)[2024-11-10 03:59:03] [ERROR] Insert basic monitoring data failed, err: database or disk is full (13); SQL logic error: cannot rollback - no transaction is active (1)[2024-11-10 03:59:03] [ERROR] Insert io monitoring data failed, err: database or disk is full (13); SQL logic error: cannot rollback - no transaction is active (1)[2024-11-10 03:59:03] [ERROR] Insert network monitori[2024-11-10 10:24:14] [INFO] init logger successfully[2024-11-10 10:24:14] [INFO] init monitor db successfully[2024-11-10 10:24:14] [INFO] init db successfully[2024-11-10 10:24:14] [INFO] Migration run successfully[2024-11-10 10:26:26] [INFO] init logger successfully[2024-11-10 10:26:26] [INFO] init monitor db successfully[2024-11-10 10:26:26] [INFO] init db successfully[2024-11-10 10:26:26] [INFO] Migration run successfully[2024-11-10 10:26:27] [INFO] init cache successfully[2024-11-10 10:26:27] [INFO] init session successfully

11/11 早上发现服务异常,手动进入SSH执行 1panel 命令启动,因此日志从11点才有记录:

[2024-11-11 11:00:28] [INFO] init logger successfully[2024-11-11 11:00:28] [INFO] init monitor db successfully[2024-11-11 11:00:28] [INFO] init db successfully[2024-11-11 11:00:28] [INFO] Migration run successfully[2024-11-11 11:00:28] [INFO] init cache successfully[2024-11-11 11:00:28] [INFO] init session successfully[2024-11-11 11:00:31] [INFO] add shell job 校园网拨号 successful[2024-11-11 11:00:31] [INFO] start cronjob entryID: 5[2024-11-11 11:00:31] [INFO] add clean job 清理缓存 successful[2024-11-11 11:00:31] [INFO] start cronjob entryID: 6[2024-11-11 11:00:31] [INFO] add snapshot job 系统快照 successful[2024-11-11 11:00:31] [INFO] start cronjob entryID: 7[2024-11-11 11:00:31] [INFO] [xpack] init db successfully[2024-11-11 11:00:31] [INFO] [xpack] init alert db successfully[2024-11-11 11:00:31] [INFO] [xpack] migration run successfully[2024-11-11 11:00:31] [INFO] Starting synchronization with App Store...[2024-11-11 11:00:31] [INFO] listen at http://0.0.0.0:36615 [tcp4][2024-11-11 11:00:41] [ERROR] App Store synchronization failed[2024-11-11 11:00:48] [ERROR] load remote time with [ntp.aliyun.com] failed, err: failed to read server response: read udp 192.168.1.5:60473->203.107.6.88:123: i/o timeout[2024-11-11 11:00:53] [ERROR] load latest version from oss failed, err: Get "https://resource.fit2cloud.com/1panel/package/stable/latest": net/http: TLS handshake timeout[2024-11-11 11:01:03] [ERROR] load latest version from oss failed, err: Get "https://resource.fit2cloud.com/1panel/package/stable/latest.current": net/http: TLS handshake timeout

可能是磁盘满了