Skip to content

[求助/Help] 主节点损坏,使用的是外部mysql数据库,更换机器连回原数据库 ,重新安装,default-host-health 无法就绪,其它所有服务 都running状态 #22781

@sun3book

Description

@sun3book

os: ubuntu 22.04.05
cloudpods: v3.11.10版

default-host-health 无法就绪,查看日志报错信息如下:

root@zhcx-cloudpods01:/opt# kubectl logs default-host-health-4wcm6 -n onecloud
[info 250625 10:49:17 procutils.WaitZombieLoop(zombie_others.go:36)] My pid is not 1 and no need to wait zombies
[info 250625 10:49:17 options.ParseOptions(options.go:310)] Use configuration file: /etc/yunion/host.conf
[warning 250625 10:49:17 structarg.(*ArgumentParser).parseJSONKeyValue(structarg.go:1214)] Cannot find argument enable-health-checker
[warning 250625 10:49:17 structarg.(*ArgumentParser).parseJSONKeyValue(structarg.go:1214)] Cannot find argument health-driver
[warning 250625 10:49:17 structarg.(*ArgumentParser).parseJSONKeyValue(structarg.go:1214)] Cannot find argument enable-rbac
[warning 250625 10:49:17 structarg.(*ArgumentParser).parseJSONKeyValue(structarg.go:1214)] Cannot find argument enable-qmp-monitor
[warning 250625 10:49:17 structarg.(*ArgumentParser).parseJSONKeyValue(structarg.go:1214)] Cannot find argument disk-is-ssd
[info 250625 10:49:17 options.ParseOptions(options.go:331)] Set log level to "info"
[info 2025-06-25 10:49:17 options.ParseOptions(options.go:310)] Use configuration file: /etc/yunion/common/common.conf
[warning 2025-06-25 10:49:17 structarg.(*ArgumentParser).parseJSONKeyValue(structarg.go:1214)] Cannot find argument enable-isolated-device-whitelist
[warning 2025-06-25 10:49:17 structarg.(*ArgumentParser).parseJSONKeyValue(structarg.go:1214)] Cannot find argument ext4-hugefile-size-gb
[warning 2025-06-25 10:49:17 structarg.(*ArgumentParser).parseJSONKeyValue(structarg.go:1214)] Cannot find argument default-process-timeout-seconds
[warning 2025-06-25 10:49:17 structarg.(*ArgumentParser).parseJSONKeyValue(structarg.go:1214)] Cannot find argument deploy-concurrent
[warning 2025-06-25 10:49:17 structarg.(*ArgumentParser).parseJSONKeyValue(structarg.go:1214)] Cannot find argument log-with-time-zone
[warning 2025-06-25 10:49:17 structarg.(*ArgumentParser).parseJSONKeyValue(structarg.go:1214)] Cannot find argument no-hpet
[warning 2025-06-25 10:49:17 structarg.(*ArgumentParser).parseJSONKeyValue(structarg.go:1214)] Cannot find argument ext4-largefile-size-gb
[warning 2025-06-25 10:49:17 structarg.(*ArgumentParser).parseJSONKeyValue(structarg.go:1214)] Cannot find argument image-deploy-driver
[warning 2025-06-25 10:49:17 structarg.(*ArgumentParser).parseJSONKeyValue(structarg.go:1214)] Cannot find argument image-cache-expire-days
[warning 2025-06-25 10:49:17 structarg.(*ArgumentParser).parseJSONKeyValue(structarg.go:1214)] Cannot find argument image-cache-cleanup-on-startup
[warning 2025-06-25 10:49:17 structarg.(*ArgumentParser).parseJSONKeyValue(structarg.go:1214)] Cannot find argument enable-default-policy
[warning 2025-06-25 10:49:17 structarg.(*ArgumentParser).parseJSONKeyValue(structarg.go:1214)] Cannot find argument dhcp-renewal-time
[warning 2025-06-25 10:49:17 structarg.(*ArgumentParser).parseJSONKeyValue(structarg.go:1214)] Cannot find argument enable-app-profiling
[warning 2025-06-25 10:49:17 structarg.(*ArgumentParser).parseJSONKeyValue(structarg.go:1214)] Cannot find argument enable-change-owner-auto-rename
[warning 2025-06-25 10:49:17 structarg.(*ArgumentParser).parseJSONKeyValue(structarg.go:1214)] Cannot find argument log-timestamp-format
[warning 2025-06-25 10:49:17 structarg.(*ArgumentParser).parseJSONKeyValue(structarg.go:1214)] Cannot find argument image-cache-cleanup-percentage
[warning 2025-06-25 10:49:17 structarg.(*ArgumentParser).parseJSONKeyValue(structarg.go:1214)] Cannot find argument executor-connect-timeout-seconds
[warning 2025-06-25 10:49:17 structarg.(*ArgumentParser).parseJSONKeyValue(structarg.go:1214)] Cannot find argument dhcp-lease-time
[warning 2025-06-25 10:49:17 structarg.(*ArgumentParser).parseJSONKeyValue(structarg.go:1214)] Cannot find argument image-cache-cleanup-dry-run
[info 2025-06-25 10:49:17 options.ParseOptions(options.go:331)] Set log level to "info"
[info 2025-06-25 10:49:17 main.(*SHostHealthService).InitService(main.go:42)] exec socket path: /var/run/onecloud/exec.sock
[info 2025-06-25 10:49:20 main.(*SHostHealthService).RunService.func1(main.go:55)] Auth complete!!
[fatal 2025-06-25 10:49:23 main.(*SHostHealthService).RunService.func1(main.go:64)] Init host health manager failed init default etcd client: context deadline exceeded

查看etcd 容器日志:kubectl logs default-etcd-b7pfbdgsps -n onecloud

2025-06-25 10:50:48.184549 I | embed: rejected connection from "10.40.175.203:48584" (error "remote error: tls: bad certificate", ServerName "default-etcd-client.onecloud.svc")
2025-06-25 10:50:48.288926 I | embed: rejected connection from "10.158.155.250:29416" (error "remote error: tls: bad certificate", ServerName "default-etcd-client.onecloud.svc")
2025-06-25 10:50:48.293628 I | embed: rejected connection from "10.158.155.250:38735" (error "remote error: tls: bad certificate", ServerName "default-etcd-client.onecloud.svc")
2025-06-25 10:50:48.517165 I | embed: rejected connection from "10.40.175.203:48586" (error "remote error: tls: bad certificate", ServerName "default-etcd-client.onecloud.svc")
2025-06-25 10:50:48.645475 I | embed: rejected connection from "10.40.175.236:44916" (error "remote error: tls: bad certificate", ServerName "default-etcd-client.onecloud.svc")
2025-06-25 10:50:48.685110 I | embed: rejected connection from "10.40.175.219:45608" (error "remote error: tls: bad certificate", ServerName "default-etcd-client.onecloud.svc")
2025-06-25 10:50:48.711510 I | embed: rejected connection from "10.40.175.236:44926" (error "remote error: tls: bad certificate", ServerName "default-etcd-client.onecloud.svc")
2025-06-25 10:50:48.843433 I | embed: rejected connection from "10.40.175.202:54052" (error "remote error: tls: bad certificate", ServerName "default-etcd-client.onecloud.svc")
2025-06-25 10:50:49.302068 I | embed: rejected connection from "10.158.155.250:32993" (error "remote error: tls: bad certificate", ServerName "default-etcd-client.onecloud.svc")
2025-06-25 10:50:49.309920 I | embed: rejected connection from "10.158.155.250:6663" (error "remote error: tls: bad certificate", ServerName "default-etcd-client.onecloud.svc")
2025-06-25 10:50:49.349389 I | embed: rejected connection from "10.40.175.242:57344" (error "remote error: tls: bad certificate", ServerName "default-etcd-client.onecloud.svc")
2025-06-25 10:50:49.660983 I | embed: rejected connection from "10.40.175.226:56932" (error "remote error: tls: bad certificate", ServerName "default-etcd-client.onecloud.svc")
2025-06-25 10:50:49.727535 I | embed: rejected connection from "10.40.175.226:56938" (error "remote error: tls: bad certificate", ServerName "default-etcd-client.onecloud.svc")
2025-06-25 10:50:50.363164 I | embed: rejected connection from "10.40.175.245:52502" (error "remote error: tls: bad certificate", ServerName "default-etcd-client.onecloud.svc")
2025-06-25 10:50:50.372084 I | embed: rejected connection from "10.40.175.247:49518" (error "remote error: tls: bad certificate", ServerName "default-etcd-client.onecloud.svc")
2025-06-25 10:50:50.381436 I | embed: rejected connection from "10.158.155.250:21579" (error "remote error: tls: bad certificate", ServerName "default-etcd-client.onecloud.svc")
2025-06-25 10:50:50.386546 I | embed: rejected connection from "10.40.175.247:49522" (error "remote error: tls: bad certificate", ServerName "default-etcd-client.onecloud.svc")
2025-06-25 10:50:50.395513 I | embed: rejected connection from "10.40.175.203:48588" (error "remote error: tls: bad certificate", ServerName "default-etcd-client.onecloud.svc")
2025-06-25 10:50:50.849522 I | embed: rejected connection from "10.40.175.252:53582" (error "remote error: tls: bad certificate", ServerName "default-etcd-client.onecloud.svc")
2025-06-25 10:50:50.858701 I | embed: rejected connection from "10.40.175.251:59614" (error "remote error: tls: bad certificate", ServerName "default-etcd-client.onecloud.svc")
2025-06-25 10:50:50.859751 I | embed: rejected connection from "10.40.175.251:59622" (error "remote error: tls: bad certificate", ServerName "default-etcd-client.onecloud.svc")
2025-06-25 10:50:50.940406 I | embed: rejected connection from "10.40.175.235:45158" (error "remote error: tls: bad certificate", ServerName "default-etcd-client.onecloud.svc")
2025-06-25 10:50:51.031000 I | embed: rejected connection from "10.158.155.250:7208" (error "remote error: tls: bad certificate", ServerName "default-etcd-client.onecloud.svc")
2025-06-25 10:50:51.125064 I | embed: rejected connection from "10.40.175.202:54054" (error "remote error: tls: bad certificate", ServerName "default-etcd-client.onecloud.svc")
2025-06-25 10:50:51.136335 I | embed: rejected connection from "[::1]:57212" (error "EOF", ServerName "localhost")
2025-06-25 10:50:51.186872 I | embed: rejected connection from "10.158.155.250:16680" (error "remote error: tls: bad certificate", ServerName "default-etcd-client.onecloud.svc")
2025-06-25 10:50:51.370387 I | embed: rejected connection from "10.40.175.245:52506" (error "remote error: tls: bad certificate", ServerName "default-etcd-client.onecloud.svc")
2025-06-25 10:50:51.382119 I | embed: rejected connection from "10.40.175.247:49526" (error "remote error: tls: bad certificate", ServerName "default-etcd-client.onecloud.svc")
2025-06-25 10:50:51.388826 I | embed: rejected connection from "10.158.155.250:17171" (error "remote error: tls: bad certificate", ServerName "default-etcd-client.onecloud.svc")
2025-06-25 10:50:51.396584 I | embed: rejected connection from "10.40.175.247:49532" (error "remote error: tls: bad certificate", ServerName "default-etcd-client.onecloud.svc")
2025-06-25 10:50:51.858713 I | embed: rejected connection from "10.40.175.252:53586" (error "remote error: tls: bad certificate", ServerName "default-etcd-client.onecloud.svc")
2025-06-25 10:50:51.868161 I | embed: rejected connection from "10.40.175.251:59638" (error "remote error: tls: bad certificate", ServerName "default-etcd-client.onecloud.svc")
2025-06-25 10:50:51.868342 I | embed: rejected connection from "10.40.175.251:59650" (error "remote error: tls: bad certificate", ServerName "default-etcd-client.onecloud.svc")
2025-06-25 10:50:51.946039 I | embed: rejected connection from "10.40.175.235:45160" (error "remote error: tls: bad certificate", ServerName "default-etcd-client.onecloud.svc")
2025-06-25 10:50:52.687141 I | embed: rejected connection from "10.40.175.245:52508" (error "remote error: tls: bad certificate", ServerName "default-etcd-client.onecloud.svc")
2025-06-25 10:50:52.837009 I | embed: rejected connection from "10.40.175.247:49534" (error "remote error: tls: bad certificate", ServerName "default-etcd-client.onecloud.svc")
2025-06-25 10:50:52.871918 I | embed: rejected connection from "10.158.155.250:58319" (error "remote error: tls: bad certificate", ServerName "default-etcd-client.onecloud.svc")
2025-06-25 10:50:53.158600 I | embed: rejected connection from "10.40.175.251:59664" (error "remote error: tls: bad certificate", ServerName "default-etcd-client.onecloud.svc")
2025-06-25 10:50:53.197802 I | embed: rejected connection from "10.40.175.247:49546" (error "remote error: tls: bad certificate", ServerName "default-etcd-client.onecloud.svc")
2025-06-25 10:50:53.316993 I | embed: rejected connection from "10.40.175.252:53600" (error "remote error: tls: bad certificate", ServerName "default-etcd-client.onecloud.svc")
2025-06-25 10:50:53.336526 I | embed: rejected connection from "10.40.175.251:59678" (error "remote error: tls: bad certificate", ServerName "default-etcd-client.onecloud.svc")
2025-06-25 10:50:53.849820 I | embed: rejected connection from "10.40.175.235:37976" (error "remote error: tls: bad certificate", ServerName "default-etcd-client.onecloud.svc")
2025-06-25 10:50:55.092513 I | embed: rejected connection from "10.158.155.250:61568" (error "remote error: tls: bad certificate", ServerName "default-etcd-client.onecloud.svc")
2025-06-25 10:50:55.700054 I | embed: rejected connection from "10.40.175.251:56434" (error "remote error: tls: bad certificate", ServerName "default-etcd-client.onecloud.svc")

已执行过 : climc service-cert-delete etcd 重建证书

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions