[chrony-users] Chrony is making NTP Server offline on the system boot |
[ Thread Index |
Date Index
| More chrony.tuxfamily.org/chrony-users Archives
]
- To: "chrony-users@xxxxxxxxxxxxxxxxxxxx" <chrony-users@xxxxxxxxxxxxxxxxxxxx>, "chrony-dev-request@xxxxxxxxxxxxxxxxxxxx" <chrony-dev-request@xxxxxxxxxxxxxxxxxxxx>
- Subject: [chrony-users] Chrony is making NTP Server offline on the system boot
- From: "Iqbal Singh Aulakh -X (iaulakh - HCL AMERICA INC at Cisco)" <iaulakh@xxxxxxxxx>
- Date: Sun, 8 Jan 2023 08:33:59 +0000
- Accept-language: en-US
- Arc-authentication-results: i=1; mx.microsoft.com 1; spf=pass smtp.mailfrom=cisco.com; dmarc=pass action=none header.from=cisco.com; dkim=pass header.d=cisco.com; arc=none
- Arc-message-signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=microsoft.com; s=arcselector9901; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-AntiSpam-MessageData-ChunkCount:X-MS-Exchange-AntiSpam-MessageData-0:X-MS-Exchange-AntiSpam-MessageData-1; bh=YFk73H4TyvYqUjmRpqt4MNX9SdCnmJFZFIttfmDoaEs=; b=bEVmyEsJ0XzKaonXr4sl6ohIXsxfnx6MTfP+QNNjXGERtsRqYmKjCxThjfiHQS+6DDh6FkON8VHOEsIZAQz6ur6RSVD/8FCFPMbpcmgDBqBm1d8hTArKMiq0JIR3Iq83B9+LTLtgG2sZ6fBL7ZKN3Mh4q4x15PCTeXuUYOY9wH8IQVKeFduzS6/4C87FBoh/JHvo61kijMz2aqBuCCsxCFxAB8a0LR0L/U6lthIbtP1IVIH1iyzoBBcT15nkiqiVIpudoz1YSm8CYGHOBG1hrt+UN182Z6VuLqkDImV+7iMcl0By0j8UQW9nBZ1MzTqxgRBWwF1oZ/NoXI4SGO7PLw==
- Arc-seal: i=1; a=rsa-sha256; s=arcselector9901; d=microsoft.com; cv=none; b=V4X8IG+sgXYYUkJ0KFpxAweAQwEVALJQPnEymwtAIQeAujajFtvGjWCa6tcYgy67VmF5FYbbSSmOxlG7R7B9Q4pUbLcZcVOAahKl3+ThidcUFJ/3TqSDjccv/yyCxBaHSgNtnHLzEIszJ9rvWYhw3Z1hF13S+24sdB+ZdFhpodnfxtyNhHyB1CZIZV8eKVCOimbfJeqrABHx1A70lwIbI6H3vNkl1viGnxhYjbO2V3XOW8TrHlnK1Pofett6Zoysh5OviHG5nUzDw1YWFPj1UrsNN/oQ0iDfhPLnPjotb0i6q85QdaH54SYisgR51icryft/79yc9KoIZ60YAQm7fQ==
- Authentication-results: dkim=none (message not signed) header.d=none;dmarc=none action=none header.from=cisco.com;
- Cc: "Kiran Kumar Pamula -X (kpamula - HCL TECHNOLOGIES LIMITED at Cisco)" <kpamula@xxxxxxxxx>
- Dkim-signature: v=1; a=rsa-sha256; c=relaxed/simple; d=cisco.com; i=@cisco.com; l=4478; q=dns/txt; s=iport; t=1673166852; x=1674376452; h=from:to:cc:subject:date:message-id: content-transfer-encoding:mime-version; bh=BlfjQxbsSzsG8m5//zCm+bVNbtniNNu3GuPkN0ZO87M=; b=UqZLupdVyz8lyANnfd4JFOSZ2v1/3LQjl2MrouzBXhkGd8i/i9G8GhHF 9QwiSfKI6tBJgLOd6Nd2DlgvT+kmYT6iWm6nJO7d4TN8XH8dOVApaeuIe qzwpjtPdJlVckmSTdz/nUBBDEFbiCuemTO3+deoEcn3pzfLwZD4ATtZyu s=;
- Dkim-signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=cisco.com; s=selector1; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-SenderADCheck; bh=YFk73H4TyvYqUjmRpqt4MNX9SdCnmJFZFIttfmDoaEs=; b=L/T+uATAbZnyVKa9PcR9XIPl34K4334LonUQ249K+6B5xD2tEgKxeGTsMQRDAjC/e1PgYiwOE3mqUppq9a3rJO+/U7iHGTjJ/hbTGYNYgafrYb6PUCzh1/wqmfwqdWGrsc+xXIIMjHeXlRPrqzXWiYx3F7XyK/KCdEOhgE1RZoE=
- Ironport-data: A9a23:JiLQSq14zKjuM0coi/bD5fhxkn2cJEfYwER7XKvMYLTBsI5bp2QHy TYZX22HaavfM2Wgc9wlaYrj90kEuJLTy4JiHVY43Hw8FHgiRegpqji6wuYcGwvIc6UvmWo+t 512huHodZxyFjmGzvuUGuCJQUNUjclkfZKhTr+aUsxNbVU8Enx50Eo8w7VRbrNA2LBVPSvc4 bsenOWHULOV82Yc3rU8sv/rRLtH5ZweiRtA1rAMTakjUGz2yxH5OKkiyZSZdBMUdGX78tmSH I4vxJnhlo/QEoxE5tmNyt4XeWVSKlLe0JTnZnd+A8CfbhZ+SiMaz6ljZcANUBpssB6CtPtDk MVQh5r3cFJ8VkHMsLx1vxhwGiV6O+hN/6XKZCn5us2IxEqAeHzpqxlsJBhpZstDpKAuWicXr 61wxDMlNnhvg8q3ya+/Q+psrs8iN8LseogYvxmMyBmDVah/EM6dGs0m4/d30Dc7vcRyAMyFT O83MR9eYTDrSExAbwJ/5JUWxbf02SaXnydjgFmVv60x8i3fwRI0yrX0LdfOZvSBRd9SmFfeu n/W8W38AxULctuFxlKt+XK2gene2D7gVZgJPLa47PlskRuP23wdARgXUUr9puO24nNSQPpWL 0gSvyEpt6V3pQqgT8L2WFuzp3vsUgMgt8R4DuJiuTuc8/fv2C2SLUM2Fi5cWtphjZpjLdA17 WOhk9TsDD1plbSaT3OB67uZxQ9e3wBIdAfuggdZEWM4D8nfTJIb1UmWEos6eEKhppikRmmuk mHiQD0W2u17sCId60msEbkraRqAq57VSQhdCu7/ATz/t1sRiGJIm+WVBbXz5PJEKsOSSUOM+ SlCkMmF5+dIBpaI/MBsfAnvNO/wjxpmGGSD6bKKI3XH327yk5JEVdsKiAyS3G8zbq45lcbBO Sc/Qz956p5JJ2eNZqRqeY+3AMlC5fG+So+8D66ONIsWPscZmOq7EMdGOBL4M4fFzRZErE3DE czznTuEVCxDUv03kFJauc9EiuJ0rszB+Y8jbcmrk0v4uVZvTHWUUrwCeECfdfw06bjsnekm2 4g3Cid+8D0GCLeWSnCOqeY7dAlWRVBlXsqeg5IMKYa+zv9ORTtJ5wn5m+1xIuSIXs19y4/1w 51KchYFmQKi3SGadl3ih7IKQOqHYKuTZEkTZUQEVWtEEVB6CWpzxM/zr6cKQIQ=
- Ironport-hdrordr: A9a23:4lD7r6N8msKBYMBcT3n155DYdb4zR+YMi2TDiHoedfUFSKOlfp 6V8MjzjSWE8gr4WBkb6LS90dq7MA7hHPlOkMMs1NaZLULbUQ6TTb2KgrGSuwEIdxeOlNK1tp 0QPpSWaueAdmSS5PySiGLTfrZQo+Vvm5rY4ts2uk0dND2CHJsQiTuRZDzrd3FedU1jP94UBZ Cc7s1Iq36LYnIMdPm2AXEDQqzqu8DLvIiOW29LOzcXrC21yR+44r/zFBaVmj0EVSlU/Lsk+W /Z1yTk+6SYte2hwBO07R6d030Woqqu9jJwPr3NtiEnEESutu9uXvUiZ1S2hkF1nAho0idurD CDmWZlAy050QKsQoj8m2qT5+Cn6kdo15cnomXo2EcKZqfCNXQH4oN69PxkWwqc5Ew6sN5m1q VXm2qfqppMFBvF2D/w/t7SSnhR5zyJSFcZ4JouZkZkIPwjQa4UqZZa8FJeEZ8GEi6/4Ic7EP N2BMWZ4PpNa1uVY33Qo2EqmbWXLzwONwbDRlJHtt2e0jBQknw8x0wExNYHlnNF8J4mUZFL6+ nNL6wtnrBTSc0da757GY46MIKKI32IRQiJPHOZIFzhGq1CM3XRq4Tv6LFw/+2ucIxg9upGpH 0AaiIriYcfQTOcNSTV5uw7zvnkehTMYQjQ
- Ironport-phdr: A9a23:u9VLPBCK6EPz+yr16HV/UyQVaBdPi9zP1kY95pkmjudIdaKut9TnM VfE7PpgxFnOQc3A6v1ChuaX1sKoWWEJ7Zub9nxXdptKWkwJjMwMlFkmB8iIQUTwMP/taXk8G 8JPHF9o9n22Kw5bAsH7MlbTuXa1qzUVH0aXCA==
- Thread-index: AdkjOjFlG38OYXvTTomv4DB5Cs768w==
- Thread-topic: Chrony is making NTP Server offline on the system boot
Hello
We are having issue with chrony and it's making ntp server offline only on system boot.
Need hand and feet support in this regard.
Issue: Chrony Marking NTP servers as offline Upon reboot of the nodes.
We do not see anything in the message log or puppet logs.
Customers' Multiple Data centers/Sites got affected by this issue.
Due to the marking of NTP offline, the clock went out of sync, resulting in a loss of traffic and causing a nationwide outage.
Output from some of the affected nodes.
This is not limited to CentOS 7, CentOS 8 or Alma Linux and happened in multiple environments.
The last reported issue was in CentOS 7
Currently, we captured the data on CentOS; they are the affected nodes in the customer's setup.
Note: In our implementation, we are not managing network interfaces with Network Manager and Looked into the following.
And it did not make any difference.
Another Note: Some nodes are affected in Multiple Data Centers, and the rest are fine.
In affected nodes, it's reproducible upon reboot and where we have not seen this issue, even in the same env on different nodes.
it's not happening over there.
We tried the following link from red hat, but it was also not applicable.
https://access.redhat.com/solutions/3968471
We made changes to our puppet to bring up interfaces only after network interfaces are up but it didn't make any difference.
Here is the output
[qns@pcc01 ~]$ uname -a
Linux pcc01 3.10.0-957.21.3.el7.x86_64 #1 SMP Tue Jun 18 16:35:19 UTC 2019 x86_64 x86_64 x86_64 GNU/Linux
[qns@pcc01 ~]$ cat /etc/*-release
CentOS Linux release 7.6.1810 (Core)
NAME="CentOS Linux"
VERSION="7 (Core)"
ID="centos"
ID_LIKE="rhel fedora"
VERSION_ID="7"
PRETTY_NAME="CentOS Linux 7 (Core)"
ANSI_COLOR="0;31"
CPE_NAME="cpe:/o:centos:centos:7"
HOME_URL=https://www.centos.org/
BUG_REPORT_URL=https://bugs.centos.org/
CENTOS_MANTISBT_PROJECT="CentOS-7"
CENTOS_MANTISBT_PROJECT_VERSION="7"
REDHAT_SUPPORT_PRODUCT="centos"
REDHAT_SUPPORT_PRODUCT_VERSION="7"
CentOS Linux release 7.6.1810 (Core)
CentOS Linux release 7.6.1810 (Core)
[qns@pcc01 ~]$ rpm -qa | sort -i | grep -i '(chrony|ntp)'
chrony-3.2-2.el7.x86_64
fontpackages-filesystem-1.44-8.el7.noarch
ntp-4.2.6p5-28.el7.centos.x86_64
ntpdate-4.2.6p5-28.el7.centos.x86_64
[qns@pcc01 ~]$ ps auxwwwf | grep -i '(chrony|ntp)'
ntp 30958 0.0 0.0 25720 1896 ? Ss 2021 0:50 /usr/sbin/ntpd -u ntp:ntp -g
qns 3827 0.0 0.0 112708 1012 pts/0 S+ 19:30 0:00 _ grep --color=auto -i (chrony|ntp)
[qns@pcc01 ~]$ date
Thu Sep 1 19:30:04 UTC 2022
[qns@pps02 ~]$ chronyc sources
210 Number of sources = 2
MS Name/IP address Stratum Poll Reach LastRx Last sample
^? lb01 2 10 0 44d +3581ms[+3581ms] +/- 16.0s
^? lb02 3 10 0 47d +46us[ +46us] +/- 15.9s
[qns@pps02 ~]$ chronyc tracking
Reference ID : AC1AF20C (lb02)
Stratum : 4
Ref time (UTC) : Thu Jun 09 10:20:05 2022
System time : 0.000000153 seconds slow of NTP time
Last offset : -0.000002489 seconds
RMS offset : 0.187102452 seconds
Frequency : 17.639 ppm slow
Residual freq : -0.000 ppm
Skew : 0.002 ppm
Root delay : 0.038741432 seconds
Root dispersion : 7.168707848 seconds
Update interval : 1031.8 seconds
Leap status : Normal
[qns@pps02 ~]$ chronyc sourcestats
210 Number of sources = 2
Name/IP Address NP NR Span Frequency Freq Skew Offset Std Dev
lb01 18 11 92m +6.918 0.022 +30.4s 42us
lb02 64 33 18h +0.000 0.000 +885us 18us
[qns@lb01 log]$ chronyc sources
210 Number of sources = 2
MS Name/IP address Stratum Poll Reach LastRx Last sample
^* NTP-server1.> 1 6 0 50d -25us[ -161us] +/- 17ms
^- NTP-server2.> 2 6 0 50d +685us[ +685us] +/- 34ms
[qns@lb01
log]$ chronyc tracking
Reference ID : 9BAED650 (NTP-server1.com)
Stratum : 2
Ref time (UTC) : Wed Jun 08 06:58:03 2022
System time : 0.000000028 seconds slow of NTP time
Last offset : -0.000135603 seconds
RMS offset : 0.000135603 seconds
Frequency : 15.377 ppm slow
Residual freq : -31.511 ppm
Skew : 0.026 ppm
Root delay : 0.030828953 seconds
Root dispersion : 142.074813843 seconds
Update interval : 2.1 seconds
Leap status : Normal
[qns@lb01 log]$ chronyc sourcestats
210 Number of sources = 2
Name/IP Address NP NR Span Frequency Freq Skew Offset Std Dev
NTP-server1.> 4 4 6 -31.511 1011.791 -137.6s 149us
NTP-server2.> 4 4 6 -179.900 1376.575 -785.5s 152us
Thanks
Iqbal Singh
--
To unsubscribe email chrony-users-request@xxxxxxxxxxxxxxxxxxxx
with "unsubscribe" in the subject.
For help email chrony-users-request@xxxxxxxxxxxxxxxxxxxx
with "help" in the subject.
Trouble? Email listmaster@xxxxxxxxxxxxxxxxxxxx.