RE: [chrony-users] Silent Failure -- Enhancement Request

[ Thread Index | Date Index | More chrony.tuxfamily.org/chrony-users Archives ]


It wounds like you are saying that there is only one source. That is very bead
practice, as the chrony and ntp notes state. You should have at least three
sources, precisely for the cases like yours. If one source dies you have
backups. So the default configuration on your end should be 3-5 sources. It probably
does not matter if the other sources are of lower quality than the one. They
are a backup.


William G. Unruh __| Canadian Institute for|____ Tel: +1(604)822-3273
Physics&Astronomy _|___ Advanced Research _|____ Fax: +1(604)822-5324
UBC, Vancouver,BC _|_ Program in Cosmology |____ unruh@xxxxxxxxxxxxxx
Canada V6T 1Z1 ____|____ and Gravity ______|_    theory.physics.ubc.ca/

On Fri, 19 Apr 2024, Chris Knox wrote:

[CAUTION: Non-UBC Email]

Bryah, thanks for the answer.  Yes, now that we have the scars, we're monitoring chronyd's health carefully.  But my question goes a bit beyond that.  If chronyd is configured and running, that implies that the owner of the system wants the time to be correct. If the configured time source is not reachable, it seems at least as important as chronyd being out by a half-second and worth logging to syslog where someone who is not a time synchronization expert will notice something being amiss.  In effect, chronyd will tell me if it has a figurative hangnail, but will suffer in silence if it is starving.

I would suggest the Chrony authors add a default configuration to call for help if the time source is unreachable.  Is this an appropriate venue to ask for that enhancement?


--
Chris Knox
IT Infrastructure Engineer
tel         1.602.308.5438
18615 N. Claret Dr., Scottsdale, AZ  85255
Christopher.Knox@xxxxxxxxxxxxxxxx

-----Original Message-----
From: Bryan Christianson <bryan@xxxxxxxxxxxxx>
Sent: Thursday, April 18, 2024 4:28 PM
To: chrony-users@xxxxxxxxxxxxxxxxxxxx
Subject: Re: [chrony-users] Silent Failure -- Enhancement Request

You could monitor the Reach field of chronyc and check that it has a value of 377, raising an appropriate alarm for your system on failure.

On 19 Apr 2024, at 10:45, Chris Knox <Christopher.Knox@xxxxxxxxxxxxxxxx> wrote:

We recently moved a bunch of systems out of a data center and shut it down.  Time sync was an overlooked item in the move.  As a result, the time server was not reachable, but it did not become apparent until servers started drifting enough to create issues.  Looking in the syslog of the various systems, the only entries I see are when chronyd could again hit the time server.  Over the previous weeks, chronyd suffered in silence until we were able to establish a valid time server, at which point log entries came fast and furious because the time was more than .5 seconds out.  Yet there was no message for all that time (several months) during which one server drifted out synch by more than 30 seconds.  Is there a configuration in chrony.conf to complain when the time server is not reachable?  If there is, why isn’t that the default behavior?
 --
Chris


Bryan Christianson
bryan@xxxxxxxxxxxxx




--
To unsubscribe email chrony-users-request@xxxxxxxxxxxxxxxxxxxx
with "unsubscribe" in the subject.
For help email chrony-users-request@xxxxxxxxxxxxxxxxxxxx
with "help" in the subject.
Trouble?  Email listmaster@xxxxxxxxxxxxxxxxxxxx.



Mail converted by MHonArc 2.6.19+ http://listengine.tuxfamily.org/