Re: [chrony-dev] Running chronyd without syncing system clock

[ Thread Index | Date Index | More chrony.tuxfamily.org/chrony-dev Archives ]

To: chrony-dev@xxxxxxxxxxxxxxxxxxxx
Subject: Re: [chrony-dev] Running chronyd without syncing system clock
From: Bill Unruh <unruh@xxxxxxxxxxxxxx>
Date: Thu, 23 Feb 2012 07:51:16 -0800 (PST)

On Thu, 23 Feb 2012, Ed W wrote:

On 23/02/2012 08:24, Leo Baltus wrote:
Op 22/02/2012 om 23:07:51 +0000, schreef Ed W:
In our setup we do not like to pin a service to a specific piece of
hardware. If, for some reason, a service should run elsewhere wejuststop it en start it elsewhere. bind() make is invisible for theoutsideto see and firewalls do not need to know about it either. This iswhat
we do for all our services, except ... ntp
I do something similar, but it later occurred to me that it serves
no useful purpose to put two ntp servers on a single clocked
machine?
This is exactly why I want to separate the systemclock sync from the
networkservice so that each instance serves a specific purpose.
Hmm, one of us has got the wrong idea I think?  Miroslav - is it me?

My thought process (please knock it down) is:
- We can't know what the "correct" time is, all we have is a bunch ofmeasurements from a variety of sources that are assumed to have variousrandom errors associated- Based on some heuristics we pick one of these inaccurate sources tosync against, being fully aware that we can't correctly measure thesource, only measure it give or take some error term (which we hopewill average through time)- Because the source isn't a constant high resolution tick we need somelocal high res clock to use for all normal clock requirements. Thisclock is also inaccurate so we have a combined problem to measure theinaccuracy of our local clock vs the source clock.- With a local high res clock and an occasional glimpse at an upstreamclock assumed to be accurate, we can use the two things and estimatethe "correct current time" based on offsetting the local high res clockusing a bunch of maths.- Note that the local clock doesn't normally match the upstream clock,we initially skew it close over some time period and then continuallymonitor it computing some error term based on it's observe deviationfrom the source


An important difference between chrony and ntpd is that chrony uses not only
the current time difference between the local clock and the remote clock, but
also a history of up to 64 previous measurements (compensated for the change
in offset and rate that we have imposed on the local clock.)

Now you could:
a) Run one process to sync the local clock, and one or more separatentp processes to sync that clock to other consumers via ntp. All ntpprocesses serve the same single, physical, hardware clock.


No idea what this means. sync the local clock to what? There is nothing to
sync it to without the measurements of the remote clocks. The system has no
idea what the "real true" time is except for the measurements made on the
remote clocks. (sync is a comparison, you cannot sync one clock on its own,
you sync one clock to another, or sync two clocks to each other).

And if you have two ntp processes trying to handle the same hardware clock,
how does process A know what changes process B has made, and how do they
resolve contentions? A wants to speed up the clock, B to slow it down? Do they
have a fight? How do they even know that there is a contention?

b) Run multiple processes in virtualised spaces, with virtualisedclocks that drift from the real hardware clock. Each process computesit's own clock error term and serves that via NTP to consumers.


No idea what this means. What is a "virtual clock". Clocks are things in which
some hardware delivers a series of hardware "ticks", whether clock interrupts
or processor cycles. What is it that "virtual" clocks are supposed to deliver?
Number of times that two is multiplied by 3 in the software on that virutual
machine?

c) Run single process to sync the local clock AND serve NTP toconsumers. Allow NTP process to answer requests from multiple IPaddresses, ie masquerade as multiple NTP servers.


That is the way things run now, but I have no idea why this is "masquerade as
multiple NTP servers"

The problem with a) is that the separate NTP processes have noknowledge of the sync state with the upstream source clock. All theycan do is serve the physical hardware clock (which is of course beingskewed periodically by some separate NTP process). I don't know howwell that works in practice, but certainly it seems redundant to havemultiple NTP processes serving *a single clock*, additional processeshave no new knowledge and hence no obvious advantage over a single NTPprocess.
Problem with b) is that you are faking several inaccurate clocks froman upstream "inaccurate" clock... It doesn't seem obvious that avirtualised clock which is allowed to drift from the real hardwareclock can be anything other than more unstable and less accurate thanthe real hardware clock? So now we have multiple processes trying tosync a clock which is the composite of a clock with two sources ofjitter/drift (real hardware clock + virtualisation inaccuracies).Therefore this seems less optimal than having the virtual machines usethe real hardware clock (and now only one process can be in charge ofconditioning the clock again)
c) seems most optimal. One NTP process per real physical clock. Thatsingle process then has complete knowledge of the modelled inaccuraciesof the hardware clock and the upstream source clocks and can make anintegrated decision on what to supply to downstream clients.
I think you prefer either a) or b), but from what I can see they bothhave significant disadvantages in terms of accuracy and seem quiteredundant? Please shoot down the logic of the above! (be gentle...)


You prefer them why?

My solution was to pin NTP instances to hardware and if
they go down then they go down (do you care?) - if you do care then
why not make the failover system be something which pushes IPs to
working instances (so some individual instances might appear to be
two servers) rather than instances which know their IPs..?
In that case I cannot bind them to a specific IP adress which isneeded
to be transparant for firewalls in and outside my network.
If so then that seems to only be a limitation of your currentvirtualisation system? I don't see any reason why it can't be doneeasily?
At the simplest if your IPs are only used for the NTP process then youcan literally just attach one or more IPs to an existing virtualmachine and it will answer to all those IPs. I haven't double checked,but as far as I know you can have chrony bind to multiple IPs and so itwill answer all of them? Plenty of options exist to shuffle IPsbetween machines, some with very high speed failover
Another option is some kind of front end load balancer/nat. Soundslike you don't desire that kind of option, but it might be the moststraightforward for those with a firewall in front of the services?
Slightly crude, but you could use iptables DNAT to forward packets fromthe downed service. I see no reason why this shouldn't work adequatelybut likely it doesn't fit neatly into your virtualisation system so Isuspect it's the least desirable. Basically the idea would be that ifthe (one) virtual server is downed on a piece of hardware, then youbring up some local firewall rule on that physical machine to proxyincoming connections to some other machine.
Consider you have ntp[1-3].example.com. for some reason ntp2 fails.The options seem to be:
- Only run ntp[1,3] and leave ntp2 not answering. Most consumers willset multiple ntp sources and so this should be invisible- Have ntp1 answer both ntp1 and ntp2 IPs. This helps consumers whoonly choose a single source, but will potentially skew consumers whosee ntp[1-3] as their sources since they will appear to see two sourceswith strongly correlated performance?- Leave ntp2's IP dead, but change your DNS to point ntp2 to some otherIP. Multiple issues for high availability, but probably satisfactoryfor many situations
I guess you need to think about the above first because it presumablylimitations of your downstream clients define the most appropriatesolution?
I'm really struggling to see any benefit in running more than one NTPprocess per real, physical clock? If it's imperative that somethinganswers on a particular IP address then it seems more optimal to haveone of the still running ntp processes take over that IP?


I am afraid I was unable to penetrate his writing to figure out what he wanted
or meant.

Where have I gone wrong?
Good luck with whatever you pick! Please do share your final solution- I have the same challenge! (My idea is simply that if an NTP machinegoes down, then it goes down...)


Well, that is what it does. If you have 16 machines, all able to take over
from each other, then those 16 machines can all be ntp servers, and if one
goes down, then chrony will use the other 15.

I have been having trouble recently with my Sure gps pps clock. It will go
down for hours at a time-- no idea why. The system just naturally falls over
to using some other server as it time source. And when the PPS comes back up,
it goes back to using it as the time source.

Chrony does have a minor problem is using only one server at a time for its
time guidance. Thus when things fall over, there is a potential discontinuity
in the time. (in my case at the 10 us to 100 us level). But with only one high
resolution GPS time, that is inevitable. ( and doubling it up is not really an
option, since the interrupt servicing time is about 20us, which means that one
or the other of the clocks would always be late by that kind of time. And
which interrupt gets serviced first tends to be random.)


Ed W


--
William G. Unruh   |  Canadian Institute for|     Tel: +1(604)822-3273
Physics&Astronomy  |     Advanced Research  |     Fax: +1(604)822-5324
UBC, Vancouver,BC  |   Program in Cosmology |     unruh@xxxxxxxxxxxxxx
Canada V6T 1Z1     |      and Gravity       |  www.theory.physics.ubc.ca/

--
To unsubscribe email chrony-dev-request@xxxxxxxxxxxxxxxxxxxx with "unsubscribe" in the subject.
For help email chrony-dev-request@xxxxxxxxxxxxxxxxxxxx with "help" in the subject.
Trouble?  Email listmaster@xxxxxxxxxxxxxxxxxxxx.

References:
- [chrony-dev] Running chronyd without syncing system clock
  - From: Leo Baltus
- Re: [chrony-dev] Running chronyd without syncing system clock
  - From: Ed W
- Re: [chrony-dev] Running chronyd without syncing system clock
  - From: Leo Baltus
- Re: [chrony-dev] Running chronyd without syncing system clock
  - From: Ed W

Messages sorted by: [ date | thread ]
Prev by Date: Re: [chrony-dev] Running chronyd without syncing system clock
Next by Date: Re: [chrony-dev] Running chronyd without syncing system clock
Previous by thread: Re: [chrony-dev] Running chronyd without syncing system clock
Next by thread: Re: [chrony-dev] Running chronyd without syncing system clock

Mail converted by MHonArc 2.6.19+

http://listengine.tuxfamily.org/