Hi group,
On one of the networks I run, some of the NTP servers have failed and
proven difficult to get back up and running. This has resulted in the
usual ensemble consisting of one working server and two not running
servers.
Unfortunately, some ntpd-based NTP clients have decided that having
only one time source means they should stop adjusting the local clock,
thus causing machines to drift apart rather than fall back to using
the surviving server.
Is there a way (via ntp.conf or similar) to tell the ntpd algorithms
to not give up in this scenario.
Unfortunately, some ntpd-based NTP clients have decided that having
only one time source means they should stop adjusting the local clock,
thus causing machines to drift apart rather than fall back to using
the surviving server.
On 07/01/2026 10:33, Jakob Bohm wrote:
Unfortunately, some ntpd-based NTP clients have decided that having
only one time source means they should stop adjusting the local clock,
thus causing machines to drift apart rather than fall back to using
the surviving server.
Do the problem clients have the local clock driver configured? I seem
to remember that this will be treated normally, as far as rejecting
outliers is concerned, so if there is one other source, and the error
bounds don't overlap, both will be rejected. (Pure clients should not
be using this.)
On 07/01/2026 10:33, Jakob Bohm wrote:
Unfortunately, some ntpd-based NTP clients have decided that having
only one time source means they should stop adjusting the local clock,
thus causing machines to drift apart rather than fall back to using
the surviving server.
Do the problem clients have the local clock driver configured? I seem
to remember that this will be treated normally, as far as rejecting
outliers is concerned, so if there is one other source, and the error
bounds don't overlap, both will be rejected. (Pure clients should not
be using this.)
Also, the affected clients seem to be running the ntpsec modification of ntpd, but I was hoping it would be a setting not specific to ntpsec.
On 07/01/2026 17:04, Jakob Bohm wrote:
Also, the affected clients seem to be running the ntpsec modification
of ntpd, but I was hoping it would be a setting not specific to ntpsec.
What is the reason, reported by ntpq, for rejecting the server (the
tally character, from the peers sub-command)?
"+" rather than "*", and the offset diverges slowly but surely far from
0 (some clients reached multiple seconds of error).
On 13/01/2026 07:09, Jakob Bohm wrote:
"+" rather than "*", and the offset diverges slowly but surely far
from 0 (some clients reached multiple seconds of error).
If it is showing "+", it should be included in the solution for the
offset and used to adjust the clock. My assumption would be that it was
an ntpsec bug, but I'm not aware of any expertise on ntpsec on this newsgroup/mailing list.
On 13/01/2026 12:54, David Woolley wrote:
On 13/01/2026 07:09, Jakob Bohm wrote:
"+" rather than "*", and the offset diverges slowly but surely far
from 0 (some clients reached multiple seconds of error).
If it is showing "+", it should be included in the solution for the
offset and used to adjust the clock. My assumption would be that it
was an ntpsec bug, but I'm not aware of any expertise on ntpsec on
this newsgroup/mailing list.
Yes, I know it should be included, problem is that if there is only one
time source that qualifies, some ntpd versions (at least my copy of
ntpsec) finds no solution and doesn't discipline the local clock.
Work around for now is to make some low quality local servers (lots of jitter) selectable, thus letting ntpd select a solution with both good
and bad sources.
| Sysop: | DaiTengu |
|---|---|
| Location: | Appleton, WI |
| Users: | 1,104 |
| Nodes: | 10 (0 / 10) |
| Uptime: | 492393:28:49 |
| Calls: | 14,151 |
| Calls today: | 2 |
| Files: | 186,281 |
| D/L today: |
8,575 files (2,834M bytes) |
| Messages: | 2,501,316 |
| Posted today: | 1 |