time-nuts@lists.febo.com

Discussion of precise time and frequency measurement

View all threads

Re: [time-nuts] Trimble Thunderbolt no longer determines the correct date

TV
Tom Van Baak
Fri, Aug 11, 2017 12:59 AM

Brad,

Strictly speaking, there's no problem with GPS, or Trimble, or the Thunderbolt GPSDO: each did exactly as designed and documented. Anyone working with timing systems (from astronomy to calendars to watch making to operating systems) knows there are many subtle details. Just wait until 2038 for MOAR (the unix Mother of all Rollovers)!

Each GPS receiver manufacturer deals with dates and times and rollovers differently. Still, it sounds like the 3rd party who wrote software for the E911 or paging system forgot to read the manual on the GPS receiver they chose.

The Trimble Thunderbolts are used in many Radio Paging Systems to
synchronize their transmitters in simulcast mode.
Systems that are using the models affected by the 1024 week epoch
are currently off the air or functioning poorly.

If you are able, can you explain to us what exactly in the transmitters is the problem? All of our TBolts continue to give valid 1 PPS and 10 MHz outputs. What exactly do you mean by "synchronizing transmitters"?

Trimble's only distributor, Novotech, did not know about it and had no
inventory of the new replacement E series Thunderbolt GPS Receivers.
Trimble says they are shipping new units from their overseas factory in
about 2 weeks -- that's the best they can offer!

Is there no way for E911 people to manually set the clock on their system?

I read here on the Time Nuts messages that some are considering: "some in-line
device that re-writes the serial data as it comes out of the Thunderbolt

Right, that was an idea I mentioned. Easy to do but I don't think anyone bothered, because:

  1. Mark had already fixed Heather.exe for his users, some NTP people added code to their TBolt drivers, Didier updated his LCD Monitor board -- these are all solutions that keep the same TBolt and just tweak the interpretation of the received TSIP date. Hence no need to update the TBolt or create an inline date-fixer gizmo.

  2. Almost all of us use TBolts for their precise time & frequency outputs, not their TSIP packets, so rollovers don't matter.

Does anyone plan to do this? Or does anyone have any ideas for a short-term solution.
Any suggestions would be sincerely appreciated.

Presumably the E911 system runs some kind of software or operating system? Surely there's a way to have the guys who wrote the s/w put a fix in? That should be much faster and cheaper than waiting 2 weeks for new h/w, no?

Bob,

Next up is the comment that it took two weeks and $27,000 to fix.

Wow! At that price, I bet quite a few time-nuts will have a R-Pi or Arduino or PIC solution ready by the weekend. Me, I'm busy with family and eclipse and so will have to pass on the offer.

You'll want to continuously read serial, unpack TSIP (the DLE stuff), fix the 0x8F-AB message, repack into TSIP and serial output. You may even get away with a single UART since the Rx/Tx pins are both 9600 baud and full duplex. The 0x8F-AB date fix involves converting UTC ymd to #days, adding 1024, converting #days to UTC ymd, where #days is any linear day counting system that works from 1980 to 2100. Both Mark and I posted samples a while ago. It may take some work to make the inline translator code asynchronous enough to avoid data loss or excessive packet latency, though. And it's impossible to do real-time byte-for-byte translation because the DLE escapes in TSIP can slightly alter the byte count.

Adrian,

While it wouldn't be difficult to build such a device, manufacturing a decent
quantity in less than 2 weeks to beat Trimble would be a tall order.

Agreed. You know as there's a customer, a weekend project can easily turn into a 2 week project.

/tvb

Brad, Strictly speaking, there's no problem with GPS, or Trimble, or the Thunderbolt GPSDO: each did exactly as designed and documented. Anyone working with timing systems (from astronomy to calendars to watch making to operating systems) knows there are many subtle details. Just wait until 2038 for MOAR (the unix Mother of all Rollovers)! Each GPS receiver manufacturer deals with dates and times and rollovers differently. Still, it sounds like the 3rd party who wrote software for the E911 or paging system forgot to read the manual on the GPS receiver they chose. > The Trimble Thunderbolts are used in many Radio Paging Systems to > synchronize their transmitters in simulcast mode. > Systems that are using the models affected by the 1024 week epoch > are currently off the air or functioning poorly. If you are able, can you explain to us what exactly in the transmitters is the problem? All of our TBolts continue to give valid 1 PPS and 10 MHz outputs. What exactly do you mean by "synchronizing transmitters"? > Trimble's only distributor, Novotech, did not know about it and had no > inventory of the new replacement E series Thunderbolt GPS Receivers. > Trimble says they are shipping new units from their overseas factory in > about 2 weeks -- that's the best they can offer! Is there no way for E911 people to manually set the clock on their system? > I read here on the Time Nuts messages that some are considering: "some in-line > device that re-writes the serial data as it comes out of the Thunderbolt Right, that was an idea I mentioned. Easy to do but I don't think anyone bothered, because: 1) Mark had already fixed Heather.exe for his users, some NTP people added code to their TBolt drivers, Didier updated his LCD Monitor board -- these are all solutions that keep the same TBolt and just tweak the interpretation of the received TSIP date. Hence no need to update the TBolt or create an inline date-fixer gizmo. 2) Almost all of us use TBolts for their precise time & frequency outputs, not their TSIP packets, so rollovers don't matter. > Does anyone plan to do this? Or does anyone have any ideas for a short-term solution. > Any suggestions would be sincerely appreciated. Presumably the E911 system runs some kind of software or operating system? Surely there's a way to have the guys who wrote the s/w put a fix in? That should be much faster and cheaper than waiting 2 weeks for new h/w, no? Bob, > Next up is the comment that it took two weeks and $27,000 to fix. Wow! At that price, I bet quite a few time-nuts will have a R-Pi or Arduino or PIC solution ready by the weekend. Me, I'm busy with family and eclipse and so will have to pass on the offer. You'll want to continuously read serial, unpack TSIP (the DLE stuff), fix the 0x8F-AB message, repack into TSIP and serial output. You may even get away with a single UART since the Rx/Tx pins are both 9600 baud and full duplex. The 0x8F-AB date fix involves converting UTC ymd to #days, adding 1024, converting #days to UTC ymd, where #days is any linear day counting system that works from 1980 to 2100. Both Mark and I posted samples a while ago. It may take some work to make the inline translator code asynchronous enough to avoid data loss or excessive packet latency, though. And it's impossible to do real-time byte-for-byte translation because the DLE escapes in TSIP can slightly alter the byte count. Adrian, > While it wouldn't be difficult to build such a device, manufacturing a decent > quantity in less than 2 weeks to beat Trimble would be a tall order. Agreed. You know as there's a customer, a weekend project can easily turn into a 2 week project. /tvb
BD
Brad Dye
Fri, Aug 11, 2017 1:58 AM

Tom, et.al.

The E911 installation, in the news, is just one of several. Others are hospitals, fire stations, etc. using different dispatch systems.

In a wide-area simulcast-overlap paging system, the transmitters in the same coverage area are carefully set to all transmit at exactly the same time. Each transmitter frequency is set to be slightly different in order to control signal overlap/signal cancellation (read: a few Hertz). This has allowed simulcast paging to achieve better coverage that any other technology that I am aware of — especially in a one-to-many transmission.  Some of the fine points of this system engineering are discussed here: http://braddye.com/angus_flex_at_6400.html http://braddye.com/angus_flex_at_6400.html

A simple explanation of simulcasting is here: http://braddye.com/simulcasting.html http://braddye.com/simulcasting.html

So to me "synchronizing transmitters” means the control system sends the traffic out to all the transmitters (over satellite) and tells them all to hold the messages in a buffer until “the big hand points straight up” or whatever data command the system uses. (excuse the vernacular)

The problems being experienced right now appear to be the interface of the ThunderBolt with the Zetron Model 620 simulcast controller over TSIP. The Zetron box is also called a “wireless data encoder.” https://www.zetron.com/Portals/0/PDFs/products/Spec%20Sheet%20PDFs/04-Paging/005-1232%20Model%20600-620%20Spec.pdf https://www.zetron.com/Portals/0/PDFs/products/Spec%20Sheet%20PDFs/04-Paging/005-1232%20Model%20600-620%20Spec.pdf This is beyond my pay grade since I am 75 years old and semi-retired.

It is not our goal to blame a particular piece of equipment for this problem. The facts are the 1024 roll over happened and just about nobody in the paging business knew it was coming. A solution does not need to be found before replacement hardware arrives from wherever it is manufactured—in two weeks. I am sure the operators would rather solve it with a translator box, or something, than buy a new GPSDO for every transmitter in a system — because there are many.

When I worked in the paging industry and there was a public safety emergency because a system was off the air, no one ate or slept until it was fixed. Well that may be a slight exaggeration because I have never missed many meals.

Best Regards,

Brad Dye, K9IQY (licensed 60 years)
Editor, The Wireless Messaging News
P.O. Box 266
Fairfield, IL  62837 USA
Telephone: 618-599-7869
Skype: braddye
http://www.braddye.com http://www.braddye.com/

On Aug 10, 2017, at 7:59 PM, Tom Van Baak <tvb@LeapSecond.com mailto:tvb@LeapSecond.com> wrote:

Brad,

Strictly speaking, there's no problem with GPS, or Trimble, or the Thunderbolt GPSDO: each did exactly as designed and documented. Anyone working with timing systems (from astronomy to calendars to watch making to operating systems) knows there are many subtle details. Just wait until 2038 for MOAR (the unix Mother of all Rollovers)!

Each GPS receiver manufacturer deals with dates and times and rollovers differently. Still, it sounds like the 3rd party who wrote software for the E911 or paging system forgot to read the manual on the GPS receiver they chose.

The Trimble Thunderbolts are used in many Radio Paging Systems to
synchronize their transmitters in simulcast mode.
Systems that are using the models affected by the 1024 week epoch
are currently off the air or functioning poorly.

If you are able, can you explain to us what exactly in the transmitters is the problem? All of our TBolts continue to give valid 1 PPS and 10 MHz outputs. What exactly do you mean by "synchronizing transmitters"?

Trimble's only distributor, Novotech, did not know about it and had no
inventory of the new replacement E series Thunderbolt GPS Receivers.
Trimble says they are shipping new units from their overseas factory in
about 2 weeks -- that's the best they can offer!

Is there no way for E911 people to manually set the clock on their system?

I read here on the Time Nuts messages that some are considering: "some in-line
device that re-writes the serial data as it comes out of the Thunderbolt

Right, that was an idea I mentioned. Easy to do but I don't think anyone bothered, because:

  1. Mark had already fixed Heather.exe for his users, some NTP people added code to their TBolt drivers, Didier updated his LCD Monitor board -- these are all solutions that keep the same TBolt and just tweak the interpretation of the received TSIP date. Hence no need to update the TBolt or create an inline date-fixer gizmo.

  2. Almost all of us use TBolts for their precise time & frequency outputs, not their TSIP packets, so rollovers don't matter.

Does anyone plan to do this? Or does anyone have any ideas for a short-term solution.
Any suggestions would be sincerely appreciated.

Presumably the E911 system runs some kind of software or operating system? Surely there's a way to have the guys who wrote the s/w put a fix in? That should be much faster and cheaper than waiting 2 weeks for new h/w, no?

Bob,

Next up is the comment that it took two weeks and $27,000 to fix.

Wow! At that price, I bet quite a few time-nuts will have a R-Pi or Arduino or PIC solution ready by the weekend. Me, I'm busy with family and eclipse and so will have to pass on the offer.

You'll want to continuously read serial, unpack TSIP (the DLE stuff), fix the 0x8F-AB message, repack into TSIP and serial output. You may even get away with a single UART since the Rx/Tx pins are both 9600 baud and full duplex. The 0x8F-AB date fix involves converting UTC ymd to #days, adding 1024, converting #days to UTC ymd, where #days is any linear day counting system that works from 1980 to 2100. Both Mark and I posted samples a while ago. It may take some work to make the inline translator code asynchronous enough to avoid data loss or excessive packet latency, though. And it's impossible to do real-time byte-for-byte translation because the DLE escapes in TSIP can slightly alter the byte count.

Adrian,

While it wouldn't be difficult to build such a device, manufacturing a decent
quantity in less than 2 weeks to beat Trimble would be a tall order.

Agreed. You know as there's a customer, a weekend project can easily turn into a 2 week project.

/tvb


time-nuts mailing list -- time-nuts@febo.com mailto:time-nuts@febo.com
To unsubscribe, go to https://www.febo.com/cgi-bin/mailman/listinfo/time-nuts https://www.febo.com/cgi-bin/mailman/listinfo/time-nuts
and follow the instructions there.

Tom, et.al. The E911 installation, in the news, is just one of several. Others are hospitals, fire stations, etc. using different dispatch systems. In a wide-area simulcast-overlap paging system, the transmitters in the same coverage area are carefully set to all transmit at exactly the same time. Each transmitter frequency is set to be slightly different in order to control signal overlap/signal cancellation (read: a few Hertz). This has allowed simulcast paging to achieve better coverage that any other technology that I am aware of — especially in a one-to-many transmission. Some of the fine points of this system engineering are discussed here: http://braddye.com/angus_flex_at_6400.html <http://braddye.com/angus_flex_at_6400.html> A simple explanation of simulcasting is here: http://braddye.com/simulcasting.html <http://braddye.com/simulcasting.html> So to me "synchronizing transmitters” means the control system sends the traffic out to all the transmitters (over satellite) and tells them all to hold the messages in a buffer until “the big hand points straight up” or whatever data command the system uses. (excuse the vernacular) The problems being experienced right now appear to be the interface of the ThunderBolt with the Zetron Model 620 simulcast controller over TSIP. The Zetron box is also called a “wireless data encoder.” https://www.zetron.com/Portals/0/PDFs/products/Spec%20Sheet%20PDFs/04-Paging/005-1232%20Model%20600-620%20Spec.pdf <https://www.zetron.com/Portals/0/PDFs/products/Spec%20Sheet%20PDFs/04-Paging/005-1232%20Model%20600-620%20Spec.pdf> This is beyond my pay grade since I am 75 years old and semi-retired. It is not our goal to blame a particular piece of equipment for this problem. The facts are the 1024 roll over happened and just about nobody in the paging business knew it was coming. A solution does not need to be found before replacement hardware arrives from wherever it is manufactured—in two weeks. I am sure the operators would rather solve it with a translator box, or something, than buy a new GPSDO for every transmitter in a system — because there are many. When I worked in the paging industry and there was a public safety emergency because a system was off the air, no one ate or slept until it was fixed. Well that may be a slight exaggeration because I have never missed many meals. Best Regards, Brad Dye, K9IQY (licensed 60 years) Editor, The Wireless Messaging News P.O. Box 266 Fairfield, IL 62837 USA Telephone: 618-599-7869 Skype: braddye http://www.braddye.com <http://www.braddye.com/> > On Aug 10, 2017, at 7:59 PM, Tom Van Baak <tvb@LeapSecond.com <mailto:tvb@LeapSecond.com>> wrote: > > Brad, > > Strictly speaking, there's no problem with GPS, or Trimble, or the Thunderbolt GPSDO: each did exactly as designed and documented. Anyone working with timing systems (from astronomy to calendars to watch making to operating systems) knows there are many subtle details. Just wait until 2038 for MOAR (the unix Mother of all Rollovers)! > > Each GPS receiver manufacturer deals with dates and times and rollovers differently. Still, it sounds like the 3rd party who wrote software for the E911 or paging system forgot to read the manual on the GPS receiver they chose. > >> The Trimble Thunderbolts are used in many Radio Paging Systems to >> synchronize their transmitters in simulcast mode. >> Systems that are using the models affected by the 1024 week epoch >> are currently off the air or functioning poorly. > > If you are able, can you explain to us what exactly in the transmitters is the problem? All of our TBolts continue to give valid 1 PPS and 10 MHz outputs. What exactly do you mean by "synchronizing transmitters"? > >> Trimble's only distributor, Novotech, did not know about it and had no >> inventory of the new replacement E series Thunderbolt GPS Receivers. >> Trimble says they are shipping new units from their overseas factory in >> about 2 weeks -- that's the best they can offer! > > Is there no way for E911 people to manually set the clock on their system? > >> I read here on the Time Nuts messages that some are considering: "some in-line >> device that re-writes the serial data as it comes out of the Thunderbolt > > Right, that was an idea I mentioned. Easy to do but I don't think anyone bothered, because: > > 1) Mark had already fixed Heather.exe for his users, some NTP people added code to their TBolt drivers, Didier updated his LCD Monitor board -- these are all solutions that keep the same TBolt and just tweak the interpretation of the received TSIP date. Hence no need to update the TBolt or create an inline date-fixer gizmo. > > 2) Almost all of us use TBolts for their precise time & frequency outputs, not their TSIP packets, so rollovers don't matter. > >> Does anyone plan to do this? Or does anyone have any ideas for a short-term solution. >> Any suggestions would be sincerely appreciated. > > Presumably the E911 system runs some kind of software or operating system? Surely there's a way to have the guys who wrote the s/w put a fix in? That should be much faster and cheaper than waiting 2 weeks for new h/w, no? > > Bob, > >> Next up is the comment that it took two weeks and $27,000 to fix. > > Wow! At that price, I bet quite a few time-nuts will have a R-Pi or Arduino or PIC solution ready by the weekend. Me, I'm busy with family and eclipse and so will have to pass on the offer. > > You'll want to continuously read serial, unpack TSIP (the DLE stuff), fix the 0x8F-AB message, repack into TSIP and serial output. You may even get away with a single UART since the Rx/Tx pins are both 9600 baud and full duplex. The 0x8F-AB date fix involves converting UTC ymd to #days, adding 1024, converting #days to UTC ymd, where #days is any linear day counting system that works from 1980 to 2100. Both Mark and I posted samples a while ago. It may take some work to make the inline translator code asynchronous enough to avoid data loss or excessive packet latency, though. And it's impossible to do real-time byte-for-byte translation because the DLE escapes in TSIP can slightly alter the byte count. > > Adrian, > >> While it wouldn't be difficult to build such a device, manufacturing a decent >> quantity in less than 2 weeks to beat Trimble would be a tall order. > > Agreed. You know as there's a customer, a weekend project can easily turn into a 2 week project. > > /tvb > > _______________________________________________ > time-nuts mailing list -- time-nuts@febo.com <mailto:time-nuts@febo.com> > To unsubscribe, go to https://www.febo.com/cgi-bin/mailman/listinfo/time-nuts <https://www.febo.com/cgi-bin/mailman/listinfo/time-nuts> > and follow the instructions there.
TV
Tom Van Baak
Fri, Aug 11, 2017 4:26 PM

The E911 installation, in the news, is just one of several. Others are hospitals,
fire stations, etc. using different dispatch systems.

Hey, at least important things like mobile phones, ISP's, Google, Amazon, FedEx and Starbucks aren't affected ;-)

In a wide-area simulcast-overlap paging system, the transmitters in the same
coverage area are carefully set to all transmit at exactly the same time.

That's fine. And very clever. But why is this "life safety" system tied to GPS, to a particular vendor, to a particular model of receiver (that clearly states in the documentation that it has a 1024 week / 19.6 year window of valid UTC times)?

So to me "synchronizing transmitters” means the control system sends the
traffic out to all the transmitters (over satellite) and tells them all to hold the
messages in a buffer until “the big hand points straight up” or whatever data
command the system uses. (excuse the vernacular)

Exactly. In most of the precise timing world the "big hand" is the "top of the second", or the so-called 1 PPS pulse. The idea is that all 1PPS agree with each other, whether from a cesium clock, or WWVB receiver, or NTP, or GPS (or any other GNSS system).

Since the paging system failed it sounds like it was synchronized to some "hand" other than 1PPS. The rare GPS rollover events tend not to disrupt the 1PPS output -- it is still perfectly aligned with UTC -- which is why almost no one else worries about the recent TBolt episode, or any other GPS receiver for that matter.

The problems being experienced right now appear to be the interface of the ThunderBolt
with the Zetron Model 620 simulcast controller over TSIP. The Zetron box is also called
a “wireless data encoder.”

Ah, ok. So do you or anyone have contact within Zetron? The easy fix would be for them to upgrade their firmware and send out a patch. Probably cheaper than supplying new receivers from Trimble. I don't know; for us, a s/w fix is easy compared to a h/w fix or a h/w swap-out. But in the real world, once technicians have driven to a remote installation, maybe there's no real difference between a s/w fix and a h/w swap.

It is not our goal to blame a particular piece of equipment for this problem.

Right, no need to blame. I think many of us would just want to pinpoint the root cause of the problem, out of engineering curiosity. By root cause I mean actual schematics or lines of source code. It's always been my hope, after every one of these widespread infrastructure events, that the actual source code or design decisions be published eventually so that we can all learn from it.

The facts are the 1024 roll over happened and just about nobody in the paging
business knew it was coming.

Ok, now you know about GPS rollovers! Fun, isn't it? Leap seconds are fun too.

When the dust settles, you may want to look into the more general topic of life safety infrastructure vs. free-from-the-sky time & frequency. These days nanosecond precise time is cheaper than water -- but it's also fragile. A lot has been written about this. It's both a wake-up call for naive vendors of products based on GPS alone and also an opportunity for vendors who know how to design and market resilient timing products.

/tvb

> The E911 installation, in the news, is just one of several. Others are hospitals, > fire stations, etc. using different dispatch systems. Hey, at least important things like mobile phones, ISP's, Google, Amazon, FedEx and Starbucks aren't affected ;-) > In a wide-area simulcast-overlap paging system, the transmitters in the same > coverage area are carefully set to all transmit at exactly the same time. That's fine. And very clever. But why is this "life safety" system tied to GPS, to a particular vendor, to a particular model of receiver (that clearly states in the documentation that it has a 1024 week / 19.6 year window of valid UTC times)? > So to me "synchronizing transmitters” means the control system sends the > traffic out to all the transmitters (over satellite) and tells them all to hold the > messages in a buffer until “the big hand points straight up” or whatever data > command the system uses. (excuse the vernacular) Exactly. In most of the precise timing world the "big hand" is the "top of the second", or the so-called 1 PPS pulse. The idea is that all 1PPS agree with each other, whether from a cesium clock, or WWVB receiver, or NTP, or GPS (or any other GNSS system). Since the paging system failed it sounds like it was synchronized to some "hand" other than 1PPS. The rare GPS rollover events tend not to disrupt the 1PPS output -- it is still perfectly aligned with UTC -- which is why almost no one else worries about the recent TBolt episode, or any other GPS receiver for that matter. > The problems being experienced right now appear to be the interface of the ThunderBolt > with the Zetron Model 620 simulcast controller over TSIP. The Zetron box is also called > a “wireless data encoder.” Ah, ok. So do you or anyone have contact within Zetron? The easy fix would be for them to upgrade their firmware and send out a patch. Probably cheaper than supplying new receivers from Trimble. I don't know; for us, a s/w fix is easy compared to a h/w fix or a h/w swap-out. But in the real world, once technicians have driven to a remote installation, maybe there's no real difference between a s/w fix and a h/w swap. > It is not our goal to blame a particular piece of equipment for this problem. Right, no need to blame. I think many of us would just want to pinpoint the root cause of the problem, out of engineering curiosity. By root cause I mean actual schematics or lines of source code. It's always been my hope, after every one of these widespread infrastructure events, that the actual source code or design decisions be published eventually so that we can all learn from it. > The facts are the 1024 roll over happened and just about nobody in the paging > business knew it was coming. Ok, now you know about GPS rollovers! Fun, isn't it? Leap seconds are fun too. When the dust settles, you may want to look into the more general topic of life safety infrastructure vs. free-from-the-sky time & frequency. These days nanosecond precise time is cheaper than water -- but it's also fragile. A lot has been written about this. It's both a wake-up call for naive vendors of products based on GPS alone and also an opportunity for vendors who know how to design and market resilient timing products. /tvb
MS
Mark Spencer
Fri, Aug 11, 2017 4:51 PM

Nice post Tom.  I was also wondering about the replacement hardware vs software patch issue.  Just speculation on my part but perhaps changing the software involves an extensive QA / test process, vs swapping out a piece of hardware ?  Again this is just pure speculation on my part.

Mark Spencer

On Aug 11, 2017, at 9:26 AM, Tom Van Baak tvb@LeapSecond.com wrote:

The E911 installation, in the news, is just one of several. Others are hospitals,
fire stations, etc. using different dispatch systems.

Hey, at least important things like mobile phones, ISP's, Google, Amazon, FedEx and Starbucks aren't affected ;-)

In a wide-area simulcast-overlap paging system, the transmitters in the same
coverage area are carefully set to all transmit at exactly the same time.

That's fine. And very clever. But why is this "life safety" system tied to GPS, to a particular vendor, to a particular model of receiver (that clearly states in the documentation that it has a 1024 week / 19.6 year window of valid UTC times)?

So to me "synchronizing transmitters” means the control system sends the
traffic out to all the transmitters (over satellite) and tells them all to hold the
messages in a buffer until “the big hand points straight up” or whatever data
command the system uses. (excuse the vernacular)

Exactly. In most of the precise timing world the "big hand" is the "top of the second", or the so-called 1 PPS pulse. The idea is that all 1PPS agree with each other, whether from a cesium clock, or WWVB receiver, or NTP, or GPS (or any other GNSS system).

Since the paging system failed it sounds like it was synchronized to some "hand" other than 1PPS. The rare GPS rollover events tend not to disrupt the 1PPS output -- it is still perfectly aligned with UTC -- which is why almost no one else worries about the recent TBolt episode, or any other GPS receiver for that matter.

The problems being experienced right now appear to be the interface of the ThunderBolt
with the Zetron Model 620 simulcast controller over TSIP. The Zetron box is also called
a “wireless data encoder.”

Ah, ok. So do you or anyone have contact within Zetron? The easy fix would be for them to upgrade their firmware and send out a patch. Probably cheaper than supplying new receivers from Trimble. I don't know; for us, a s/w fix is easy compared to a h/w fix or a h/w swap-out. But in the real world, once technicians have driven to a remote installation, maybe there's no real difference between a s/w fix and a h/w swap.

It is not our goal to blame a particular piece of equipment for this problem.

Right, no need to blame. I think many of us would just want to pinpoint the root cause of the problem, out of engineering curiosity. By root cause I mean actual schematics or lines of source code. It's always been my hope, after every one of these widespread infrastructure events, that the actual source code or design decisions be published eventually so that we can all learn from it.

The facts are the 1024 roll over happened and just about nobody in the paging
business knew it was coming.

Ok, now you know about GPS rollovers! Fun, isn't it? Leap seconds are fun too.

When the dust settles, you may want to look into the more general topic of life safety infrastructure vs. free-from-the-sky time & frequency. These days nanosecond precise time is cheaper than water -- but it's also fragile. A lot has been written about this. It's both a wake-up call for naive vendors of products based on GPS alone and also an opportunity for vendors who know how to design and market resilient timing products.

/tvb


time-nuts mailing list -- time-nuts@febo.com
To unsubscribe, go to https://www.febo.com/cgi-bin/mailman/listinfo/time-nuts
and follow the instructions there.

Nice post Tom. I was also wondering about the replacement hardware vs software patch issue. Just speculation on my part but perhaps changing the software involves an extensive QA / test process, vs swapping out a piece of hardware ? Again this is just pure speculation on my part. Mark Spencer On Aug 11, 2017, at 9:26 AM, Tom Van Baak <tvb@LeapSecond.com> wrote: >> The E911 installation, in the news, is just one of several. Others are hospitals, >> fire stations, etc. using different dispatch systems. > > Hey, at least important things like mobile phones, ISP's, Google, Amazon, FedEx and Starbucks aren't affected ;-) > >> In a wide-area simulcast-overlap paging system, the transmitters in the same >> coverage area are carefully set to all transmit at exactly the same time. > > That's fine. And very clever. But why is this "life safety" system tied to GPS, to a particular vendor, to a particular model of receiver (that clearly states in the documentation that it has a 1024 week / 19.6 year window of valid UTC times)? > >> So to me "synchronizing transmitters” means the control system sends the >> traffic out to all the transmitters (over satellite) and tells them all to hold the >> messages in a buffer until “the big hand points straight up” or whatever data >> command the system uses. (excuse the vernacular) > > Exactly. In most of the precise timing world the "big hand" is the "top of the second", or the so-called 1 PPS pulse. The idea is that all 1PPS agree with each other, whether from a cesium clock, or WWVB receiver, or NTP, or GPS (or any other GNSS system). > > Since the paging system failed it sounds like it was synchronized to some "hand" other than 1PPS. The rare GPS rollover events tend not to disrupt the 1PPS output -- it is still perfectly aligned with UTC -- which is why almost no one else worries about the recent TBolt episode, or any other GPS receiver for that matter. > >> The problems being experienced right now appear to be the interface of the ThunderBolt >> with the Zetron Model 620 simulcast controller over TSIP. The Zetron box is also called >> a “wireless data encoder.” > > Ah, ok. So do you or anyone have contact within Zetron? The easy fix would be for them to upgrade their firmware and send out a patch. Probably cheaper than supplying new receivers from Trimble. I don't know; for us, a s/w fix is easy compared to a h/w fix or a h/w swap-out. But in the real world, once technicians have driven to a remote installation, maybe there's no real difference between a s/w fix and a h/w swap. > >> It is not our goal to blame a particular piece of equipment for this problem. > > Right, no need to blame. I think many of us would just want to pinpoint the root cause of the problem, out of engineering curiosity. By root cause I mean actual schematics or lines of source code. It's always been my hope, after every one of these widespread infrastructure events, that the actual source code or design decisions be published eventually so that we can all learn from it. > >> The facts are the 1024 roll over happened and just about nobody in the paging >> business knew it was coming. > > Ok, now you know about GPS rollovers! Fun, isn't it? Leap seconds are fun too. > > When the dust settles, you may want to look into the more general topic of life safety infrastructure vs. free-from-the-sky time & frequency. These days nanosecond precise time is cheaper than water -- but it's also fragile. A lot has been written about this. It's both a wake-up call for naive vendors of products based on GPS alone and also an opportunity for vendors who know how to design and market resilient timing products. > > /tvb > > _______________________________________________ > time-nuts mailing list -- time-nuts@febo.com > To unsubscribe, go to https://www.febo.com/cgi-bin/mailman/listinfo/time-nuts > and follow the instructions there. >
BK
Bob kb8tq
Fri, Aug 11, 2017 4:53 PM

Hi

The whole “offset frequency” simulcast thing is pretty old. It most certainly pre-dates
GPS. It’s actually old enough that the first OCXO I ever designed at Motorola went into that kind
of system. The “time sync” thing came along a while after that.

There’s always been a lot of infrastructure gear that goes on chugging for a long time.
Budgets are pretty skinny for a lot of very important stuff. That doesn’t just go for the
gear, it also goes for the payroll to support them. Maybe it should not be so, but in most
of the places I’m aware of, it is.

Bob

On Aug 11, 2017, at 12:26 PM, Tom Van Baak tvb@leapsecond.com wrote:

The E911 installation, in the news, is just one of several. Others are hospitals,
fire stations, etc. using different dispatch systems.

Hey, at least important things like mobile phones, ISP's, Google, Amazon, FedEx and Starbucks aren't affected ;-)

In a wide-area simulcast-overlap paging system, the transmitters in the same
coverage area are carefully set to all transmit at exactly the same time.

That's fine. And very clever. But why is this "life safety" system tied to GPS, to a particular vendor, to a particular model of receiver (that clearly states in the

Hi The whole “offset frequency” simulcast thing is pretty old. It most certainly pre-dates GPS. It’s actually old enough that the first OCXO I ever designed at Motorola went into that kind of system. The “time sync” thing came along a while after that. There’s always been a lot of infrastructure gear that goes on chugging for a *long* time. Budgets are pretty skinny for a lot of very important stuff. That doesn’t just go for the gear, it also goes for the payroll to support them. Maybe it should not be so, but in most of the places I’m aware of, it is. Bob > On Aug 11, 2017, at 12:26 PM, Tom Van Baak <tvb@leapsecond.com> wrote: > >> The E911 installation, in the news, is just one of several. Others are hospitals, >> fire stations, etc. using different dispatch systems. > > Hey, at least important things like mobile phones, ISP's, Google, Amazon, FedEx and Starbucks aren't affected ;-) > >> In a wide-area simulcast-overlap paging system, the transmitters in the same >> coverage area are carefully set to all transmit at exactly the same time. > > That's fine. And very clever. But why is this "life safety" system tied to GPS, to a particular vendor, to a particular model of receiver (that clearly states in the
BK
Bob kb8tq
Fri, Aug 11, 2017 4:57 PM

Hi

Best guess is that the “real work” on the firmware took place …errr… a bit over 19.6 years ago.
That’s a massively long time ago in terms of development tools and hardware. Simply getting
a tool suite back up and going on the “old code” would be a big task in most organizations. Ask
me about stuff I did 20 years ago and you’ll get a bit of a blank stare. That assumes you still have
me on the payroll. Dig into it from scratch with nobody having direct experience … yikes.  Yes, I’ve
been involved in those sort of projects, they rarely end well ….

Bob

On Aug 11, 2017, at 12:51 PM, Mark Spencer mark@alignedsolutions.com wrote:

Nice post Tom.  I was also wondering about the replacement hardware vs software patch issue.  Just speculation on my part but perhaps changing the software involves an extensive QA / test process, vs swapping out a piece of hardware ?  Again this is just pure speculation on my part.

Mark Spencer

On Aug 11, 2017, at 9:26 AM, Tom Van Baak tvb@LeapSecond.com wrote:

The E911 installation, in the news, is just one of several. Others are hospitals,
fire stations, etc. using different dispatch systems.

Hey, at least important things like mobile phones, ISP's, Google, Amazon, FedEx and Starbucks aren't affected ;-)

In a wide-area simulcast-overlap paging system, the transmitters in the same
coverage area are carefully set to all transmit at exactly the same time.

That's fine. And very clever. But why is this "life safety" system tied to GPS, to a particular vendor, to a particular model of receiver (that clearly states in the documentation that it has a 1024 week / 19.6 year window of valid UTC times)?

So to me "synchronizing transmitters” means the control system sends the
traffic out to all the transmitters (over satellite) and tells them all to hold the
messages in a buffer until “the big hand points straight up” or whatever data
command the system uses. (excuse the vernacular)

Exactly. In most of the precise timing world the "big hand" is the "top of the second", or the so-called 1 PPS pulse. The idea is that all 1PPS agree with each other, whether from a cesium clock, or WWVB receiver, or NTP, or GPS (or any other GNSS system).

Since the paging system failed it sounds like it was synchronized to some "hand" other than 1PPS. The rare GPS rollover events tend not to disrupt the 1PPS output -- it is still perfectly aligned with UTC -- which is why almost no one else worries about the recent TBolt episode, or any other GPS receiver for that matter.

The problems being experienced right now appear to be the interface of the ThunderBolt
with the Zetron Model 620 simulcast controller over TSIP. The Zetron box is also called
a “wireless data encoder.”

Ah, ok. So do you or anyone have contact within Zetron? The easy fix would be for them to upgrade their firmware and send out a patch. Probably cheaper than supplying new receivers from Trimble. I don't know; for us, a s/w fix is easy compared to a h/w fix or a h/w swap-out. But in the real world, once technicians have driven to a remote installation, maybe there's no real difference between a s/w fix and a h/w swap.

It is not our goal to blame a particular piece of equipment for this problem.

Right, no need to blame. I think many of us would just want to pinpoint the root cause of the problem, out of engineering curiosity. By root cause I mean actual schematics or lines of source code. It's always been my hope, after every one of these widespread infrastructure events, that the actual source code or design decisions be published eventually so that we can all learn from it.

The facts are the 1024 roll over happened and just about nobody in the paging
business knew it was coming.

Ok, now you know about GPS rollovers! Fun, isn't it? Leap seconds are fun too.

When the dust settles, you may want to look into the more general topic of life safety infrastructure vs. free-from-the-sky time & frequency. These days nanosecond precise time is cheaper than water -- but it's also fragile. A lot has been written about this. It's both a wake-up call for naive vendors of products based on GPS alone and also an opportunity for vendors who know how to design and market resilient timing products.

/tvb


time-nuts mailing list -- time-nuts@febo.com
To unsubscribe, go to https://www.febo.com/cgi-bin/mailman/listinfo/time-nuts
and follow the instructions there.


time-nuts mailing list -- time-nuts@febo.com
To unsubscribe, go to https://www.febo.com/cgi-bin/mailman/listinfo/time-nuts
and follow the instructions there.

Hi Best guess is that the “real work” on the firmware took place …errr… a bit over 19.6 years ago. That’s a massively long time ago in terms of development tools and hardware. Simply getting a tool suite back up and going on the “old code” would be a big task in most organizations. Ask me about stuff I did 20 years ago and you’ll get a bit of a blank stare. That assumes you still have me on the payroll. Dig into it from scratch with nobody having direct experience … yikes. Yes, I’ve been involved in those sort of projects, they rarely end well …. Bob > On Aug 11, 2017, at 12:51 PM, Mark Spencer <mark@alignedsolutions.com> wrote: > > Nice post Tom. I was also wondering about the replacement hardware vs software patch issue. Just speculation on my part but perhaps changing the software involves an extensive QA / test process, vs swapping out a piece of hardware ? Again this is just pure speculation on my part. > > Mark Spencer > > > > On Aug 11, 2017, at 9:26 AM, Tom Van Baak <tvb@LeapSecond.com> wrote: > >>> The E911 installation, in the news, is just one of several. Others are hospitals, >>> fire stations, etc. using different dispatch systems. >> >> Hey, at least important things like mobile phones, ISP's, Google, Amazon, FedEx and Starbucks aren't affected ;-) >> >>> In a wide-area simulcast-overlap paging system, the transmitters in the same >>> coverage area are carefully set to all transmit at exactly the same time. >> >> That's fine. And very clever. But why is this "life safety" system tied to GPS, to a particular vendor, to a particular model of receiver (that clearly states in the documentation that it has a 1024 week / 19.6 year window of valid UTC times)? >> >>> So to me "synchronizing transmitters” means the control system sends the >>> traffic out to all the transmitters (over satellite) and tells them all to hold the >>> messages in a buffer until “the big hand points straight up” or whatever data >>> command the system uses. (excuse the vernacular) >> >> Exactly. In most of the precise timing world the "big hand" is the "top of the second", or the so-called 1 PPS pulse. The idea is that all 1PPS agree with each other, whether from a cesium clock, or WWVB receiver, or NTP, or GPS (or any other GNSS system). >> >> Since the paging system failed it sounds like it was synchronized to some "hand" other than 1PPS. The rare GPS rollover events tend not to disrupt the 1PPS output -- it is still perfectly aligned with UTC -- which is why almost no one else worries about the recent TBolt episode, or any other GPS receiver for that matter. >> >>> The problems being experienced right now appear to be the interface of the ThunderBolt >>> with the Zetron Model 620 simulcast controller over TSIP. The Zetron box is also called >>> a “wireless data encoder.” >> >> Ah, ok. So do you or anyone have contact within Zetron? The easy fix would be for them to upgrade their firmware and send out a patch. Probably cheaper than supplying new receivers from Trimble. I don't know; for us, a s/w fix is easy compared to a h/w fix or a h/w swap-out. But in the real world, once technicians have driven to a remote installation, maybe there's no real difference between a s/w fix and a h/w swap. >> >>> It is not our goal to blame a particular piece of equipment for this problem. >> >> Right, no need to blame. I think many of us would just want to pinpoint the root cause of the problem, out of engineering curiosity. By root cause I mean actual schematics or lines of source code. It's always been my hope, after every one of these widespread infrastructure events, that the actual source code or design decisions be published eventually so that we can all learn from it. >> >>> The facts are the 1024 roll over happened and just about nobody in the paging >>> business knew it was coming. >> >> Ok, now you know about GPS rollovers! Fun, isn't it? Leap seconds are fun too. >> >> When the dust settles, you may want to look into the more general topic of life safety infrastructure vs. free-from-the-sky time & frequency. These days nanosecond precise time is cheaper than water -- but it's also fragile. A lot has been written about this. It's both a wake-up call for naive vendors of products based on GPS alone and also an opportunity for vendors who know how to design and market resilient timing products. >> >> /tvb >> >> _______________________________________________ >> time-nuts mailing list -- time-nuts@febo.com >> To unsubscribe, go to https://www.febo.com/cgi-bin/mailman/listinfo/time-nuts >> and follow the instructions there. >> > _______________________________________________ > time-nuts mailing list -- time-nuts@febo.com > To unsubscribe, go to https://www.febo.com/cgi-bin/mailman/listinfo/time-nuts > and follow the instructions there.
JN
Jeremy Nichols
Fri, Aug 11, 2017 5:54 PM

Agreed. Re-inventing stuff from twenty years ago was uneconomic and
possibly impossible when I was in Silicon Valley, twenty years ago.

Jeremy

On Fri, Aug 11, 2017 at 9:57 AM, Bob kb8tq kb8tq@n1k.org wrote:

Hi

Best guess is that the “real work” on the firmware took place …errr… a bit
over 19.6 years ago.
That’s a massively long time ago in terms of development tools and
hardware. Simply getting
a tool suite back up and going on the “old code” would be a big task in
most organizations. Ask
me about stuff I did 20 years ago and you’ll get a bit of a blank stare.
That assumes you still have
me on the payroll. Dig into it from scratch with nobody having direct
experience … yikes.  Yes, I’ve
been involved in those sort of projects, they rarely end well ….

Bob

On Aug 11, 2017, at 12:51 PM, Mark Spencer mark@alignedsolutions.com

wrote:

Nice post Tom.  I was also wondering about the replacement hardware vs

software patch issue.  Just speculation on my part but perhaps changing the
software involves an extensive QA / test process, vs swapping out a piece
of hardware ?  Again this is just pure speculation on my part.

Mark Spencer

On Aug 11, 2017, at 9:26 AM, Tom Van Baak tvb@LeapSecond.com wrote:

The E911 installation, in the news, is just one of several. Others are

hospitals,

fire stations, etc. using different dispatch systems.

Hey, at least important things like mobile phones, ISP's, Google,

Amazon, FedEx and Starbucks aren't affected ;-)

In a wide-area simulcast-overlap paging system, the transmitters in

the same

coverage area are carefully set to all transmit at exactly the same

time.

That's fine. And very clever. But why is this "life safety" system tied

to GPS, to a particular vendor, to a particular model of receiver (that
clearly states in the documentation that it has a 1024 week / 19.6 year
window of valid UTC times)?

So to me "synchronizing transmitters” means the control system sends

the

traffic out to all the transmitters (over satellite) and tells them

all to hold the

messages in a buffer until “the big hand points straight up” or

whatever data

command the system uses. (excuse the vernacular)

Exactly. In most of the precise timing world the "big hand" is the "top

of the second", or the so-called 1 PPS pulse. The idea is that all 1PPS
agree with each other, whether from a cesium clock, or WWVB receiver, or
NTP, or GPS (or any other GNSS system).

Since the paging system failed it sounds like it was synchronized to

some "hand" other than 1PPS. The rare GPS rollover events tend not to
disrupt the 1PPS output -- it is still perfectly aligned with UTC -- which
is why almost no one else worries about the recent TBolt episode, or any
other GPS receiver for that matter.

The problems being experienced right now appear to be the interface of

the ThunderBolt

with the Zetron Model 620 simulcast controller over TSIP. The Zetron

box is also called

a “wireless data encoder.”

Ah, ok. So do you or anyone have contact within Zetron? The easy fix

would be for them to upgrade their firmware and send out a patch. Probably
cheaper than supplying new receivers from Trimble. I don't know; for us, a
s/w fix is easy compared to a h/w fix or a h/w swap-out. But in the real
world, once technicians have driven to a remote installation, maybe there's
no real difference between a s/w fix and a h/w swap.

It is not our goal to blame a particular piece of equipment for this

problem.

Right, no need to blame. I think many of us would just want to pinpoint

the root cause of the problem, out of engineering curiosity. By root cause
I mean actual schematics or lines of source code. It's always been my hope,
after every one of these widespread infrastructure events, that the actual
source code or design decisions be published eventually so that we can all
learn from it.

The facts are the 1024 roll over happened and just about nobody in the

paging

business knew it was coming.

Ok, now you know about GPS rollovers! Fun, isn't it? Leap seconds are

fun too.

When the dust settles, you may want to look into the more general topic

of life safety infrastructure vs. free-from-the-sky time & frequency. These
days nanosecond precise time is cheaper than water -- but it's also
fragile. A lot has been written about this. It's both a wake-up call for
naive vendors of products based on GPS alone and also an opportunity for
vendors who know how to design and market resilient timing products.

/tvb


time-nuts mailing list -- time-nuts@febo.com
To unsubscribe, go to https://www.febo.com/cgi-bin/

mailman/listinfo/time-nuts

and follow the instructions there.


time-nuts mailing list -- time-nuts@febo.com
To unsubscribe, go to https://www.febo.com/cgi-bin/

mailman/listinfo/time-nuts

and follow the instructions there.


time-nuts mailing list -- time-nuts@febo.com
To unsubscribe, go to https://www.febo.com/cgi-bin/
mailman/listinfo/time-nuts
and follow the instructions there.

Agreed. Re-inventing stuff from twenty years ago was uneconomic and possibly impossible when I was in Silicon Valley, twenty years ago. Jeremy On Fri, Aug 11, 2017 at 9:57 AM, Bob kb8tq <kb8tq@n1k.org> wrote: > Hi > > Best guess is that the “real work” on the firmware took place …errr… a bit > over 19.6 years ago. > That’s a massively long time ago in terms of development tools and > hardware. Simply getting > a tool suite back up and going on the “old code” would be a big task in > most organizations. Ask > me about stuff I did 20 years ago and you’ll get a bit of a blank stare. > That assumes you still have > me on the payroll. Dig into it from scratch with nobody having direct > experience … yikes. Yes, I’ve > been involved in those sort of projects, they rarely end well …. > > Bob > > > On Aug 11, 2017, at 12:51 PM, Mark Spencer <mark@alignedsolutions.com> > wrote: > > > > Nice post Tom. I was also wondering about the replacement hardware vs > software patch issue. Just speculation on my part but perhaps changing the > software involves an extensive QA / test process, vs swapping out a piece > of hardware ? Again this is just pure speculation on my part. > > > > Mark Spencer > > > > > > > > On Aug 11, 2017, at 9:26 AM, Tom Van Baak <tvb@LeapSecond.com> wrote: > > > >>> The E911 installation, in the news, is just one of several. Others are > hospitals, > >>> fire stations, etc. using different dispatch systems. > >> > >> Hey, at least important things like mobile phones, ISP's, Google, > Amazon, FedEx and Starbucks aren't affected ;-) > >> > >>> In a wide-area simulcast-overlap paging system, the transmitters in > the same > >>> coverage area are carefully set to all transmit at exactly the same > time. > >> > >> That's fine. And very clever. But why is this "life safety" system tied > to GPS, to a particular vendor, to a particular model of receiver (that > clearly states in the documentation that it has a 1024 week / 19.6 year > window of valid UTC times)? > >> > >>> So to me "synchronizing transmitters” means the control system sends > the > >>> traffic out to all the transmitters (over satellite) and tells them > all to hold the > >>> messages in a buffer until “the big hand points straight up” or > whatever data > >>> command the system uses. (excuse the vernacular) > >> > >> Exactly. In most of the precise timing world the "big hand" is the "top > of the second", or the so-called 1 PPS pulse. The idea is that all 1PPS > agree with each other, whether from a cesium clock, or WWVB receiver, or > NTP, or GPS (or any other GNSS system). > >> > >> Since the paging system failed it sounds like it was synchronized to > some "hand" other than 1PPS. The rare GPS rollover events tend not to > disrupt the 1PPS output -- it is still perfectly aligned with UTC -- which > is why almost no one else worries about the recent TBolt episode, or any > other GPS receiver for that matter. > >> > >>> The problems being experienced right now appear to be the interface of > the ThunderBolt > >>> with the Zetron Model 620 simulcast controller over TSIP. The Zetron > box is also called > >>> a “wireless data encoder.” > >> > >> Ah, ok. So do you or anyone have contact within Zetron? The easy fix > would be for them to upgrade their firmware and send out a patch. Probably > cheaper than supplying new receivers from Trimble. I don't know; for us, a > s/w fix is easy compared to a h/w fix or a h/w swap-out. But in the real > world, once technicians have driven to a remote installation, maybe there's > no real difference between a s/w fix and a h/w swap. > >> > >>> It is not our goal to blame a particular piece of equipment for this > problem. > >> > >> Right, no need to blame. I think many of us would just want to pinpoint > the root cause of the problem, out of engineering curiosity. By root cause > I mean actual schematics or lines of source code. It's always been my hope, > after every one of these widespread infrastructure events, that the actual > source code or design decisions be published eventually so that we can all > learn from it. > >> > >>> The facts are the 1024 roll over happened and just about nobody in the > paging > >>> business knew it was coming. > >> > >> Ok, now you know about GPS rollovers! Fun, isn't it? Leap seconds are > fun too. > >> > >> When the dust settles, you may want to look into the more general topic > of life safety infrastructure vs. free-from-the-sky time & frequency. These > days nanosecond precise time is cheaper than water -- but it's also > fragile. A lot has been written about this. It's both a wake-up call for > naive vendors of products based on GPS alone and also an opportunity for > vendors who know how to design and market resilient timing products. > >> > >> /tvb > >> > >> _______________________________________________ > >> time-nuts mailing list -- time-nuts@febo.com > >> To unsubscribe, go to https://www.febo.com/cgi-bin/ > mailman/listinfo/time-nuts > >> and follow the instructions there. > >> > > _______________________________________________ > > time-nuts mailing list -- time-nuts@febo.com > > To unsubscribe, go to https://www.febo.com/cgi-bin/ > mailman/listinfo/time-nuts > > and follow the instructions there. > > _______________________________________________ > time-nuts mailing list -- time-nuts@febo.com > To unsubscribe, go to https://www.febo.com/cgi-bin/ > mailman/listinfo/time-nuts > and follow the instructions there. >
MD
Magnus Danielson
Thu, Aug 17, 2017 7:40 AM

On 08/11/2017 06:26 PM, Tom Van Baak wrote:

The E911 installation, in the news, is just one of several. Others are hospitals,
fire stations, etc. using different dispatch systems.

Hey, at least important things like mobile phones, ISP's, Google, Amazon, FedEx and Starbucks aren't affected ;-)

In a wide-area simulcast-overlap paging system, the transmitters in the same
coverage area are carefully set to all transmit at exactly the same time.

That's fine. And very clever. But why is this "life safety" system tied to GPS, to a particular vendor, to a particular model of receiver (that clearly states in the documentation that it has a 1024 week / 19.6 year window of valid UTC times)?

Due to ignorance. This is why I have been working for over a decade to
spread the knowledge. What is "generally known" at time-nuts and PTTI
does not reach these folks. There is no common knowledge here.

So to me "synchronizing transmitters” means the control system sends the
traffic out to all the transmitters (over satellite) and tells them all to hold the
messages in a buffer until “the big hand points straight up” or whatever data
command the system uses. (excuse the vernacular)

Exactly. In most of the precise timing world the "big hand" is the "top of the second", or the so-called 1 PPS pulse. The idea is that all 1PPS agree with each other, whether from a cesium clock, or WWVB receiver, or NTP, or GPS (or any other GNSS system).

Since the paging system failed it sounds like it was synchronized to some "hand" other than 1PPS. The rare GPS rollover events tend not to disrupt the 1PPS output -- it is still perfectly aligned with UTC -- which is why almost no one else worries about the recent TBolt episode, or any other GPS receiver for that matter.

If they also extract the time, and several systems do that, then you are
out of luck thought.

There is great benefits in synchrocasting, something I have worked with
for over a decade too, but then delivering alternative to GPS dependence
on each station.

The problems being experienced right now appear to be the interface of the ThunderBolt
with the Zetron Model 620 simulcast controller over TSIP. The Zetron box is also called
a “wireless data encoder.”

Ah, ok. So do you or anyone have contact within Zetron? The easy fix would be for them to upgrade their firmware and send out a patch. Probably cheaper than supplying new receivers from Trimble. I don't know; for us, a s/w fix is easy compared to a h/w fix or a h/w swap-out. But in the real world, once technicians have driven to a remote installation, maybe there's no real difference between a s/w fix and a h/w swap.

Might work out, but to be honest, don't hold your breath. Transmitter
vendors have not the best record of understanding the system aspects and
how they can contribute. Worth a try thought. Any user of GPS should
have a workaround for the 1024 weeks wrap-around if they use the time.

Oh, there is a lack of common specification of GPS user equipment
covering stuff like this, every little market have their "unique"
requirements. It ends up that they are not comparing notes, learning
from each other and find common strategies for common problems. Most of
the uniqueness is in interfacing to their environment.

It is not our goal to blame a particular piece of equipment for this problem.

Right, no need to blame. I think many of us would just want to pinpoint the root cause of the problem, out of engineering curiosity. By root cause I mean actual schematics or lines of source code. It's always been my hope, after every one of these widespread infrastructure events, that the actual source code or design decisions be published eventually so that we can all learn from it.

No, it is treated as company secrets. We are far from the point where it
is open source and can be inspected by many eyes. You are expected to
buy a "black box" and trust the vendor to do the right thing. When the
vendor do not do the right thing, throw the box and buy a new one. Some
vendors have been black-listed in the process.

I just wished the vendors was acting better.
I had to go to Washington for this mess.

Cheers,
Magnus

On 08/11/2017 06:26 PM, Tom Van Baak wrote: >> The E911 installation, in the news, is just one of several. Others are hospitals, >> fire stations, etc. using different dispatch systems. > > Hey, at least important things like mobile phones, ISP's, Google, Amazon, FedEx and Starbucks aren't affected ;-) > >> In a wide-area simulcast-overlap paging system, the transmitters in the same >> coverage area are carefully set to all transmit at exactly the same time. > > That's fine. And very clever. But why is this "life safety" system tied to GPS, to a particular vendor, to a particular model of receiver (that clearly states in the documentation that it has a 1024 week / 19.6 year window of valid UTC times)? Due to ignorance. This is why I have been working for over a decade to spread the knowledge. What is "generally known" at time-nuts and PTTI does not reach these folks. There is no common knowledge here. >> So to me "synchronizing transmitters” means the control system sends the >> traffic out to all the transmitters (over satellite) and tells them all to hold the >> messages in a buffer until “the big hand points straight up” or whatever data >> command the system uses. (excuse the vernacular) > > Exactly. In most of the precise timing world the "big hand" is the "top of the second", or the so-called 1 PPS pulse. The idea is that all 1PPS agree with each other, whether from a cesium clock, or WWVB receiver, or NTP, or GPS (or any other GNSS system). > > Since the paging system failed it sounds like it was synchronized to some "hand" other than 1PPS. The rare GPS rollover events tend not to disrupt the 1PPS output -- it is still perfectly aligned with UTC -- which is why almost no one else worries about the recent TBolt episode, or any other GPS receiver for that matter. If they also extract the time, and several systems do that, then you are out of luck thought. There is great benefits in synchrocasting, something I have worked with for over a decade too, but then delivering alternative to GPS dependence on each station. >> The problems being experienced right now appear to be the interface of the ThunderBolt >> with the Zetron Model 620 simulcast controller over TSIP. The Zetron box is also called >> a “wireless data encoder.” > > Ah, ok. So do you or anyone have contact within Zetron? The easy fix would be for them to upgrade their firmware and send out a patch. Probably cheaper than supplying new receivers from Trimble. I don't know; for us, a s/w fix is easy compared to a h/w fix or a h/w swap-out. But in the real world, once technicians have driven to a remote installation, maybe there's no real difference between a s/w fix and a h/w swap. Might work out, but to be honest, don't hold your breath. Transmitter vendors have not the best record of understanding the system aspects and how they can contribute. Worth a try thought. Any user of GPS should have a workaround for the 1024 weeks wrap-around if they use the time. Oh, there is a lack of common specification of GPS user equipment covering stuff like this, every little market have their "unique" requirements. It ends up that they are not comparing notes, learning from each other and find common strategies for common problems. Most of the uniqueness is in interfacing to their environment. >> It is not our goal to blame a particular piece of equipment for this problem. > > Right, no need to blame. I think many of us would just want to pinpoint the root cause of the problem, out of engineering curiosity. By root cause I mean actual schematics or lines of source code. It's always been my hope, after every one of these widespread infrastructure events, that the actual source code or design decisions be published eventually so that we can all learn from it. No, it is treated as company secrets. We are far from the point where it is open source and can be inspected by many eyes. You are expected to buy a "black box" and trust the vendor to do the right thing. When the vendor do not do the right thing, throw the box and buy a new one. Some vendors have been black-listed in the process. I just wished the vendors was acting better. I had to go to Washington for this mess. Cheers, Magnus