On 6/11/21 10:16 AM, Jon Lewis wrote:
On Fri, 11 Jun 2021, Seth Mattinen wrote:
Did Any2 LAX barf last night between about 1am and 8am Pacific time?
More like 00:00-7:45 (Pacific time).
Anyone know what broke, and why the IX was dead for nearly 8 hours? This is our second recent issue with "an Any2 IX", having dealt with an IX partition event at Any2 Denver just a few weeks ago.
What I saw was a lot of unreachable nexthops (I'm in LA2) on routes advertised through the route servers. Most of my direct BGP sessions were down, but a handful were still working including the route servers. For example, I was getting routes for AS29791 from the route servers, but nexthop 206.72.211.106 was dead to me. Not to pick on Internap other than a mutual customer called me directly at 1am and wanted to know why things were down. I killed the route server sessions and went back to sleep. Feels like LA1 and LA2 got split, but however the route servers interconnect still worked, which was problematic.