Original message <m0uscvd-000F25C@roam.psg.com> From: randy@psg.com (Randy Bush) Date: Aug 19, 15:36 Subject: Re: Customer AS
just what i always wanted, two connections to a broken provider. you must be kidding. I guess this brings to mind the question: Why would you want *any* connections to a broken provider?
all providers break. -- End of excerpt from Randy Bush
Agreed. That can't be stressed enough. *ALL* providers break. Many of them choose different days and times to do so, and multihoming to different providers causes a huge increase in total reliability as a result. Speaking of broken providers, AGIS just mailed me and said that Sprint is "broken" today, and so they've decided to filter out Sprint routes until it gets fixed. Anyone know the _real_ story? -matthew kaufman
Speaking of broken providers, AGIS just mailed me and said that Sprint is "broken" today, and so they've decided to filter out Sprint routes until it gets fixed. Anyone know the _real_ story?
As a sprintlink customer I can say that today was from hell. Couldn't reach *any* of the root nameservers for about 2-3 hours, of course before I discovered it was a sprintlink problem I killed and restarted named thus loosing my cache :-( and things still aren't back to normal. NOC said that they were having major "route-flapping" problems, and were trying to determine the source of the problem. I hope they've figured it out. regards, -joe
Isn't this the provider that says that dual-homing to them (ie, not to another provider) should always be adequate?
As a sprintlink customer I can say that today was from hell. Couldn't reach *any* of the root nameservers for about 2-3 hours, of course before I discovered it was a sprintlink problem I killed and restarted named thus loosing my cache :-( and things still aren't back to normal. NOC said that they were having major "route-flapping" problems, and were trying to determine the source of the problem. I hope they've figured it out.
[DISCLAIMER: I have not read all or even most articles in this thread, so perhaps what I write below has already come up] About a year ago I had a chat over e-mail with a Cisco engineer where I proposed a new feature which would allow a simple solution to the problem of 'multi-homing with two providers for backup while using PA space' and yet would allow carriers to apply a policy like Sprint's. This came up because at the time I was working with an ISP that was multi-homed, but which, at the time, experienced AS splits very often; at those times I wanted to drop some of the aggregates to reflect the split and so allow users in either half of the AS to continue to have access to the rest of the AS and the Internet. But I wanted this to happen automatically. So what I proposed was that route-maps be allowed to match conditions where other routes are or are not known to the router. The way to implement this would be to take these special match clausesand apply them to any new/withdrawn routes heard via BGP or added/removed to the routing table by other protocols. Thus a router could be configured to stop suppressing certain routes heard from outside if those routes became unavailable internally, or could stop suppressing the announcement of certain routes if .... and so on. Thus if a carrier's (let's call it A) customer (C) multi-homes to a second carrier (B), A can suppress all announcements of C's routes heard at A's border with B, but stop suppressing those routes if C's connection to A went down, causing the router at the A-B border to lose its internal route to C; thus when the A-C link goes down A learns how to route to C via B automatically. Other benefits of such a feature would be to allow A and B to peer a many exchanges but exchange routes at only one of them; if the exchange went down or one or both of the ASes partitioned, then A and B could automatically start exchange some routes wherever necessary. There are some drawbacks. The BGP process must keep track of suppressed routes, it can't just drop them, which of course requires more memory. But then, this only applies to suppressed routes that should be automatically unsuppressed. Another drawback is that AS partitioning events would add to the route flapping normally experienced (though this would be a function of the frequency of such events). On the other hand this could reduce the costs of redundancy; the costs of implementing this feature and using it could be made up for by charging multi-homed sites for implementing it for them. I am not sure if this is worth considering at all or not. It may be that someone already implements something like the above using syslog to keep track of route flapping and expect scripts to change filters on the fly. Nick
participants (4)
-
joe@smartlink.net
-
jon@branch.com
-
matthew@scruz.net
-
Nick Williams