question about ipv6 prefix from ripe to ISP

tdensmore · October 10, 2012, 10:00:57 AM

Quote from: kasperd on October 09, 2012, 03:19:51 PM
Quote from: tdensmore on October 09, 2012, 09:40:32 AMOne of my concerns is "happy eyeballs" and the like would start to prefer v6, and for a huge number of users, google, youtube, and facebook = "The Internet." I had thought of using something stateful to allow expected inbound traffic back in, but applying that to every interface might be a PITA. I obviously haven't done much research in this area yet at all, but I have seen ip6tables in pre-2.6.30 kernels totally break IPv6 when conntrack isn't possible.
I guess the hardware you'd want to do this on is routers that can forward packets in hardware, and AFAIK ip6tables isn't used on such hardware. But regardless of which platform is being used, it is obviously possible, that implementations are buggy. I know the principles, but I don't know about the exact set of features and bugs available in specific products.

Sorry - I was using ip6tables as an example of stateless vs stateful where stateless breaks things if bad assumptions are made. IIRC, pre-2.6.30 kernels didn't support ipv6 conntrack, so all the stateful ip6tables rules would silently fail closed since none of the state= options will match, and what's left is "reject." The stateless "accept tcp if != syn" workaround is ok, but I like stateful tracking better.

Quote from: kasperd on October 09, 2012, 03:19:51 PM
Quote from: tdensmore on October 09, 2012, 09:40:32 AMI burn a /64 per P2P but only assign a /126 to P2P links, not for sparseness, but because I want to avoid ping-pong, etc
Is a /126 sufficient to prevent that? I would have thought you had to go to a /127 to not have any destination address, which could loop on a link. But I was also under the impression, that both ends of the connection are responsible for not forwarding a packet back on the link on which it arrived, so the problem you describe could only happen if both ends of the link are running a buggy stack.

I don't honestly know if it's enough to stop issues with ping-pong specifically, but it seems like it should help a lot with ND exhaustion attacks or whatever else is discovered down the road. I think there was some reason /127 was worse than /126 in certain very specific cases (something to do with multicast?), but again, I just picked one of many options and ran with it. I'm hoping that small subnets and network edge ACLs that block traffic to P2P links will be enough. Time will tell on that one. Honestly, of all the DoSes I've ever been a victim of, most of them haven't been very interesting. They've generally been the small packet, high PPS UDP variety, not super exciting 0-day.

kasperd · October 10, 2012, 11:15:10 AM

Quote from: tdensmore on October 10, 2012, 10:00:57 AMI like stateful tracking better.

There are advantages and disadvantages.

Stateful tracking breaks down if packets in the two directions do not travel through the same device. So once you do stateful tracking, you cannot do redundant connectivity. Probably you already do have a single point of failure on the connection to your customer, and if you can put the stateful tracking there, then you are not introducing another single point of failure.

Stateful tracking also breaks down if the device maintaining the state is restarted. An open TCP connection is supposed to be able to stay alive for long periods of time with no packets being exchanged. As long as the two endpoints of the TCP connection are not restarted, they can just resume communication when there is new data to transfer (or when a receiver is ready for more data).

Routers on the path may have been restarted in the meantime, and even if they haven't, they have no way of knowing if the connection is gone because both ends of the connection were restarted. So there is no certain way of knowing if an entry in the table can be disposed of.

Some of these problems can be avoided with a stateful firewall, which is capable of reestablishing state mid-connection. For example if a typical TCP packet is received from the outside, you let it through to the customer, even if there is no matching entry. If a packet is send by the customer, you let it through regardless of missing state, and if it is not a RST packet, you create the state.

Packets you allow going in can have ACK and PSH flags, but no other flags. And they can have a single timestamp option, and no other options. Make sure they are completely ordinary such that there is no way they could be exploiting a bug in the TCP stack. That way you can be confident that in the worst case, it just going to trigger a RST, like it should. If the packet is actually valid, the customer's computer will respond, and the state will be recreated when the outgoing packet goes through the firewall.

This doesn't solve the problem of the two directions going through different firewalls. But you can restart the firewall or reroute both directions through another firewall, in case one goes down.

As far as UDP traffic goes, I think most traffic is simple RPC protocols like DNS lookups. Those are very short lived. And should the state get lost in the middle of an RPC, then the client will retransmit. And more complicated UDP based protocols can easily be made to deal with stateful tracking. All they have to do is to ensure that packets are periodically sent in both directions. Sending a stream of UDP packets without periodically getting confirmation, that the receiver is still expecting packets, is bad practice anyway. (I know an interesting story about what bad stuff can happen, unfortunately that is confidential.)

Quote from: tdensmore on October 10, 2012, 10:00:57 AMit seems like it should help a lot with ND exhaustion attacks

Certainly. If your prefix is /64 or /96 or anything in between, then ND exhaustion attacks is an issue to worry about. But anything from /116 to /127 should keep you protected from ND exhaustion attacks.

As long as you also need to support IPv4 on those links, you probably don't want thousands of nodes, so a /116 would give enough addresses. I think you said you wanted to keep the prefixes on nibble boundaries, so you could go with /116, /120, or /124 depending on the number of nodes on link. But I don't think using a /126 for P2P links add much confusion either.

Quote from: tdensmore on October 10, 2012, 10:00:57 AMI just picked one of many options and ran with it.

That's also the way to go. I was just wondering because the problem you described and the solution didn't sound like they were matching.

Quote from: tdensmore on October 10, 2012, 10:00:57 AMHonestly, of all the DoSes I've ever been a victim of, most of them haven't been very interesting. They've generally been the small packet, high PPS UDP variety, not super exciting 0-day.

That sort of issue is not specific to either version of the protocol. I don't think there are many nodes on the network, which can actually handle a sustained flow of minimum sized packets. How you react to such a flow is of course an issue. I have seen operating systems that would crash if you looped an Ethernet cable back to the same switch.

There is actually a weakness in TCP which can be used to remotely cause a peer to send a large number of small packets, that can overload that peer's local network. I have PoC code. When testing it my own LAN I found three different places that were vulnerable to such a DoS attack.

The attacked computer would disable the affected network interface for about five seconds, each time it was attacked. Judging from the kernel log, it sounded like the driver was suspecting a hardware fault.
Flow control on gigabit Ethernet is flawed. Pushback means that in certain situations a 100Mbit/s flow can fully block a 1Gbit/s link.
One computer would have a kernel panic if it was on the path from the attacked computer towards the attacker.

News:

question about ipv6 prefix from ripe to ISP

tdensmore

kasperd