core-lightning

mirror of https://github.com/ElementsProject/lightning.git synced 2024-11-19 09:54:16 +01:00

Author	SHA1	Message	Date
Rusty Russell	5d42600076	connectd: ratelimit onion messages However fast we can handle them, it's antisocial to allow others to make us spam the rest of the network. Changelog-Protocol: onion messages: we limit incoming to 4 per second, allowing a little burst. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2024-07-10 13:34:00 +02:00
Rusty Russell	f122c0beb4	connectd: include map of scid->peer node id. This will let us fwd onion messages via scid, even if they're aliases. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2024-07-10 13:34:00 +02:00
Rusty Russell	4a78d17748	connectd: do response to gossip queries, don't hand them to gossipd. This basically means moving the code from gossipd to connectd to handle these queries. This will get connectd have finer control over ratelimiting them. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2024-07-10 12:21:19 +09:30
Rusty Russell	d60977f37f	connectd: use gossmap streaming interface. This is more efficient in a few ways: 1. It's trivial to get to the end of the gossip_store, we don't have to iterate. 2. It tends to be mmaped so we don't have to call pread(). Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2024-07-10 12:21:19 +09:30
Rusty Russell	401533667d	connectd: throttle streaming gossip for peers. We currently stream gossip as fast as we can, even if they start at timestamp 0. Instead, use a simple token bucket filter and only let them have 1MB per second (500 bytes per second for testing). Signed-off-by: Rusty Russell <rusty@rustcorp.com.au> Changelog-Protocol: connectd: we now throttle outgoing gossip at 1MB/second per peer.	2024-07-10 12:21:19 +09:30
Rusty Russell	155311b053	connectd: --dev-handshake-no-reply so we can test pending connections. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2024-05-14 18:16:26 -05:00
Rusty Russell	a9b7402910	pytest: test dropping transient connections. Requires a hack to exhaust connectd fds and make us close a transient. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2024-05-14 18:16:26 -05:00
Rusty Russell	8268df9a4b	connectd: implement "transient" connections. Currently, anything which doesn't have a live channel is considered transient. We free this first under stress, and also if they're still connecting. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2024-05-14 18:16:26 -05:00
Rusty Russell	d3dbcf03fa	channeld: close an unimportant connection when fds get low. We use a crude heuristic: if we were trying to contact them, it's a "deliberate" connection, and should be preserved. Changelog-Changed: connectd: prioritize peers with channels (and log!) if we run low on file descriptors. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2024-05-09 01:23:46 -05:00
Rusty Russell	6a648fd2bc	connectd: use hash table, not linked list, for connecting structs. I thought I was going to want to have a convenient way of counting these, but it turns out unnecessary. Still, this is slightly more efficient and simple, so I am including it. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2024-05-09 01:23:46 -05:00
Rusty Russell	ad7dcf381e	lightningd: tell connectd about the custom messages. We re-send whenever a plugin which allows them starts/finishes. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2023-10-24 11:50:57 +10:30
Rusty Russell	0ff91e65dc	connectd: remove #if DEVELOPER We still refuse to run dev commands if lightningd sends it to us despite us not being in developer mode, but that's mainly paranoia. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2023-09-21 20:08:24 +09:30
Rusty Russell	a9f26b7d07	common/daemon.c: remove #ifdef DEVELOPER in favor of runtime flag. Also requires us to expose memleak when !DEVELOPER, however we only ever used the memleak tracking when the LIGHTNINGD_DEV_MEMLEAK environment variable was set, so keep that. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2023-09-21 20:08:24 +09:30
Rusty Russell	ed58c24bc7	connectd: log broken if TCP_CORK fails. But not if we're a developer using dev_disconnect, which substitutes the fd. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2023-04-10 09:41:56 +09:30
Rusty Russell	295557ac50	connectd: don't try to set TCP_CORK on websocket pipe. Most of this is piping the flag through so we know it's a websocket! Reported-by: @ShahanaFarooqui Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2023-04-10 09:41:56 +09:30
Rusty Russell	2209d0149f	connectd: add new start_shutdown message. We stop listening, and also refuse to send "connectd_peer_spoke" to create new subdaemons. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2023-02-05 20:40:47 +01:00
Rusty Russell	05ac74fc44	connectd: keep array of our listening sockets. This allows us to free them if we want to stop listening. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2023-02-05 20:40:47 +01:00
Rusty Russell	6a95d3a25e	common: expose node_id_hash functions. They're used in several places, and we're about to add more. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2023-01-21 08:05:31 -06:00
Rusty Russell	81e57dce52	connectd: ensure htables are always tal objects. We want to change the htable allocator to use tal, which will need this. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2023-01-12 11:44:10 +10:30
Rusty Russell	d31420211a	connectd: add counters to each peer connection. This allows us to detect when lightningd hasn't seen our latest disconnect/reconnect; in particular, we would hit the following pattern: 1. lightningd says to connect a subd. 2. connectd disconnects and reconnects. 3. connectd reads message, connects subd. 4. lightningd reads disconnect and reconnect, sends msg to connect to subd again. 5. connectd asserts because subd is alreacy connected. This way connectd can tell if lightningd is talking about the previous connection, and ignoere it. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2022-07-18 20:50:04 -05:00
Rusty Russell	9b6c97437e	connectd: remove reconnection logic. We don't have to put aside a peer which is reconnecting and wait for lightningd to remove the old peer, we can now simply free the old and add the new. Fixes: #5240 Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2022-07-18 20:50:04 -05:00
Rusty Russell	8678c5efb3	connectd: release peer soon as lightingd tells us. Now we have separate peer draining logic, we can simply use it when connectd tells us to release the peer, without waiting. (We could simply free the peer, but that's a bit rude, as messages can get lost). This removes various complex flags and logic we had before. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au> Changelog-Fixed: `connectd`: various crashes and issues fixed by simplification and rewrite.	2022-07-18 20:50:04 -05:00
Rusty Russell	9dc3880360	connectd: put peer into "draining" mode when we want to close it. This removes it from the hashtable, and forces it to do nothing but send out any remaining packets, then close. It is, in effect, reduced to a stub, with no further interactions with the rest of the system (all subds are freed already). Also removes the need for an explicit "final_msg" too. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2022-07-18 20:50:04 -05:00
Rusty Russell	6fd8fa4d95	connectd: optimize requests for "recent" gossip. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2022-07-15 21:18:29 +09:30
Rusty Russell	7dd8e27862	connectd: don't insist on ping replies when other traffic is flowing. Got complaints about us hanging up on some nodes because they don't respond to pings in a timely manner (e.g. ACINQ?), but that turned out to be something else. Nonetheless, we've had reports in the past of LND badly prioritizing gossip traffic, and thus important messages can get queued behind gossip dumps! Signed-off-by: Rusty Russell <rusty@rustcorp.com.au> Changelog-Changed: connectd: give busy peers more time to respond to pings.	2022-07-09 12:27:05 +09:30
Rusty Russell	fd90e5746b	connectd: don't keep around more than one old connection. This was fixed in `1c495ca5a8` ("connectd: fix accidental handling of old reconnections.") and then reverted by the rework in "connectd: avoid use-after-free upon multiple reconnections by a peer". The latter made the race much less likely, since we cleaned up the reconnecting struct once the connection was hung up by the remote node, but it's still theoretically possible. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2022-06-28 13:47:27 +09:30
Matt Whitlock	83c825945c	connectd: avoid use-after-free upon multiple reconnections by a peer `peer_reconnected` was freeing a `struct peer_reconnected` instance while a pointer to that instance was registered to be passed as an argument to the `retry_peer_connected` callback function. This caused a use-after-free crash when `retry_peer_connected` attempted to reparent the instance to the temporary context. Instead, never have `peer_reconnected` free a `struct peer_reconnected` instance, and only ever allow such an instance to be freed after the `retry_peer_connected` callback has finished with it. To ensure that the instance is freed even if the connection is closed before the callback can be invoked, parent the instance to the connection rather than to the daemon. Absent the need to free `struct peer_reconnected` instances outside of the `retry_peer_connected` callback, there is no use for the `reconnected` hashtable, so remove it as well. See: https://github.com/ElementsProject/lightning/issues/5282#issuecomment-1141454255 Fixes: #5282 Fixes: #5284 Changelog-Fixed: connectd no longer crashes when peers reconnect.	2022-06-28 13:47:27 +09:30
Rusty Russell	8b62e2584f	connectd: remove enable-autotor-v2-mode option Changelog-Removed: lightningd: removed `enable-autotor-v2-mode` option (deprecated v0.10.1) Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2022-05-18 10:15:36 +09:30
Rusty Russell	1c495ca5a8	connectd: fix accidental handling of old reconnections. We had multiple reports of channels being unilaterally closed because it seemed like the peer was sending old revocation numbers. Turns out, it was actually old reestablish messages! When we have a reconnection, we would put the new connection aside, and tell lightningd to close the current connection: when it did, we would restart processing of the initial reconnection. However, we could end up with multiple "reconnecting" connections, while waiting for an existing connection to close. Though the connections were long gone, there could still be messages queued (particularly the channel_reestablish message, which comes early on). Eventually, a normal reconnection would cause us to process one of these reconnecting connections, and channeld would see the (perhaps very old!) messages, and get confused. (I have a test which triggers this, but it also hangs the connect command, due to other issues we will fix in the next release...) Fixes: #5240 Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2022-05-16 09:59:42 +09:30
Rusty Russell	37e8d2fb0f	connectd: disable advertizement of WEBSOCKET addresses. This seems to prevent broad propagation, due to LND not allowing it. See https://github.com/lightningnetwork/lnd/issues/6432 We still announce it if you disable deprecated-apis, so tests still work, and hopefully we can enable it in future. Fixes: #5196 Signed-off-by: Rusty Russell <rusty@rustcorp.com.au> Changelog-EXPERIMENTAL: Protocol: disabled websocket announcement due to LND propagation issues	2022-04-21 06:13:55 +09:30
Rusty Russell	9bddfc2048	connectd: take dev-suppress-gossip from gossipd. Gossipd didn't actually suppress all gossip, resulting in a flake! Doing it in connectd now makes much more sense. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2022-03-31 19:38:05 +10:30
Rusty Russell	ea7120a313	lightningd: add --dev-no-ping-timer to avoid ping response timeouts. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2022-03-31 13:40:27 +10:30
Rusty Russell	2424b7dea8	connectd: hold peer until we're interested. Either because lightningd tells us it wants to talk, or because the peer says something about a channel. We also introduce a behavior change: we disconnect after a failed open. We might want to modify this later, but we it's a side-effect of openingd not holding onto idle connections. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2022-03-23 13:20:12 +10:30
Rusty Russell	16e9ba0361	connectd: fix confusing names. The message from lightningd simply acknowleges that we are allowed to discard the peer (because no subdaemons are talking to it anymore). This difference becomes more stark once connectd holds on to idle peers. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2022-03-23 13:20:12 +10:30
Rusty Russell	fcd0b2eb42	connectd: prepare for multiple subd connections. We still always have 1, but the infrastructure is now in place. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2022-03-23 13:20:12 +10:30
Rusty Russell	c075d78431	connectd: use listen_fd array directly, rather than returning binding arr. We always added to both arrays, might as well just keep one. We make mayfail an explicit flag, rather than relying on the presence of errstr, which is never NULL now. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2022-03-05 15:48:03 +10:30
Michael Schmoock	38e2abf68a	peer_exchange: set, read and log remote_addr Changelog-Added: Protocol: set remote_addr on init tlvs	2022-02-22 05:45:47 +10:30
Rusty Russell	3121cebf4c	gossipd: don't hand out fds. Gossipd now simply gets told by channeld when peers arrive or leave. (it only needs to know for the seeker). Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2022-02-08 11:15:52 +10:30
Rusty Russell	3c5d27e3e9	subdaemons: remove gossipd fd from per-peer daemons. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2022-02-08 11:15:52 +10:30
Rusty Russell	50eccb6a12	connectd: handle pings and pongs. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au> Changelog-Changed: JSON_RPC: `ping` now works with connected peers, even without a channel.	2022-02-08 11:15:52 +10:30
Rusty Russell	bba468a51c	connectd: temporarily have two fds to gossipd. We want to stream gossip through this, but currently connectd treats the fd as synchronous. While we work on getting rid of that, it's easiest to have two fds. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2022-02-08 11:15:52 +10:30
Rusty Russell	d29795a198	connectd: don't just close to peer, but use shutdown(). We would lose packets sometimes due to this previously, but it doesn't happen over localhost so our tests didn't notice. However, now we have connectd being sole thing talking to peers, we can do a more elegant shutdown, which should fix closing. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au> Changelog-Fixed: Protocol: Always flush sockets to increase chance that final message get to peer (esp. error packets).	2022-01-20 15:24:06 +10:30
Rusty Russell	a93c49ca65	connectd: implement @ correctly. dev_blackhole_fd was a hack, and doesn't work well now we are async (it worked for sync comms in per-peer daemons, but now we could sneak through a read before we get to the next write). So, make explicit flags and use them. This is much easier now we have all peer comms in one place. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2022-01-20 15:24:06 +10:30
Rusty Russell	26b9384fd0	various: minor cleanups from Christian's review. More significant things have been folded. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2022-01-20 15:24:06 +10:30
Rusty Russell	6d4c56e8b6	connectd: put more stuff into struct gossip_state. We're the only ones who use it now, so put our fields inside it and make it local. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2022-01-20 15:24:06 +10:30
Rusty Russell	029d65cf2e	connectd: serve gossip_store file for the peer. We actually intercept the gossip_timestamp_filter, so the gossip_store mechanism inside the per-peer daemon never kicks off for normal connections. The gossipwith tool doesn't set OPT_GOSSIP_QUERIES, so it gets both, but that only effects one place. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2022-01-20 15:24:06 +10:30
Rusty Russell	6dae0118f9	connectd: clearly differentiate incoming and outgoing paths. This should make it clearer where the problem seen in https://github.com/ElementsProject/lightning/issues/4297 is. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2021-03-25 11:31:58 +10:30
Saibato	2b0aba13a8	connectd/connectd: Display an error hint for V3 tor onions when connect fail. @thestick613 noticed that since tor version below 0.3.2.2-alpha will not support V3 ed25519 address formats, the error handling is not that helpful in the error message from cli. So now we add an hint. Changelog-None: Signed-off-by: Saibato <saibato.naga@pm.me> connectd/connectd.h; Add helper function to update conn error list Signed-off-by: Saibato <saibato.naga@pm.me>	2020-07-01 11:21:58 +02:00
Rusty Russell	2f1502abf4	cleanup: make 'u8 features' and 'struct feature_set fset' more explicit. It's almost always "their_features" and "our_features" respectively, so make those names clear. Suggested-by: @cdecker Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2020-04-03 13:13:21 +10:30
Rusty Russell	15f54878e4	connectd: do feature bits check after init exchange. This will help with the next patch, where we wean off using a global for features: connectd.c has access to the feature bits. Since connectd might now want to send a message, it needs the crypto_state non-const, which makes this less trivial than it would otherwise be. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2020-04-03 13:13:21 +10:30

1 2

56 Commits