mirrors/core-lightning

mirror of https://github.com/ElementsProject/lightning.git synced 2025-02-24 15:10:51 +01:00

Author	SHA1	Message	Date
Rusty Russell	8e4b589a9e	connectd: message to tell lightningd if we couldn't forward an onion message. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2024-12-05 17:38:16 +10:30
Rusty Russell	2e90f59dfe	connectd: fix crash when we get an incoming conn while outgoing attempt is ratelimited. ``` Program received signal SIGSEGV, Segmentation fault. 0x000000001014e9d8 in io_set_finish_ (conn=0x0, finish=0x0, arg=0x0) at ccan/ccan/io/io.c:137 137 conn->finish = finish; (gdb) bt incoming=true) at connectd/connectd.c:394 ``` Fixes: #7871 Reported-by: grubles Signed-off-by: Rusty Russell <rusty@rustcorp.com.au> Changelog-None: broken in this release	2024-11-28 17:24:47 +10:30
Alex Myers	d099f9fe5b	connectd: force our own channel gossip to more peers Large nodes were not always getting their own channel gossip out reliably. The number of peers we spam our own channel gossip to is limited to save large nodes on startup, but this should be relaxed slightly to ensure propagation. Changelog-Fixed: Own-channel gossip is broadcast to more peers on connect.	2024-11-28 14:54:08 +10:30
Rusty Russell	faf7ae6ad4	pytest: add test for connection ratelimiting. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2024-11-25 15:39:13 +10:30
Rusty Russell	3d294f813d	connectd: limit to 10 connections at once. We wait until a connection fails, or a subd is connected to the peer, before letting another one through. This should prevent us from overwhelming lightningd on large nodes, but unlike the previous back-off, it's based on how fast lightningd is, not an arbitrary time. We also let one through each second, in case we're connecting to many, but not doing anything but gossip (e.g. 100 explicit connect commands). Signed-off-by: Rusty Russell <rusty@rustcorp.com.au> Changelog-Changed: Reconnecting to peers at startup should be significantly faster (dependent on machine speed).	2024-11-25 15:39:13 +10:30
Rusty Russell	3587afeaa2	connectd: remove transient flag. The important flag replaces it, and now we can be more intelligent about eviction in overload. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2024-11-25 15:39:13 +10:30
Rusty Russell	15950bb7d4	connectd: reconnect for non-transient connections. Rather than have lightningd call us repeatedly to try to connect, have it tell us what peers are transient and aren't, and connectd will automatically try to maintain that connection. There's a new "downgrade_peer" message to tell it a peer is now transient: to make it non-transient we simply tell connectd to connect as a non-transient. The first time, I missed that dual_open_control does its own state transitions :( Signed-off-by: Rusty Russell <rusty@rustcorp.com.au> Changelog-Changed: `connectd` now handles maintaining/reconnecting to important peers, and we remember the last successful address we connected to.	2024-11-25 15:39:13 +10:30
Rusty Russell	4ee59e7a49	connectd: expose --dev-no-reconnect and --dev-fast-reconnect options. Once connectd is controlling reconnections, it'll need these. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2024-11-25 15:39:13 +10:30
Rusty Russell	23dc10cf81	connectd: get our own addresses to contact node from node_announcements. Let lightningd feed us hints to try first, but we can extract the addresses from node_announcement messages ourselves. (Lightningd used to ask gossipd on our behalf: this is far simpler!) One side effect of this is that we don't hand back address hints given to us by lightningd: it would use these again for reconnecting. This is breaks test_sendpay_grouping, so we disable it temporarily. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2024-11-25 15:39:13 +10:30
Rusty Russell	5b92383b02	connectd: send self-advertizing gossip rather than having gossipd do it. It's now trivial for us to do this ourselves, since we have gossmap. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2024-11-25 15:39:13 +10:30
Rusty Russell	45533584e2	global: rename blinding to path_key everywhere. Get with the modern nomenclature: the pubkey inside a blinded path is called the `path_key` now. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2024-10-16 07:14:32 +10:30
Rusty Russell	4ee9d1d2f2	gossmap: include cltv_expiry_delta in gossmap_chan_get_update_details for completeness. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2024-10-15 09:58:04 +10:30
Rusty Russell	4e6bac6d36	connectd: fix double-free crash on connection timeout. tmpctx may not get cleaned immediately, so the timeout (a child of the struct early_peer at this point) can still outlast the conn. Do the clearer thing, and explicitly free the timeout. Changelog-Fixed: connectd: crash on erroneous timeout. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2024-10-14 16:36:58 +01:00
Rusty Russell	af90fdc0bb	common/utils: macros to help get copy/compare across different types right. Things are often equivalent but different types: 1. u8 arrays in libwally. 2. sha256 3. Secrets derived via sha256 4. txids Rather than open-coding a BUILD_ASSERT & memcpy, create a macro to do it. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2024-07-23 09:54:47 +09:30
Rusty Russell	5d42600076	connectd: ratelimit onion messages However fast we can handle them, it's antisocial to allow others to make us spam the rest of the network. Changelog-Protocol: onion messages: we limit incoming to 4 per second, allowing a little burst. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2024-07-10 13:34:00 +02:00
Rusty Russell	621bfe370e	connectd: forward onion messages by scid as well as node_id. This is now permitted in the offers PR, so we should support it. But we can't just look up in the gossmap, since the "short_channel_id" could be an alias. So we get lightningd to tell us all scid->peer mappings, and look up in that. Changelog-Added: Protocol: onion messages can now be forwarded by short_channel_id. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2024-07-10 13:34:00 +02:00
Rusty Russell	f122c0beb4	connectd: include map of scid->peer node id. This will let us fwd onion messages via scid, even if they're aliases. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2024-07-10 13:34:00 +02:00
Rusty Russell	b5f921ce0a	lightningd: add routine to directly inject an onion message. Unlike "sendonionmessage" which instructs us to send to a peer, this process it locally (presumably, it contains the next hop). This is useful because it allows us to process an onion message which starts with us (a legal case for a blinded path supplied by someone else!). It also opens the door to bolt12 self-pay. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2024-07-10 13:34:00 +02:00
Rusty Russell	ba82592196	common/onion_message_parse: return string, not bool. Allows for caller to log, but more importantly, when we add a command to inject onion messages, allows for us to capture the error. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2024-07-10 13:34:00 +02:00
Rusty Russell	47584bd504	connectd: tie gossip query responses into ratelimiting code. A bit tricky, since we get more than one message at a time. However, this just means we go over quota for a bit, and will get caught when those are sent (we do this for a single message already, so it's not that much worse). Note: this not only limits sending, but it limits the actuall query processing, which is nice. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2024-07-10 12:21:19 +09:30
Rusty Russell	4a78d17748	connectd: do response to gossip queries, don't hand them to gossipd. This basically means moving the code from gossipd to connectd to handle these queries. This will get connectd have finer control over ratelimiting them. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2024-07-10 12:21:19 +09:30
Rusty Russell	d60977f37f	connectd: use gossmap streaming interface. This is more efficient in a few ways: 1. It's trivial to get to the end of the gossip_store, we don't have to iterate. 2. It tends to be mmaped so we don't have to call pread(). Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2024-07-10 12:21:19 +09:30
Rusty Russell	401533667d	connectd: throttle streaming gossip for peers. We currently stream gossip as fast as we can, even if they start at timestamp 0. Instead, use a simple token bucket filter and only let them have 1MB per second (500 bytes per second for testing). Signed-off-by: Rusty Russell <rusty@rustcorp.com.au> Changelog-Protocol: connectd: we now throttle outgoing gossip at 1MB/second per peer.	2024-07-10 12:21:19 +09:30
Rusty Russell	5e585d061f	connectd: log incoming onion message IO properly. I noticed we were missing this. Move logging up a level so it's easier to spot the omission. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2024-07-09 15:09:29 +02:00
Rusty Russell	01cd605cb1	connectd: fix missing peer close. We were getting the following message in test_feerate_stress: ``` 2024-07-08T02:15:45.5663941Z lightningd-2 2024-07-08T02:13:45.696Z BROKEN 0266e4598d1d3c415f572a8488830b60f7e744ed9235eb0b1ba93283b315c03518-connectd: Peer did not close, forcing close ``` I can reproduce it locally if I run the test enough, and finally found the issue by printing the status of the fd when we time it out (using routines from connectd.c). The peer fd alternates between reading and writing. When we go to discard it, we wake the write queue, so write_to_peer() get called. It won't shutdown the socket if there are still subds attached, and will wait again for a read. The last subd exit has to also wake the write queue if we're draining, so it can do the io_sock_shutdown. Otherwise, we hit the timeout, causing the message above. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2024-07-09 18:03:44 +09:30
Rusty Russell	5a5fee92b3	connectd: don't report socket fds twice. The initial commit had this code twice, for some reason! Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2024-07-09 18:03:44 +09:30
Rusty Russell	002dc60b33	Gossip: BOLT catch, remove initial_routing_sync. Everyone sends a gossip_timestamp_filter message these days to start gossip. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2024-06-19 15:54:24 +09:30
Rusty Russell	06cf5ac841	Doc: update bolts to assume gossip_queries under the new meaning. Everyone understands gossip_queries now, but peers leave it unset to indicate they have nothing useful to say. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2024-06-19 15:54:24 +09:30
Rusty Russell	155311b053	connectd: --dev-handshake-no-reply so we can test pending connections. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2024-05-14 18:16:26 -05:00
Rusty Russell	a9b7402910	pytest: test dropping transient connections. Requires a hack to exhaust connectd fds and make us close a transient. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2024-05-14 18:16:26 -05:00
Rusty Russell	8268df9a4b	connectd: implement "transient" connections. Currently, anything which doesn't have a live channel is considered transient. We free this first under stress, and also if they're still connecting. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2024-05-14 18:16:26 -05:00
Rusty Russell	541cc9dd1f	connectd: fix exhaustion code where we pick random peer. If we don't find one searching from our random spot in the peer table, we're supposed to wrap, not crash! Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2024-05-14 18:16:26 -05:00
Rusty Russell	cb2c4963f2	bolt12: allow first_node_id in blinded path to be a scid. We don't actually support it yet, but this threads through the type change, puts it in "decode" etc. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2024-05-12 19:11:43 -05:00
Rusty Russell	5d061c4cf4	global: remove tags from BOLT quotes now dual-funding is in master A few of them had minor wording changes, too. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2024-05-09 16:14:23 -05:00
Rusty Russell	d3dbcf03fa	channeld: close an unimportant connection when fds get low. We use a crude heuristic: if we were trying to contact them, it's a "deliberate" connection, and should be preserved. Changelog-Changed: connectd: prioritize peers with channels (and log!) if we run low on file descriptors. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2024-05-09 01:23:46 -05:00
Rusty Russell	6a648fd2bc	connectd: use hash table, not linked list, for connecting structs. I thought I was going to want to have a convenient way of counting these, but it turns out unnecessary. Still, this is slightly more efficient and simple, so I am including it. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2024-05-09 01:23:46 -05:00
Rusty Russell	3bfe622413	connectd: log when we fail to receive an fd from lightningd. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2024-05-09 01:23:46 -05:00
Rusty Russell	c8c87e2bf6	connectd: log if we fail an accept() call. This can happen if we're totally out of fds, but previously we gave no log message indicating this! Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2024-05-09 01:23:46 -05:00
Rusty Russell	ba922f9160	lightningd/connectd: remove --experimental-websocket-port Signed-off-by: Rusty Russell <rusty@rustcorp.com.au> Changelog-Removed: Config `experimental-websocket-port` (deprecated 23.08, EOL 24.02)	2024-03-25 15:02:35 +10:30
Rusty Russell	e0e879c003	common: remove type_to_string files altogther. This means including <common/utils.h> where it was indirectly included. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2024-03-20 13:51:48 +10:30
Rusty Russell	37d22f9141	global: change all type_to_string to fmt_X. This has the benefit of being shorter, as well as more reliable (you will get a link error if we can't print it, not a runtime one!). Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2024-03-20 13:51:48 +10:30
Rusty Russell	be546fc60e	Makefiles: remove leftover references to $(EXP) This was also removed a while ago, it's never set. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2024-03-20 13:51:48 +10:30
Rusty Russell	c755dfdfc9	connectd: fix bad assert. This code was trying to check that the address type is not one of the ADDR_TYPE_TOR* types, but the is_toraddr() function checks a domain name! The cast should have been a clue that this was wrong! Anyway, wireaddr_to_addrinfo() aborts on these cases already, so the asserts here are superfluous. Found in unrelated CI run: ``` Valgrind error file: valgrind-errors.20610 ==20610== Conditional jump or move depends on uninitialised value(s) ==20610== at 0x484ED28: strlen (in /usr/libexec/valgrind/vgpreload_memcheck-amd64-linux.so) ==20610== by 0x138FA3: is_toraddr (wireaddr.c:344) ==20610== by 0x11499B: conn_init (connectd.c:729) ==20610== by 0x28FD73: next_plan (io.c:59) ==20610== by 0x28FF94: io_new_conn_ (io.c:116) ==20610== by 0x11531B: try_connect_one_addr (connectd.c:927) ==20610== by 0x1182A8: try_connect_peer (connectd.c:1781) ==20610== by 0x11834E: connect_to_peer (connectd.c:1797) ==20610== by 0x119241: recv_req (connectd.c:2074) ==20610== by 0x12836F: handle_read (daemon_conn.c:35) ==20610== by 0x28FD73: next_plan (io.c:59) ==20610== by 0x2909A8: do_plan (io.c:407) ==20610== ``` Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2024-02-15 12:07:47 +01:00
Rusty Russell	e7f1f29dbd	connectd: don't suppress channel_announcement without channel_update yet. This happens if: 1. The peer sets a timestamp filter to non-zero, and 2. We have a channel_announcement without a channel_update. The timestamp is 0 as a placeholder as part of the recent gossip rework (we used to hold these channel_announcement in memory, which was complex). But this means we won't send it in this case, and if we later send the channel_update, CI will complain about 'Bad gossip order'. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2024-02-12 11:43:33 +01:00
Rusty Russell	07cd4a809b	gossipd: remove spam handling. We weakened this progressively over time, and gossip v1.5 makes spam impossible by protocol, so we can wait until then. Removing this code simplifies things a great deal! Signed-off-by: Rusty Russell <rusty@rustcorp.com.au> Changelog-Removed: Protocol: we no longer ratelimit gossip messages by channel, making our code far simpler.	2024-02-04 09:24:44 +10:30
Rusty Russell	db6f0da3b3	connectd: separate routine to inject message without closing connection. We will want this to send private channel_updates direct to peer. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2024-01-31 14:47:33 +10:30
niftynei	fa8458c00a	dualfund: add test to make sure that tx-sigs sent before commitment results in an error.	2023-11-02 19:32:05 +10:30
Rusty Russell	e4d7266fff	common: add amount_feerate helper. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2023-11-01 14:11:28 +10:30
Rusty Russell	25110ff2cc	connectd: fix fd leak for --offline. ``` BROKEN connectd: dev_report_fds: 5 open but unowned? ``` Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2023-10-26 12:59:55 +10:30
Rusty Russell	043c4ae5eb	pytest: fix flake in test_even_sendcustommsg Make sure plugin has got message to connectd before sending! ``` def test_even_sendcustommsg(node_factory): l1, l2 = node_factory.get_nodes(2, opts={'log-level': 'io', 'allow_warning': True}) l1.connect(l2) # Even-numbered message msg = hex(43690)[2:] + ('ff' * 30) + 'bb' # l2 will hang up when it gets this. l1.rpc.sendcustommsg(l2.info['id'], msg) l2.daemon.wait_for_log(r'\[IN\] {}'.format(msg)) l1.daemon.wait_for_log('Invalid unknown even msg') wait_for(lambda: l1.rpc.listpeers(l2.info['id'])['peers'] == []) # Now with a plugin which allows it l1.connect(l2) l2.rpc.plugin_start(os.path.join(os.getcwd(), "tests/plugins/allow_even_msgs.py")) l1.rpc.sendcustommsg(l2.info['id'], msg) l2.daemon.wait_for_log(r'\[IN\] {}'.format(msg)) > l2.daemon.wait_for_log(r'allow_even_msgs.Got message 43690') tests/test_misc.py:3623: ... > raise TimeoutError('Unable to find "{}" in logs.'.format(exs)) E TimeoutError: Unable to find "[re.compile('allow_even_msgs.Got message 43690')]" in logs. contrib/pyln-testing/pyln/testing/utils.py:327: TimeoutError ``` Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2023-10-24 09:51:43 +02:00

1 2 3 4 5 ...

461 commits