core-lightning

mirror of https://github.com/ElementsProject/lightning.git synced 2024-11-20 02:27:51 +01:00

Author	SHA1	Message	Date
Rusty Russell	5df9e5b7b4	gossipd: allow node_announcements and channel_announcements with unsupported features. The flat feature PR changes the rules so these are OK to propagate. That makes sense: the unsupported features means there's something unsupported about the node or channel, not the msg itself (for that we'd use a different message type). Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-11-10 10:42:29 +01:00
Rusty Russell	5a8677edc6	gossipd: add txout_failure when a close is seen. This prevents a gratuitous lookup of we get a late channel_announce, but even better, it suppresses the "bad gossip" messages in case of a late channel_update, which have plagued Travis (especially since we got aggressive in pushing our own updates). Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-11-07 03:50:53 +00:00
Rusty Russell	abe7133bd5	gossipd: use in_txout_failures to do lookup in channel_announcement. This correctly refreshes the txout entry against aging. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-11-07 03:50:53 +00:00
Rusty Russell	7f45e55d84	gossipd: set the push marker for our own messages. This is a better fix than doing it manually, which turned out to do it in the wrong order (node_announcement followed by channel_announcement) anyway. Should fix many "Bad gossip" messages. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-11-04 17:50:58 +01:00
arowser	0985c6e219	Fix build fail on 32bit OS.	2019-10-23 07:23:33 +11:00
Rusty Russell	bc430cced3	gossipd: fix false-positive memleak detection in pending_node_map. lightning_gossipd(17421): MEMLEAK: 0x564b4b17b5a8 ligtning_gossipd(17421): label=gossipd/routing.c:1490:struct pending_node_announce lightning_gossipd(17421): backtrace: lightning_gossipd(17421): ccan/ccan/tal/tal.c:437 (tal_alloc_) lightning_gossipd(17421): gossipd/routing.c:1490 (catch_node_announcement) lightning_gossipd(17421): gossipd/routing.c:1837 (handle_channel_announcement) lightning_gossipd(17421): gossipd/gossipd.c:238 (handle_channel_announcement_msg) lightning_gossipd(17421): gossipd/gossipd.c:461 (peer_msg_in) lightning_gossipd(17421): common/daemon_conn.c:31 (handle_read) lightning_gossipd(17421): ccan/ccan/io/io.c:59 (next_plan) lightning_gossipd(17421): ccan/ccan/io/io.c:407 (do_plan) lightning_gossipd(17421): ccan/ccan/io/io.c:417 (io_ready) lightning_gossipd(17421): ccan/ccan/io/poll.c:445 (io_loop) lightning_gossipd(17421): gossipd/gossipd.c:1700 (main) lightning_gossipd(17421): parents: lightning_gossipd(17421): gossipd/routing.c:294:struct routing_state Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-10-21 14:08:05 +02:00
Rusty Russell	79e2c3f89a	gossipd: don't crash if we're forced to discard corrupt gossip store. When we're in remove_all_gossip, we don't call free_chan, but free it manually. This trips over the developer-mode check that we called free_chan! Make it also insert the magic so that destroy_chan_check passes: lightning_gossipd: gossipd/routing.c:496: destroy_chan_check: Assertion `chan->sat.satoshis == (u64)chan' failed. lightning_gossipd: FATAL SIGNAL 6 (version v0.7.3rc2-2-gf89d7c1) 0x5632436a4544 send_backtrace common/daemon.c:41 0x5632436a45ea crashdump common/daemon.c:54 0x7f053c3c7f5f ??? ???:0 0x7f053c3c7ed7 ??? ???:0 0x7f053c3a9534 ??? ???:0 0x7f053c3a940e ??? ???:0 0x7f053c3b9011 ??? ???:0 0x563243698b9d destroy_chan_check gossipd/routing.c:496 0x5632436dca46 notify ccan/ccan/tal/tal.c:235 0x5632436dcf35 del_tree ccan/ccan/tal/tal.c:397 0x5632436dd2c1 tal_free ccan/ccan/tal/tal.c:481 0x56324369f004 remove_all_gossip gossipd/routing.c:2981 0x563243692f5d gossip_store_load gossipd/gossip_store.c:772 0x56324368eff4 gossip_init gossipd/gossipd.c:872 0x563243690cbb recv_req gossipd/gossipd.c:1580 0x5632436a4a69 handle_read common/daemon_conn.c:31 0x5632436cc7ae next_plan ccan/ccan/io/io.c:59 0x5632436cd32b do_plan ccan/ccan/io/io.c:407 0x5632436cd369 io_ready ccan/ccan/io/io.c:417 0x5632436cf52f io_loop ccan/ccan/io/poll.c:445 0x56324369102f main gossipd/gossipd.c:1700 Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-10-17 23:40:05 +02:00
Rusty Russell	1e59d2a738	gossipd: count channel_updates on new channels correctly. If we get a channel_update while we're still verifying the channel_announcement we didn't set the peer pointer, so it didn't get credit. As a result, the seeker tended to think we were done gossiping sooner than we were. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-10-15 18:05:54 +02:00
Rusty Russell	bd55f6d940	common/features: only support a single feature bitset. This is mainly an internal-only change, especially since we don't offer any globalfeatures. However, LND (as of next release) will offer global features, and also expect option_static_remotekey to be a global feature. So we send our (merged) feature bitset as both global and local in init, and fold those bitsets together when we get an init msg. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-10-11 02:52:04 +00:00
Rusty Russell	a1644c1b6e	seeker: start doing a channel probe if we see unknown node_announcement msgs. It usually means we're missing something, but there's no way to ask what. Simply start a broad scid probe. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-10-10 21:48:52 -05:00
Rusty Russell	82a5efa932	gossipd: start streaming gossip from last gossip timestamp minus 10 minutes. We assume that the time for gossip propagation is < 10 minutes, so by going back that far from last gossip we won't miss anything, Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-10-10 21:48:52 -05:00
Rusty Russell	877d1eaab3	gossipd: don't request channel_updates if we're being spammed. It's simple: if we wouldn't accept the timestamp we see, don't put the channel in the stale_scid_map. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-10-10 21:48:52 -05:00
Rusty Russell	869b5e40b5	gossipd: simplify seeker state machine. We eliminate the "need peer" states and instead check if the random_peer_softref has been cleared. We can also unify our restart handlers for all these cases; even the probe_scids case, by giving gossip credit for the scids as they come in (at a discount, since scids are 8 bytes vs the ~200 bytes for normal gossip messages). Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-10-10 21:48:52 -05:00
Rusty Russell	79ca9bf998	gossipd: use per-peer information to make messages clearer. We can (usually) indicate what peer caused the bad gossip error. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-10-10 21:48:52 -05:00
Rusty Russell	0091300ee3	gossipd: track what peer gave us gossip msgs so we can credit it. Since we have to validate, there can be a delay (and peer might vanish) between receiving the gossip and actually confirming it, hence the use of softref. We will use this information to check that the peers are making progress as we start asking them for specific information. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-10-10 21:48:52 -05:00
Rusty Russell	4bf0bc1f28	gossipd: age txout_failures map. We do this by keeping a current and an old map, and moving the current to old every hour or 10,000 entries. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-09-27 02:32:53 +00:00
Rusty Russell	aa9024db51	gossipd: fix memleak false-positive 2019-09-26T02:00:47.832Z DEBUG lightning_hsmd(89822): Client: Received message 33 from client' 2019-09-26T02:00:47.837Z BROKEN lightning_gossipd(89828): MEMLEAK: 0x55eddc5d1fd8 2019-09-26T02:00:47.838Z BROKEN lightning_gossipd(89828): label=gossipd/routing.c:1579:struct unupdated_channel 2019-09-26T02:00:47.838Z DEBUG lightning_gossipd(89828): backtrace: 2019-09-26T02:00:47.838Z DEBUG lightning_gossipd(89828): ccan/ccan/tal/tal.c:437 (tal_alloc_) 2019-09-26T02:00:47.838Z DEBUG lightning_gossipd(89828): gossipd/routing.c:1579 (routing_add_channel_announcement) 2019-09-26T02:00:47.838Z DEBUG lightning_gossipd(89828): gossipd/routing.c:1867 (handle_pending_cannouncement) 2019-09-26T02:00:47.838Z DEBUG lightning_gossipd(89828): gossipd/gossipd.c:1543 (handle_txout_reply) 2019-09-26T02:00:47.838Z DEBUG lightning_gossipd(89828): gossipd/gossipd.c:1726 (recv_req) 2019-09-26T02:00:47.838Z DEBUG lightning_gossipd(89828): common/daemon_conn.c:31 (handle_read) 2019-09-26T02:00:47.838Z DEBUG lightning_gossipd(89828): ccan/ccan/io/io.c:59 (next_plan) 2019-09-26T02:00:47.838Z DEBUG lightning_gossipd(89828): ccan/ccan/io/io.c:407 (do_plan) Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-09-27 00:01:34 +00:00
Rusty Russell	d24c850899	gossipd: restore a flag for fast pruning I was seeing some accidental pruning under load / Travis, and in particular we stopped accepting channel_updates because they were 103 seconds old. But making it too long makes the prune test untenable, so restore a separate flag that this test can use. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-09-27 00:01:34 +00:00
Rusty Russell	a071a754b3	gossipd: place limit on pending announcements. Now we queue them, we should place a limit. It's not the worst thing in the world if we discard them (we'll catch up eventually), but we should try not to in case we're just a bit behind. Our behaviour here is also O(n^2) so we don't want a massive queue anyway. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-09-25 04:01:56 +00:00
Rusty Russell	fd2d74aa9b	gossipd: defer asking about txouts until we're synced or they're 6 deep. The first one means we don't discard channels just because we're not synced, and the second is implied by the spec: don't accept channel_announcement if the channel isn't 6 deep. Since LND defers in such cases, we do too (unless it's newer than the current block, in which case we simply discard). Otherwise there's a risk that a slow node might discard valid gossip. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-09-25 04:01:56 +00:00
trueptolemy	d8dce6e61f	cleanup: Use `u32` as the type of `max_hops` in `gossipd`	2019-09-24 16:01:24 +02:00
Rusty Russell	b55ff34f93	gossipd: fix corner case where gossip msg too old after pending delay. Happened under Travis with --dev-fast-gossip (90 second prune time), but can happen anyway if gossip is almost 2 weeks old when we receive it: 2019-09-20T19:16:51.367Z DEBUG lightning_gossipd(20972): Received node_announcement for node 022d223620a359a47ff7f7ac447c85c46c923da53389221a0054c11c1e3ca31d59 2019-09-20T19:16:51.376Z DEBUG lightning_gossipd(20972): Ignoring node_announcement timestamp 1569006918 for 022d223620a359a47ff7f7ac447c85c46c923da53389221a0054c11c1e3ca31d59 2019-09-20T19:16:51.669Z BROKEN lightning_gossipd(20972): pending node_announcement 01013094af771d60f4de69bb39ce045e4edf4a06fe6c80078dfa4fab58ab5617d6ad4fa34b6d3437380db0a8293cea348bbc77f714ef71fcd8515bfc82336667441f00005d852546022d223620a359a47ff7f7ac447c85c46c923da53389221a0054c11c1e3ca31d59022d2253494c454e544152544953542d633961313734610000000000000000000000000000 malformed? (version c9a174a) Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-09-22 20:56:11 +02:00
Rusty Russell	6a8d18c7e3	gossipd: naming cleanups. Suggested-by: @cdecker. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-09-20 06:55:00 +00:00
Rusty Russell	39c9dcbafc	ratelimit: adjust based on --dev-fast-gossip, test. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-09-20 06:55:00 +00:00
Rusty Russell	147eaced2e	developer: consolidiate gossip timing options into one --dev-fast-gossip. It's generally clearer to have simple hardcoded numbers with an #if DEVELOPER around it, than apparent variables which aren't, really. Interestingly, our pruning test was always kinda broken: we have to pass two cycles, since l2 will refresh the channel once to avoid pruning. Do the more obvious thing, and cut the network in half and check that l1 and l3 time out. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-09-20 06:55:00 +00:00
Rusty Russell	8139164aa0	gossipd: disallow far future (+1 day) or far past (2 weeks) timestamps. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-09-20 06:55:00 +00:00
Rusty Russell	76860683aa	gossipd: only allow one channel_update per direction per day. And similar for node_announcement. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-09-20 06:55:00 +00:00
Rusty Russell	a92ead48bf	gossipd: ignore redundant channel_update and node_announcement. If you send a message which simply changes timestamp and signature, we drop it. You shouldn't be doing that, and the door to ignoring them was opened by by option_gossip_query_ex, which would allow clients to ignore updates with the same checksum. This is more aggressive at reducing spam messages, but we allow refreshes (to be conservative, we allow them even when 1/2 of the way through the refresh period). I dropped the now-unnecessary sleep from test_gossip_pruning, too. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-09-20 06:55:00 +00:00
Rusty Russell	0bab2580fc	gossipd: clean up local channel updates. Make update_local_channel use a timer if it's too soon to make another update. 1. Implement cupdate_different() which compares two updates. 2. make update_local_channel() take a single arg for timer usage. 3. Set timestamp of non-disable update back 5 minutes, so we can always generate a disable update if we need to. 4. Make update_local_channel() itself do the "unchanged update" suppression. gossipd: clean up local channel updates. 5. Keep pointer to the current timer so we override any old updates with a new one, to avoid a race. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-09-20 06:55:00 +00:00
Rusty Russell	27d9b75456	gossipd: add shadow structure for local chans. Normally we'd put a pointer into struct half_chan for local information, but it would be NULL on 99.99% of nodes. Instead, keep a separate hash table. This immediately subsumes the previous "map of local-disabled channels", and will be enhanced further. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-09-20 06:55:00 +00:00
trueptolemy	5361a5d059	JSON-API: `getroute` now also support `exclude` nodes	2019-09-16 12:22:06 +08:00
Rusty Russell	a46e880f1d	gossipd: in DEVELOPER mode, catch missing free_chan() For memory-usage reasons, struct chan doesn't use a tal destructor, in favor of us calling free_chan in the right places. In DEVELOPER mode, we should check that is the case. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-09-12 05:11:56 +00:00
Rusty Russell	768d293149	gossipd: don't get upset if we can't add channel_update. In particular, the timestamp might be wrong once we start checking that. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-09-12 05:11:56 +00:00
Rusty Russell	2577ad87d5	gossipd: use gossip_time_now() everywhere. We've been slack, but it's going to be important for testing ratelimiting. And it currently has a minor memory leak. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-09-12 05:11:56 +00:00
darosior	0b0ad4c22d	transition from status_trace() to status_debug	2019-09-10 02:02:51 +00:00
Rusty Russell	aca2e4f722	common/memleak: add dynamic hooks for assisting memleak. Rather than reaching into data structures, let them register their own callbacks. This avoids us having to expose "memleak_remove_xxx" functions, and call them manually. Under the hood, this is done by having a specially-named tal child of the thing we want to assist, containing the callback. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-09-06 14:35:01 +02:00
Rusty Russell	2f1e116510	gossipd: use htable_count() rather than reaching into htable struct. Now ccan/htable provides the helper, let's use it. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-08-26 08:44:22 +00:00
ZmnSCPxj	3e74ca4b86	gossipd/routing.c: Correctly handle a duplicated entry in `exclude` of `getroute`.	2019-08-02 16:06:15 +02:00
Rusty Russell	6bb8525e5d	gossipd: fix crash when we prune old, un-updated channel announcements. We added a random channel to the list, but we can just free it immediately (since traversal of a uintmap isn't altered by deletion). This was introduced in `d1f43d993a` where we explicitly call free_chan rather than relying on destructors. Fixes: #2837 Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-07-28 14:15:32 +02:00
Rusty Russell	b3215a866b	gossipd: fix inverted test in debug print. ==1503== Use of uninitialised value of size 8 ==1503== at 0x566786B: _itoa_word (_itoa.c:179) ==1503== by 0x566AF0D: vfprintf (vfprintf.c:1642) ==1503== by 0x569790F: vsnprintf (vsnprintf.c:114) ==1503== by 0x156CCB: do_vfmt (str.c:66) ==1503== by 0x156DB1: tal_vfmt_ (str.c:92) ==1503== by 0x1289CD: status_vfmt (status.c:141) ==1503== by 0x128AAC: status_fmt (status.c:151) ==1503== by 0x118E05: route_prune (routing.c:2495) ==1503== by 0x11DE2D: gossip_refresh_network (gossipd.c:1997) ==1503== by 0x1292B8: timer_expired (timeout.c:39) ==1503== by 0x12088C: main (gossipd.c:3075) Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-07-17 20:16:55 -05:00
Rusty Russell	8928f0b5f9	gossipd: remove gossip entirely if we hit a problem on load. The crashes in #2750 are mostly caused by us trying to partially truncate the store. The simplest fix for release is to discard the whole thing if we detect a problem. This is a workaround: it'd be far nicer to try to recover. Fixes: #2750 Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-06-21 22:03:35 +00:00
Rusty Russell	8ce3b86aa5	gossipd: tighter correctness checks during gossip_store load. We shouldn't be loading old timestamps, either. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-06-21 22:03:35 +00:00
Rusty Russell	10c503b4b4	gossip_store: clean up a truncated store. We might have channel_announcements which have no channel_update: normally these don't get written into the store until there is one, but if the store was truncated it can happen. We then get upset on compaction, since we don't have an in-memory representation of the channel_announcement. Similarly, we leave the node_announcement pending until after that channel_announcement, leading to a similar case. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-06-15 10:52:05 +02:00
Rusty Russell	745634d9b9	gossipd: don't catch pending node_announcements more than once. We catch node_announcements for nodes where we haven't finished analyzing the channel_announcement yet (either because we're still checking UTXO, or in this case, because we're waiting for a channel_update). But we reference count the pending_node_announce, so if we have multiple channels pending, we might try to insert it twice. Clear it so this doesn't happen. There's a second bug where we continue to catch node_announcements until all the channel_announcements are no longer pending; this is fixed by removing it from the map. Fixes: #2735 Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-06-13 05:58:09 +00:00
Rusty Russell	18069ab3da	gossipd: APIs return more information about routing message handling. In particular, we'll need to know the short_channel_id if a channel_update is unknown (implies we're missing a channel), and whether processing a pending channel_announcement was successful (implies that the channel was real). Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-06-12 00:37:46 +00:00
Rusty Russell	ab31f40aa2	gossipd: don't charge ourselves fees when calculating route. This means there's now a semantic difference between the default `fromid` and setting `fromid` explicitly to our own node_id. In the default case, it means we don't charge ourselves fees on the route. This means we can spend the full channel balance. We still want to consider the pricing of local channels, however: there's a reason to discount one over another, and that is to bias things. So we add the first-hop fee to the risk value instead. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-06-11 23:19:11 +00:00
Rusty Russell	f8b98e032c	gossipd: Don't abort() on duplicate entries in gossip_store. Triggered by a previous variant of this PR, but a goo1d idea to simply discard the store in general when we get a duplicate entry. We crash trying to delete old ones, which means writing to the store. But they should have already been deleted. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-06-04 01:29:39 +00:00
Rusty Russell	34c113a17a	gossipd: trivial clean up of routing_add_channel_update. For some reason I was reluctant to use the hc local variable; I even re-declared it! Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-06-04 01:29:39 +00:00
Rusty Russell	3e733afb2b	gossipd: remove broadcast map altogether. This clarifies things a fair bit: we simply add and remove from the gossip_store directly. Before this series: (--disable-developer, -Og) store_load_msec:20669-20902(20822.2+/-82) vsz_kb:439704-439712(439706+/-3.2) listnodes_sec:0.890000-1.000000(0.92+/-0.04) listchannels_sec:11.960000-13.380000(12.576+/-0.49) routing_sec:3.070000-5.970000(4.814+/-1.2) peer_write_all_sec:28.490000-30.580000(29.532+/-0.78) After: (--disable-developer, -Og) store_load_msec:19722-20124(19921.6+/-1.4e+02) vsz_kb:288320 listnodes_sec:0.860000-0.980000(0.912+/-0.056) listchannels_sec:10.790000-12.260000(11.65+/-0.5) routing_sec:2.540000-4.950000(4.262+/-0.88) peer_write_all_sec:17.570000-19.500000(18.048+/-0.73) Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-06-04 01:29:39 +00:00
Rusty Russell	948490ec58	gossipd: add timestamp in gossip store header. (We don't increment the gossip_store version, since there are only a few commits since the last time we did this). This lets the reader simply filter messages; this is especially nice since the channel_announcement timestamp is derived, not in the actual message. This also creates a 'struct gossip_hdr' which makes the code a bit clearer. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	2019-06-04 01:29:39 +00:00

1 2 3 4 5 ...

312 Commits