cleaning relay lists

Hey folks,

I’ve been poking around a big old list of relays (about 2900 of them, from
the Damus dump):
https://gist.github.com/melvincarvalho/ca79028a47aeea346bfdcf1950b1ba9f

Some fun oddities in there:

- a few aren’t relays at all

- some use encryption or odd schemes

- some aren’t even URIs

- lots of little duplications (trailing slash vs no slash, etc.)

Feels like it might be time for a shared regex (or a tiny lib/CLI tool?) to
help clean up these kinds of lists. Would be handy for all sorts of tooling.

Anyone already tackled this kind of cleanup, or want to pitch in?

Melvin

Received on Friday, 16 May 2025 09:13:47 UTC