As a little side (side, side) project I've hacked a small mirror of the forum. Its quite simple and updates the most recent posts ever ~2 hours.
Have a look, maybe its useful: https://nimforum.code0.xyz
What did you build it with? I assume you used Nim, so I'm asking about libraries.
As for usefulness, I've found myself annoyed at a lack of no-JS post viewing when viewing historical posts on web.archive.org, so I think the Wayback Machine will make it more useful.
As a side note, I had no idea the forum experienced so much spam. Filtering and mods must do a pretty good job at getting rid of it quickly.
As a side note, I had no idea the forum experienced so much spam. Filtering and mods must do a pretty good job at getting rid of it quickly.
It ebbs and flows, it's been higher than normal lately. But good to hear that our moderation efforts works! The most annoying thing is that new users are in a "moderated" state until manually approved. So the only people who ever see all this spam is the moderators. Which means it's not even effective spam, it's a loose-loose situation.
sqlite has some search capabilities. i think those require having matching words in searches though.
there is also something like wrapping bert.cpp to make neural fingerprints, combine it with some kind of nearest neighbor thing like annoy or hsnw and you can make a neat little neural search engine that way as well. those are nice at finding stuff with inexact matches.
i finally managed to get rid of the mirrored spam.
Its interesting some mods of the forum seem to just delete the spam, some set the author to "Moderated" or "Spammer".
I wonder if the spamming is done by (stupid?) bots, and if we could do some technical shenanigans to avoid those spam.