~drwho

I am everywhere.

https://drwho.virtadpt.net/

Living 20 minutes into the future. Eccentric weirdo. Virtual Adept. Time traveler. Thelemite. Technomage. Hacker on main. APT 3319. Not human. 30% software and implants. H+ - 0.4 on the Berram-7 scale. Furry adjacent. Pan/poly. Burnout. Cyberpunk but I don't have enough hair left for a pink mohawk; still have the leather jacket. Sburb speedrunner. XKCD 705. The Diet Coke(tm) of evil. Antifa cyborg supersoldier. DJ at the crab rave afterparty.

Magneto was right.

I hate Pulseaudio. systemd can eat my entire ass.

Avatar by Avery Liell-Kok.


#491 Lieu search engine is aggressively spidering my site. a month ago

Comment by ~drwho on ~amolith/fediring

I think monthly recrawls make sense, ~amolith. Most sites don't change all that frequently and the ones that do tend to post their links before needing to search for them makes sense.

Is this a monthly full re-crawl, or is there a way to optimize it? Say, by paying attention to If-Modified-Since headers or HTTP 304 status returns?

#491 Lieu search engine is aggressively spidering my site. a month ago

on ~amolith/fediring

we should go with something along the lines of once every two weeks or once every month (depending on what Lieu allows)

Crawling and ingesting are just two commands that execute in sequence in a cronjob, so any cron expression is fine. It had just been @daily, but is now @monthly. There's a thread on fedi about it where the Lieu creator says they crawl the xxiivv ring manually every 3-4 months; based on that, I think leaving it at monthly sounds fine for now. It's not 3-4 months, but it's still a ~30x reduction in traffic ^^'

What do you think ~drwho?

#491 Lieu search engine is aggressively spidering my site. a month ago

Ticket created by ~drwho on ~amolith/fediring

Hello. My website (https://drwho.virtadpt.net/) is part of the Fediring. Earlier this week I noticed that one particular IP address (5.161.53.68 - fediring.net) is the source for about 63.7% of all of the web traffic on my site in the last 31 days.

I'm flattered that somebody's indexing me. However, that seems a little excessive. Is Lieu supposed to be that aggressive when it spiders sites? Or is my site just that big? I don't have any experience with it so I don't know if that's normal or not. Can you please advise?

#108 Request to join Fediring 2 years ago

Comment by ~drwho on ~amolith/fediring

Thanks for the link!

#108 Request to join Fediring 2 years ago

Comment by ~drwho on ~amolith/fediring

By the way, using the e-mail address on fediring.net for submitting site requests doesn't seem to work. My e-mail provider keeps getting the following error:

~amolithfediring@todo.sr.ht: host todo.sr.ht[173.195.146.145] said: 550 The tracker or ticket you requested does not exist. (in reply to end of DATA command)

#108 Request to join Fediring 2 years ago

Ticket created by ~drwho on ~amolith/fediring