~julianmarcos


#32 Unknown link types result in offpunk sync crashing 1 year, 2 months ago

Comment by ~julianmarcos on ~lioploum/offpunk

Referencing the commit that fixes this for posterity: https://git.sr.ht/~lioploum/offpunk/commit/316465835217744f560fe2cd68bc457c1fc998d6 Commit id 316465835217744f560fe2cd68bc457c1fc998d6

Thanks, I have finally synced offpunk on my computer without it crashing. (Yes, I have read the comments you did the moment you posted them, but I've been holding off until offpunk finishes so I don't have to send another comment if there's a bug related to the same field.) I have not had any other issues, thanks.

( Also, I guess I should have tried to fix the adding every fetched link to the tour part as offpunk has already put over 31000 links in there. I guess I accidentally almost made a archive of the world wide web, I should probably also request a feature to ignore certain links when fetching, (like upload.wikimedia.org or en.wikipedia.org) And yes, I think I'm a bit insane using --depth 5 and linking to a capsule list. I guess I should probably email the user-discussion list to make the proposals. )

#32 Unknown link types result in offpunk sync crashing 1 year, 2 months ago

Comment by ~julianmarcos on ~lioploum/offpunk

On Thu, 30 Nov 2023 22:24:20 +0000 "~lioploum" outgoing@sr.ht wrote:

Could you test it with trunk? I think I’ve fixed a similar crash in the upcoming 2.1 No, it seems like my problem from this test with trunk (c3aff6755e256fd977bd5073c5c2880f03d9c177)

https://tracker.debian.org[7]
Traceback (most recent call last):
  File "/usr/sbin/offpunk", line 8, in <module>
    sys.exit(main())
             ^^^^^^
  File "/usr/lib/python3.11/site-packages/offpunk.py", line 1897, in main
    gc.call_sync(refresh_time=refresh_time,depth=depth,lists=args.url)
  File "/usr/lib/python3.11/site-packages/offpunk.py", line 1747, in call_sync
    fetch_list(l,validity=refresh_time,depth=depth,tourchildren=True)
  File "/usr/lib/python3.11/site-packages/offpunk.py", line 1721, in fetch_list
    fetch_url(l,depth=depth,validity=validity,savetotour=tourchildren,count=[counter,end])
  File "/usr/lib/python3.11/site-packages/offpunk.py", line 1711, in fetch_url
    fetch_url(k,depth=d,validity=0,savetotour=savetotour,\
  File "/usr/lib/python3.11/site-packages/offpunk.py", line 1711, in fetch_url
    fetch_url(k,depth=d,validity=0,savetotour=savetotour,\
  File "/usr/lib/python3.11/site-packages/offpunk.py", line 1711, in fetch_url
    fetch_url(k,depth=d,validity=0,savetotour=savetotour,\
  [Previous line repeated 2 more times]
  File "/usr/lib/python3.11/site-packages/offpunk.py", line 1668, in fetch_url
    if not netcache.is_cache_valid(url,validity=validity):
           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/usr/lib/python3.11/site-packages/netcache.py", line 106, in is_cache_valid
    cache = get_cache_path(url)
            ^^^^^^^^^^^^^^^^^^^
  File "/usr/lib/python3.11/site-packages/netcache.py", line 135, in get_cache_path
    parsed = urllib.parse.urlparse(url)
             ^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/usr/lib/python3.11/urllib/parse.py", line 395, in urlparse
    splitresult = urlsplit(url, scheme, allow_fragments)
                  ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/usr/lib/python3.11/urllib/parse.py", line 500, in urlsplit
    _check_bracketed_host(bracketed_host)
  File "/usr/lib/python3.11/urllib/parse.py", line 446, in _check_bracketed_host
    ip = ipaddress.ip_address(hostname) # Throws Value Error if not IPv6 or IPv4
         ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/usr/lib/python3.11/ipaddress.py", line 54, in ip_address
    raise ValueError(f'{address!r} does not appear to be an IPv4 or IPv6 address')
ValueError: '7' does not appear to be an IPv4 or IPv6 address

(I've added a debugging call to fetch_url to print the url every time, so I had the data from fetch_url)

The URL gemini://rkta.srht.site/debbug-subscribe.gmi fails to be able to extract links as netcache seems to interpret the link

=> https://tracker.debian.org [7] https://tracker.debian.org

Erroneously, but that's not handled by offpunk but by urllib, it seems to be run something like the following.

urllib.parse.urlparse("https://tracker.debian.org[7]")

fetch_url() receives url as 'https://tracker.debian.org[7]' which is inherently incorrect I think. Is the page wrong?

It further seems to be called as Line 1710 in offpunk.py calls fetch_url, it is running with mode == "links_only", which means that when AbstractRender().get_links(mode) gets called (from the page which links to the different string) the function should only return the link, which should be https://tracker.debian.org.

I guess to fix this editing some things from lines in between 497 and 523 is needed.

#32 Unknown link types result in offpunk sync crashing 1 year, 2 months ago

Ticket created by ~julianmarcos on ~lioploum/offpunk

Unknown links cause offpunk to raise the following:

Traceback (most recent call last):rc://irc.libera.chat/#xxxx           
  File "/usr/sbin/offpunk", line 8, in <module>
    sys.exit(main())
             ^^^^^^
  File "/usr/lib/python3.11/site-packages/offpunk.py", line 1888, in main
    gc.call_sync(refresh_time=refresh_time,depth=depth,lists=args.url)
  File "/usr/lib/python3.11/site-packages/offpunk.py", line 1739, in call_sync
    fetch_list(l,validity=refresh_time,depth=depth,tourchildren=True)
  File "/usr/lib/python3.11/site-packages/offpunk.py", line 1713, in fetch_list
    fetch_url(l,depth=depth,validity=validity,savetotour=tourchildren,count=[counter,end])
  File "/usr/lib/python3.11/site-packages/offpunk.py", line 1703, in fetch_url
    fetch_url(k,depth=d,validity=0,savetotour=savetotour,\
  File "/usr/lib/python3.11/site-packages/offpunk.py", line 1703, in fetch_url
    fetch_url(k,depth=d,validity=0,savetotour=savetotour,\
  File "/usr/lib/python3.11/site-packages/offpunk.py", line 1703, in fetch_url
    fetch_url(k,depth=d,validity=0,savetotour=savetotour,\
  [Previous line repeated 1 more time]
  File "/usr/lib/python3.11/site-packages/offpunk.py", line 1696, in fetch_url
    links = r.get_links(mode=mode)
            ^^^^^^^^^^^^^^^^^^^^^^
  File "/usr/lib/python3.11/site-packages/ansicat.py", line 528, in get_links
    self._build_body_and_links(mode)
  File "/usr/lib/python3.11/site-packages/ansicat.py", line 515, in _build_body_and_links
    abs_l = urllib.parse.urljoin(self.url,l.split()[0])
            ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/usr/lib/python3.11/urllib/parse.py", line 551, in urljoin
    urlparse(url, bscheme, allow_fragments)
  File "/usr/lib/python3.11/urllib/parse.py", line 395, in urlparse
    splitresult = urlsplit(url, scheme, allow_fragments)
                  ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/usr/lib/python3.11/urllib/parse.py", line 500, in urlsplit
    _check_bracketed_host(bracketed_host)
  File "/usr/lib/python3.11/urllib/parse.py", line 446, in _check_bracketed_host
    ip = ipaddress.ip_address(hostname) # Throws Value Error if not IPv6 or IPv4
         ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/usr/lib/python3.11/ipaddress.py", line 54, in ip_address
    raise ValueError(f'{address!r} does not appear to be an IPv4 or IPv6 address')
ValueError: '7' does not appear to be an IPv4 or IPv6 address

#31 Handling certificates for status 6X are not supported by offpunk. 1 year, 2 months ago

Comment by ~julianmarcos on ~lioploum/offpunk

Just noticed it might be dangerous with sites like bubble, where clicking on a link can do things such as delete a post, so maybe yes implement it, but put some warning up.

#31 Handling certificates for status 6X are not supported by offpunk. 1 year, 2 months ago

Ticket created by ~julianmarcos on ~lioploum/offpunk

Handling certificates for status 6X are not supported by offpunk

Please open a bug report

I ran onto this bug when trying to sync to local the website auragem.letz.dev, (I can't be really sure if it's auragem.letz.dev or ploum.net, as they were both in the sync list) with --depth 5.

Could there be added support for setting certificates to use to load websites?

#190 Accounts not shown in the order requested with `-a` 1 year, 5 months ago

Ticket created by ~julianmarcos on ~rjarry/aerc

aerc doesn't seem to show the accounts in the order requested, so every account seems to be set in a weirdly placed order, instead of the order given when choosing the accounts to open. The suggested solution is to order the accounts in the same order as specified in the -a option.

#156 Be able to move emails in between accounts 1 year, 5 months ago

Comment by ~julianmarcos on ~rjarry/aerc

Correct, when you start typing :mv -a it'll list all accounts, and when you complete one of them or when you write the entire name, it would automatically start completing folder names for that account.

#156 Be able to move emails in between accounts 1 year, 5 months ago

on ~rjarry/aerc

On Mon Aug 28, 2023 at 2:16 PM CEST, ~julianmarcos wrote:

  1. mv grows new option -a meaning the target account, so while in the “FLOSS” account, I could run command mv -a Home Junk to move the message to the Junk folder in the Home account. Seems like the simplest solution of this problem, but seems quite hackish.

I guess this would be the easiest.

How does the completion work? Is it possible that in the moment -a NAMEOFACCOUNT is added on the command line, aerc would complete from the target account not from the current one?

Best,

#Matěj

https://matej.ceplovi.cz/blog/, @mcepl@floss.social GPG Finger: 3C76 A027 CA45 AD70 98B5 BC1D 7920 5802 880B C9D8

Find the dependencies -- and eliminate them. -- according to http://is.gd/oeYpcI the motto of the MS Excel team

#156 Be able to move emails in between accounts 1 year, 5 months ago

Comment by ~julianmarcos on ~rjarry/aerc

On Mon Aug 28, 2023 at 1:42 PM CEST, ~mcepl wrote:

  1. mv grows new option -a meaning the target account, so while in the “FLOSS” account, I could run command mv -a Home Junk to move the message to the Junk folder in the Home account. Seems like the simplest solution of this problem, but seems quite hackish.

I guess this would be the easiest.

  1. Whole from mv needs to be redefined to include (optionally) the account name. E.g., something like mv #Home/Junk (instead of # use any other character which cannot be legitimately part of the account name). Probably more complicated, but seems like closer to The Right Thing™ to me.

Yeah, this would mean limiting the possible account name characters, which may or may not be a breaking change. But I guess you could escape the characters, but I guess we could see which characters couldn't be legitimely part of the folder name, because of that I think 1 would be simpler.

#156 Be able to move emails in between accounts 1 year, 5 months ago

Comment by ~julianmarcos on ~rjarry/aerc

Ops, it didn't stop quoting the data properly.