wvstolzing

wvstolzing@lemmy.ml · 4 months ago

Though ‘finding’ the UDP packet should cost a lot more, because, whoever knows where it is?

wvstolzing@lemmy.ml · 1 year ago

That is a great change to the papers of the past where you have to have an affiliation to a university to get access to a paper and sometimes even that is not enough.

‘Oxford Scholarship Online’ would license different sets of books to different departments; so someone from the philosophy department couldn’t get access to books classified under sociology or history.

Imagine doing something similar at the checkout table in a ‘physical’ library.

wvstolzing@lemmy.ml · 1 year ago

Here’s another video: https://www.youtube.com/watch?v=PriwCi6SzLo (including an interview with the great Alexandra Elbakyan).

Cory Doctorow recently wrote about this in some detail (incl. helpful links): https://pluralistic.net/2024/08/16/the-public-sphere/#not-the-elsevier

wvstolzing@lemmy.ml · 1 year ago

The name of the pdf file inside the torrent is its md5 hashsum without the .pdf extension.

On libgen.rs you can see the md5 hashsum on the download page; on libgen.li you need to look at the JSON file provided at the link on the search result , as they don’t render it on the ui.

wvstolzing@lemmy.ml · 1 year ago

The torrents are alive; as long as you can get the torrent links from libgen, you have access to the files. (No need to share whole archives either, you can pick & choose).

wvstolzing@lemmy.ml · 1 year ago

The Nyxt browser – webkit as rendering engine, extensible by Common Lisp – was making good progress, though its progress slowed down considerably lately; and there are a few ‘showstoppers’ preventing everyday usage, at least for me.

wvstolzing@lemmy.ml · 1 year ago

deleted by creator

wvstolzing@lemmy.ml · 1 year ago

Michael W. Lucas’s “Networking for System Administrators” is a great resource: https://mwl.io/nonfiction/networking#n4sa

wvstolzing@lemmy.ml · 2 years ago

chromium is based on a fork of webkit; webkit proper does remain – I don’t know how much of an influence google has on it though; all I ‘know’ is that it’s Apple’s adoption of a KDE project.

wvstolzing@lemmy.ml · 2 years ago

Firefox is already compatible with v3, by the way, since version 109: https://extensionworkshop.com/documentation/develop/manifest-v3-migration-guide/

wvstolzing@lemmy.ml · edit-2 2 years ago

Another vote for Tesseract – just to clarify the terminology, though: PDF is a fragile format best used read-only; so you really don’t want to edit a pdf, but make a new one using the same (or cleaned-up) bitmaps and a new ocr text layer.

Now, tesseract is excellent at recognizing glyphs; but especially if the scanned image is a little fuzzy, the layout detection falters; and when it falters, you get redundant line breaks, & chunks of text in the wrong order – all of which gets incredibly annoying for searching & copying purposes. So if you can spare the time, and the text requires it, you may need to mark regions (paragraphs & titles mainly) on the bitmap image manually. There exist a few frontends to Tesseract that help with a task like that; check out, e.g., https://github.com/manisandro/gImageReader - inside single paragraph blocks of text, Tesseract doesn’t get as easily confused; and the text output is in the correct reading order, & w/o redundant breaks.

wvstolzing@lemmy.ml · 2 years ago

Recently I became aware of ‘StarLite’ tablets – the prices are pretty steep, but the specs look really good, esp. wrt the screen.