• rayon@lemm.ee
    link
    fedilink
    arrow-up
    2
    ·
    1 year ago

    That's true. Also I guess domain names in most ideogram-based languages cannot be meaningfully converted to ASCII. The best detection method I'm aware of is detecting a mix of different alphabets in the domain, but I imagine even this has a lot of false positives