Homoglyphs and Clustering in Unicode

The dataset used in the paper is a collection of Unicode characters, with a focus on identifying homoglyphs and clustering them into equivalence classes.

Data and Resources

Cite this as

Perry Deng, Cooper Linsky, Matthew Wright (2025). Dataset: Homoglyphs and Clustering in Unicode. https://doi.org/10.57702/6ryjj52o

DOI retrieved: January 3, 2025

Additional Info

Field Value
Created January 3, 2025
Last update January 3, 2025
Defined In https://doi.org/10.1109/ISI49825.2020.9280538
Author Perry Deng
More Authors
Cooper Linsky
Matthew Wright
Homepage https://github.com/PerryXDeng/weaponizing-unicode