Skip to content

unicode_chars.rs is outdated again #51080

@est31

Description

@est31

src/libsyntax/parse/lexer/unicode_chars.rs contains a list of confusable characters (see #25957). Its comments mention that it bases on a document. Of this document, a newer revision has been published (link). We should update unicode_chars.rs to include the newly listed confusables as well.

In the last update, in #43629, I've suggested manually going through the list. But maybe we can find a way to isolate the characters that are special to us and autogenerate the remaining list?

Metadata

Metadata

Assignees

No one assigned

    Labels

    A-UnicodeArea: UnicodeC-feature-requestCategory: A feature request, i.e: not implemented / a PR.T-libs-apiRelevant to the library API team, which will review and decide on the PR/issue.

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions