Add Farsi characters to command detection #12
Loading…
Add table
Add a link
Reference in a new issue
No description provided.
Delete branch "main"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
In addition to Latin characters, packages like xetex enable users to have commands defined by e.g., Farsi characters. This pull request adds such characters to the list of characters that can form a command. This list requires extending to include all Arabic and maybe other language characters.
What if we just put
\u00c0-\uffffthere, rather than trying to special case all scripts? Or can you think of a non-ascii character that is commonly used to terminate a command name?It makes sense to include everything and exclude the ones we don't need. For example, U+200E, U+200F, U+202B, and the rest of the RTL/LTR encoding characters should be excluded. Or not, as they are not visible, they won't change anything visually for the user.
Pull request closed