recentpopularlog in

unicode

« earlier   
Dive deep into Swift String
Today we will go through the fundamentals of the String type in Swift, from the String Encoding to the Swift String API. Let's dive in!
swift:string  unicode 
4 days ago by nanoxd
Vim: enter Unicode characters with 8-digit hex code - Stack Overflow | https://stackoverflow.com/
Things I know:

The command ga on the character 𝓭 gives me hex:0001d4ed.
I can copy it on the clipboard and paste it via "+p.
I know how to enter Unicode values that have a 4 digit hex code:
<C-v>u for example <C-v>u03b1 gives the α character.
vim  unicode  specialcharacters  textediting  solution 
6 days ago by kme
Is there an alternative to sed that supports unicode? - Unix & Linux Stack Exchange | https://unix.stackexchange.com/
Just use that syntax:

sed 's/馑//g' file1

Or in the escaped form:

sed "s/$(echo -ne '\u9991')//g" file1

(Note that older versions of Bash and some shells do not understand echo -e '\u9991', so check first.)
sed  unicode  textprocessing  solution 
6 days ago by kme
The World’s Writing Systems
This web site presents one glyph for each of the world’s writing systems. It is the first step of the Missing Scripts Project, a long-term initiative that aims to identify writing systems which are not yet encoded in the Unicode standard. As of today, there are still 146 scripts not yet encoded in Unicode.
typography  history  art  unicode 
8 days ago by bezthomas
Characters hard to substitute with sed - Stack Overflow
I think that the sed command you are looking for is this:

sed 's/\xE2\x80\x94/-/g' thisfile

\xE2\x80\x94 is hex for what I assume is the offending character sequence. (FYI, it is the UTF-8 code for character 2014, a long dash of some kind). This is preferable to trying to throw special characters directly into a sed command.

If this does not work, use hexdump to find out exactly what the offending bytes are.

hexdump -C thisfile


I understood it and that's precisely the character. Alas, I ran the sed and it didn't work. The hexdump shows it as '? 200 224', but when I tested in a created text file where I type a dash like this it appeared the same in the hexdump. Same hexdumps, but only those files in ISO-8859-15 have problems when displaying in kate or as subs.
shell  terminal  cli  bash  utf8  unicode 
9 days ago by dusko
Dynamic charset converter - interactive conversion tables to compare 8-bits character sets viewing their unicode names and binary, octal, decimal, hexadecimal and UTF-8 values
Dynamic charset converter — interactive conversion tables to compare 8-bits character sets viewing their unicode names and binary, octal, decimal, hexadecimal and UTF-8 values
unicode  utf8 
9 days ago by dusko

Copy this bookmark:





to read