pax_global_header00006660000000000000000000000064140455552320014517gustar00rootroot0000000000000052 comment=351815d8da1dc1b2227e8fd832e690f8aee47747 translitcodec-version-0.7.0/000077500000000000000000000000001404555523200160445ustar00rootroot00000000000000translitcodec-version-0.7.0/.gitignore000066400000000000000000000000251404555523200200310ustar00rootroot00000000000000*.pyc MANIFEST dist/ translitcodec-version-0.7.0/.hgignore000066400000000000000000000000511404555523200176430ustar00rootroot00000000000000syntax: glob MANIFEST dist build *.py? translitcodec-version-0.7.0/AUTHORS000066400000000000000000000005721404555523200171200ustar00rootroot00000000000000translitcodec was originally written by Jason Kirtland in 2008. Contributors are: - Jason Kirtland - Craig Dennis - Piotr Skamruk - Claude Paroz - Wojciech Banaś The translitcodec source distribution includes the 'transtab' package by Markus Kuhn . translitcodec-version-0.7.0/CHANGES000066400000000000000000000021561404555523200170430ustar00rootroot00000000000000===================== translitcodec Changes ===================== 0.7.0 ----- Released on May 8, 2021 - Added support for error handles - Fixed conversion of the German eszett char 0.6.0 ----- Released on December 13, 2020 - Add support for Python 3.9 0.5.2 ----- Released on January 19, 2020 - Install package with setuptools 0.5.1 ----- Released on January 19, 2020 - Add python_requires to prevent installation with Python 2 packages 0.5 --- Released on January 18, 2020 - Complete coverage of the Vietnamese alphabet - Removed Python 2 support 0.4 --- Released on May 11, 2015 - Added Python 3 compatibility 0.3 --- Released on February 14, 2011 - Fixes to the transtab table rebuilding tool. - Added translitcodec.__version__ 0.2 --- Released on January 27, 2011 - Resolves issue of "TypeError: character mapping must return integer, None or unicode" when a blank value (eg: \N{ZERO WIDTH SPACE} \u200B) was encoded. Unicode blanks are now returned. - Characters in the ASCII range are no longer included in the translation tables. 0.1 --- Released on December 28, 2008 - Initial packaged release. translitcodec-version-0.7.0/LICENSE000066400000000000000000000020731404555523200170530ustar00rootroot00000000000000Copyright (c) 2008 Jason Kirtland Permission is hereby granted, free of charge, to any person obtaining a copy of this software and associated documentation files (the "Software"), to deal in the Software without restriction, including without limitation the rights to use, copy, modify, merge, publish, distribute, sublicense, and/or sell copies of the Software, and to permit persons to whom the Software is furnished to do so, subject to the following conditions: The above copyright notice and this permission notice shall be included in all copies or substantial portions of the Software. THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE SOFTWARE. translitcodec-version-0.7.0/MANIFEST.in000066400000000000000000000002261404555523200176020ustar00rootroot00000000000000include AUTHORS include LICENSE include CHANGES include README recursive-include tests *py recursive-include scripts *py recursive-include transtab * translitcodec-version-0.7.0/README000066400000000000000000000060511404555523200167260ustar00rootroot00000000000000Unicode to 8-bit charset transliteration codec. This package contains codecs for transliterating ISO 10646 texts into best-effort representations using smaller coded character sets (ASCII, ISO 8859, etc.). The translation tables used by the codecs are from the ``transtab`` collection by Markus Kuhn. Three types of transliterating codecs are provided: "long", using as many characters as needed to make a natural replacement. For example, \u00e4 LATIN SMALL LETTER A WITH DIAERESIS ``ä`` will be replaced with ``ae``. "short", using the minimum number of characters to make a replacement. For example, \u00e4 LATIN SMALL LETTER A WITH DIAERESIS ``ä`` will be replaced with ``a``. "one", only performing single character replacements. Characters that can not be transliterated with a single character are passed through unchanged. For example, \u2639 WHITE FROWNING FACE ``☹`` will be passed through unchanged. Using the codecs is simple:: >>> import translitcodec >>> import codecs >>> codecs.encode('fácil € ☺', 'translit/long') 'facil EUR :-)' >>> codecs.encode('fácil € ☺', 'translit/short') 'facil E :-)' The codecs return Unicode by default. To receive a bytestring back, either chain the output of encode() to another codec, or append the name of the desired byte encoding to the codec name:: >>> codecs.encode('fácil € ☺', 'translit/one').encode('ascii', 'replace') 'facil E ?' >>> 'fácil € ☺'.encode('translit/one/ascii', 'replace') 'facil E ?' The package also supplies a 'transliterate' codec, an alias for 'translit/long'. Another way to use the library is to use an error handle. Error handles are available: * 'strict/translit/long', 'strict/translit/short', 'strict/translit/one' - similar to 'strict' * 'ignore/translit/long', 'ignore/translit/short', 'ignore/translit/one' - similar to 'ignore' * 'replace/translit/long', 'replace/translit/short', 'replace/translit/one' - similar to 'replace' These error handles above, work similarly to Python's built-in ones. The difference is that transliteration is attempted first. >>> codecs.encode('Zażółć gęślą jaźń € ☺另!@#', 'ISO-8859-2', 'replace/translit/long').decode('ISO-8859-2') 'Zażółć gęślą jaźń EUR :-)?!@#' >>> codecs.encode('Zażółć gęślą jaźń € ☺另!@#', 'ISO-8859-2', 'replace/translit/short').decode('ISO-8859-2') 'Zażółć gęślą jaźń E :-)?!@#' >>> codecs.encode('Zażółć gęślą jaźń € ☺另!@#', 'ISO-8859-2', 'replace/translit/one').decode('ISO-8859-2') 'Zażółć gęślą jaźń E ??!@#' >>> codecs.encode('Zażółć gęślą jaźń € ☺另!@#', 'ISO-8859-2', 'ignore/translit/long').decode('ISO-8859-2') 'Zażółć gęślą jaźń EUR :-)!@#' >>> codecs.encode('Zażółć gęślą jaźń € ☺另!@#', 'ISO-8859-2', 'ignore/translit/short').decode('ISO-8859-2') 'Zażółć gęślą jaźń E :-)!@#' >>> codecs.encode('Zażółć gęślą jaźń € ☺另!@#', 'ISO-8859-2', 'ignore/translit/one').decode('ISO-8859-2') 'Zażółć gęślą jaźń E !@#' translitcodec-version-0.7.0/scripts/000077500000000000000000000000001404555523200175335ustar00rootroot00000000000000translitcodec-version-0.7.0/scripts/update_table.py000066400000000000000000000045171404555523200225450ustar00rootroot00000000000000""" Updates translitcodec/__init__.py with translation table information built from the 'transtab' database. :copyright: the translitcodec authors and developers, see AUTHORS. :license: MIT, see LICENSE for more details. """ import csv import os import sys csv.register_dialect('transtab', delimiter=';') def read_table(path='transtab/transtab'): long, short, single = {}, {}, {} with open(path) as fh: for line in fh.readlines(): if not line.startswith('<'): continue from_spec, raw_to = line.strip().split(' ', 1) from_ord = int(from_spec[2:-1], 16) if from_ord <= 128: continue raw = next(csv.reader([raw_to], 'transtab')) long_char = _unpack_uchrs(raw[0]) if len(raw) < 2: short_char = long_char else: short_char = _unpack_uchrs(raw[1]) long[from_ord] = long_char short[from_ord] = short_char if len(short_char) == 1: single[from_ord] = short_char return long, short, single def _unpack_uchrs(packed): chunks = packed.replace(''): bucket = old with open(path, 'w') as fh: fh.writelines(preamble) fh.write("\n") _dump_dict(fh, 'long_table', long) _dump_dict(fh, 'short_table', short) _dump_dict(fh, 'single_table', single) fh.write("\n") fh.writelines(postamble) def _dump_dict(fh, name, data): fh.write("%s = {\n" % name) for pair in sorted(data.items()): fh.write(" %r: %r,\n" % pair) fh.write("}\n\n") if __name__ == '__main__': if not (os.path.exists('translitcodec') and os.path.exists('transtab')): print("Can not find translitcodec/ and transtab/ directories.") sys.exit(-1) tables = read_table() update_inclusion(*tables) print("Updated.") translitcodec-version-0.7.0/setup.py000066400000000000000000000023441404555523200175610ustar00rootroot00000000000000import codecs from setuptools import setup lines = codecs.open('README', 'r', 'utf-8').readlines()[3:] lines.append('\n') lines.extend(codecs.open('CHANGES', 'r', 'utf-8').readlines()[1:]) desc = ''.join(lines).lstrip() import translitcodec version = translitcodec.__version__ setup(name='translitcodec', version=version, description='Unicode to 8-bit charset transliteration codec', long_description=desc, long_description_content_type='text/x-rst', author='Jason Kirtland', author_email='jek@discorporate.us', url='https://github.com/claudep/translitcodec', packages=['translitcodec'], license='MIT License', python_requires='>=3', classifiers=[ 'Development Status :: 5 - Production/Stable', 'Intended Audience :: Developers', 'License :: OSI Approved :: MIT License', 'Operating System :: OS Independent', 'Programming Language :: Python', 'Programming Language :: Python :: 3', 'Programming Language :: Python :: 3 :: Only', 'Programming Language :: Python :: Implementation :: CPython', 'Topic :: Software Development :: Libraries', 'Topic :: Utilities', ], ) translitcodec-version-0.7.0/tests/000077500000000000000000000000001404555523200172065ustar00rootroot00000000000000translitcodec-version-0.7.0/tests/test_codec.py000066400000000000000000000066131404555523200217020ustar00rootroot00000000000000"""Very basic codec tests. :copyright: the translitcodec authors and developers, see AUTHORS. :license: MIT, see LICENSE for more details. """ import codecs import translitcodec from unittest import TestCase class CodecTests(TestCase): data = '£ ☹ wøóf méåw' def test_default(self): assert codecs.encode(self.data, 'transliterate') == 'GBP :-( woof meaaw' def test_translit_long(self): assert codecs.encode(self.data, 'translit/long') == 'GBP :-( woof meaaw' def test_translit_short(self): assert codecs.encode(self.data, 'translit/short') == 'GBP :-( woof meaw' def test_translit_one(self): assert codecs.encode(self.data, 'translit/one') == '\u00a3 \u2639 woof meaw' def test_translit_long_ascii(self): assert self.data.encode('translit/long/ascii') == b'GBP :-( woof meaaw' def test_translit_short_ascii(self): assert self.data.encode('translit/short/ascii') == b'GBP :-( woof meaw' def test_translit_one_ascii(self): try: codecs.encode(self.data, 'translit/one/ascii') assert False except UnicodeEncodeError: assert True assert codecs.encode(self.data, 'translit/one/ascii', 'replace') == b'? ? woof meaw' def test_ascii_level_characters_remain(self): assert codecs.encode("'", 'translit/long') == "'" def test_zero_width_space(self): try: char = codecs.encode('\u200b', 'translit/long') assert char == '' except TypeError: assert False class AlphabetTests(TestCase): def test_vietnamese(self): alphabet_upper = 'AĂÂBCDĐEÊGHIKLMNOÔƠPQRSTUƯVXY' alphabet_lower = 'aăâbcdđeêghiklmnoôơpqrstuưvxy' self.assertEqual( codecs.encode(alphabet_upper, 'transliterate'), 'AAABCDDEEGHIKLMNOOOPQRSTUUVXY' ) self.assertEqual( codecs.encode(alphabet_lower, 'transliterate'), 'aaabcddeeghiklmnooopqrstuuvxy' ) class ErrorHandlersTests(TestCase): data = 'Zażółć gęślą jaźń € ☺另!@#' page = 'ISO-8859-2' def _process(self, error_handler_name): return codecs.encode(self.data, self.page, error_handler_name).decode(self.page) def test_replace_long(self): assert self._process('replace/translit/long') == 'Zażółć gęślą jaźń EUR :-)?!@#' def test_replace_short(self): assert self._process('replace/translit/short') == 'Zażółć gęślą jaźń E :-)?!@#' def test_replace_one(self): assert self._process('replace/translit/one') == 'Zażółć gęślą jaźń E ??!@#' def test_ignore_long(self): assert self._process('ignore/translit/long') == 'Zażółć gęślą jaźń EUR :-)!@#' def test_ignore_short(self): assert self._process('ignore/translit/short') == 'Zażółć gęślą jaźń E :-)!@#' def test_ignore_one(self): assert self._process('ignore/translit/one') == 'Zażółć gęślą jaźń E !@#' def test_strict_long(self): with self.assertRaises(UnicodeEncodeError): self._process('strict/translit/long') def test_strict_short(self): with self.assertRaises(UnicodeEncodeError): self._process('strict/translit/short') def test_strict_one(self): with self.assertRaises(UnicodeEncodeError): self._process('strict/translit/one') translitcodec-version-0.7.0/translitcodec/000077500000000000000000000000001404555523200207025ustar00rootroot00000000000000translitcodec-version-0.7.0/translitcodec/__init__.py000066400000000000000000001130111404555523200230100ustar00rootroot00000000000000"""Unicode to 8-bit charset transliteration codec. This package contains codecs for transliterating ISO 10646 texts into best-effort representations using smaller coded character sets (ASCII, ISO 8859, etc.). The translation tables used by the codecs are from the ``transtab`` collection by Markus Kuhn. :copyright: the translitcodec authors and developers, see AUTHORS. :license: MIT, see LICENSE for more details. """ import codecs import sys import unicodedata __version_info__ = (0, 7, 0) __version__ = '.'.join(str(_) for _ in __version_info__) def long_encode(input, errors='strict'): """Transliterate to 8 bit using as many letters as needed. For example, \u00e4 LATIN SMALL LETTER A WITH DIAERESIS ``ä`` will be replaced with ``ae``. """ if not isinstance(input, str): input = str(input, sys.getdefaultencoding(), errors) length = len(input) input = unicodedata.normalize('NFKC', input) return input.translate(long_table), length def short_encode(input, errors='strict'): """Transliterate to 8 bit using as few letters as possible. For example, \u00e4 LATIN SMALL LETTER A WITH DIAERESIS ``ä`` will be replaced with ``a``. """ if not isinstance(input, str): input = str(input, sys.getdefaultencoding(), errors) length = len(input) input = unicodedata.normalize('NFKC', input) return input.translate(short_table), length def single_encode(input, errors='strict'): """Transliterate to 8 bit using only single letter replacements. For example, \u2639 WHITE FROWNING FACE ``☹`` will be passed through unchanged. """ if not isinstance(input, str): input = str(input, sys.getdefaultencoding(), errors) length = len(input) input = unicodedata.normalize('NFKC', input) return input.translate(single_table), length def _error_handle_base(exc, table, unknown_char_cb): if isinstance(exc, UnicodeEncodeError): char = unicodedata.normalize('NFKC', exc.object[exc.start:exc.end])[0] new_char = char.translate(table) if char == new_char: new_char = unknown_char_cb(char, new_char, exc) return new_char, exc.start + 1 else: raise exc def replace_long(exc): """Error handler for transliterate to 8 bit using as many letters as needed. For example, \u00e4 LATIN SMALL LETTER A WITH DIAERESIS ``ä`` will be replaced with ``ae``. If the character is not replaced, then the '?' character is returned. """ return _error_handle_base(exc, long_table, lambda c, n, e: '?') def replace_short(exc): """Error handler for transliterate to 8 bit using as few letters as possible. For example, \u00e4 LATIN SMALL LETTER A WITH DIAERESIS ``ä`` will be replaced with ``a``. If the character is not replaced, then the '?' character is returned. """ return _error_handle_base(exc, short_table, lambda c, n, e: '?') def replace_single(exc): """Error handler for transliterate to 8 bit using only single letter replacements. For example, \u2639 WHITE FROWNING FACE ``☹`` will be passed through unchanged. If the character is not replaced, then the '?' character is returned. """ return _error_handle_base(exc, single_table, lambda c, n, e: '?') def ignore_long(exc): """Error handler for transliterate to 8 bit using as many letters as needed. For example, \u00e4 LATIN SMALL LETTER A WITH DIAERESIS ``ä`` will be replaced with ``ae``. If the character is not replaced, then it will be skipped. """ return _error_handle_base(exc, long_table, lambda c, n, e: '') def ignore_short(exc): """Error handler for transliterate to 8 bit using as few letters as possible. For example, \u00e4 LATIN SMALL LETTER A WITH DIAERESIS ``ä`` will be replaced with ``a``. If the character is not replaced, then it will be skipped. """ return _error_handle_base(exc, short_table, lambda c, n, e: '') def ignore_single(exc): """Error handler for transliterate to 8 bit using only single letter replacements. For example, \u2639 WHITE FROWNING FACE ``☹`` will be passed through unchanged. If the character is not replaced, then it will be skipped. """ return _error_handle_base(exc, single_table, lambda c, n, e: '') def re_reaise(c, n, e): raise e def strict_long(exc): """Error handler for transliterate to 8 bit using as many letters as needed. For example, \u00e4 LATIN SMALL LETTER A WITH DIAERESIS ``ä`` will be replaced with ``ae``. If the character is not replaced, then an exception is thrown. """ return _error_handle_base(exc, long_table, re_reaise) def strict_short(exc): """Error handler for transliterate to 8 bit using as few letters as possible. For example, \u00e4 LATIN SMALL LETTER A WITH DIAERESIS ``ä`` will be replaced with ``a``. If the character is not replaced, then an exception is thrown. """ return _error_handle_base(exc, short_table, re_reaise) def strict_single(exc): """Error handler for transliterate to 8 bit using only single letter replacements. For example, \u2639 WHITE FROWNING FACE ``☹`` will be passed through unchanged. If the character is not replaced, then an exception is thrown. """ return _error_handle_base(exc, single_table, re_reaise) def no_decode(input, errors='strict'): raise TypeError("transliterating codec does not support decode.") def _double_encoding_factory(encoder, byte_encoder, byte_encoding): """Send the transliterated output to another codec.""" def dbl_encode(input, errors='strict'): uni, length = encoder(input, errors) return byte_encoder(uni, errors)[0], length dbl_encode.__name__ = '%s_%s' % (encoder.__name__, byte_encoding) return dbl_encode def trans_search(encoding): """Lookup transliterating codecs.""" if encoding == 'transliterate': return codecs.CodecInfo(long_encode, no_decode) # translit/long/utf8 # translit/one # translit/short/ascii delim = '/' if sys.version_info > (3, 9): delim = '_' if encoding.startswith('translit' + delim): parts = encoding.split(delim) if parts[1] == 'long': encoder = long_encode elif parts[1] == 'short': encoder = short_encode elif parts[1] == 'one': encoder = single_encode else: return None if len(parts) == 2: pass elif len(parts) == 3: byte_enc = parts[2] byte_encoder = codecs.lookup(byte_enc).encode encoder = _double_encoding_factory(encoder, byte_encoder, byte_enc) else: return None return codecs.CodecInfo(encoder, no_decode) return None codecs.register(trans_search) codecs.register_error('replace/translit/long', replace_long) codecs.register_error('replace/translit/short', replace_short) codecs.register_error('replace/translit/one', replace_single) codecs.register_error('ignore/translit/long', ignore_long) codecs.register_error('ignore/translit/short', ignore_short) codecs.register_error('ignore/translit/one', ignore_single) codecs.register_error('strict/translit/long', strict_long) codecs.register_error('strict/translit/short', strict_short) codecs.register_error('strict/translit/one', strict_single) ### Code below is generated by update_table.py; do not edit. ### > long_table = { 160: ' ', 161: '!', 162: 'c', 163: 'GBP', 165: 'Y', 166: '|', 167: 'S', 168: '"', 169: '(c)', 170: 'a', 171: '<<', 172: '-', 173: '-', 174: '(R)', 175: '-', 176: ' ', 177: '+/-', 178: '^2', 179: '^3', 180: "'", 181: 'μ', 182: 'P', 183: '.', 184: ',', 185: '^1', 186: 'o', 187: '>>', 188: ' 1/4', 189: ' 1/2', 190: ' 3/4', 191: '?', 192: 'A', 193: 'A', 194: 'A', 195: 'A', 196: 'Ae', 197: 'Aa', 198: 'AE', 199: 'C', 200: 'E', 201: 'E', 202: 'E', 203: 'E', 204: 'I', 205: 'I', 206: 'I', 207: 'I', 208: 'D', 209: 'N', 210: 'O', 211: 'O', 212: 'O', 213: 'O', 214: 'Oe', 215: 'x', 216: 'O', 217: 'U', 218: 'U', 219: 'U', 220: 'Ue', 221: 'Y', 222: 'Th', 223: 'ss', 224: 'a', 225: 'a', 226: 'a', 227: 'a', 228: 'ae', 229: 'aa', 230: 'ae', 231: 'c', 232: 'e', 233: 'e', 234: 'e', 235: 'e', 236: 'i', 237: 'i', 238: 'i', 239: 'i', 240: 'd', 241: 'n', 242: 'o', 243: 'o', 244: 'o', 245: 'o', 246: 'oe', 247: ':', 248: 'o', 249: 'u', 250: 'u', 251: 'u', 252: 'ue', 253: 'y', 254: 'th', 255: 'y', 256: 'A', 257: 'a', 258: 'A', 259: 'a', 260: 'A', 261: 'a', 262: 'C', 263: 'c', 264: 'Ch', 265: 'ch', 266: 'C', 267: 'c', 268: 'C', 269: 'c', 270: 'D', 271: 'd', 272: 'D', 273: 'd', 274: 'E', 275: 'e', 276: 'E', 277: 'e', 278: 'E', 279: 'e', 280: 'E', 281: 'e', 282: 'E', 283: 'e', 284: 'Gh', 285: 'gh', 286: 'G', 287: 'g', 288: 'G', 289: 'g', 290: 'G', 291: 'g', 292: 'Hh', 293: 'hh', 294: 'H', 295: 'h', 296: 'I', 297: 'i', 298: 'I', 299: 'i', 300: 'I', 301: 'i', 302: 'I', 303: 'i', 304: 'I', 305: 'i', 306: 'IJ', 307: 'ij', 308: 'Jh', 309: 'jh', 310: 'K', 311: 'k', 312: 'k', 313: 'L', 314: 'l', 315: 'L', 316: 'l', 317: 'L', 318: 'l', 319: 'L·', 320: 'l·', 321: 'L', 322: 'l', 323: 'N', 324: 'n', 325: 'N', 326: 'n', 327: 'N', 328: 'n', 329: "'n", 330: 'NG', 331: 'ng', 332: 'O', 333: 'o', 334: 'O', 335: 'o', 336: 'O', 337: 'o', 338: 'OE', 339: 'oe', 340: 'R', 341: 'r', 342: 'R', 343: 'r', 344: 'R', 345: 'r', 346: 'S', 347: 's', 348: 'Sh', 349: 'sh', 350: 'S', 351: 's', 352: 'S', 353: 's', 354: 'T', 355: 't', 356: 'T', 357: 't', 358: 'T', 359: 't', 360: 'U', 361: 'u', 362: 'U', 363: 'u', 364: 'U', 365: 'u', 366: 'U', 367: 'u', 368: 'U', 369: 'u', 370: 'U', 371: 'u', 372: 'W', 373: 'w', 374: 'Y', 375: 'y', 376: 'Y', 377: 'Z', 378: 'z', 379: 'Z', 380: 'z', 381: 'Z', 382: 'z', 383: 's', 402: 'f', 416: 'O', 417: 'o', 431: 'U', 432: 'u', 536: 'Ş', 537: 'ş', 538: 'Ţ', 539: 'ţ', 697: '′', 699: '‘', 700: '’', 701: '‛', 710: '^', 712: "'", 713: '¯', 716: ',', 720: ':', 730: '°', 732: '~', 733: '"', 884: "'", 885: ',', 894: ';', 7682: 'B', 7683: 'b', 7690: 'D', 7691: 'd', 7710: 'F', 7711: 'f', 7744: 'M', 7745: 'm', 7766: 'P', 7767: 'p', 7776: 'S', 7777: 's', 7786: 'T', 7787: 't', 7808: 'W', 7809: 'w', 7810: 'W', 7811: 'w', 7812: 'W', 7813: 'w', 7918: 'U', 7919: 'u', 7922: 'Y', 7923: 'y', 8192: ' ', 8193: ' ', 8194: ' ', 8195: ' ', 8196: ' ', 8197: ' ', 8198: ' ', 8199: ' ', 8200: ' ', 8201: ' ', 8202: '', 8203: '', 8204: '', 8205: '', 8206: '', 8207: '', 8208: '-', 8209: '-', 8210: '-', 8211: '-', 8212: '--', 8213: '--', 8214: '||', 8215: '_', 8216: "'", 8217: "'", 8218: "'", 8219: "'", 8220: '"', 8221: '"', 8222: '"', 8223: '"', 8224: '+', 8225: '++', 8226: 'o', 8227: '>', 8228: '.', 8229: '..', 8230: '...', 8231: '-', 8234: '', 8235: '', 8236: '', 8237: '', 8238: '', 8239: ' ', 8240: ' 0/00', 8242: "'", 8243: '"', 8244: "'''", 8245: '`', 8246: '``', 8247: '```', 8249: '<', 8250: '>', 8252: '!!', 8254: '-', 8259: '-', 8260: '/', 8264: '?!', 8265: '!?', 8266: '7', 8304: '^0', 8308: '^4', 8309: '^5', 8310: '^6', 8311: '^7', 8312: '^8', 8313: '^9', 8314: '^+', 8315: '^-', 8316: '^=', 8317: '^(', 8318: '^)', 8319: '^n', 8320: '_0', 8321: '_1', 8322: '_2', 8323: '_3', 8324: '_4', 8325: '_5', 8326: '_6', 8327: '_7', 8328: '_8', 8329: '_9', 8330: '_+', 8331: '_-', 8332: '_=', 8333: '_(', 8334: '_)', 8364: 'EUR', 8448: 'a/c', 8449: 'a/s', 8451: '°C', 8453: 'c/o', 8454: 'c/u', 8457: '°F', 8467: 'l', 8470: 'Nº', 8471: '(P)', 8480: '[SM]', 8481: 'TEL', 8482: '[TM]', 8486: 'Ω', 8490: 'K', 8491: 'Å', 8494: 'e', 8531: ' 1/3', 8532: ' 2/3', 8533: ' 1/5', 8534: ' 2/5', 8535: ' 3/5', 8536: ' 4/5', 8537: ' 1/6', 8538: ' 5/6', 8539: ' 1/8', 8540: ' 3/8', 8541: ' 5/8', 8542: ' 7/8', 8543: ' 1/', 8544: 'I', 8545: 'II', 8546: 'III', 8547: 'IV', 8548: 'V', 8549: 'VI', 8550: 'VII', 8551: 'VIII', 8552: 'IX', 8553: 'X', 8554: 'XI', 8555: 'XII', 8556: 'L', 8557: 'C', 8558: 'D', 8559: 'M', 8560: 'i', 8561: 'ii', 8562: 'iii', 8563: 'iv', 8564: 'v', 8565: 'vi', 8566: 'vii', 8567: 'viii', 8568: 'ix', 8569: 'x', 8570: 'xi', 8571: 'xii', 8572: 'l', 8573: 'c', 8574: 'd', 8575: 'm', 8592: '<-', 8593: '^', 8594: '->', 8595: 'v', 8596: '<->', 8656: '<=', 8658: '=>', 8660: '<=>', 8722: '–', 8725: '/', 8726: '\\', 8727: '*', 8728: 'o', 8729: '·', 8734: 'inf', 8739: '|', 8741: '||', 8758: ':', 8764: '~', 8800: '/=', 8801: '=', 8804: '<=', 8805: '>=', 8810: '<<', 8811: '>>', 8853: '(+)', 8854: '(-)', 8855: '(x)', 8856: '(/)', 8866: '|-', 8867: '-|', 8870: '|-', 8871: '|=', 8872: '|=', 8873: '||-', 8901: '·', 8902: '*', 8917: '#', 8920: '<<<', 8921: '>>>', 8943: '...', 9001: '<', 9002: '>', 9216: 'NUL', 9217: 'SOH', 9218: 'STX', 9219: 'ETX', 9220: 'EOT', 9221: 'ENQ', 9222: 'ACK', 9223: 'BEL', 9224: 'BS', 9225: 'HT', 9226: 'LF', 9227: 'VT', 9228: 'FF', 9229: 'CR', 9230: 'SO', 9231: 'SI', 9232: 'DLE', 9233: 'DC1', 9234: 'DC2', 9235: 'DC3', 9236: 'DC4', 9237: 'NAK', 9238: 'SYN', 9239: 'ETB', 9240: 'CAN', 9241: 'EM', 9242: 'SUB', 9243: 'ESC', 9244: 'FS', 9245: 'GS', 9246: 'RS', 9247: 'US', 9248: 'SP', 9249: 'DEL', 9251: '_', 9252: 'NL', 9253: '///', 9254: '?', 9312: '(1)', 9313: '(2)', 9314: '(3)', 9315: '(4)', 9316: '(5)', 9317: '(6)', 9318: '(7)', 9319: '(8)', 9320: '(9)', 9321: '(10)', 9322: '(11)', 9323: '(12)', 9324: '(13)', 9325: '(14)', 9326: '(15)', 9327: '(16)', 9328: '(17)', 9329: '(18)', 9330: '(19)', 9331: '(20)', 9332: '(1)', 9333: '(2)', 9334: '(3)', 9335: '(4)', 9336: '(5)', 9337: '(6)', 9338: '(7)', 9339: '(8)', 9340: '(9)', 9341: '(10)', 9342: '(11)', 9343: '(12)', 9344: '(13)', 9345: '(14)', 9346: '(15)', 9347: '(16)', 9348: '(17)', 9349: '(18)', 9350: '(19)', 9351: '(20)', 9352: '1.', 9353: '2.', 9354: '3.', 9355: '4.', 9356: '5.', 9357: '6.', 9358: '7.', 9359: '8.', 9360: '9.', 9361: '10.', 9362: '11.', 9363: '12.', 9364: '13.', 9365: '14.', 9366: '15.', 9367: '16.', 9368: '17.', 9369: '18.', 9370: '19.', 9371: '20.', 9372: '(a)', 9373: '(b)', 9374: '(c)', 9375: '(d)', 9376: '(e)', 9377: '(f)', 9378: '(g)', 9379: '(h)', 9380: '(i)', 9381: '(j)', 9382: '(k)', 9383: '(l)', 9384: '(m)', 9385: '(n)', 9386: '(o)', 9387: '(p)', 9388: '(q)', 9389: '(r)', 9390: '(s)', 9391: '(t)', 9392: '(u)', 9393: '(v)', 9394: '(w)', 9395: '(x)', 9396: '(y)', 9397: '(z)', 9398: '(A)', 9399: '(B)', 9400: '(C)', 9401: '(D)', 9402: '(E)', 9403: '(F)', 9404: '(G)', 9405: '(H)', 9406: '(I)', 9407: '(J)', 9408: '(K)', 9409: '(L)', 9410: '(M)', 9411: '(N)', 9412: '(O)', 9413: '(P)', 9414: '(Q)', 9415: '(R)', 9416: '(S)', 9417: '(T)', 9418: '(U)', 9419: '(V)', 9420: '(W)', 9421: '(X)', 9422: '(Y)', 9423: '(Z)', 9424: '(a)', 9425: '(b)', 9426: '(c)', 9427: '(d)', 9428: '(e)', 9429: '(f)', 9430: '(g)', 9431: '(h)', 9432: '(i)', 9433: '(j)', 9434: '(k)', 9435: '(l)', 9436: '(m)', 9437: '(n)', 9438: '(o)', 9439: '(p)', 9440: '(q)', 9441: '(r)', 9442: '(s)', 9443: '(t)', 9444: '(u)', 9445: '(v)', 9446: '(w)', 9447: '(x)', 9448: '(y)', 9449: '(z)', 9450: '(0)', 9472: '-', 9473: '=', 9474: '|', 9475: '|', 9476: '-', 9477: '=', 9478: '|', 9479: '|', 9480: '-', 9481: '=', 9482: '|', 9483: '|', 9484: '+', 9485: '+', 9486: '+', 9487: '+', 9488: '+', 9489: '+', 9490: '+', 9491: '+', 9492: '+', 9493: '+', 9494: '+', 9495: '+', 9496: '+', 9497: '+', 9498: '+', 9499: '+', 9500: '+', 9501: '+', 9502: '+', 9503: '+', 9504: '+', 9505: '+', 9506: '+', 9507: '+', 9508: '+', 9509: '+', 9510: '+', 9511: '+', 9512: '+', 9513: '+', 9514: '+', 9515: '+', 9516: '+', 9517: '+', 9518: '+', 9519: '+', 9520: '+', 9521: '+', 9522: '+', 9523: '+', 9524: '+', 9525: '+', 9526: '+', 9527: '+', 9528: '+', 9529: '+', 9530: '+', 9531: '+', 9532: '+', 9533: '+', 9534: '+', 9535: '+', 9536: '+', 9537: '+', 9538: '+', 9539: '+', 9540: '+', 9541: '+', 9542: '+', 9543: '+', 9544: '+', 9545: '+', 9546: '+', 9547: '+', 9548: '-', 9549: '=', 9550: '|', 9551: '|', 9552: '=', 9553: '|', 9554: '+', 9555: '+', 9556: '+', 9557: '+', 9558: '+', 9559: '+', 9560: '+', 9561: '+', 9562: '+', 9563: '+', 9564: '+', 9565: '+', 9566: '+', 9567: '+', 9568: '+', 9569: '+', 9570: '+', 9571: '+', 9572: '+', 9573: '+', 9574: '+', 9575: '+', 9576: '+', 9577: '+', 9578: '+', 9579: '+', 9580: '+', 9581: '+', 9582: '+', 9583: '+', 9584: '+', 9585: '/', 9586: '\\', 9587: 'X', 9596: '-', 9597: '|', 9598: '-', 9599: '|', 9675: 'o', 9702: 'o', 9733: '*', 9734: '*', 9746: 'X', 9747: 'X', 9785: ':-(', 9786: ':-)', 9787: '(-:', 9837: 'b', 9839: '#', 9985: '%<', 9986: '%<', 9987: '%<', 9988: '%<', 9996: 'V', 10003: '√', 10004: '√', 10005: 'x', 10006: 'x', 10007: 'X', 10008: 'X', 10009: '+', 10010: '+', 10011: '+', 10012: '+', 10013: '+', 10014: '+', 10015: '+', 10016: '+', 10017: '*', 10018: '+', 10019: '+', 10020: '+', 10021: '+', 10022: '+', 10023: '+', 10025: '*', 10026: '*', 10027: '*', 10028: '*', 10029: '*', 10030: '*', 10031: '*', 10032: '*', 10033: '*', 10034: '*', 10035: '*', 10036: '*', 10037: '*', 10038: '*', 10039: '*', 10040: '*', 10041: '*', 10042: '*', 10043: '*', 10044: '*', 10045: '*', 10046: '*', 10047: '*', 10048: '*', 10049: '*', 10050: '*', 10051: '*', 10052: '*', 10053: '*', 10054: '*', 10055: '*', 10056: '*', 10057: '*', 10058: '*', 10059: '*', 64256: 'ff', 64257: 'fi', 64258: 'fl', 64259: 'ffi', 64260: 'ffl', 64261: 'ſt', 64262: 'st', 65279: '', 65533: '?', } short_table = { 160: ' ', 161: '!', 162: 'c', 163: 'GBP', 165: 'Y', 166: '|', 167: 'S', 168: '"', 169: 'c', 170: 'a', 171: '<<', 172: '-', 173: '-', 174: '(R)', 175: '-', 176: ' ', 177: '+/-', 178: '2', 179: '3', 180: "'", 181: 'u', 182: 'P', 183: '.', 184: ',', 185: '1', 186: 'o', 187: '>>', 188: ' 1/4', 189: ' 1/2', 190: ' 3/4', 191: '?', 192: 'A', 193: 'A', 194: 'A', 195: 'A', 196: 'A', 197: 'A', 198: 'A', 199: 'C', 200: 'E', 201: 'E', 202: 'E', 203: 'E', 204: 'I', 205: 'I', 206: 'I', 207: 'I', 208: 'D', 209: 'N', 210: 'O', 211: 'O', 212: 'O', 213: 'O', 214: 'O', 215: 'x', 216: 'O', 217: 'U', 218: 'U', 219: 'U', 220: 'U', 221: 'Y', 222: 'Th', 223: 'ss', 224: 'a', 225: 'a', 226: 'a', 227: 'a', 228: 'a', 229: 'a', 230: 'a', 231: 'c', 232: 'e', 233: 'e', 234: 'e', 235: 'e', 236: 'i', 237: 'i', 238: 'i', 239: 'i', 240: 'd', 241: 'n', 242: 'o', 243: 'o', 244: 'o', 245: 'o', 246: 'o', 247: ':', 248: 'o', 249: 'u', 250: 'u', 251: 'u', 252: 'u', 253: 'y', 254: 'th', 255: 'y', 256: 'A', 257: 'a', 258: 'A', 259: 'a', 260: 'A', 261: 'a', 262: 'C', 263: 'c', 264: 'C', 265: 'c', 266: 'C', 267: 'c', 268: 'C', 269: 'c', 270: 'D', 271: 'd', 272: 'D', 273: 'd', 274: 'E', 275: 'e', 276: 'E', 277: 'e', 278: 'E', 279: 'e', 280: 'E', 281: 'e', 282: 'E', 283: 'e', 284: 'G', 285: 'g', 286: 'G', 287: 'g', 288: 'G', 289: 'g', 290: 'G', 291: 'g', 292: 'H', 293: 'h', 294: 'H', 295: 'h', 296: 'I', 297: 'i', 298: 'I', 299: 'i', 300: 'I', 301: 'i', 302: 'I', 303: 'i', 304: 'I', 305: 'i', 306: 'IJ', 307: 'ij', 308: 'J', 309: 'j', 310: 'K', 311: 'k', 312: 'k', 313: 'L', 314: 'l', 315: 'L', 316: 'l', 317: 'L', 318: 'l', 319: 'L.', 320: 'l.', 321: 'L', 322: 'l', 323: 'N', 324: 'n', 325: 'N', 326: 'n', 327: 'N', 328: 'n', 329: "'n", 330: 'N', 331: 'n', 332: 'O', 333: 'o', 334: 'O', 335: 'o', 336: 'O', 337: 'o', 338: 'OE', 339: 'oe', 340: 'R', 341: 'r', 342: 'R', 343: 'r', 344: 'R', 345: 'r', 346: 'S', 347: 's', 348: 'S', 349: 's', 350: 'S', 351: 's', 352: 'S', 353: 's', 354: 'T', 355: 't', 356: 'T', 357: 't', 358: 'T', 359: 't', 360: 'U', 361: 'u', 362: 'U', 363: 'u', 364: 'U', 365: 'u', 366: 'U', 367: 'u', 368: 'U', 369: 'u', 370: 'U', 371: 'u', 372: 'W', 373: 'w', 374: 'Y', 375: 'y', 376: 'Y', 377: 'Z', 378: 'z', 379: 'Z', 380: 'z', 381: 'Z', 382: 'z', 383: 's', 402: 'f', 416: 'O', 417: 'o', 431: 'U', 432: 'u', 536: 'S', 537: 's', 538: 'T', 539: 't', 697: "'", 699: '‘', 700: "'", 701: '‛', 710: '^', 712: "'", 713: '¯', 716: ',', 720: ':', 730: '°', 732: '~', 733: '"', 884: "'", 885: ',', 894: ';', 7682: 'B', 7683: 'b', 7690: 'D', 7691: 'd', 7710: 'F', 7711: 'f', 7744: 'M', 7745: 'm', 7766: 'P', 7767: 'p', 7776: 'S', 7777: 's', 7786: 'T', 7787: 't', 7808: 'W', 7809: 'w', 7810: 'W', 7811: 'w', 7812: 'W', 7813: 'w', 7918: 'U', 7919: 'u', 7922: 'Y', 7923: 'y', 8192: ' ', 8193: ' ', 8194: ' ', 8195: ' ', 8196: ' ', 8197: ' ', 8198: ' ', 8199: ' ', 8200: ' ', 8201: ' ', 8202: '', 8203: '', 8204: '', 8205: '', 8206: '', 8207: '', 8208: '-', 8209: '-', 8210: '-', 8211: '-', 8212: '--', 8213: '--', 8214: '||', 8215: '_', 8216: "'", 8217: "'", 8218: "'", 8219: "'", 8220: '"', 8221: '"', 8222: '"', 8223: '"', 8224: '+', 8225: '++', 8226: 'o', 8227: '>', 8228: '.', 8229: '..', 8230: '...', 8231: '-', 8234: '', 8235: '', 8236: '', 8237: '', 8238: '', 8239: ' ', 8240: ' 0/00', 8242: "'", 8243: '"', 8244: "'''", 8245: '`', 8246: '``', 8247: '```', 8249: '<', 8250: '>', 8252: '!!', 8254: '-', 8259: '-', 8260: '/', 8264: '?!', 8265: '!?', 8266: '7', 8304: '0', 8308: '4', 8309: '5', 8310: '6', 8311: '7', 8312: '8', 8313: '9', 8314: '+', 8315: '-', 8316: '=', 8317: '(', 8318: ')', 8319: 'n', 8320: '0', 8321: '1', 8322: '2', 8323: '3', 8324: '4', 8325: '5', 8326: '6', 8327: '7', 8328: '8', 8329: '9', 8330: '+', 8331: '-', 8332: '=', 8333: '(', 8334: ')', 8364: 'E', 8448: 'a/c', 8449: 'a/s', 8451: 'C', 8453: 'c/o', 8454: 'c/u', 8457: 'F', 8467: 'l', 8470: 'No', 8471: '(P)', 8480: '[SM]', 8481: 'TEL', 8482: '[TM]', 8486: 'ohm', 8490: 'K', 8491: 'Å', 8494: 'e', 8531: ' 1/3', 8532: ' 2/3', 8533: ' 1/5', 8534: ' 2/5', 8535: ' 3/5', 8536: ' 4/5', 8537: ' 1/6', 8538: ' 5/6', 8539: ' 1/8', 8540: ' 3/8', 8541: ' 5/8', 8542: ' 7/8', 8543: ' 1/', 8544: 'I', 8545: 'II', 8546: 'III', 8547: 'IV', 8548: 'V', 8549: 'VI', 8550: 'VII', 8551: 'VIII', 8552: 'IX', 8553: 'X', 8554: 'XI', 8555: 'XII', 8556: 'L', 8557: 'C', 8558: 'D', 8559: 'M', 8560: 'i', 8561: 'ii', 8562: 'iii', 8563: 'iv', 8564: 'v', 8565: 'vi', 8566: 'vii', 8567: 'viii', 8568: 'ix', 8569: 'x', 8570: 'xi', 8571: 'xii', 8572: 'l', 8573: 'c', 8574: 'd', 8575: 'm', 8592: '<-', 8593: '^', 8594: '->', 8595: 'v', 8596: '<->', 8656: '<=', 8658: '=>', 8660: '<=>', 8722: '-', 8725: '/', 8726: '\\', 8727: '*', 8728: 'o', 8729: '·', 8734: 'inf', 8739: '|', 8741: '||', 8758: ':', 8764: '~', 8800: '/=', 8801: '=', 8804: '<=', 8805: '>=', 8810: '<<', 8811: '>>', 8853: '(+)', 8854: '(-)', 8855: '(x)', 8856: '(/)', 8866: '|-', 8867: '-|', 8870: '|-', 8871: '|=', 8872: '|=', 8873: '||-', 8901: '·', 8902: '*', 8917: '#', 8920: '<<<', 8921: '>>>', 8943: '...', 9001: '<', 9002: '>', 9216: 'NUL', 9217: 'SOH', 9218: 'STX', 9219: 'ETX', 9220: 'EOT', 9221: 'ENQ', 9222: 'ACK', 9223: 'BEL', 9224: 'BS', 9225: 'HT', 9226: 'LF', 9227: 'VT', 9228: 'FF', 9229: 'CR', 9230: 'SO', 9231: 'SI', 9232: 'DLE', 9233: 'DC1', 9234: 'DC2', 9235: 'DC3', 9236: 'DC4', 9237: 'NAK', 9238: 'SYN', 9239: 'ETB', 9240: 'CAN', 9241: 'EM', 9242: 'SUB', 9243: 'ESC', 9244: 'FS', 9245: 'GS', 9246: 'RS', 9247: 'US', 9248: 'SP', 9249: 'DEL', 9251: '_', 9252: 'NL', 9253: '///', 9254: '?', 9312: '1', 9313: '2', 9314: '3', 9315: '4', 9316: '5', 9317: '6', 9318: '7', 9319: '8', 9320: '9', 9321: '(10)', 9322: '(11)', 9323: '(12)', 9324: '(13)', 9325: '(14)', 9326: '(15)', 9327: '(16)', 9328: '(17)', 9329: '(18)', 9330: '(19)', 9331: '(20)', 9332: '1', 9333: '2', 9334: '3', 9335: '4', 9336: '5', 9337: '6', 9338: '7', 9339: '8', 9340: '9', 9341: '(10)', 9342: '(11)', 9343: '(12)', 9344: '(13)', 9345: '(14)', 9346: '(15)', 9347: '(16)', 9348: '(17)', 9349: '(18)', 9350: '(19)', 9351: '(20)', 9352: '1', 9353: '2', 9354: '3', 9355: '4', 9356: '5', 9357: '6', 9358: '7', 9359: '8', 9360: '9', 9361: '10.', 9362: '11.', 9363: '12.', 9364: '13.', 9365: '14.', 9366: '15.', 9367: '16.', 9368: '17.', 9369: '18.', 9370: '19.', 9371: '20.', 9372: 'a', 9373: 'b', 9374: 'c', 9375: 'd', 9376: 'e', 9377: 'f', 9378: 'g', 9379: 'h', 9380: 'i', 9381: 'j', 9382: 'k', 9383: 'l', 9384: 'm', 9385: 'n', 9386: 'o', 9387: 'p', 9388: 'q', 9389: 'r', 9390: 's', 9391: 't', 9392: 'u', 9393: 'v', 9394: 'w', 9395: 'x', 9396: 'y', 9397: 'z', 9398: 'A', 9399: 'B', 9400: 'C', 9401: 'D', 9402: 'E', 9403: 'F', 9404: 'G', 9405: 'H', 9406: 'I', 9407: 'J', 9408: 'K', 9409: 'L', 9410: 'M', 9411: 'N', 9412: 'O', 9413: 'P', 9414: 'Q', 9415: 'R', 9416: 'S', 9417: 'T', 9418: 'U', 9419: 'V', 9420: 'W', 9421: 'X', 9422: 'Y', 9423: 'Z', 9424: 'a', 9425: 'b', 9426: 'c', 9427: 'd', 9428: 'e', 9429: 'f', 9430: 'g', 9431: 'h', 9432: 'i', 9433: 'j', 9434: 'k', 9435: 'l', 9436: 'm', 9437: 'n', 9438: 'o', 9439: 'p', 9440: 'q', 9441: 'r', 9442: 's', 9443: 't', 9444: 'u', 9445: 'v', 9446: 'w', 9447: 'x', 9448: 'y', 9449: 'z', 9450: '0', 9472: '-', 9473: '=', 9474: '|', 9475: '|', 9476: '-', 9477: '=', 9478: '|', 9479: '|', 9480: '-', 9481: '=', 9482: '|', 9483: '|', 9484: '+', 9485: '+', 9486: '+', 9487: '+', 9488: '+', 9489: '+', 9490: '+', 9491: '+', 9492: '+', 9493: '+', 9494: '+', 9495: '+', 9496: '+', 9497: '+', 9498: '+', 9499: '+', 9500: '+', 9501: '+', 9502: '+', 9503: '+', 9504: '+', 9505: '+', 9506: '+', 9507: '+', 9508: '+', 9509: '+', 9510: '+', 9511: '+', 9512: '+', 9513: '+', 9514: '+', 9515: '+', 9516: '+', 9517: '+', 9518: '+', 9519: '+', 9520: '+', 9521: '+', 9522: '+', 9523: '+', 9524: '+', 9525: '+', 9526: '+', 9527: '+', 9528: '+', 9529: '+', 9530: '+', 9531: '+', 9532: '+', 9533: '+', 9534: '+', 9535: '+', 9536: '+', 9537: '+', 9538: '+', 9539: '+', 9540: '+', 9541: '+', 9542: '+', 9543: '+', 9544: '+', 9545: '+', 9546: '+', 9547: '+', 9548: '-', 9549: '=', 9550: '|', 9551: '|', 9552: '=', 9553: '|', 9554: '+', 9555: '+', 9556: '+', 9557: '+', 9558: '+', 9559: '+', 9560: '+', 9561: '+', 9562: '+', 9563: '+', 9564: '+', 9565: '+', 9566: '+', 9567: '+', 9568: '+', 9569: '+', 9570: '+', 9571: '+', 9572: '+', 9573: '+', 9574: '+', 9575: '+', 9576: '+', 9577: '+', 9578: '+', 9579: '+', 9580: '+', 9581: '+', 9582: '+', 9583: '+', 9584: '+', 9585: '/', 9586: '\\', 9587: 'X', 9596: '-', 9597: '|', 9598: '-', 9599: '|', 9675: 'o', 9702: 'o', 9733: '*', 9734: '*', 9746: 'X', 9747: 'X', 9785: ':-(', 9786: ':-)', 9787: '(-:', 9837: 'b', 9839: '#', 9985: '%<', 9986: '%<', 9987: '%<', 9988: '%<', 9996: 'V', 10003: '√', 10004: '√', 10005: 'x', 10006: 'x', 10007: 'X', 10008: 'X', 10009: '+', 10010: '+', 10011: '+', 10012: '+', 10013: '+', 10014: '+', 10015: '+', 10016: '+', 10017: '*', 10018: '+', 10019: '+', 10020: '+', 10021: '+', 10022: '+', 10023: '+', 10025: '*', 10026: '*', 10027: '*', 10028: '*', 10029: '*', 10030: '*', 10031: '*', 10032: '*', 10033: '*', 10034: '*', 10035: '*', 10036: '*', 10037: '*', 10038: '*', 10039: '*', 10040: '*', 10041: '*', 10042: '*', 10043: '*', 10044: '*', 10045: '*', 10046: '*', 10047: '*', 10048: '*', 10049: '*', 10050: '*', 10051: '*', 10052: '*', 10053: '*', 10054: '*', 10055: '*', 10056: '*', 10057: '*', 10058: '*', 10059: '*', 64256: 'ff', 64257: 'fi', 64258: 'fl', 64259: 'ffi', 64260: 'ffl', 64261: 'st', 64262: 'st', 65279: '', 65533: '?', } single_table = { 160: ' ', 161: '!', 162: 'c', 165: 'Y', 166: '|', 167: 'S', 168: '"', 169: 'c', 170: 'a', 172: '-', 173: '-', 175: '-', 176: ' ', 178: '2', 179: '3', 180: "'", 181: 'u', 182: 'P', 183: '.', 184: ',', 185: '1', 186: 'o', 191: '?', 192: 'A', 193: 'A', 194: 'A', 195: 'A', 196: 'A', 197: 'A', 198: 'A', 199: 'C', 200: 'E', 201: 'E', 202: 'E', 203: 'E', 204: 'I', 205: 'I', 206: 'I', 207: 'I', 208: 'D', 209: 'N', 210: 'O', 211: 'O', 212: 'O', 213: 'O', 214: 'O', 215: 'x', 216: 'O', 217: 'U', 218: 'U', 219: 'U', 220: 'U', 221: 'Y', 223: 's', 224: 'a', 225: 'a', 226: 'a', 227: 'a', 228: 'a', 229: 'a', 230: 'a', 231: 'c', 232: 'e', 233: 'e', 234: 'e', 235: 'e', 236: 'i', 237: 'i', 238: 'i', 239: 'i', 240: 'd', 241: 'n', 242: 'o', 243: 'o', 244: 'o', 245: 'o', 246: 'o', 247: ':', 248: 'o', 249: 'u', 250: 'u', 251: 'u', 252: 'u', 253: 'y', 255: 'y', 256: 'A', 257: 'a', 258: 'A', 259: 'a', 260: 'A', 261: 'a', 262: 'C', 263: 'c', 264: 'C', 265: 'c', 266: 'C', 267: 'c', 268: 'C', 269: 'c', 270: 'D', 271: 'd', 272: 'D', 273: 'd', 274: 'E', 275: 'e', 276: 'E', 277: 'e', 278: 'E', 279: 'e', 280: 'E', 281: 'e', 282: 'E', 283: 'e', 284: 'G', 285: 'g', 286: 'G', 287: 'g', 288: 'G', 289: 'g', 290: 'G', 291: 'g', 292: 'H', 293: 'h', 294: 'H', 295: 'h', 296: 'I', 297: 'i', 298: 'I', 299: 'i', 300: 'I', 301: 'i', 302: 'I', 303: 'i', 304: 'I', 305: 'i', 308: 'J', 309: 'j', 310: 'K', 311: 'k', 312: 'k', 313: 'L', 314: 'l', 315: 'L', 316: 'l', 317: 'L', 318: 'l', 321: 'L', 322: 'l', 323: 'N', 324: 'n', 325: 'N', 326: 'n', 327: 'N', 328: 'n', 330: 'N', 331: 'n', 332: 'O', 333: 'o', 334: 'O', 335: 'o', 336: 'O', 337: 'o', 340: 'R', 341: 'r', 342: 'R', 343: 'r', 344: 'R', 345: 'r', 346: 'S', 347: 's', 348: 'S', 349: 's', 350: 'S', 351: 's', 352: 'S', 353: 's', 354: 'T', 355: 't', 356: 'T', 357: 't', 358: 'T', 359: 't', 360: 'U', 361: 'u', 362: 'U', 363: 'u', 364: 'U', 365: 'u', 366: 'U', 367: 'u', 368: 'U', 369: 'u', 370: 'U', 371: 'u', 372: 'W', 373: 'w', 374: 'Y', 375: 'y', 376: 'Y', 377: 'Z', 378: 'z', 379: 'Z', 380: 'z', 381: 'Z', 382: 'z', 383: 's', 402: 'f', 416: 'O', 417: 'o', 431: 'U', 432: 'u', 536: 'S', 537: 's', 538: 'T', 539: 't', 697: "'", 699: '‘', 700: "'", 701: '‛', 710: '^', 712: "'", 713: '¯', 716: ',', 720: ':', 730: '°', 732: '~', 733: '"', 884: "'", 885: ',', 894: ';', 7682: 'B', 7683: 'b', 7690: 'D', 7691: 'd', 7710: 'F', 7711: 'f', 7744: 'M', 7745: 'm', 7766: 'P', 7767: 'p', 7776: 'S', 7777: 's', 7786: 'T', 7787: 't', 7808: 'W', 7809: 'w', 7810: 'W', 7811: 'w', 7812: 'W', 7813: 'w', 7918: 'U', 7919: 'u', 7922: 'Y', 7923: 'y', 8192: ' ', 8194: ' ', 8196: ' ', 8197: ' ', 8198: ' ', 8199: ' ', 8200: ' ', 8201: ' ', 8208: '-', 8209: '-', 8210: '-', 8211: '-', 8215: '_', 8216: "'", 8217: "'", 8218: "'", 8219: "'", 8220: '"', 8221: '"', 8222: '"', 8223: '"', 8224: '+', 8226: 'o', 8227: '>', 8228: '.', 8231: '-', 8239: ' ', 8242: "'", 8243: '"', 8245: '`', 8249: '<', 8250: '>', 8254: '-', 8259: '-', 8260: '/', 8266: '7', 8304: '0', 8308: '4', 8309: '5', 8310: '6', 8311: '7', 8312: '8', 8313: '9', 8314: '+', 8315: '-', 8316: '=', 8317: '(', 8318: ')', 8319: 'n', 8320: '0', 8321: '1', 8322: '2', 8323: '3', 8324: '4', 8325: '5', 8326: '6', 8327: '7', 8328: '8', 8329: '9', 8330: '+', 8331: '-', 8332: '=', 8333: '(', 8334: ')', 8364: 'E', 8451: 'C', 8457: 'F', 8467: 'l', 8490: 'K', 8491: 'Å', 8494: 'e', 8544: 'I', 8548: 'V', 8553: 'X', 8556: 'L', 8557: 'C', 8558: 'D', 8559: 'M', 8560: 'i', 8564: 'v', 8569: 'x', 8572: 'l', 8573: 'c', 8574: 'd', 8575: 'm', 8593: '^', 8595: 'v', 8722: '-', 8725: '/', 8726: '\\', 8727: '*', 8728: 'o', 8729: '·', 8739: '|', 8758: ':', 8764: '~', 8801: '=', 8901: '·', 8902: '*', 8917: '#', 9001: '<', 9002: '>', 9251: '_', 9254: '?', 9312: '1', 9313: '2', 9314: '3', 9315: '4', 9316: '5', 9317: '6', 9318: '7', 9319: '8', 9320: '9', 9332: '1', 9333: '2', 9334: '3', 9335: '4', 9336: '5', 9337: '6', 9338: '7', 9339: '8', 9340: '9', 9352: '1', 9353: '2', 9354: '3', 9355: '4', 9356: '5', 9357: '6', 9358: '7', 9359: '8', 9360: '9', 9372: 'a', 9373: 'b', 9374: 'c', 9375: 'd', 9376: 'e', 9377: 'f', 9378: 'g', 9379: 'h', 9380: 'i', 9381: 'j', 9382: 'k', 9383: 'l', 9384: 'm', 9385: 'n', 9386: 'o', 9387: 'p', 9388: 'q', 9389: 'r', 9390: 's', 9391: 't', 9392: 'u', 9393: 'v', 9394: 'w', 9395: 'x', 9396: 'y', 9397: 'z', 9398: 'A', 9399: 'B', 9400: 'C', 9401: 'D', 9402: 'E', 9403: 'F', 9404: 'G', 9405: 'H', 9406: 'I', 9407: 'J', 9408: 'K', 9409: 'L', 9410: 'M', 9411: 'N', 9412: 'O', 9413: 'P', 9414: 'Q', 9415: 'R', 9416: 'S', 9417: 'T', 9418: 'U', 9419: 'V', 9420: 'W', 9421: 'X', 9422: 'Y', 9423: 'Z', 9424: 'a', 9425: 'b', 9426: 'c', 9427: 'd', 9428: 'e', 9429: 'f', 9430: 'g', 9431: 'h', 9432: 'i', 9433: 'j', 9434: 'k', 9435: 'l', 9436: 'm', 9437: 'n', 9438: 'o', 9439: 'p', 9440: 'q', 9441: 'r', 9442: 's', 9443: 't', 9444: 'u', 9445: 'v', 9446: 'w', 9447: 'x', 9448: 'y', 9449: 'z', 9450: '0', 9472: '-', 9473: '=', 9474: '|', 9475: '|', 9476: '-', 9477: '=', 9478: '|', 9479: '|', 9480: '-', 9481: '=', 9482: '|', 9483: '|', 9484: '+', 9485: '+', 9486: '+', 9487: '+', 9488: '+', 9489: '+', 9490: '+', 9491: '+', 9492: '+', 9493: '+', 9494: '+', 9495: '+', 9496: '+', 9497: '+', 9498: '+', 9499: '+', 9500: '+', 9501: '+', 9502: '+', 9503: '+', 9504: '+', 9505: '+', 9506: '+', 9507: '+', 9508: '+', 9509: '+', 9510: '+', 9511: '+', 9512: '+', 9513: '+', 9514: '+', 9515: '+', 9516: '+', 9517: '+', 9518: '+', 9519: '+', 9520: '+', 9521: '+', 9522: '+', 9523: '+', 9524: '+', 9525: '+', 9526: '+', 9527: '+', 9528: '+', 9529: '+', 9530: '+', 9531: '+', 9532: '+', 9533: '+', 9534: '+', 9535: '+', 9536: '+', 9537: '+', 9538: '+', 9539: '+', 9540: '+', 9541: '+', 9542: '+', 9543: '+', 9544: '+', 9545: '+', 9546: '+', 9547: '+', 9548: '-', 9549: '=', 9550: '|', 9551: '|', 9552: '=', 9553: '|', 9554: '+', 9555: '+', 9556: '+', 9557: '+', 9558: '+', 9559: '+', 9560: '+', 9561: '+', 9562: '+', 9563: '+', 9564: '+', 9565: '+', 9566: '+', 9567: '+', 9568: '+', 9569: '+', 9570: '+', 9571: '+', 9572: '+', 9573: '+', 9574: '+', 9575: '+', 9576: '+', 9577: '+', 9578: '+', 9579: '+', 9580: '+', 9581: '+', 9582: '+', 9583: '+', 9584: '+', 9585: '/', 9586: '\\', 9587: 'X', 9596: '-', 9597: '|', 9598: '-', 9599: '|', 9675: 'o', 9702: 'o', 9733: '*', 9734: '*', 9746: 'X', 9747: 'X', 9837: 'b', 9839: '#', 9996: 'V', 10003: '√', 10004: '√', 10005: 'x', 10006: 'x', 10007: 'X', 10008: 'X', 10009: '+', 10010: '+', 10011: '+', 10012: '+', 10013: '+', 10014: '+', 10015: '+', 10016: '+', 10017: '*', 10018: '+', 10019: '+', 10020: '+', 10021: '+', 10022: '+', 10023: '+', 10025: '*', 10026: '*', 10027: '*', 10028: '*', 10029: '*', 10030: '*', 10031: '*', 10032: '*', 10033: '*', 10034: '*', 10035: '*', 10036: '*', 10037: '*', 10038: '*', 10039: '*', 10040: '*', 10041: '*', 10042: '*', 10043: '*', 10044: '*', 10045: '*', 10046: '*', 10047: '*', 10048: '*', 10049: '*', 10050: '*', 10051: '*', 10052: '*', 10053: '*', 10054: '*', 10055: '*', 10056: '*', 10057: '*', 10058: '*', 10059: '*', 65533: '?', } ### < translitcodec-version-0.7.0/transtab/000077500000000000000000000000001404555523200176625ustar00rootroot00000000000000translitcodec-version-0.7.0/transtab/Makefile000077500000000000000000000017271404555523200213340ustar00rootroot00000000000000TARGETS=transtab transtab.repertoire transtab.missing-MES-2 transtab.changes all: $(TARGETS) # transtab.utf is the file that should be edited transtab: transtab.utf format=iso ./transcomp $< >$@ format=isoutf ./transcomp transtab >transtab.utf transtab.repertoire: transtab format=utf ./transcomp transtab >$@ transtab.missing-MES-2: transtab.repertoire uniset + ../MES-2 - transtab.repertoire - 0000-007f clean table | \ format=isoutf ./transcomp - >$@ transtab.missing-TARGET1: transtab.repertoire uniset + ../../font/ucs-fonts/TARGET1 - transtab.repertoire \ - 0000-007f clean table | \ format=isoutf ./transcomp - >$@ transtab.changes: transtab.utf rlog $< >$@ distribution: $(TARGETS) ci -l transtab.utf cd .. ; tar cvf transtab.tar \ transtab/README transtab/REFERENCES transtab/Makefile \ transtab/transcomp \ transtab/transtab.utf $(TARGETS:%=transtab/%) ; \ gzip -9f transtab.tar ; \ mv transtab.tar.gz $(HOME)/.www/download/ clean: rm -f *~ translitcodec-version-0.7.0/transtab/README000066400000000000000000000067511404555523200205530ustar00rootroot00000000000000 Unicode to 8-bit charset transliteration table ---------------------------------------------- Markus Kuhn -- 2000-10-09 This package contains a table for transliterating ISO 10646 texts into best-effort representations using smaller coded character sets (ASCII, ISO 8859, etc.). It is primarily intended for inclusion into the GNU C library, but might be of use for other applications as well. The table is freely available to anyone. Files: transtab This is the table in the format suggested in ISO/IEC TR 14652 transtab.utf Same as transtab, but with added comments that show the strings encoded in UTF-8. This is the file that should be edited to make changes. The makefile will build the others from this one. transtab.repertoire List of characters covered by transtab suitable for feeding into uniset. Also contains the UTF-8 strings as comments. transtab.missing-MES-2 List of characters in CEN MES-2 minus those in transtab.repertoire. Intended to help getting an overview of what is and what is not covered. Transtab does not aim to cover MES-2 completely. It aims to provide transliterations only for those characters where they are feasible. transcomp Perl script to reformat and merge transliteration tables The transliteration table contains a list of substitution strings for each member of the covered Unicode subset. Applications are expected to use this list as follows: - Remove all substitution strings that contain Unicode characters that are not available in the destination character set. - Remove all substitution strings that are longer (or shorter) than required by the application (in particular, some applications might need substitution strings that are exactly one character long). - Of the remaining substitution strings, pick the first one in the list. - If no substitution string remains for a Unicode character, use a default character such as for instance "?". Applications are not required or supposed to recursively substitute Unicode characters found in substitution strings. The substitution strings make no use of combining characters, that is the output will be ISO 10646 Level 1. The input strings should preferably be normalized into decomposed form first. The substitution strings in this table aim to be visually or semantically equivalent to the characters they replace. Ideally, they should correspond to the fallback notation that people naturally use in email or on typewriters to substitute unavailable characters. They are not intended as unique mnemonics for characters (such as for example those in RFC 1345). If you use transliteration in C library locales, please make sure that the X/Open function wcwidth() and wcswidth() accurately predict how many character cell position the cursor will advance, even when transliteration is used. This is essential to allow applications to perform correct terminal screen layout even when multi-character transliterations are used. The latest version of this package is available from http://www.cl.cam.ac.uk/~mgk25/download/transtab.tar.gz Please send comments and patches (preferably diff -u on transtab.utf) to Markus.Kuhn@cl.cam.ac.uk Acknowledgements: Some parts of this table were inspired and recycled from the def7_uni.tbl file in lynx-2.8.4. Enjoy ... Markus -- Markus G. Kuhn, Computer Laboratory, University of Cambridge, UK Email: mkuhn at acm.org, WWW: translitcodec-version-0.7.0/transtab/REFERENCES000066400000000000000000000032511404555523200212270ustar00rootroot00000000000000 Some Literature References on Transliteration and Transcription --------------------------------------------------------------- Markus Kuhn -- 2000-10-12 Arabic ISO 233:1984 Documentation -- Transliteration of Arabic characters into Latin characters ISO 233-2:1993 Information and documentation -- Transliteration of Arabic characters into Latin characters -- Part 2: Arabic language -- Simplified transliteration ISO 233-3:1999 Information and documentation -- Transliteration of Arabic characters into Latin characters -- Part 3: Persian language -- Simplified transliteration (available in English only) Armenian ISO 9985:1996 Information and documentation -- Transliteration of Armenian characters into Latin characters Esperanto L.L. Zamenhof: Fundamento de Esperanto, 1905 http://www.esperanto.net/veb/faq-15.html Georgian ISO 9984:1996 Information and documentation -- Transliteration of Georgian characters into Latin characters Hebrew ISO 259:1984 Documentation -- Transliteration of Hebrew characters into Latin characters ISO 259-2:1994 Information and documentation -- Transliteration of Hebrew characters into Latin characters -- Part 2: Simplified transliteration International Phonetic Alphabet http://www.hpl.hp.com/personal/Evan_Kirshenbaum/IPA/faq.html Korean ISO/TR 11941:1996 Information and documentation -- Transliteration of Korean script into Latin characters Russian ISO 9:1995 Information and documentation -- Transliteration of Cyrillic characters into Latin characters -- Slavic and non-Slavic languages Thai ISO 11940:1998 Information and documentation -- Transliteration of Thai translitcodec-version-0.7.0/transtab/transcomp000077500000000000000000000122071404555523200216200ustar00rootroot00000000000000#!/usr/bin/perl sub utf8 ($) { my $c = shift(@_); if ($c < 0x80) { return sprintf("%c", $c); } elsif ($c < 0x800) { return sprintf("%c%c", 0xc0 | ($c >> 6), 0x80 | ($c & 0x3f)); } elsif ($c < 0x10000) { return sprintf("%c%c%c", 0xe0 | ($c >> 12), 0x80 | (($c >> 6) & 0x3f), 0x80 | ( $c & 0x3f)); } elsif ($c < 0x200000) { return sprintf("%c%c%c%c", 0xf0 | ($c >> 18), 0x80 | (($c >> 12) & 0x3f), 0x80 | (($c >> 6) & 0x3f), 0x80 | ( $c & 0x3f)); } elsif ($c < 0x4000000) { return sprintf("%c%c%c%c%c", 0xf8 | ($c >> 24), 0x80 | (($c >> 18) & 0x3f), 0x80 | (($c >> 12) & 0x3f), 0x80 | (($c >> 6) & 0x3f), 0x80 | ( $c & 0x3f)); } elsif ($c < 0x80000000) { return sprintf("%c%c%c%c%c%c", 0xfe | ($c >> 30), 0x80 | (($c >> 24) & 0x3f), 0x80 | (($c >> 18) & 0x3f), 0x80 | (($c >> 12) & 0x3f), 0x80 | (($c >> 6) & 0x3f), 0x80 | ( $c & 0x3f)); } else { return utf8(0xfffd); } } sub append_translit { my ($ucs, $t) = @_; $ucs =~ /^[0-9A-F]{4}$/ || die("ERROR: append_translit('$ucs','$t')\n"); $t =~ /^([0-9A-F]{4})*$/ || die("ERROR: append_translit('$ucs','$t')\n"); #print STDERR "append_translit('$ucs','$t')\n"; if (!defined($trans{$ucs})) { $trans{$ucs} = []; } push(@{$trans{$ucs}}, $t); } $unicodedata = "UnicodeData.txt"; $datadir = "$ENV{HOME}/local/lib/ucs"; # read list of all Unicode names if (!open(UDATA, $unicodedata) && !open(UDATA, "$datadir/$unicodedata")) { die ("Can't open Unicode database '$unicodedata':\n$!\n\n" . "Please make sure that you have downloaded the file\n" . "ftp://ftp.unicode.org/Public/UNIDATA/UnicodeData.txt\n"); } while () { if (/^([0-9,A-F]{4,6});([^;]*);([^;]*);([^;]*);([^;]*);([^;]*);([^;]*);([^;]*);([^;]*);([^;]*);([^;]*);([^;]*);([^;]*);([^;]*);([^;]*)$/) { $name{$1} = $2; } else { die("Syntax error in line '$_' in file '$unicodedata'"); } } close(UDATA); while () { next if /^\s*[\%\#]/; next if /^\s*$/; if (/^([0-9a-fA-F]{4})\s*(\#.*)?$/) { # uniset table format $ucs = $1; $ucs =~ tr/a-f/A-F/; if (!$trans{$ucs}) { append_translit($ucs, ""); } } elsif (/^\s*\s+(.*?)(\%.*)?$/) { # ISO/IEC TR 14652 format $ucs = $1; $ucs =~ tr/a-f/A-F/; $_ = $2; while (1) { if (/^;?(.*)$/) { $t = $1; $t =~ tr/a-f/A-F/; $_=$2; append_translit($ucs, $t); } elsif (/^\"((?:)*)\";?(.*)$/) { $t = $1; $_ = $2; $t =~ tr/a-f/A-F/; $t =~ s///g; append_translit($ucs, $t); } elsif (/^\"([^<\"]+)\";?(.*)$/) { $_ = $2; $t = ""; for ($i = 0; $i < length($1); $i++) { $t .= sprintf("%04X", ord(substr($1,$i,1))); } append_translit($ucs, $t); } elsif (/^\s*\%/ || /^\s*$/) { last; } else { die("parsing problem: '$_'\n"); } } } elsif (/^U\+([0-9a-fA-F]{4}):(.*)$/ || /^U\+([0-9a-fA-F]{4})\s*\"(.*)\"\s*(\#.*)?$/) { # Lynx format $ucs = $1; $ucs =~ tr/a-f/A-F/; $t = ""; for ($i = 0; $i < length($2); $i++) { $t .= sprintf("%04X", ord(substr($2,$i,1))); } append_translit($ucs, $t); } elsif (/0x([0-9a-fA-F]{2})\s*(.*?)\s*(\#.*)?$/) { # Lynx format $t = hex($1); $_=$2; while ($_) { if (/^U\+([0-9a-fA-F]{4})-U\+([0-9a-fA-F]{4})\s*(.*)$/) { $ucs1=$1; $ucs2=$2; $_=$3; for ($ucs=hex($ucs1); $ucs <= hex($ucs2); $ucs++) { append_translit(sprintf("%04X", $ucs), sprintf("%04X", $t)); } } elsif (/^U\+([0-9a-fA-F]{4})\s*(.*)$/) { $_=$3; append_translit(sprintf("%04X", hex($1)), sprintf("%04X", $t)); } else { print STDERR "Can't handle suffix: '$_'\n"; last; } } } else { print STDERR "Can't handle: $_"; } } $ENV{format} = iso if !$ENV{format}; if ($ENV{format} =~ /^iso/) { # output in ISO/IEC DTR 14652 format print "% \$Id: \$\n\n"; for $ucs (sort(keys(%trans))) { print "% $name{$ucs}\n"; if ($ENV{format} eq isoutf) { print "% " . utf8(hex($ucs)) . " -> "; @l = @{$trans{$ucs}}; while ($t = shift @l) { print "'"; while ($t =~ /^(....)/) { $t = $'; print utf8(hex($1)); } print "'"; print ", " if @l; } print "\n"; } print " "; @l = @{$trans{$ucs}}; while (defined($t = shift @l)) { if (length($t) == 4) { print ""; } else { $t =~ s/(....)//g; print "\"$t\""; } print ";" if @l; } print "\n"; } } if ($ENV{format} eq utf) { for $ucs (sort(keys(%trans))) { print "U+$ucs # " . utf8(hex($ucs)) . " -> "; @l = @{$trans{$ucs}}; while ($t = shift @l) { print "'"; while ($t =~ /^(....)/) { $t = $'; print utf8(hex($1)); } print "'"; print ", " if @l; } print "\n"; } } translitcodec-version-0.7.0/transtab/transtab000066400000000000000000001307101404555523200214250ustar00rootroot00000000000000% $Id: $ % APOSTROPHE % GRAVE ACCENT ; % NO-BREAK SPACE % INVERTED EXCLAMATION MARK % CENT SIGN % POUND SIGN "" % YEN SIGN % BROKEN BAR % SECTION SIGN % DIAERESIS % COPYRIGHT SIGN ""; % FEMININE ORDINAL INDICATOR % LEFT-POINTING DOUBLE ANGLE QUOTATION MARK "" % NOT SIGN % SOFT HYPHEN % REGISTERED SIGN "" % MACRON % DEGREE SIGN % PLUS-MINUS SIGN "" % SUPERSCRIPT TWO ""; % SUPERSCRIPT THREE ""; % ACUTE ACCENT % MICRO SIGN ; % PILCROW SIGN % MIDDLE DOT % CEDILLA % SUPERSCRIPT ONE ""; % MASCULINE ORDINAL INDICATOR % RIGHT-POINTING DOUBLE ANGLE QUOTATION MARK "" % VULGAR FRACTION ONE QUARTER "" % VULGAR FRACTION ONE HALF "" % VULGAR FRACTION THREE QUARTERS "" % INVERTED QUESTION MARK % LATIN CAPITAL LETTER A WITH GRAVE % LATIN CAPITAL LETTER A WITH ACUTE % LATIN CAPITAL LETTER A WITH CIRCUMFLEX % LATIN CAPITAL LETTER A WITH TILDE % LATIN CAPITAL LETTER A WITH DIAERESIS ""; % LATIN CAPITAL LETTER A WITH RING ABOVE ""; % LATIN CAPITAL LETTER AE ""; % LATIN CAPITAL LETTER C WITH CEDILLA % LATIN CAPITAL LETTER E WITH GRAVE % LATIN CAPITAL LETTER E WITH ACUTE % LATIN CAPITAL LETTER E WITH CIRCUMFLEX % LATIN CAPITAL LETTER E WITH DIAERESIS % LATIN CAPITAL LETTER I WITH GRAVE % LATIN CAPITAL LETTER I WITH ACUTE % LATIN CAPITAL LETTER I WITH CIRCUMFLEX % LATIN CAPITAL LETTER I WITH DIAERESIS % LATIN CAPITAL LETTER ETH % LATIN CAPITAL LETTER N WITH TILDE % LATIN CAPITAL LETTER O WITH GRAVE % LATIN CAPITAL LETTER O WITH ACUTE % LATIN CAPITAL LETTER O WITH CIRCUMFLEX % LATIN CAPITAL LETTER O WITH TILDE % LATIN CAPITAL LETTER O WITH DIAERESIS ""; % MULTIPLICATION SIGN % LATIN CAPITAL LETTER O WITH STROKE % LATIN CAPITAL LETTER U WITH GRAVE % LATIN CAPITAL LETTER U WITH ACUTE % LATIN CAPITAL LETTER U WITH CIRCUMFLEX % LATIN CAPITAL LETTER U WITH DIAERESIS ""; % LATIN CAPITAL LETTER Y WITH ACUTE % LATIN CAPITAL LETTER THORN "" % LATIN SMALL LETTER SHARP S ""; % LATIN SMALL LETTER A WITH GRAVE % LATIN SMALL LETTER A WITH ACUTE % LATIN SMALL LETTER A WITH CIRCUMFLEX % LATIN SMALL LETTER A WITH TILDE % LATIN SMALL LETTER A WITH DIAERESIS ""; % LATIN SMALL LETTER A WITH RING ABOVE ""; % LATIN SMALL LETTER AE ""; % LATIN SMALL LETTER C WITH CEDILLA % LATIN SMALL LETTER E WITH GRAVE % LATIN SMALL LETTER E WITH ACUTE % LATIN SMALL LETTER E WITH CIRCUMFLEX % LATIN SMALL LETTER E WITH DIAERESIS % LATIN SMALL LETTER I WITH GRAVE % LATIN SMALL LETTER I WITH ACUTE % LATIN SMALL LETTER I WITH CIRCUMFLEX % LATIN SMALL LETTER I WITH DIAERESIS % LATIN SMALL LETTER ETH % LATIN SMALL LETTER N WITH TILDE % LATIN SMALL LETTER O WITH GRAVE % LATIN SMALL LETTER O WITH ACUTE % LATIN SMALL LETTER O WITH CIRCUMFLEX % LATIN SMALL LETTER O WITH TILDE % LATIN SMALL LETTER O WITH DIAERESIS ""; % DIVISION SIGN % LATIN SMALL LETTER O WITH STROKE % LATIN SMALL LETTER U WITH GRAVE % LATIN SMALL LETTER U WITH ACUTE % LATIN SMALL LETTER U WITH CIRCUMFLEX % LATIN SMALL LETTER U WITH DIAERESIS ""; % LATIN SMALL LETTER Y WITH ACUTE % LATIN SMALL LETTER THORN "" % LATIN SMALL LETTER Y WITH DIAERESIS % LATIN CAPITAL LETTER A WITH MACRON % LATIN SMALL LETTER A WITH MACRON % LATIN CAPITAL LETTER A WITH BREVE % LATIN SMALL LETTER A WITH BREVE % LATIN CAPITAL LETTER A WITH OGONEK % LATIN SMALL LETTER A WITH OGONEK % LATIN CAPITAL LETTER C WITH ACUTE % LATIN SMALL LETTER C WITH ACUTE % LATIN CAPITAL LETTER C WITH CIRCUMFLEX ""; % LATIN SMALL LETTER C WITH CIRCUMFLEX ""; % LATIN CAPITAL LETTER C WITH DOT ABOVE % LATIN SMALL LETTER C WITH DOT ABOVE % LATIN CAPITAL LETTER C WITH CARON % LATIN SMALL LETTER C WITH CARON % LATIN CAPITAL LETTER D WITH CARON % LATIN SMALL LETTER D WITH CARON % LATIN CAPITAL LETTER D WITH STROKE % LATIN SMALL LETTER D WITH STROKE % LATIN CAPITAL LETTER E WITH MACRON % LATIN SMALL LETTER E WITH MACRON % LATIN CAPITAL LETTER E WITH BREVE % LATIN SMALL LETTER E WITH BREVE % LATIN CAPITAL LETTER E WITH DOT ABOVE % LATIN SMALL LETTER E WITH DOT ABOVE % LATIN CAPITAL LETTER E WITH OGONEK % LATIN SMALL LETTER E WITH OGONEK % LATIN CAPITAL LETTER E WITH CARON % LATIN SMALL LETTER E WITH CARON % LATIN CAPITAL LETTER G WITH CIRCUMFLEX ""; % LATIN SMALL LETTER G WITH CIRCUMFLEX ""; % LATIN CAPITAL LETTER G WITH BREVE % LATIN SMALL LETTER G WITH BREVE % LATIN CAPITAL LETTER G WITH DOT ABOVE % LATIN SMALL LETTER G WITH DOT ABOVE % LATIN CAPITAL LETTER G WITH CEDILLA % LATIN SMALL LETTER G WITH CEDILLA % LATIN CAPITAL LETTER H WITH CIRCUMFLEX ""; % LATIN SMALL LETTER H WITH CIRCUMFLEX ""; % LATIN CAPITAL LETTER H WITH STROKE % LATIN SMALL LETTER H WITH STROKE % LATIN CAPITAL LETTER I WITH TILDE % LATIN SMALL LETTER I WITH TILDE % LATIN CAPITAL LETTER I WITH MACRON % LATIN SMALL LETTER I WITH MACRON % LATIN CAPITAL LETTER I WITH BREVE % LATIN SMALL LETTER I WITH BREVE % LATIN CAPITAL LETTER I WITH OGONEK % LATIN SMALL LETTER I WITH OGONEK % LATIN CAPITAL LETTER I WITH DOT ABOVE % LATIN SMALL LETTER DOTLESS I % LATIN CAPITAL LIGATURE IJ "" % LATIN SMALL LIGATURE IJ "" % LATIN CAPITAL LETTER J WITH CIRCUMFLEX ""; % LATIN SMALL LETTER J WITH CIRCUMFLEX ""; % LATIN CAPITAL LETTER K WITH CEDILLA % LATIN SMALL LETTER K WITH CEDILLA % LATIN SMALL LETTER KRA % LATIN CAPITAL LETTER L WITH ACUTE % LATIN SMALL LETTER L WITH ACUTE % LATIN CAPITAL LETTER L WITH CEDILLA % LATIN SMALL LETTER L WITH CEDILLA % LATIN CAPITAL LETTER L WITH CARON % LATIN SMALL LETTER L WITH CARON % LATIN CAPITAL LETTER L WITH MIDDLE DOT "";""; % LATIN SMALL LETTER L WITH MIDDLE DOT "";""; % LATIN CAPITAL LETTER L WITH STROKE % LATIN SMALL LETTER L WITH STROKE % LATIN CAPITAL LETTER N WITH ACUTE % LATIN SMALL LETTER N WITH ACUTE % LATIN CAPITAL LETTER N WITH CEDILLA % LATIN SMALL LETTER N WITH CEDILLA % LATIN CAPITAL LETTER N WITH CARON % LATIN SMALL LETTER N WITH CARON % LATIN SMALL LETTER N PRECEDED BY APOSTROPHE "" % LATIN CAPITAL LETTER ENG ""; % LATIN SMALL LETTER ENG ""; % LATIN CAPITAL LETTER O WITH MACRON % LATIN SMALL LETTER O WITH MACRON % LATIN CAPITAL LETTER O WITH BREVE % LATIN SMALL LETTER O WITH BREVE % LATIN CAPITAL LETTER O WITH DOUBLE ACUTE % LATIN SMALL LETTER O WITH DOUBLE ACUTE % LATIN CAPITAL LIGATURE OE "" % LATIN SMALL LIGATURE OE "" % LATIN CAPITAL LETTER R WITH ACUTE % LATIN SMALL LETTER R WITH ACUTE % LATIN CAPITAL LETTER R WITH CEDILLA % LATIN SMALL LETTER R WITH CEDILLA % LATIN CAPITAL LETTER R WITH CARON % LATIN SMALL LETTER R WITH CARON % LATIN CAPITAL LETTER S WITH ACUTE % LATIN SMALL LETTER S WITH ACUTE % LATIN CAPITAL LETTER S WITH CIRCUMFLEX ""; % LATIN SMALL LETTER S WITH CIRCUMFLEX ""; % LATIN CAPITAL LETTER S WITH CEDILLA % LATIN SMALL LETTER S WITH CEDILLA % LATIN CAPITAL LETTER S WITH CARON % LATIN SMALL LETTER S WITH CARON % LATIN CAPITAL LETTER T WITH CEDILLA % LATIN SMALL LETTER T WITH CEDILLA % LATIN CAPITAL LETTER T WITH CARON % LATIN SMALL LETTER T WITH CARON % LATIN CAPITAL LETTER T WITH STROKE % LATIN SMALL LETTER T WITH STROKE % LATIN CAPITAL LETTER U WITH TILDE % LATIN SMALL LETTER U WITH TILDE % LATIN CAPITAL LETTER U WITH MACRON % LATIN SMALL LETTER U WITH MACRON % LATIN CAPITAL LETTER U WITH BREVE % LATIN SMALL LETTER U WITH BREVE % LATIN CAPITAL LETTER U WITH RING ABOVE % LATIN SMALL LETTER U WITH RING ABOVE % LATIN CAPITAL LETTER U WITH DOUBLE ACUTE % LATIN SMALL LETTER U WITH DOUBLE ACUTE % LATIN CAPITAL LETTER U WITH OGONEK % LATIN SMALL LETTER U WITH OGONEK % LATIN CAPITAL LETTER W WITH CIRCUMFLEX % LATIN SMALL LETTER W WITH CIRCUMFLEX % LATIN CAPITAL LETTER Y WITH CIRCUMFLEX % LATIN SMALL LETTER Y WITH CIRCUMFLEX % LATIN CAPITAL LETTER Y WITH DIAERESIS % LATIN CAPITAL LETTER Z WITH ACUTE % LATIN SMALL LETTER Z WITH ACUTE % LATIN CAPITAL LETTER Z WITH DOT ABOVE % LATIN SMALL LETTER Z WITH DOT ABOVE % LATIN CAPITAL LETTER Z WITH CARON % LATIN SMALL LETTER Z WITH CARON % LATIN SMALL LETTER LONG S % LATIN SMALL LETTER F WITH HOOK % LATIN CAPITAL LETTER O WITH HORN % LATIN SMALL LETTER O WITH HORN % LATIN CAPITAL LETTER U WITH HORN % LATIN SMALL LETTER U WITH HORN % LATIN CAPITAL LETTER S WITH COMMA BELOW ; % LATIN SMALL LETTER S WITH COMMA BELOW ; % LATIN CAPITAL LETTER T WITH COMMA BELOW ; % LATIN SMALL LETTER T WITH COMMA BELOW ; % MODIFIER LETTER PRIME ; % MODIFIER LETTER TURNED COMMA % MODIFIER LETTER APOSTROPHE ; % MODIFIER LETTER REVERSED COMMA % MODIFIER LETTER CIRCUMFLEX ACCENT % MODIFIER LETTER VERTICAL LINE % MODIFIER LETTER MACRON % MODIFIER LETTER LOW VERTICAL LINE % MODIFIER LETTER TRIANGULAR COLON % RING ABOVE % SMALL TILDE % DOUBLE ACUTE ACCENT % GREEK NUMERAL SIGN % GREEK LOWER NUMERAL SIGN % GREEK QUESTION MARK % LATIN CAPITAL LETTER B WITH DOT ABOVE % LATIN SMALL LETTER B WITH DOT ABOVE % LATIN CAPITAL LETTER D WITH DOT ABOVE % LATIN SMALL LETTER D WITH DOT ABOVE % LATIN CAPITAL LETTER F WITH DOT ABOVE % LATIN SMALL LETTER F WITH DOT ABOVE % LATIN CAPITAL LETTER M WITH DOT ABOVE % LATIN SMALL LETTER M WITH DOT ABOVE % LATIN CAPITAL LETTER P WITH DOT ABOVE % LATIN SMALL LETTER P WITH DOT ABOVE % LATIN CAPITAL LETTER S WITH DOT ABOVE % LATIN SMALL LETTER S WITH DOT ABOVE % LATIN CAPITAL LETTER T WITH DOT ABOVE % LATIN SMALL LETTER T WITH DOT ABOVE % LATIN CAPITAL LETTER W WITH GRAVE % LATIN SMALL LETTER W WITH GRAVE % LATIN CAPITAL LETTER W WITH ACUTE % LATIN SMALL LETTER W WITH ACUTE % LATIN CAPITAL LETTER W WITH DIAERESIS % LATIN SMALL LETTER W WITH DIAERESIS % LATIN CAPITAL LETTER U WITH HORN AND TILDE % LATIN SMALL LETTER U WITH HORN AND TILDE % LATIN CAPITAL LETTER Y WITH GRAVE % LATIN SMALL LETTER Y WITH GRAVE % EN QUAD % EM QUAD "" % EN SPACE % EM SPACE "" % THREE-PER-EM SPACE % FOUR-PER-EM SPACE % SIX-PER-EM SPACE % FIGURE SPACE % PUNCTUATION SPACE % THIN SPACE % HAIR SPACE "" % ZERO WIDTH SPACE "" % ZERO WIDTH NON-JOINER "" % ZERO WIDTH JOINER "" % LEFT-TO-RIGHT MARK "" % RIGHT-TO-LEFT MARK "" % HYPHEN % NON-BREAKING HYPHEN % FIGURE DASH % EN DASH % EM DASH "" % HORIZONTAL BAR "" % DOUBLE VERTICAL LINE "" % DOUBLE LOW LINE % LEFT SINGLE QUOTATION MARK % RIGHT SINGLE QUOTATION MARK % SINGLE LOW-9 QUOTATION MARK % SINGLE HIGH-REVERSED-9 QUOTATION MARK % LEFT DOUBLE QUOTATION MARK % RIGHT DOUBLE QUOTATION MARK % DOUBLE LOW-9 QUOTATION MARK % DOUBLE HIGH-REVERSED-9 QUOTATION MARK % DAGGER % DOUBLE DAGGER "" % BULLET % TRIANGULAR BULLET % ONE DOT LEADER % TWO DOT LEADER "" % HORIZONTAL ELLIPSIS "" % HYPHENATION POINT % LEFT-TO-RIGHT EMBEDDING "" % RIGHT-TO-LEFT EMBEDDING "" % POP DIRECTIONAL FORMATTING "" % LEFT-TO-RIGHT OVERRIDE "" % RIGHT-TO-LEFT OVERRIDE "" % NARROW NO-BREAK SPACE % PER MILLE SIGN "" % PRIME % DOUBLE PRIME % TRIPLE PRIME "" % REVERSED PRIME % REVERSED DOUBLE PRIME "" % REVERSED TRIPLE PRIME "" % SINGLE LEFT-POINTING ANGLE QUOTATION MARK % SINGLE RIGHT-POINTING ANGLE QUOTATION MARK % DOUBLE EXCLAMATION MARK "" % OVERLINE % HYPHEN BULLET % FRACTION SLASH % QUESTION EXCLAMATION MARK "" % EXCLAMATION QUESTION MARK "" % TIRONIAN SIGN ET % SUPERSCRIPT ZERO ""; % SUPERSCRIPT FOUR ""; % SUPERSCRIPT FIVE ""; % SUPERSCRIPT SIX ""; % SUPERSCRIPT SEVEN ""; % SUPERSCRIPT EIGHT ""; % SUPERSCRIPT NINE ""; % SUPERSCRIPT PLUS SIGN ""; % SUPERSCRIPT MINUS ""; % SUPERSCRIPT EQUALS SIGN ""; % SUPERSCRIPT LEFT PARENTHESIS ""; % SUPERSCRIPT RIGHT PARENTHESIS ""; % SUPERSCRIPT LATIN SMALL LETTER N ""; % SUBSCRIPT ZERO ""; % SUBSCRIPT ONE ""; % SUBSCRIPT TWO ""; % SUBSCRIPT THREE ""; % SUBSCRIPT FOUR ""; % SUBSCRIPT FIVE ""; % SUBSCRIPT SIX ""; % SUBSCRIPT SEVEN ""; % SUBSCRIPT EIGHT ""; % SUBSCRIPT NINE ""; % SUBSCRIPT PLUS SIGN ""; % SUBSCRIPT MINUS ""; % SUBSCRIPT EQUALS SIGN ""; % SUBSCRIPT LEFT PARENTHESIS ""; % SUBSCRIPT RIGHT PARENTHESIS ""; % EURO SIGN ""; % ACCOUNT OF "" % ADDRESSED TO THE SUBJECT "" % DEGREE CELSIUS ""; % CARE OF "" % CADA UNA "" % DEGREE FAHRENHEIT ""; % SCRIPT SMALL L % NUMERO SIGN "";"" % SOUND RECORDING COPYRIGHT "" % SERVICE MARK "" % TELEPHONE SIGN "" % TRADE MARK SIGN "" % OHM SIGN ;""; % KELVIN SIGN % ANGSTROM SIGN % ESTIMATED SYMBOL % VULGAR FRACTION ONE THIRD "" % VULGAR FRACTION TWO THIRDS "" % VULGAR FRACTION ONE FIFTH "" % VULGAR FRACTION TWO FIFTHS "" % VULGAR FRACTION THREE FIFTHS "" % VULGAR FRACTION FOUR FIFTHS "" % VULGAR FRACTION ONE SIXTH "" % VULGAR FRACTION FIVE SIXTHS "" % VULGAR FRACTION ONE EIGHTH "" % VULGAR FRACTION THREE EIGHTHS "" % VULGAR FRACTION FIVE EIGHTHS "" % VULGAR FRACTION SEVEN EIGHTHS "" % FRACTION NUMERATOR ONE "" % ROMAN NUMERAL ONE % ROMAN NUMERAL TWO "" % ROMAN NUMERAL THREE "" % ROMAN NUMERAL FOUR "" % ROMAN NUMERAL FIVE % ROMAN NUMERAL SIX "" % ROMAN NUMERAL SEVEN "" % ROMAN NUMERAL EIGHT "" % ROMAN NUMERAL NINE "" % ROMAN NUMERAL TEN % ROMAN NUMERAL ELEVEN "" % ROMAN NUMERAL TWELVE "" % ROMAN NUMERAL FIFTY % ROMAN NUMERAL ONE HUNDRED % ROMAN NUMERAL FIVE HUNDRED % ROMAN NUMERAL ONE THOUSAND % SMALL ROMAN NUMERAL ONE % SMALL ROMAN NUMERAL TWO "" % SMALL ROMAN NUMERAL THREE "" % SMALL ROMAN NUMERAL FOUR "" % SMALL ROMAN NUMERAL FIVE % SMALL ROMAN NUMERAL SIX "" % SMALL ROMAN NUMERAL SEVEN "" % SMALL ROMAN NUMERAL EIGHT "" % SMALL ROMAN NUMERAL NINE "" % SMALL ROMAN NUMERAL TEN % SMALL ROMAN NUMERAL ELEVEN "" % SMALL ROMAN NUMERAL TWELVE "" % SMALL ROMAN NUMERAL FIFTY % SMALL ROMAN NUMERAL ONE HUNDRED % SMALL ROMAN NUMERAL FIVE HUNDRED % SMALL ROMAN NUMERAL ONE THOUSAND % LEFTWARDS ARROW "" % UPWARDS ARROW % RIGHTWARDS ARROW "" % DOWNWARDS ARROW % LEFT RIGHT ARROW "" % LEFTWARDS DOUBLE ARROW "" % RIGHTWARDS DOUBLE ARROW "" % LEFT RIGHT DOUBLE ARROW "" % MINUS SIGN ; % DIVISION SLASH % SET MINUS % ASTERISK OPERATOR % RING OPERATOR % BULLET OPERATOR % INFINITY "" % DIVIDES % PARALLEL TO "" % RATIO % TILDE OPERATOR % NOT EQUAL TO "" % IDENTICAL TO % LESS-THAN OR EQUAL TO "" % GREATER-THAN OR EQUAL TO "" % MUCH LESS-THAN "" % MUCH GREATER-THAN "" % CIRCLED PLUS "" % CIRCLED MINUS "" % CIRCLED TIMES "" % CIRCLED DIVISION SLASH "" % RIGHT TACK "" % LEFT TACK "" % ASSERTION "" % MODELS "" % TRUE "" % FORCES "" % DOT OPERATOR % STAR OPERATOR % EQUAL AND PARALLEL TO % VERY MUCH LESS-THAN "" % VERY MUCH GREATER-THAN "" % MIDLINE HORIZONTAL ELLIPSIS "" % LEFT-POINTING ANGLE BRACKET % RIGHT-POINTING ANGLE BRACKET % SYMBOL FOR NULL "" % SYMBOL FOR START OF HEADING "" % SYMBOL FOR START OF TEXT "" % SYMBOL FOR END OF TEXT "" % SYMBOL FOR END OF TRANSMISSION "" % SYMBOL FOR ENQUIRY "" % SYMBOL FOR ACKNOWLEDGE "" % SYMBOL FOR BELL "" % SYMBOL FOR BACKSPACE "" % SYMBOL FOR HORIZONTAL TABULATION "" % SYMBOL FOR LINE FEED "" % SYMBOL FOR VERTICAL TABULATION "" % SYMBOL FOR FORM FEED "" % SYMBOL FOR CARRIAGE RETURN "" % SYMBOL FOR SHIFT OUT "" % SYMBOL FOR SHIFT IN "" % SYMBOL FOR DATA LINK ESCAPE "" % SYMBOL FOR DEVICE CONTROL ONE "" % SYMBOL FOR DEVICE CONTROL TWO "" % SYMBOL FOR DEVICE CONTROL THREE "" % SYMBOL FOR DEVICE CONTROL FOUR "" % SYMBOL FOR NEGATIVE ACKNOWLEDGE "" % SYMBOL FOR SYNCHRONOUS IDLE "" % SYMBOL FOR END OF TRANSMISSION BLOCK "" % SYMBOL FOR CANCEL "" % SYMBOL FOR END OF MEDIUM "" % SYMBOL FOR SUBSTITUTE "" % SYMBOL FOR ESCAPE "" % SYMBOL FOR FILE SEPARATOR "" % SYMBOL FOR GROUP SEPARATOR "" % SYMBOL FOR RECORD SEPARATOR "" % SYMBOL FOR UNIT SEPARATOR "" % SYMBOL FOR SPACE "" % SYMBOL FOR DELETE "" % OPEN BOX % SYMBOL FOR NEWLINE "" % SYMBOL FOR DELETE FORM TWO "" % SYMBOL FOR SUBSTITUTE FORM TWO % CIRCLED DIGIT ONE ""; % CIRCLED DIGIT TWO ""; % CIRCLED DIGIT THREE ""; % CIRCLED DIGIT FOUR ""; % CIRCLED DIGIT FIVE ""; % CIRCLED DIGIT SIX ""; % CIRCLED DIGIT SEVEN ""; % CIRCLED DIGIT EIGHT ""; % CIRCLED DIGIT NINE ""; % CIRCLED NUMBER TEN "" % CIRCLED NUMBER ELEVEN "" % CIRCLED NUMBER TWELVE "" % CIRCLED NUMBER THIRTEEN "" % CIRCLED NUMBER FOURTEEN "" % CIRCLED NUMBER FIFTEEN "" % CIRCLED NUMBER SIXTEEN "" % CIRCLED NUMBER SEVENTEEN "" % CIRCLED NUMBER EIGHTEEN "" % CIRCLED NUMBER NINETEEN "" % CIRCLED NUMBER TWENTY "" % PARENTHESIZED DIGIT ONE ""; % PARENTHESIZED DIGIT TWO ""; % PARENTHESIZED DIGIT THREE ""; % PARENTHESIZED DIGIT FOUR ""; % PARENTHESIZED DIGIT FIVE ""; % PARENTHESIZED DIGIT SIX ""; % PARENTHESIZED DIGIT SEVEN ""; % PARENTHESIZED DIGIT EIGHT ""; % PARENTHESIZED DIGIT NINE ""; % PARENTHESIZED NUMBER TEN "" % PARENTHESIZED NUMBER ELEVEN "" % PARENTHESIZED NUMBER TWELVE "" % PARENTHESIZED NUMBER THIRTEEN "" % PARENTHESIZED NUMBER FOURTEEN "" % PARENTHESIZED NUMBER FIFTEEN "" % PARENTHESIZED NUMBER SIXTEEN "" % PARENTHESIZED NUMBER SEVENTEEN "" % PARENTHESIZED NUMBER EIGHTEEN "" % PARENTHESIZED NUMBER NINETEEN "" % PARENTHESIZED NUMBER TWENTY "" % DIGIT ONE FULL STOP ""; % DIGIT TWO FULL STOP ""; % DIGIT THREE FULL STOP ""; % DIGIT FOUR FULL STOP ""; % DIGIT FIVE FULL STOP ""; % DIGIT SIX FULL STOP ""; % DIGIT SEVEN FULL STOP ""; % DIGIT EIGHT FULL STOP ""; % DIGIT NINE FULL STOP ""; % NUMBER TEN FULL STOP "" % NUMBER ELEVEN FULL STOP "" % NUMBER TWELVE FULL STOP "" % NUMBER THIRTEEN FULL STOP "" % NUMBER FOURTEEN FULL STOP "" % NUMBER FIFTEEN FULL STOP "" % NUMBER SIXTEEN FULL STOP "" % NUMBER SEVENTEEN FULL STOP "" % NUMBER EIGHTEEN FULL STOP "" % NUMBER NINETEEN FULL STOP "" % NUMBER TWENTY FULL STOP "" % PARENTHESIZED LATIN SMALL LETTER A ""; % PARENTHESIZED LATIN SMALL LETTER B ""; % PARENTHESIZED LATIN SMALL LETTER C ""; % PARENTHESIZED LATIN SMALL LETTER D ""; % PARENTHESIZED LATIN SMALL LETTER E ""; % PARENTHESIZED LATIN SMALL LETTER F ""; % PARENTHESIZED LATIN SMALL LETTER G ""; % PARENTHESIZED LATIN SMALL LETTER H ""; % PARENTHESIZED LATIN SMALL LETTER I ""; % PARENTHESIZED LATIN SMALL LETTER J ""; % PARENTHESIZED LATIN SMALL LETTER K ""; % PARENTHESIZED LATIN SMALL LETTER L ""; % PARENTHESIZED LATIN SMALL LETTER M ""; % PARENTHESIZED LATIN SMALL LETTER N ""; % PARENTHESIZED LATIN SMALL LETTER O ""; % PARENTHESIZED LATIN SMALL LETTER P ""; % PARENTHESIZED LATIN SMALL LETTER Q ""; % PARENTHESIZED LATIN SMALL LETTER R ""; % PARENTHESIZED LATIN SMALL LETTER S ""; % PARENTHESIZED LATIN SMALL LETTER T ""; % PARENTHESIZED LATIN SMALL LETTER U ""; % PARENTHESIZED LATIN SMALL LETTER V ""; % PARENTHESIZED LATIN SMALL LETTER W ""; % PARENTHESIZED LATIN SMALL LETTER X ""; % PARENTHESIZED LATIN SMALL LETTER Y ""; % PARENTHESIZED LATIN SMALL LETTER Z ""; % CIRCLED LATIN CAPITAL LETTER A ""; % CIRCLED LATIN CAPITAL LETTER B ""; % CIRCLED LATIN CAPITAL LETTER C ""; % CIRCLED LATIN CAPITAL LETTER D ""; % CIRCLED LATIN CAPITAL LETTER E ""; % CIRCLED LATIN CAPITAL LETTER F ""; % CIRCLED LATIN CAPITAL LETTER G ""; % CIRCLED LATIN CAPITAL LETTER H ""; % CIRCLED LATIN CAPITAL LETTER I ""; % CIRCLED LATIN CAPITAL LETTER J ""; % CIRCLED LATIN CAPITAL LETTER K ""; % CIRCLED LATIN CAPITAL LETTER L ""; % CIRCLED LATIN CAPITAL LETTER M ""; % CIRCLED LATIN CAPITAL LETTER N ""; % CIRCLED LATIN CAPITAL LETTER O ""; % CIRCLED LATIN CAPITAL LETTER P ""; % CIRCLED LATIN CAPITAL LETTER Q ""; % CIRCLED LATIN CAPITAL LETTER R ""; % CIRCLED LATIN CAPITAL LETTER S ""; % CIRCLED LATIN CAPITAL LETTER T ""; % CIRCLED LATIN CAPITAL LETTER U ""; % CIRCLED LATIN CAPITAL LETTER V ""; % CIRCLED LATIN CAPITAL LETTER W ""; % CIRCLED LATIN CAPITAL LETTER X ""; % CIRCLED LATIN CAPITAL LETTER Y ""; % CIRCLED LATIN CAPITAL LETTER Z ""; % CIRCLED LATIN SMALL LETTER A ""; % CIRCLED LATIN SMALL LETTER B ""; % CIRCLED LATIN SMALL LETTER C ""; % CIRCLED LATIN SMALL LETTER D ""; % CIRCLED LATIN SMALL LETTER E ""; % CIRCLED LATIN SMALL LETTER F ""; % CIRCLED LATIN SMALL LETTER G ""; % CIRCLED LATIN SMALL LETTER H ""; % CIRCLED LATIN SMALL LETTER I ""; % CIRCLED LATIN SMALL LETTER J ""; % CIRCLED LATIN SMALL LETTER K ""; % CIRCLED LATIN SMALL LETTER L ""; % CIRCLED LATIN SMALL LETTER M ""; % CIRCLED LATIN SMALL LETTER N ""; % CIRCLED LATIN SMALL LETTER O ""; % CIRCLED LATIN SMALL LETTER P ""; % CIRCLED LATIN SMALL LETTER Q ""; % CIRCLED LATIN SMALL LETTER R ""; % CIRCLED LATIN SMALL LETTER S ""; % CIRCLED LATIN SMALL LETTER T ""; % CIRCLED LATIN SMALL LETTER U ""; % CIRCLED LATIN SMALL LETTER V ""; % CIRCLED LATIN SMALL LETTER W ""; % CIRCLED LATIN SMALL LETTER X ""; % CIRCLED LATIN SMALL LETTER Y ""; % CIRCLED LATIN SMALL LETTER Z ""; % CIRCLED DIGIT ZERO ""; % BOX DRAWINGS LIGHT HORIZONTAL % BOX DRAWINGS HEAVY HORIZONTAL % BOX DRAWINGS LIGHT VERTICAL % BOX DRAWINGS HEAVY VERTICAL % BOX DRAWINGS LIGHT TRIPLE DASH HORIZONTAL % BOX DRAWINGS HEAVY TRIPLE DASH HORIZONTAL % BOX DRAWINGS LIGHT TRIPLE DASH VERTICAL % BOX DRAWINGS HEAVY TRIPLE DASH VERTICAL % BOX DRAWINGS LIGHT QUADRUPLE DASH HORIZONTAL % BOX DRAWINGS HEAVY QUADRUPLE DASH HORIZONTAL % BOX DRAWINGS LIGHT QUADRUPLE DASH VERTICAL % BOX DRAWINGS HEAVY QUADRUPLE DASH VERTICAL % BOX DRAWINGS LIGHT DOWN AND RIGHT % BOX DRAWINGS DOWN LIGHT AND RIGHT HEAVY % BOX DRAWINGS DOWN HEAVY AND RIGHT LIGHT % BOX DRAWINGS HEAVY DOWN AND RIGHT % BOX DRAWINGS LIGHT DOWN AND LEFT % BOX DRAWINGS DOWN LIGHT AND LEFT HEAVY % BOX DRAWINGS DOWN HEAVY AND LEFT LIGHT % BOX DRAWINGS HEAVY DOWN AND LEFT % BOX DRAWINGS LIGHT UP AND RIGHT % BOX DRAWINGS UP LIGHT AND RIGHT HEAVY % BOX DRAWINGS UP HEAVY AND RIGHT LIGHT % BOX DRAWINGS HEAVY UP AND RIGHT % BOX DRAWINGS LIGHT UP AND LEFT % BOX DRAWINGS UP LIGHT AND LEFT HEAVY % BOX DRAWINGS UP HEAVY AND LEFT LIGHT % BOX DRAWINGS HEAVY UP AND LEFT % BOX DRAWINGS LIGHT VERTICAL AND RIGHT % BOX DRAWINGS VERTICAL LIGHT AND RIGHT HEAVY % BOX DRAWINGS UP HEAVY AND RIGHT DOWN LIGHT % BOX DRAWINGS DOWN HEAVY AND RIGHT UP LIGHT % BOX DRAWINGS VERTICAL HEAVY AND RIGHT LIGHT % BOX DRAWINGS DOWN LIGHT AND RIGHT UP HEAVY % BOX DRAWINGS UP LIGHT AND RIGHT DOWN HEAVY % BOX DRAWINGS HEAVY VERTICAL AND RIGHT % BOX DRAWINGS LIGHT VERTICAL AND LEFT % BOX DRAWINGS VERTICAL LIGHT AND LEFT HEAVY % BOX DRAWINGS UP HEAVY AND LEFT DOWN LIGHT % BOX DRAWINGS DOWN HEAVY AND LEFT UP LIGHT % BOX DRAWINGS VERTICAL HEAVY AND LEFT LIGHT % BOX DRAWINGS DOWN LIGHT AND LEFT UP HEAVY % BOX DRAWINGS UP LIGHT AND LEFT DOWN HEAVY % BOX DRAWINGS HEAVY VERTICAL AND LEFT % BOX DRAWINGS LIGHT DOWN AND HORIZONTAL % BOX DRAWINGS LEFT HEAVY AND RIGHT DOWN LIGHT % BOX DRAWINGS RIGHT HEAVY AND LEFT DOWN LIGHT % BOX DRAWINGS DOWN LIGHT AND HORIZONTAL HEAVY % BOX DRAWINGS DOWN HEAVY AND HORIZONTAL LIGHT % BOX DRAWINGS RIGHT LIGHT AND LEFT DOWN HEAVY % BOX DRAWINGS LEFT LIGHT AND RIGHT DOWN HEAVY % BOX DRAWINGS HEAVY DOWN AND HORIZONTAL % BOX DRAWINGS LIGHT UP AND HORIZONTAL % BOX DRAWINGS LEFT HEAVY AND RIGHT UP LIGHT % BOX DRAWINGS RIGHT HEAVY AND LEFT UP LIGHT % BOX DRAWINGS UP LIGHT AND HORIZONTAL HEAVY % BOX DRAWINGS UP HEAVY AND HORIZONTAL LIGHT % BOX DRAWINGS RIGHT LIGHT AND LEFT UP HEAVY % BOX DRAWINGS LEFT LIGHT AND RIGHT UP HEAVY % BOX DRAWINGS HEAVY UP AND HORIZONTAL % BOX DRAWINGS LIGHT VERTICAL AND HORIZONTAL % BOX DRAWINGS LEFT HEAVY AND RIGHT VERTICAL LIGHT % BOX DRAWINGS RIGHT HEAVY AND LEFT VERTICAL LIGHT % BOX DRAWINGS VERTICAL LIGHT AND HORIZONTAL HEAVY % BOX DRAWINGS UP HEAVY AND DOWN HORIZONTAL LIGHT % BOX DRAWINGS DOWN HEAVY AND UP HORIZONTAL LIGHT % BOX DRAWINGS VERTICAL HEAVY AND HORIZONTAL LIGHT % BOX DRAWINGS LEFT UP HEAVY AND RIGHT DOWN LIGHT % BOX DRAWINGS RIGHT UP HEAVY AND LEFT DOWN LIGHT % BOX DRAWINGS LEFT DOWN HEAVY AND RIGHT UP LIGHT % BOX DRAWINGS RIGHT DOWN HEAVY AND LEFT UP LIGHT % BOX DRAWINGS DOWN LIGHT AND UP HORIZONTAL HEAVY % BOX DRAWINGS UP LIGHT AND DOWN HORIZONTAL HEAVY % BOX DRAWINGS RIGHT LIGHT AND LEFT VERTICAL HEAVY % BOX DRAWINGS LEFT LIGHT AND RIGHT VERTICAL HEAVY % BOX DRAWINGS HEAVY VERTICAL AND HORIZONTAL % BOX DRAWINGS LIGHT DOUBLE DASH HORIZONTAL % BOX DRAWINGS HEAVY DOUBLE DASH HORIZONTAL % BOX DRAWINGS LIGHT DOUBLE DASH VERTICAL % BOX DRAWINGS HEAVY DOUBLE DASH VERTICAL % BOX DRAWINGS DOUBLE HORIZONTAL % BOX DRAWINGS DOUBLE VERTICAL % BOX DRAWINGS DOWN SINGLE AND RIGHT DOUBLE % BOX DRAWINGS DOWN DOUBLE AND RIGHT SINGLE % BOX DRAWINGS DOUBLE DOWN AND RIGHT % BOX DRAWINGS DOWN SINGLE AND LEFT DOUBLE % BOX DRAWINGS DOWN DOUBLE AND LEFT SINGLE % BOX DRAWINGS DOUBLE DOWN AND LEFT % BOX DRAWINGS UP SINGLE AND RIGHT DOUBLE % BOX DRAWINGS UP DOUBLE AND RIGHT SINGLE % BOX DRAWINGS DOUBLE UP AND RIGHT % BOX DRAWINGS UP SINGLE AND LEFT DOUBLE % BOX DRAWINGS UP DOUBLE AND LEFT SINGLE % BOX DRAWINGS DOUBLE UP AND LEFT % BOX DRAWINGS VERTICAL SINGLE AND RIGHT DOUBLE % BOX DRAWINGS VERTICAL DOUBLE AND RIGHT SINGLE % BOX DRAWINGS DOUBLE VERTICAL AND RIGHT % BOX DRAWINGS VERTICAL SINGLE AND LEFT DOUBLE % BOX DRAWINGS VERTICAL DOUBLE AND LEFT SINGLE % BOX DRAWINGS DOUBLE VERTICAL AND LEFT % BOX DRAWINGS DOWN SINGLE AND HORIZONTAL DOUBLE % BOX DRAWINGS DOWN DOUBLE AND HORIZONTAL SINGLE % BOX DRAWINGS DOUBLE DOWN AND HORIZONTAL % BOX DRAWINGS UP SINGLE AND HORIZONTAL DOUBLE % BOX DRAWINGS UP DOUBLE AND HORIZONTAL SINGLE % BOX DRAWINGS DOUBLE UP AND HORIZONTAL % BOX DRAWINGS VERTICAL SINGLE AND HORIZONTAL DOUBLE % BOX DRAWINGS VERTICAL DOUBLE AND HORIZONTAL SINGLE % BOX DRAWINGS DOUBLE VERTICAL AND HORIZONTAL % BOX DRAWINGS LIGHT ARC DOWN AND RIGHT % BOX DRAWINGS LIGHT ARC DOWN AND LEFT % BOX DRAWINGS LIGHT ARC UP AND LEFT % BOX DRAWINGS LIGHT ARC UP AND RIGHT % BOX DRAWINGS LIGHT DIAGONAL UPPER RIGHT TO LOWER LEFT % BOX DRAWINGS LIGHT DIAGONAL UPPER LEFT TO LOWER RIGHT % BOX DRAWINGS LIGHT DIAGONAL CROSS % BOX DRAWINGS LIGHT LEFT AND HEAVY RIGHT % BOX DRAWINGS LIGHT UP AND HEAVY DOWN % BOX DRAWINGS HEAVY LEFT AND LIGHT RIGHT % BOX DRAWINGS HEAVY UP AND LIGHT DOWN % WHITE CIRCLE % WHITE BULLET % BLACK STAR % WHITE STAR % BALLOT BOX WITH X % SALTIRE % WHITE FROWNING FACE "" % WHITE SMILING FACE "" % BLACK SMILING FACE "" % MUSIC FLAT SIGN % MUSIC SHARP SIGN % UPPER BLADE SCISSORS "" % BLACK SCISSORS "" % LOWER BLADE SCISSORS "" % WHITE SCISSORS "" % VICTORY HAND % CHECK MARK % HEAVY CHECK MARK % MULTIPLICATION X % HEAVY MULTIPLICATION X % BALLOT X % HEAVY BALLOT X % OUTLINED GREEK CROSS % HEAVY GREEK CROSS % OPEN CENTRE CROSS % HEAVY OPEN CENTRE CROSS % LATIN CROSS % SHADOWED WHITE LATIN CROSS % OUTLINED LATIN CROSS % MALTESE CROSS % STAR OF DAVID % FOUR TEARDROP-SPOKED ASTERISK % FOUR BALLOON-SPOKED ASTERISK % HEAVY FOUR BALLOON-SPOKED ASTERISK % FOUR CLUB-SPOKED ASTERISK % BLACK FOUR POINTED STAR % WHITE FOUR POINTED STAR % STRESS OUTLINED WHITE STAR % CIRCLED WHITE STAR % OPEN CENTRE BLACK STAR % BLACK CENTRE WHITE STAR % OUTLINED BLACK STAR % HEAVY OUTLINED BLACK STAR % PINWHEEL STAR % SHADOWED WHITE STAR % HEAVY ASTERISK % OPEN CENTRE ASTERISK % EIGHT SPOKED ASTERISK % EIGHT POINTED BLACK STAR % EIGHT POINTED PINWHEEL STAR % SIX POINTED BLACK STAR % EIGHT POINTED RECTILINEAR BLACK STAR % HEAVY EIGHT POINTED RECTILINEAR BLACK STAR % TWELVE POINTED BLACK STAR % SIXTEEN POINTED ASTERISK % TEARDROP-SPOKED ASTERISK % OPEN CENTRE TEARDROP-SPOKED ASTERISK % HEAVY TEARDROP-SPOKED ASTERISK % SIX PETALLED BLACK AND WHITE FLORETTE % BLACK FLORETTE % WHITE FLORETTE % EIGHT PETALLED OUTLINED BLACK FLORETTE % CIRCLED OPEN CENTRE EIGHT POINTED STAR % HEAVY TEARDROP-SPOKED PINWHEEL ASTERISK % SNOWFLAKE % TIGHT TRIFOLIATE SNOWFLAKE % HEAVY CHEVRON SNOWFLAKE % SPARKLE % HEAVY SPARKLE % BALLOON-SPOKED ASTERISK % EIGHT TEARDROP-SPOKED PROPELLER ASTERISK % HEAVY EIGHT TEARDROP-SPOKED PROPELLER ASTERISK % LATIN SMALL LIGATURE FF "" % LATIN SMALL LIGATURE FI "" % LATIN SMALL LIGATURE FL "" % LATIN SMALL LIGATURE FFI "" % LATIN SMALL LIGATURE FFL "" % LATIN SMALL LIGATURE LONG S T "";"" % LATIN SMALL LIGATURE ST "" % ZERO WIDTH NO-BREAK SPACE "" % REPLACEMENT CHARACTER translitcodec-version-0.7.0/transtab/transtab.changes000066400000000000000000000030541404555523200230340ustar00rootroot00000000000000 RCS file: RCS/transtab.utf,v Working file: transtab.utf head: 1.8 branch: locks: strict mgk25: 1.8 access list: symbolic names: keyword substitution: kv total revisions: 8; selected revisions: 8 description: Transliteration table in ISO/IEC TR 14652 format ---------------------------- revision 1.8 locked by: mgk25; date: 2000-10-12 11:01:28+01; author: mgk25; state: Exp; lines: +2 -0 RCS id added ---------------------------- revision 1.7 date: 2000-10-12 09:38:41+01; author: mgk25; state: Exp; lines: +4 -4 added ae->a ---------------------------- revision 1.6 date: 2000-10-10 09:13:10+01; author: mgk25; state: Exp; lines: +20 -20 Byrial Jensen added transliterations for Esperanto, such that C, G, H, J, S with circumflex are presented by the base character followed by an H. ---------------------------- revision 1.5 date: 2000-10-09 11:33:26+01; author: mgk25; state: Exp; lines: +2 -2 *** empty log message *** ---------------------------- revision 1.4 date: 2000-10-09 11:23:38+01; author: mgk25; state: Exp; lines: +541 -232 *** empty log message *** ---------------------------- revision 1.3 date: 2000-10-09 00:35:12+01; author: mgk25; state: Exp; lines: +372 -0 *** empty log message *** ---------------------------- revision 1.2 date: 2000-10-08 23:43:06+01; author: mgk25; state: Exp; lines: +88 -4 *** empty log message *** ---------------------------- revision 1.1 date: 2000-10-08 23:19:22+01; author: mgk25; state: Exp; Initial revision ============================================================================= translitcodec-version-0.7.0/transtab/transtab.missing-MES-2000066400000000000000000001043621404555523200236620ustar00rootroot00000000000000% $Id: $ % CURRENCY SIGN % ¤ -> "" % LATIN CAPITAL LETTER SCHWA % Ə -> "" % LATIN CAPITAL LETTER EZH % Ʒ -> "" % LATIN CAPITAL LETTER A WITH DIAERESIS AND MACRON % Ǟ -> "" % LATIN SMALL LETTER A WITH DIAERESIS AND MACRON % ǟ -> "" % LATIN CAPITAL LETTER A WITH DOT ABOVE AND MACRON % Ǡ -> "" % LATIN SMALL LETTER A WITH DOT ABOVE AND MACRON % ǡ -> "" % LATIN CAPITAL LETTER AE WITH MACRON % Ǣ -> "" % LATIN SMALL LETTER AE WITH MACRON % ǣ -> "" % LATIN CAPITAL LETTER G WITH STROKE % Ǥ -> "" % LATIN SMALL LETTER G WITH STROKE % ǥ -> "" % LATIN CAPITAL LETTER G WITH CARON % Ǧ -> "" % LATIN SMALL LETTER G WITH CARON % ǧ -> "" % LATIN CAPITAL LETTER K WITH CARON % Ǩ -> "" % LATIN SMALL LETTER K WITH CARON % ǩ -> "" % LATIN CAPITAL LETTER O WITH OGONEK % Ǫ -> "" % LATIN SMALL LETTER O WITH OGONEK % ǫ -> "" % LATIN CAPITAL LETTER O WITH OGONEK AND MACRON % Ǭ -> "" % LATIN SMALL LETTER O WITH OGONEK AND MACRON % ǭ -> "" % LATIN CAPITAL LETTER EZH WITH CARON % Ǯ -> "" % LATIN SMALL LETTER EZH WITH CARON % ǯ -> "" % LATIN CAPITAL LETTER A WITH RING ABOVE AND ACUTE % Ǻ -> "" % LATIN SMALL LETTER A WITH RING ABOVE AND ACUTE % ǻ -> "" % LATIN CAPITAL LETTER AE WITH ACUTE % Ǽ -> "" % LATIN SMALL LETTER AE WITH ACUTE % ǽ -> "" % LATIN CAPITAL LETTER O WITH STROKE AND ACUTE % Ǿ -> "" % LATIN SMALL LETTER O WITH STROKE AND ACUTE % ǿ -> "" % LATIN CAPITAL LETTER H WITH CARON % Ȟ -> "" % LATIN SMALL LETTER H WITH CARON % ȟ -> "" % LATIN SMALL LETTER SCHWA % ə -> "" % LATIN SMALL LETTER R WITH LONG LEG % ɼ -> "" % LATIN SMALL LETTER EZH % ʒ -> "" % CARON % ˇ -> "" % BREVE % ˘ -> "" % DOT ABOVE % ˙ -> "" % OGONEK % ˛ -> "" % MODIFIER LETTER DOUBLE APOSTROPHE % ˮ -> "" % GREEK YPOGEGRAMMENI % ͺ -> "" % GREEK TONOS % ΄ -> "" % GREEK DIALYTIKA TONOS % ΅ -> "" % GREEK CAPITAL LETTER ALPHA WITH TONOS % Ά -> "" % GREEK ANO TELEIA % · -> "" % GREEK CAPITAL LETTER EPSILON WITH TONOS % Έ -> "" % GREEK CAPITAL LETTER ETA WITH TONOS % Ή -> "" % GREEK CAPITAL LETTER IOTA WITH TONOS % Ί -> "" % GREEK CAPITAL LETTER OMICRON WITH TONOS % Ό -> "" % GREEK CAPITAL LETTER UPSILON WITH TONOS % Ύ -> "" % GREEK CAPITAL LETTER OMEGA WITH TONOS % Ώ -> "" % GREEK SMALL LETTER IOTA WITH DIALYTIKA AND TONOS % ΐ -> "" % GREEK CAPITAL LETTER ALPHA % Α -> "" % GREEK CAPITAL LETTER BETA % Β -> "" % GREEK CAPITAL LETTER GAMMA % Γ -> "" % GREEK CAPITAL LETTER DELTA % Δ -> "" % GREEK CAPITAL LETTER EPSILON % Ε -> "" % GREEK CAPITAL LETTER ZETA % Ζ -> "" % GREEK CAPITAL LETTER ETA % Η -> "" % GREEK CAPITAL LETTER THETA % Θ -> "" % GREEK CAPITAL LETTER IOTA % Ι -> "" % GREEK CAPITAL LETTER KAPPA % Κ -> "" % GREEK CAPITAL LETTER LAMDA % Λ -> "" % GREEK CAPITAL LETTER MU % Μ -> "" % GREEK CAPITAL LETTER NU % Ν -> "" % GREEK CAPITAL LETTER XI % Ξ -> "" % GREEK CAPITAL LETTER OMICRON % Ο -> "" % GREEK CAPITAL LETTER PI % Π -> "" % GREEK CAPITAL LETTER RHO % Ρ -> "" % GREEK CAPITAL LETTER SIGMA % Σ -> "" % GREEK CAPITAL LETTER TAU % Τ -> "" % GREEK CAPITAL LETTER UPSILON % Υ -> "" % GREEK CAPITAL LETTER PHI % Φ -> "" % GREEK CAPITAL LETTER CHI % Χ -> "" % GREEK CAPITAL LETTER PSI % Ψ -> "" % GREEK CAPITAL LETTER OMEGA % Ω -> "" % GREEK CAPITAL LETTER IOTA WITH DIALYTIKA % Ϊ -> "" % GREEK CAPITAL LETTER UPSILON WITH DIALYTIKA % Ϋ -> "" % GREEK SMALL LETTER ALPHA WITH TONOS % ά -> "" % GREEK SMALL LETTER EPSILON WITH TONOS % έ -> "" % GREEK SMALL LETTER ETA WITH TONOS % ή -> "" % GREEK SMALL LETTER IOTA WITH TONOS % ί -> "" % GREEK SMALL LETTER UPSILON WITH DIALYTIKA AND TONOS % ΰ -> "" % GREEK SMALL LETTER ALPHA % α -> "" % GREEK SMALL LETTER BETA % β -> "" % GREEK SMALL LETTER GAMMA % γ -> "" % GREEK SMALL LETTER DELTA % δ -> "" % GREEK SMALL LETTER EPSILON % ε -> "" % GREEK SMALL LETTER ZETA % ζ -> "" % GREEK SMALL LETTER ETA % η -> "" % GREEK SMALL LETTER THETA % θ -> "" % GREEK SMALL LETTER IOTA % ι -> "" % GREEK SMALL LETTER KAPPA % κ -> "" % GREEK SMALL LETTER LAMDA % λ -> "" % GREEK SMALL LETTER MU % μ -> "" % GREEK SMALL LETTER NU % ν -> "" % GREEK SMALL LETTER XI % ξ -> "" % GREEK SMALL LETTER OMICRON % ο -> "" % GREEK SMALL LETTER PI % π -> "" % GREEK SMALL LETTER RHO % ρ -> "" % GREEK SMALL LETTER FINAL SIGMA % ς -> "" % GREEK SMALL LETTER SIGMA % σ -> "" % GREEK SMALL LETTER TAU % τ -> "" % GREEK SMALL LETTER UPSILON % υ -> "" % GREEK SMALL LETTER PHI % φ -> "" % GREEK SMALL LETTER CHI % χ -> "" % GREEK SMALL LETTER PSI % ψ -> "" % GREEK SMALL LETTER OMEGA % ω -> "" % GREEK SMALL LETTER IOTA WITH DIALYTIKA % ϊ -> "" % GREEK SMALL LETTER UPSILON WITH DIALYTIKA % ϋ -> "" % GREEK SMALL LETTER OMICRON WITH TONOS % ό -> "" % GREEK SMALL LETTER UPSILON WITH TONOS % ύ -> "" % GREEK SMALL LETTER OMEGA WITH TONOS % ώ -> "" % GREEK KAI SYMBOL % ϗ -> "" % GREEK LETTER STIGMA % Ϛ -> "" % GREEK SMALL LETTER STIGMA % ϛ -> "" % GREEK LETTER DIGAMMA % Ϝ -> "" % GREEK SMALL LETTER DIGAMMA % ϝ -> "" % GREEK LETTER KOPPA % Ϟ -> "" % GREEK SMALL LETTER KOPPA % ϟ -> "" % GREEK LETTER SAMPI % Ϡ -> "" % GREEK SMALL LETTER SAMPI % ϡ -> "" % CYRILLIC CAPITAL LETTER IE WITH GRAVE % Ѐ -> "" % CYRILLIC CAPITAL LETTER IO % Ё -> "" % CYRILLIC CAPITAL LETTER DJE % Ђ -> "" % CYRILLIC CAPITAL LETTER GJE % Ѓ -> "" % CYRILLIC CAPITAL LETTER UKRAINIAN IE % Є -> "" % CYRILLIC CAPITAL LETTER DZE % Ѕ -> "" % CYRILLIC CAPITAL LETTER BYELORUSSIAN-UKRAINIAN I % І -> "" % CYRILLIC CAPITAL LETTER YI % Ї -> "" % CYRILLIC CAPITAL LETTER JE % Ј -> "" % CYRILLIC CAPITAL LETTER LJE % Љ -> "" % CYRILLIC CAPITAL LETTER NJE % Њ -> "" % CYRILLIC CAPITAL LETTER TSHE % Ћ -> "" % CYRILLIC CAPITAL LETTER KJE % Ќ -> "" % CYRILLIC CAPITAL LETTER I WITH GRAVE % Ѝ -> "" % CYRILLIC CAPITAL LETTER SHORT U % Ў -> "" % CYRILLIC CAPITAL LETTER DZHE % Џ -> "" % CYRILLIC CAPITAL LETTER A % А -> "" % CYRILLIC CAPITAL LETTER BE % Б -> "" % CYRILLIC CAPITAL LETTER VE % В -> "" % CYRILLIC CAPITAL LETTER GHE % Г -> "" % CYRILLIC CAPITAL LETTER DE % Д -> "" % CYRILLIC CAPITAL LETTER IE % Е -> "" % CYRILLIC CAPITAL LETTER ZHE % Ж -> "" % CYRILLIC CAPITAL LETTER ZE % З -> "" % CYRILLIC CAPITAL LETTER I % И -> "" % CYRILLIC CAPITAL LETTER SHORT I % Й -> "" % CYRILLIC CAPITAL LETTER KA % К -> "" % CYRILLIC CAPITAL LETTER EL % Л -> "" % CYRILLIC CAPITAL LETTER EM % М -> "" % CYRILLIC CAPITAL LETTER EN % Н -> "" % CYRILLIC CAPITAL LETTER O % О -> "" % CYRILLIC CAPITAL LETTER PE % П -> "" % CYRILLIC CAPITAL LETTER ER % Р -> "" % CYRILLIC CAPITAL LETTER ES % С -> "" % CYRILLIC CAPITAL LETTER TE % Т -> "" % CYRILLIC CAPITAL LETTER U % У -> "" % CYRILLIC CAPITAL LETTER EF % Ф -> "" % CYRILLIC CAPITAL LETTER HA % Х -> "" % CYRILLIC CAPITAL LETTER TSE % Ц -> "" % CYRILLIC CAPITAL LETTER CHE % Ч -> "" % CYRILLIC CAPITAL LETTER SHA % Ш -> "" % CYRILLIC CAPITAL LETTER SHCHA % Щ -> "" % CYRILLIC CAPITAL LETTER HARD SIGN % Ъ -> "" % CYRILLIC CAPITAL LETTER YERU % Ы -> "" % CYRILLIC CAPITAL LETTER SOFT SIGN % Ь -> "" % CYRILLIC CAPITAL LETTER E % Э -> "" % CYRILLIC CAPITAL LETTER YU % Ю -> "" % CYRILLIC CAPITAL LETTER YA % Я -> "" % CYRILLIC SMALL LETTER A % а -> "" % CYRILLIC SMALL LETTER BE % б -> "" % CYRILLIC SMALL LETTER VE % в -> "" % CYRILLIC SMALL LETTER GHE % г -> "" % CYRILLIC SMALL LETTER DE % д -> "" % CYRILLIC SMALL LETTER IE % е -> "" % CYRILLIC SMALL LETTER ZHE % ж -> "" % CYRILLIC SMALL LETTER ZE % з -> "" % CYRILLIC SMALL LETTER I % и -> "" % CYRILLIC SMALL LETTER SHORT I % й -> "" % CYRILLIC SMALL LETTER KA % к -> "" % CYRILLIC SMALL LETTER EL % л -> "" % CYRILLIC SMALL LETTER EM % м -> "" % CYRILLIC SMALL LETTER EN % н -> "" % CYRILLIC SMALL LETTER O % о -> "" % CYRILLIC SMALL LETTER PE % п -> "" % CYRILLIC SMALL LETTER ER % р -> "" % CYRILLIC SMALL LETTER ES % с -> "" % CYRILLIC SMALL LETTER TE % т -> "" % CYRILLIC SMALL LETTER U % у -> "" % CYRILLIC SMALL LETTER EF % ф -> "" % CYRILLIC SMALL LETTER HA % х -> "" % CYRILLIC SMALL LETTER TSE % ц -> "" % CYRILLIC SMALL LETTER CHE % ч -> "" % CYRILLIC SMALL LETTER SHA % ш -> "" % CYRILLIC SMALL LETTER SHCHA % щ -> "" % CYRILLIC SMALL LETTER HARD SIGN % ъ -> "" % CYRILLIC SMALL LETTER YERU % ы -> "" % CYRILLIC SMALL LETTER SOFT SIGN % ь -> "" % CYRILLIC SMALL LETTER E % э -> "" % CYRILLIC SMALL LETTER YU % ю -> "" % CYRILLIC SMALL LETTER YA % я -> "" % CYRILLIC SMALL LETTER IE WITH GRAVE % ѐ -> "" % CYRILLIC SMALL LETTER IO % ё -> "" % CYRILLIC SMALL LETTER DJE % ђ -> "" % CYRILLIC SMALL LETTER GJE % ѓ -> "" % CYRILLIC SMALL LETTER UKRAINIAN IE % є -> "" % CYRILLIC SMALL LETTER DZE % ѕ -> "" % CYRILLIC SMALL LETTER BYELORUSSIAN-UKRAINIAN I % і -> "" % CYRILLIC SMALL LETTER YI % ї -> "" % CYRILLIC SMALL LETTER JE % ј -> "" % CYRILLIC SMALL LETTER LJE % љ -> "" % CYRILLIC SMALL LETTER NJE % њ -> "" % CYRILLIC SMALL LETTER TSHE % ћ -> "" % CYRILLIC SMALL LETTER KJE % ќ -> "" % CYRILLIC SMALL LETTER I WITH GRAVE % ѝ -> "" % CYRILLIC SMALL LETTER SHORT U % ў -> "" % CYRILLIC SMALL LETTER DZHE % џ -> "" % CYRILLIC CAPITAL LETTER GHE WITH UPTURN % Ґ -> "" % CYRILLIC SMALL LETTER GHE WITH UPTURN % ґ -> "" % CYRILLIC CAPITAL LETTER GHE WITH STROKE % Ғ -> "" % CYRILLIC SMALL LETTER GHE WITH STROKE % ғ -> "" % CYRILLIC CAPITAL LETTER GHE WITH MIDDLE HOOK % Ҕ -> "" % CYRILLIC SMALL LETTER GHE WITH MIDDLE HOOK % ҕ -> "" % CYRILLIC CAPITAL LETTER ZHE WITH DESCENDER % Җ -> "" % CYRILLIC SMALL LETTER ZHE WITH DESCENDER % җ -> "" % CYRILLIC CAPITAL LETTER ZE WITH DESCENDER % Ҙ -> "" % CYRILLIC SMALL LETTER ZE WITH DESCENDER % ҙ -> "" % CYRILLIC CAPITAL LETTER KA WITH DESCENDER % Қ -> "" % CYRILLIC SMALL LETTER KA WITH DESCENDER % қ -> "" % CYRILLIC CAPITAL LETTER KA WITH VERTICAL STROKE % Ҝ -> "" % CYRILLIC SMALL LETTER KA WITH VERTICAL STROKE % ҝ -> "" % CYRILLIC CAPITAL LETTER KA WITH STROKE % Ҟ -> "" % CYRILLIC SMALL LETTER KA WITH STROKE % ҟ -> "" % CYRILLIC CAPITAL LETTER BASHKIR KA % Ҡ -> "" % CYRILLIC SMALL LETTER BASHKIR KA % ҡ -> "" % CYRILLIC CAPITAL LETTER EN WITH DESCENDER % Ң -> "" % CYRILLIC SMALL LETTER EN WITH DESCENDER % ң -> "" % CYRILLIC CAPITAL LIGATURE EN GHE % Ҥ -> "" % CYRILLIC SMALL LIGATURE EN GHE % ҥ -> "" % CYRILLIC CAPITAL LETTER PE WITH MIDDLE HOOK % Ҧ -> "" % CYRILLIC SMALL LETTER PE WITH MIDDLE HOOK % ҧ -> "" % CYRILLIC CAPITAL LETTER ABKHASIAN HA % Ҩ -> "" % CYRILLIC SMALL LETTER ABKHASIAN HA % ҩ -> "" % CYRILLIC CAPITAL LETTER ES WITH DESCENDER % Ҫ -> "" % CYRILLIC SMALL LETTER ES WITH DESCENDER % ҫ -> "" % CYRILLIC CAPITAL LETTER TE WITH DESCENDER % Ҭ -> "" % CYRILLIC SMALL LETTER TE WITH DESCENDER % ҭ -> "" % CYRILLIC CAPITAL LETTER STRAIGHT U % Ү -> "" % CYRILLIC SMALL LETTER STRAIGHT U % ү -> "" % CYRILLIC CAPITAL LETTER STRAIGHT U WITH STROKE % Ұ -> "" % CYRILLIC SMALL LETTER STRAIGHT U WITH STROKE % ұ -> "" % CYRILLIC CAPITAL LETTER HA WITH DESCENDER % Ҳ -> "" % CYRILLIC SMALL LETTER HA WITH DESCENDER % ҳ -> "" % CYRILLIC CAPITAL LIGATURE TE TSE % Ҵ -> "" % CYRILLIC SMALL LIGATURE TE TSE % ҵ -> "" % CYRILLIC CAPITAL LETTER CHE WITH DESCENDER % Ҷ -> "" % CYRILLIC SMALL LETTER CHE WITH DESCENDER % ҷ -> "" % CYRILLIC CAPITAL LETTER CHE WITH VERTICAL STROKE % Ҹ -> "" % CYRILLIC SMALL LETTER CHE WITH VERTICAL STROKE % ҹ -> "" % CYRILLIC CAPITAL LETTER SHHA % Һ -> "" % CYRILLIC SMALL LETTER SHHA % һ -> "" % CYRILLIC CAPITAL LETTER ABKHASIAN CHE % Ҽ -> "" % CYRILLIC SMALL LETTER ABKHASIAN CHE % ҽ -> "" % CYRILLIC CAPITAL LETTER ABKHASIAN CHE WITH DESCENDER % Ҿ -> "" % CYRILLIC SMALL LETTER ABKHASIAN CHE WITH DESCENDER % ҿ -> "" % CYRILLIC LETTER PALOCHKA % Ӏ -> "" % CYRILLIC CAPITAL LETTER ZHE WITH BREVE % Ӂ -> "" % CYRILLIC SMALL LETTER ZHE WITH BREVE % ӂ -> "" % CYRILLIC CAPITAL LETTER KA WITH HOOK % Ӄ -> "" % CYRILLIC SMALL LETTER KA WITH HOOK % ӄ -> "" % CYRILLIC CAPITAL LETTER EN WITH HOOK % Ӈ -> "" % CYRILLIC SMALL LETTER EN WITH HOOK % ӈ -> "" % CYRILLIC CAPITAL LETTER KHAKASSIAN CHE % Ӌ -> "" % CYRILLIC SMALL LETTER KHAKASSIAN CHE % ӌ -> "" % CYRILLIC CAPITAL LETTER A WITH BREVE % Ӑ -> "" % CYRILLIC SMALL LETTER A WITH BREVE % ӑ -> "" % CYRILLIC CAPITAL LETTER A WITH DIAERESIS % Ӓ -> "" % CYRILLIC SMALL LETTER A WITH DIAERESIS % ӓ -> "" % CYRILLIC CAPITAL LIGATURE A IE % Ӕ -> "" % CYRILLIC SMALL LIGATURE A IE % ӕ -> "" % CYRILLIC CAPITAL LETTER IE WITH BREVE % Ӗ -> "" % CYRILLIC SMALL LETTER IE WITH BREVE % ӗ -> "" % CYRILLIC CAPITAL LETTER SCHWA % Ә -> "" % CYRILLIC SMALL LETTER SCHWA % ә -> "" % CYRILLIC CAPITAL LETTER SCHWA WITH DIAERESIS % Ӛ -> "" % CYRILLIC SMALL LETTER SCHWA WITH DIAERESIS % ӛ -> "" % CYRILLIC CAPITAL LETTER ZHE WITH DIAERESIS % Ӝ -> "" % CYRILLIC SMALL LETTER ZHE WITH DIAERESIS % ӝ -> "" % CYRILLIC CAPITAL LETTER ZE WITH DIAERESIS % Ӟ -> "" % CYRILLIC SMALL LETTER ZE WITH DIAERESIS % ӟ -> "" % CYRILLIC CAPITAL LETTER ABKHASIAN DZE % Ӡ -> "" % CYRILLIC SMALL LETTER ABKHASIAN DZE % ӡ -> "" % CYRILLIC CAPITAL LETTER I WITH MACRON % Ӣ -> "" % CYRILLIC SMALL LETTER I WITH MACRON % ӣ -> "" % CYRILLIC CAPITAL LETTER I WITH DIAERESIS % Ӥ -> "" % CYRILLIC SMALL LETTER I WITH DIAERESIS % ӥ -> "" % CYRILLIC CAPITAL LETTER O WITH DIAERESIS % Ӧ -> "" % CYRILLIC SMALL LETTER O WITH DIAERESIS % ӧ -> "" % CYRILLIC CAPITAL LETTER BARRED O % Ө -> "" % CYRILLIC SMALL LETTER BARRED O % ө -> "" % CYRILLIC CAPITAL LETTER BARRED O WITH DIAERESIS % Ӫ -> "" % CYRILLIC SMALL LETTER BARRED O WITH DIAERESIS % ӫ -> "" % CYRILLIC CAPITAL LETTER U WITH MACRON % Ӯ -> "" % CYRILLIC SMALL LETTER U WITH MACRON % ӯ -> "" % CYRILLIC CAPITAL LETTER U WITH DIAERESIS % Ӱ -> "" % CYRILLIC SMALL LETTER U WITH DIAERESIS % ӱ -> "" % CYRILLIC CAPITAL LETTER U WITH DOUBLE ACUTE % Ӳ -> "" % CYRILLIC SMALL LETTER U WITH DOUBLE ACUTE % ӳ -> "" % CYRILLIC CAPITAL LETTER CHE WITH DIAERESIS % Ӵ -> "" % CYRILLIC SMALL LETTER CHE WITH DIAERESIS % ӵ -> "" % CYRILLIC CAPITAL LETTER YERU WITH DIAERESIS % Ӹ -> "" % CYRILLIC SMALL LETTER YERU WITH DIAERESIS % ӹ -> "" % LATIN SMALL LETTER LONG S WITH DOT ABOVE % ẛ -> "" % GREEK SMALL LETTER ALPHA WITH PSILI % ἀ -> "" % GREEK SMALL LETTER ALPHA WITH DASIA % ἁ -> "" % GREEK SMALL LETTER ALPHA WITH PSILI AND VARIA % ἂ -> "" % GREEK SMALL LETTER ALPHA WITH DASIA AND VARIA % ἃ -> "" % GREEK SMALL LETTER ALPHA WITH PSILI AND OXIA % ἄ -> "" % GREEK SMALL LETTER ALPHA WITH DASIA AND OXIA % ἅ -> "" % GREEK SMALL LETTER ALPHA WITH PSILI AND PERISPOMENI % ἆ -> "" % GREEK SMALL LETTER ALPHA WITH DASIA AND PERISPOMENI % ἇ -> "" % GREEK CAPITAL LETTER ALPHA WITH PSILI % Ἀ -> "" % GREEK CAPITAL LETTER ALPHA WITH DASIA % Ἁ -> "" % GREEK CAPITAL LETTER ALPHA WITH PSILI AND VARIA % Ἂ -> "" % GREEK CAPITAL LETTER ALPHA WITH DASIA AND VARIA % Ἃ -> "" % GREEK CAPITAL LETTER ALPHA WITH PSILI AND OXIA % Ἄ -> "" % GREEK CAPITAL LETTER ALPHA WITH DASIA AND OXIA % Ἅ -> "" % GREEK CAPITAL LETTER ALPHA WITH PSILI AND PERISPOMENI % Ἆ -> "" % GREEK CAPITAL LETTER ALPHA WITH DASIA AND PERISPOMENI % Ἇ -> "" % GREEK SMALL LETTER EPSILON WITH PSILI % ἐ -> "" % GREEK SMALL LETTER EPSILON WITH DASIA % ἑ -> "" % GREEK SMALL LETTER EPSILON WITH PSILI AND VARIA % ἒ -> "" % GREEK SMALL LETTER EPSILON WITH DASIA AND VARIA % ἓ -> "" % GREEK SMALL LETTER EPSILON WITH PSILI AND OXIA % ἔ -> "" % GREEK SMALL LETTER EPSILON WITH DASIA AND OXIA % ἕ -> "" % GREEK CAPITAL LETTER EPSILON WITH PSILI % Ἐ -> "" % GREEK CAPITAL LETTER EPSILON WITH DASIA % Ἑ -> "" % GREEK CAPITAL LETTER EPSILON WITH PSILI AND VARIA % Ἒ -> "" % GREEK CAPITAL LETTER EPSILON WITH DASIA AND VARIA % Ἓ -> "" % GREEK CAPITAL LETTER EPSILON WITH PSILI AND OXIA % Ἔ -> "" % GREEK CAPITAL LETTER EPSILON WITH DASIA AND OXIA % Ἕ -> "" % GREEK SMALL LETTER ETA WITH PSILI % ἠ -> "" % GREEK SMALL LETTER ETA WITH DASIA % ἡ -> "" % GREEK SMALL LETTER ETA WITH PSILI AND VARIA % ἢ -> "" % GREEK SMALL LETTER ETA WITH DASIA AND VARIA % ἣ -> "" % GREEK SMALL LETTER ETA WITH PSILI AND OXIA % ἤ -> "" % GREEK SMALL LETTER ETA WITH DASIA AND OXIA % ἥ -> "" % GREEK SMALL LETTER ETA WITH PSILI AND PERISPOMENI % ἦ -> "" % GREEK SMALL LETTER ETA WITH DASIA AND PERISPOMENI % ἧ -> "" % GREEK CAPITAL LETTER ETA WITH PSILI % Ἠ -> "" % GREEK CAPITAL LETTER ETA WITH DASIA % Ἡ -> "" % GREEK CAPITAL LETTER ETA WITH PSILI AND VARIA % Ἢ -> "" % GREEK CAPITAL LETTER ETA WITH DASIA AND VARIA % Ἣ -> "" % GREEK CAPITAL LETTER ETA WITH PSILI AND OXIA % Ἤ -> "" % GREEK CAPITAL LETTER ETA WITH DASIA AND OXIA % Ἥ -> "" % GREEK CAPITAL LETTER ETA WITH PSILI AND PERISPOMENI % Ἦ -> "" % GREEK CAPITAL LETTER ETA WITH DASIA AND PERISPOMENI % Ἧ -> "" % GREEK SMALL LETTER IOTA WITH PSILI % ἰ -> "" % GREEK SMALL LETTER IOTA WITH DASIA % ἱ -> "" % GREEK SMALL LETTER IOTA WITH PSILI AND VARIA % ἲ -> "" % GREEK SMALL LETTER IOTA WITH DASIA AND VARIA % ἳ -> "" % GREEK SMALL LETTER IOTA WITH PSILI AND OXIA % ἴ -> "" % GREEK SMALL LETTER IOTA WITH DASIA AND OXIA % ἵ -> "" % GREEK SMALL LETTER IOTA WITH PSILI AND PERISPOMENI % ἶ -> "" % GREEK SMALL LETTER IOTA WITH DASIA AND PERISPOMENI % ἷ -> "" % GREEK CAPITAL LETTER IOTA WITH PSILI % Ἰ -> "" % GREEK CAPITAL LETTER IOTA WITH DASIA % Ἱ -> "" % GREEK CAPITAL LETTER IOTA WITH PSILI AND VARIA % Ἲ -> "" % GREEK CAPITAL LETTER IOTA WITH DASIA AND VARIA % Ἳ -> "" % GREEK CAPITAL LETTER IOTA WITH PSILI AND OXIA % Ἴ -> "" % GREEK CAPITAL LETTER IOTA WITH DASIA AND OXIA % Ἵ -> "" % GREEK CAPITAL LETTER IOTA WITH PSILI AND PERISPOMENI % Ἶ -> "" % GREEK CAPITAL LETTER IOTA WITH DASIA AND PERISPOMENI % Ἷ -> "" % GREEK SMALL LETTER OMICRON WITH PSILI % ὀ -> "" % GREEK SMALL LETTER OMICRON WITH DASIA % ὁ -> "" % GREEK SMALL LETTER OMICRON WITH PSILI AND VARIA % ὂ -> "" % GREEK SMALL LETTER OMICRON WITH DASIA AND VARIA % ὃ -> "" % GREEK SMALL LETTER OMICRON WITH PSILI AND OXIA % ὄ -> "" % GREEK SMALL LETTER OMICRON WITH DASIA AND OXIA % ὅ -> "" % GREEK CAPITAL LETTER OMICRON WITH PSILI % Ὀ -> "" % GREEK CAPITAL LETTER OMICRON WITH DASIA % Ὁ -> "" % GREEK CAPITAL LETTER OMICRON WITH PSILI AND VARIA % Ὂ -> "" % GREEK CAPITAL LETTER OMICRON WITH DASIA AND VARIA % Ὃ -> "" % GREEK CAPITAL LETTER OMICRON WITH PSILI AND OXIA % Ὄ -> "" % GREEK CAPITAL LETTER OMICRON WITH DASIA AND OXIA % Ὅ -> "" % GREEK SMALL LETTER UPSILON WITH PSILI % ὐ -> "" % GREEK SMALL LETTER UPSILON WITH DASIA % ὑ -> "" % GREEK SMALL LETTER UPSILON WITH PSILI AND VARIA % ὒ -> "" % GREEK SMALL LETTER UPSILON WITH DASIA AND VARIA % ὓ -> "" % GREEK SMALL LETTER UPSILON WITH PSILI AND OXIA % ὔ -> "" % GREEK SMALL LETTER UPSILON WITH DASIA AND OXIA % ὕ -> "" % GREEK SMALL LETTER UPSILON WITH PSILI AND PERISPOMENI % ὖ -> "" % GREEK SMALL LETTER UPSILON WITH DASIA AND PERISPOMENI % ὗ -> "" % GREEK CAPITAL LETTER UPSILON WITH DASIA % Ὑ -> "" % GREEK CAPITAL LETTER UPSILON WITH DASIA AND VARIA % Ὓ -> "" % GREEK CAPITAL LETTER UPSILON WITH DASIA AND OXIA % Ὕ -> "" % GREEK CAPITAL LETTER UPSILON WITH DASIA AND PERISPOMENI % Ὗ -> "" % GREEK SMALL LETTER OMEGA WITH PSILI % ὠ -> "" % GREEK SMALL LETTER OMEGA WITH DASIA % ὡ -> "" % GREEK SMALL LETTER OMEGA WITH PSILI AND VARIA % ὢ -> "" % GREEK SMALL LETTER OMEGA WITH DASIA AND VARIA % ὣ -> "" % GREEK SMALL LETTER OMEGA WITH PSILI AND OXIA % ὤ -> "" % GREEK SMALL LETTER OMEGA WITH DASIA AND OXIA % ὥ -> "" % GREEK SMALL LETTER OMEGA WITH PSILI AND PERISPOMENI % ὦ -> "" % GREEK SMALL LETTER OMEGA WITH DASIA AND PERISPOMENI % ὧ -> "" % GREEK CAPITAL LETTER OMEGA WITH PSILI % Ὠ -> "" % GREEK CAPITAL LETTER OMEGA WITH DASIA % Ὡ -> "" % GREEK CAPITAL LETTER OMEGA WITH PSILI AND VARIA % Ὢ -> "" % GREEK CAPITAL LETTER OMEGA WITH DASIA AND VARIA % Ὣ -> "" % GREEK CAPITAL LETTER OMEGA WITH PSILI AND OXIA % Ὤ -> "" % GREEK CAPITAL LETTER OMEGA WITH DASIA AND OXIA % Ὥ -> "" % GREEK CAPITAL LETTER OMEGA WITH PSILI AND PERISPOMENI % Ὦ -> "" % GREEK CAPITAL LETTER OMEGA WITH DASIA AND PERISPOMENI % Ὧ -> "" % GREEK SMALL LETTER ALPHA WITH VARIA % ὰ -> "" % GREEK SMALL LETTER ALPHA WITH OXIA % ά -> "" % GREEK SMALL LETTER EPSILON WITH VARIA % ὲ -> "" % GREEK SMALL LETTER EPSILON WITH OXIA % έ -> "" % GREEK SMALL LETTER ETA WITH VARIA % ὴ -> "" % GREEK SMALL LETTER ETA WITH OXIA % ή -> "" % GREEK SMALL LETTER IOTA WITH VARIA % ὶ -> "" % GREEK SMALL LETTER IOTA WITH OXIA % ί -> "" % GREEK SMALL LETTER OMICRON WITH VARIA % ὸ -> "" % GREEK SMALL LETTER OMICRON WITH OXIA % ό -> "" % GREEK SMALL LETTER UPSILON WITH VARIA % ὺ -> "" % GREEK SMALL LETTER UPSILON WITH OXIA % ύ -> "" % GREEK SMALL LETTER OMEGA WITH VARIA % ὼ -> "" % GREEK SMALL LETTER OMEGA WITH OXIA % ώ -> "" % GREEK SMALL LETTER ALPHA WITH PSILI AND YPOGEGRAMMENI % ᾀ -> "" % GREEK SMALL LETTER ALPHA WITH DASIA AND YPOGEGRAMMENI % ᾁ -> "" % GREEK SMALL LETTER ALPHA WITH PSILI AND VARIA AND YPOGEGRAMMENI % ᾂ -> "" % GREEK SMALL LETTER ALPHA WITH DASIA AND VARIA AND YPOGEGRAMMENI % ᾃ -> "" % GREEK SMALL LETTER ALPHA WITH PSILI AND OXIA AND YPOGEGRAMMENI % ᾄ -> "" % GREEK SMALL LETTER ALPHA WITH DASIA AND OXIA AND YPOGEGRAMMENI % ᾅ -> "" % GREEK SMALL LETTER ALPHA WITH PSILI AND PERISPOMENI AND YPOGEGRAMMENI % ᾆ -> "" % GREEK SMALL LETTER ALPHA WITH DASIA AND PERISPOMENI AND YPOGEGRAMMENI % ᾇ -> "" % GREEK CAPITAL LETTER ALPHA WITH PSILI AND PROSGEGRAMMENI % ᾈ -> "" % GREEK CAPITAL LETTER ALPHA WITH DASIA AND PROSGEGRAMMENI % ᾉ -> "" % GREEK CAPITAL LETTER ALPHA WITH PSILI AND VARIA AND PROSGEGRAMMENI % ᾊ -> "" % GREEK CAPITAL LETTER ALPHA WITH DASIA AND VARIA AND PROSGEGRAMMENI % ᾋ -> "" % GREEK CAPITAL LETTER ALPHA WITH PSILI AND OXIA AND PROSGEGRAMMENI % ᾌ -> "" % GREEK CAPITAL LETTER ALPHA WITH DASIA AND OXIA AND PROSGEGRAMMENI % ᾍ -> "" % GREEK CAPITAL LETTER ALPHA WITH PSILI AND PERISPOMENI AND PROSGEGRAMMENI % ᾎ -> "" % GREEK CAPITAL LETTER ALPHA WITH DASIA AND PERISPOMENI AND PROSGEGRAMMENI % ᾏ -> "" % GREEK SMALL LETTER ETA WITH PSILI AND YPOGEGRAMMENI % ᾐ -> "" % GREEK SMALL LETTER ETA WITH DASIA AND YPOGEGRAMMENI % ᾑ -> "" % GREEK SMALL LETTER ETA WITH PSILI AND VARIA AND YPOGEGRAMMENI % ᾒ -> "" % GREEK SMALL LETTER ETA WITH DASIA AND VARIA AND YPOGEGRAMMENI % ᾓ -> "" % GREEK SMALL LETTER ETA WITH PSILI AND OXIA AND YPOGEGRAMMENI % ᾔ -> "" % GREEK SMALL LETTER ETA WITH DASIA AND OXIA AND YPOGEGRAMMENI % ᾕ -> "" % GREEK SMALL LETTER ETA WITH PSILI AND PERISPOMENI AND YPOGEGRAMMENI % ᾖ -> "" % GREEK SMALL LETTER ETA WITH DASIA AND PERISPOMENI AND YPOGEGRAMMENI % ᾗ -> "" % GREEK CAPITAL LETTER ETA WITH PSILI AND PROSGEGRAMMENI % ᾘ -> "" % GREEK CAPITAL LETTER ETA WITH DASIA AND PROSGEGRAMMENI % ᾙ -> "" % GREEK CAPITAL LETTER ETA WITH PSILI AND VARIA AND PROSGEGRAMMENI % ᾚ -> "" % GREEK CAPITAL LETTER ETA WITH DASIA AND VARIA AND PROSGEGRAMMENI % ᾛ -> "" % GREEK CAPITAL LETTER ETA WITH PSILI AND OXIA AND PROSGEGRAMMENI % ᾜ -> "" % GREEK CAPITAL LETTER ETA WITH DASIA AND OXIA AND PROSGEGRAMMENI % ᾝ -> "" % GREEK CAPITAL LETTER ETA WITH PSILI AND PERISPOMENI AND PROSGEGRAMMENI % ᾞ -> "" % GREEK CAPITAL LETTER ETA WITH DASIA AND PERISPOMENI AND PROSGEGRAMMENI % ᾟ -> "" % GREEK SMALL LETTER OMEGA WITH PSILI AND YPOGEGRAMMENI % ᾠ -> "" % GREEK SMALL LETTER OMEGA WITH DASIA AND YPOGEGRAMMENI % ᾡ -> "" % GREEK SMALL LETTER OMEGA WITH PSILI AND VARIA AND YPOGEGRAMMENI % ᾢ -> "" % GREEK SMALL LETTER OMEGA WITH DASIA AND VARIA AND YPOGEGRAMMENI % ᾣ -> "" % GREEK SMALL LETTER OMEGA WITH PSILI AND OXIA AND YPOGEGRAMMENI % ᾤ -> "" % GREEK SMALL LETTER OMEGA WITH DASIA AND OXIA AND YPOGEGRAMMENI % ᾥ -> "" % GREEK SMALL LETTER OMEGA WITH PSILI AND PERISPOMENI AND YPOGEGRAMMENI % ᾦ -> "" % GREEK SMALL LETTER OMEGA WITH DASIA AND PERISPOMENI AND YPOGEGRAMMENI % ᾧ -> "" % GREEK CAPITAL LETTER OMEGA WITH PSILI AND PROSGEGRAMMENI % ᾨ -> "" % GREEK CAPITAL LETTER OMEGA WITH DASIA AND PROSGEGRAMMENI % ᾩ -> "" % GREEK CAPITAL LETTER OMEGA WITH PSILI AND VARIA AND PROSGEGRAMMENI % ᾪ -> "" % GREEK CAPITAL LETTER OMEGA WITH DASIA AND VARIA AND PROSGEGRAMMENI % ᾫ -> "" % GREEK CAPITAL LETTER OMEGA WITH PSILI AND OXIA AND PROSGEGRAMMENI % ᾬ -> "" % GREEK CAPITAL LETTER OMEGA WITH DASIA AND OXIA AND PROSGEGRAMMENI % ᾭ -> "" % GREEK CAPITAL LETTER OMEGA WITH PSILI AND PERISPOMENI AND PROSGEGRAMMENI % ᾮ -> "" % GREEK CAPITAL LETTER OMEGA WITH DASIA AND PERISPOMENI AND PROSGEGRAMMENI % ᾯ -> "" % GREEK SMALL LETTER ALPHA WITH VRACHY % ᾰ -> "" % GREEK SMALL LETTER ALPHA WITH MACRON % ᾱ -> "" % GREEK SMALL LETTER ALPHA WITH VARIA AND YPOGEGRAMMENI % ᾲ -> "" % GREEK SMALL LETTER ALPHA WITH YPOGEGRAMMENI % ᾳ -> "" % GREEK SMALL LETTER ALPHA WITH OXIA AND YPOGEGRAMMENI % ᾴ -> "" % GREEK SMALL LETTER ALPHA WITH PERISPOMENI % ᾶ -> "" % GREEK SMALL LETTER ALPHA WITH PERISPOMENI AND YPOGEGRAMMENI % ᾷ -> "" % GREEK CAPITAL LETTER ALPHA WITH VRACHY % Ᾰ -> "" % GREEK CAPITAL LETTER ALPHA WITH MACRON % Ᾱ -> "" % GREEK CAPITAL LETTER ALPHA WITH VARIA % Ὰ -> "" % GREEK CAPITAL LETTER ALPHA WITH OXIA % Ά -> "" % GREEK CAPITAL LETTER ALPHA WITH PROSGEGRAMMENI % ᾼ -> "" % GREEK KORONIS % ᾽ -> "" % GREEK PROSGEGRAMMENI % ι -> "" % GREEK PSILI % ᾿ -> "" % GREEK PERISPOMENI % ῀ -> "" % GREEK DIALYTIKA AND PERISPOMENI % ῁ -> "" % GREEK SMALL LETTER ETA WITH VARIA AND YPOGEGRAMMENI % ῂ -> "" % GREEK SMALL LETTER ETA WITH YPOGEGRAMMENI % ῃ -> "" % GREEK SMALL LETTER ETA WITH OXIA AND YPOGEGRAMMENI % ῄ -> "" % GREEK SMALL LETTER ETA WITH PERISPOMENI % ῆ -> "" % GREEK SMALL LETTER ETA WITH PERISPOMENI AND YPOGEGRAMMENI % ῇ -> "" % GREEK CAPITAL LETTER EPSILON WITH VARIA % Ὲ -> "" % GREEK CAPITAL LETTER EPSILON WITH OXIA % Έ -> "" % GREEK CAPITAL LETTER ETA WITH VARIA % Ὴ -> "" % GREEK CAPITAL LETTER ETA WITH OXIA % Ή -> "" % GREEK CAPITAL LETTER ETA WITH PROSGEGRAMMENI % ῌ -> "" % GREEK PSILI AND VARIA % ῍ -> "" % GREEK PSILI AND OXIA % ῎ -> "" % GREEK PSILI AND PERISPOMENI % ῏ -> "" % GREEK SMALL LETTER IOTA WITH VRACHY % ῐ -> "" % GREEK SMALL LETTER IOTA WITH MACRON % ῑ -> "" % GREEK SMALL LETTER IOTA WITH DIALYTIKA AND VARIA % ῒ -> "" % GREEK SMALL LETTER IOTA WITH DIALYTIKA AND OXIA % ΐ -> "" % GREEK SMALL LETTER IOTA WITH PERISPOMENI % ῖ -> "" % GREEK SMALL LETTER IOTA WITH DIALYTIKA AND PERISPOMENI % ῗ -> "" % GREEK CAPITAL LETTER IOTA WITH VRACHY % Ῐ -> "" % GREEK CAPITAL LETTER IOTA WITH MACRON % Ῑ -> "" % GREEK CAPITAL LETTER IOTA WITH VARIA % Ὶ -> "" % GREEK CAPITAL LETTER IOTA WITH OXIA % Ί -> "" % GREEK DASIA AND VARIA % ῝ -> "" % GREEK DASIA AND OXIA % ῞ -> "" % GREEK DASIA AND PERISPOMENI % ῟ -> "" % GREEK SMALL LETTER UPSILON WITH VRACHY % ῠ -> "" % GREEK SMALL LETTER UPSILON WITH MACRON % ῡ -> "" % GREEK SMALL LETTER UPSILON WITH DIALYTIKA AND VARIA % ῢ -> "" % GREEK SMALL LETTER UPSILON WITH DIALYTIKA AND OXIA % ΰ -> "" % GREEK SMALL LETTER RHO WITH PSILI % ῤ -> "" % GREEK SMALL LETTER RHO WITH DASIA % ῥ -> "" % GREEK SMALL LETTER UPSILON WITH PERISPOMENI % ῦ -> "" % GREEK SMALL LETTER UPSILON WITH DIALYTIKA AND PERISPOMENI % ῧ -> "" % GREEK CAPITAL LETTER UPSILON WITH VRACHY % Ῠ -> "" % GREEK CAPITAL LETTER UPSILON WITH MACRON % Ῡ -> "" % GREEK CAPITAL LETTER UPSILON WITH VARIA % Ὺ -> "" % GREEK CAPITAL LETTER UPSILON WITH OXIA % Ύ -> "" % GREEK CAPITAL LETTER RHO WITH DASIA % Ῥ -> "" % GREEK DIALYTIKA AND VARIA % ῭ -> "" % GREEK DIALYTIKA AND OXIA % ΅ -> "" % GREEK VARIA % ` -> "" % GREEK SMALL LETTER OMEGA WITH VARIA AND YPOGEGRAMMENI % ῲ -> "" % GREEK SMALL LETTER OMEGA WITH YPOGEGRAMMENI % ῳ -> "" % GREEK SMALL LETTER OMEGA WITH OXIA AND YPOGEGRAMMENI % ῴ -> "" % GREEK SMALL LETTER OMEGA WITH PERISPOMENI % ῶ -> "" % GREEK SMALL LETTER OMEGA WITH PERISPOMENI AND YPOGEGRAMMENI % ῷ -> "" % GREEK CAPITAL LETTER OMICRON WITH VARIA % Ὸ -> "" % GREEK CAPITAL LETTER OMICRON WITH OXIA % Ό -> "" % GREEK CAPITAL LETTER OMEGA WITH VARIA % Ὼ -> "" % GREEK CAPITAL LETTER OMEGA WITH OXIA % Ώ -> "" % GREEK CAPITAL LETTER OMEGA WITH PROSGEGRAMMENI % ῼ -> "" % GREEK OXIA % ´ -> "" % GREEK DASIA % ῾ -> "" % FRENCH FRANC SIGN % ₣ -> "" % LIRA SIGN % ₤ -> "" % PESETA SIGN % ₧ -> "" % DRACHMA SIGN % ₯ -> "" % UP DOWN ARROW % ↕ -> "" % UP DOWN ARROW WITH BASE % ↨ -> "" % FOR ALL % ∀ -> "" % PARTIAL DIFFERENTIAL % ∂ -> "" % THERE EXISTS % ∃ -> "" % INCREMENT % ∆ -> "" % ELEMENT OF % ∈ -> "" % NOT AN ELEMENT OF % ∉ -> "" % N-ARY PRODUCT % ∏ -> "" % N-ARY SUMMATION % ∑ -> "" % SQUARE ROOT % √ -> "" % RIGHT ANGLE % ∟ -> "" % LOGICAL AND % ∧ -> "" % LOGICAL OR % ∨ -> "" % INTERSECTION % ∩ -> "" % UNION % ∪ -> "" % INTEGRAL % ∫ -> "" % ALMOST EQUAL TO % ≈ -> "" % ESTIMATES % ≙ -> "" % SUBSET OF % ⊂ -> "" % SUPERSET OF % ⊃ -> "" % HOUSE % ⌂ -> "" % REVERSED NOT SIGN % ⌐ -> "" % TOP HALF INTEGRAL % ⌠ -> "" % BOTTOM HALF INTEGRAL % ⌡ -> "" % UPPER HALF BLOCK % ▀ -> "" % LOWER HALF BLOCK % ▄ -> "" % FULL BLOCK % █ -> "" % LEFT HALF BLOCK % ▌ -> "" % RIGHT HALF BLOCK % ▐ -> "" % LIGHT SHADE % ░ -> "" % MEDIUM SHADE % ▒ -> "" % DARK SHADE % ▓ -> "" % BLACK SQUARE % ■ -> "" % BLACK RECTANGLE % ▬ -> "" % BLACK UP-POINTING TRIANGLE % ▲ -> "" % BLACK RIGHT-POINTING POINTER % ► -> "" % BLACK DOWN-POINTING TRIANGLE % ▼ -> "" % BLACK LEFT-POINTING POINTER % ◄ -> "" % LOZENGE % ◊ -> "" % INVERSE BULLET % ◘ -> "" % INVERSE WHITE CIRCLE % ◙ -> "" % WHITE SUN WITH RAYS % ☼ -> "" % FEMALE SIGN % ♀ -> "" % MALE SIGN % ♂ -> "" % BLACK SPADE SUIT % ♠ -> "" % BLACK CLUB SUIT % ♣ -> "" % BLACK HEART SUIT % ♥ -> "" % BLACK DIAMOND SUIT % ♦ -> "" % EIGHTH NOTE % ♪ -> "" % BEAMED EIGHTH NOTES % ♫ -> "" translitcodec-version-0.7.0/transtab/transtab.repertoire000066400000000000000000000434021404555523200236050ustar00rootroot00000000000000U+0027 # ' -> '’' U+0060 # ` -> '‛', '‘' U+00A0 #   -> ' ' U+00A1 # ¡ -> '!' U+00A2 # ¢ -> 'c' U+00A3 # £ -> 'GBP' U+00A5 # ¥ -> 'Y' U+00A6 # ¦ -> '|' U+00A7 # § -> 'S' U+00A8 # ¨ -> '"' U+00A9 # © -> '(c)', 'c' U+00AA # ª -> 'a' U+00AB # « -> '<<' U+00AC # ¬ -> '-' U+00AD # ­ -> '-' U+00AE # ® -> '(R)' U+00AF # ¯ -> '-' U+00B0 # ° -> ' ' U+00B1 # ± -> '+/-' U+00B2 # ² -> '^2', '2' U+00B3 # ³ -> '^3', '3' U+00B4 # ´ -> ''' U+00B5 # µ -> 'μ', 'u' U+00B6 # ¶ -> 'P' U+00B7 # · -> '.' U+00B8 # ¸ -> ',' U+00B9 # ¹ -> '^1', '1' U+00BA # º -> 'o' U+00BB # » -> '>>' U+00BC # ¼ -> ' 1/4' U+00BD # ½ -> ' 1/2' U+00BE # ¾ -> ' 3/4' U+00BF # ¿ -> '?' U+00C0 # À -> 'A' U+00C1 # Á -> 'A' U+00C2 #  -> 'A' U+00C3 # à -> 'A' U+00C4 # Ä -> 'Ae', 'A' U+00C5 # Å -> 'Aa', 'A' U+00C6 # Æ -> 'AE', 'A' U+00C7 # Ç -> 'C' U+00C8 # È -> 'E' U+00C9 # É -> 'E' U+00CA # Ê -> 'E' U+00CB # Ë -> 'E' U+00CC # Ì -> 'I' U+00CD # Í -> 'I' U+00CE # Î -> 'I' U+00CF # Ï -> 'I' U+00D0 # Ð -> 'D' U+00D1 # Ñ -> 'N' U+00D2 # Ò -> 'O' U+00D3 # Ó -> 'O' U+00D4 # Ô -> 'O' U+00D5 # Õ -> 'O' U+00D6 # Ö -> 'Oe', 'O' U+00D7 # × -> 'x' U+00D8 # Ø -> 'O' U+00D9 # Ù -> 'U' U+00DA # Ú -> 'U' U+00DB # Û -> 'U' U+00DC # Ü -> 'Ue', 'U' U+00DD # Ý -> 'Y' U+00DE # Þ -> 'Th' U+00DF # ß -> 'ss', 'β' U+00E0 # à -> 'a' U+00E1 # á -> 'a' U+00E2 # â -> 'a' U+00E3 # ã -> 'a' U+00E4 # ä -> 'ae', 'a' U+00E5 # å -> 'aa', 'a' U+00E6 # æ -> 'ae', 'a' U+00E7 # ç -> 'c' U+00E8 # è -> 'e' U+00E9 # é -> 'e' U+00EA # ê -> 'e' U+00EB # ë -> 'e' U+00EC # ì -> 'i' U+00ED # í -> 'i' U+00EE # î -> 'i' U+00EF # ï -> 'i' U+00F0 # ð -> 'd' U+00F1 # ñ -> 'n' U+00F2 # ò -> 'o' U+00F3 # ó -> 'o' U+00F4 # ô -> 'o' U+00F5 # õ -> 'o' U+00F6 # ö -> 'oe', 'o' U+00F7 # ÷ -> ':' U+00F8 # ø -> 'o' U+00F9 # ù -> 'u' U+00FA # ú -> 'u' U+00FB # û -> 'u' U+00FC # ü -> 'ue', 'u' U+00FD # ý -> 'y' U+00FE # þ -> 'th' U+00FF # ÿ -> 'y' U+0100 # Ā -> 'A' U+0101 # ā -> 'a' U+0102 # Ă -> 'A' U+0103 # ă -> 'a' U+0104 # Ą -> 'A' U+0105 # ą -> 'a' U+0106 # Ć -> 'C' U+0107 # ć -> 'c' U+0108 # Ĉ -> 'Ch', 'C' U+0109 # ĉ -> 'ch', 'c' U+010A # Ċ -> 'C' U+010B # ċ -> 'c' U+010C # Č -> 'C' U+010D # č -> 'c' U+010E # Ď -> 'D' U+010F # ď -> 'd' U+0110 # Đ -> 'D' U+0111 # đ -> 'd' U+0112 # Ē -> 'E' U+0113 # ē -> 'e' U+0114 # Ĕ -> 'E' U+0115 # ĕ -> 'e' U+0116 # Ė -> 'E' U+0117 # ė -> 'e' U+0118 # Ę -> 'E' U+0119 # ę -> 'e' U+011A # Ě -> 'E' U+011B # ě -> 'e' U+011C # Ĝ -> 'Gh', 'G' U+011D # ĝ -> 'gh', 'g' U+011E # Ğ -> 'G' U+011F # ğ -> 'g' U+0120 # Ġ -> 'G' U+0121 # ġ -> 'g' U+0122 # Ģ -> 'G' U+0123 # ģ -> 'g' U+0124 # Ĥ -> 'Hh', 'H' U+0125 # ĥ -> 'hh', 'h' U+0126 # Ħ -> 'H' U+0127 # ħ -> 'h' U+0128 # Ĩ -> 'I' U+0129 # ĩ -> 'i' U+012A # Ī -> 'I' U+012B # ī -> 'i' U+012C # Ĭ -> 'I' U+012D # ĭ -> 'i' U+012E # Į -> 'I' U+012F # į -> 'i' U+0130 # İ -> 'I' U+0131 # ı -> 'i' U+0132 # IJ -> 'IJ' U+0133 # ij -> 'ij' U+0134 # Ĵ -> 'Jh', 'J' U+0135 # ĵ -> 'jh', 'j' U+0136 # Ķ -> 'K' U+0137 # ķ -> 'k' U+0138 # ĸ -> 'k' U+0139 # Ĺ -> 'L' U+013A # ĺ -> 'l' U+013B # Ļ -> 'L' U+013C # ļ -> 'l' U+013D # Ľ -> 'L' U+013E # ľ -> 'l' U+013F # Ŀ -> 'L·', 'L.', 'L' U+0140 # ŀ -> 'l·', 'l.', 'l' U+0141 # Ł -> 'L' U+0142 # ł -> 'l' U+0143 # Ń -> 'N' U+0144 # ń -> 'n' U+0145 # Ņ -> 'N' U+0146 # ņ -> 'n' U+0147 # Ň -> 'N' U+0148 # ň -> 'n' U+0149 # ʼn -> ''n' U+014A # Ŋ -> 'NG', 'N' U+014B # ŋ -> 'ng', 'n' U+014C # Ō -> 'O' U+014D # ō -> 'o' U+014E # Ŏ -> 'O' U+014F # ŏ -> 'o' U+0150 # Ő -> 'O' U+0151 # ő -> 'o' U+0152 # Œ -> 'OE' U+0153 # œ -> 'oe' U+0154 # Ŕ -> 'R' U+0155 # ŕ -> 'r' U+0156 # Ŗ -> 'R' U+0157 # ŗ -> 'r' U+0158 # Ř -> 'R' U+0159 # ř -> 'r' U+015A # Ś -> 'S' U+015B # ś -> 's' U+015C # Ŝ -> 'Sh', 'S' U+015D # ŝ -> 'sh', 's' U+015E # Ş -> 'S' U+015F # ş -> 's' U+0160 # Š -> 'S' U+0161 # š -> 's' U+0162 # Ţ -> 'T' U+0163 # ţ -> 't' U+0164 # Ť -> 'T' U+0165 # ť -> 't' U+0166 # Ŧ -> 'T' U+0167 # ŧ -> 't' U+0168 # Ũ -> 'U' U+0169 # ũ -> 'u' U+016A # Ū -> 'U' U+016B # ū -> 'u' U+016C # Ŭ -> 'U' U+016D # ŭ -> 'u' U+016E # Ů -> 'U' U+016F # ů -> 'u' U+0170 # Ű -> 'U' U+0171 # ű -> 'u' U+0172 # Ų -> 'U' U+0173 # ų -> 'u' U+0174 # Ŵ -> 'W' U+0175 # ŵ -> 'w' U+0176 # Ŷ -> 'Y' U+0177 # ŷ -> 'y' U+0178 # Ÿ -> 'Y' U+0179 # Ź -> 'Z' U+017A # ź -> 'z' U+017B # Ż -> 'Z' U+017C # ż -> 'z' U+017D # Ž -> 'Z' U+017E # ž -> 'z' U+017F # ſ -> 's' U+0192 # ƒ -> 'f' U+01A0 # Ơ -> 'O' U+01A1 # ơ -> 'o' U+01AF # Ư -> 'U' U+01B0 # ư -> 'u' U+0218 # Ș -> 'Ş', 'S' U+0219 # ș -> 'ş', 's' U+021A # Ț -> 'Ţ', 'T' U+021B # ț -> 'ţ', 't' U+02B9 # ʹ -> '′', ''' U+02BB # ʻ -> '‘' U+02BC # ʼ -> '’', ''' U+02BD # ʽ -> '‛' U+02C6 # ˆ -> '^' U+02C8 # ˈ -> ''' U+02C9 # ˉ -> '¯' U+02CC # ˌ -> ',' U+02D0 # ː -> ':' U+02DA # ˚ -> '°' U+02DC # ˜ -> '~' U+02DD # ˝ -> '"' U+0374 # ʹ -> ''' U+0375 # ͵ -> ',' U+037E # ; -> ';' U+1E02 # Ḃ -> 'B' U+1E03 # ḃ -> 'b' U+1E0A # Ḋ -> 'D' U+1E0B # ḋ -> 'd' U+1E1E # Ḟ -> 'F' U+1E1F # ḟ -> 'f' U+1E40 # Ṁ -> 'M' U+1E41 # ṁ -> 'm' U+1E56 # Ṗ -> 'P' U+1E57 # ṗ -> 'p' U+1E60 # Ṡ -> 'S' U+1E61 # ṡ -> 's' U+1E6A # Ṫ -> 'T' U+1E6B # ṫ -> 't' U+1E80 # Ẁ -> 'W' U+1E81 # ẁ -> 'w' U+1E82 # Ẃ -> 'W' U+1E83 # ẃ -> 'w' U+1E84 # Ẅ -> 'W' U+1E85 # ẅ -> 'w' U+1EEE # Ữ -> 'U' U+1EEF # ữ -> 'u' U+1EF2 # Ỳ -> 'Y' U+1EF3 # ỳ -> 'y' U+2000 #   -> ' ' U+2001 #   -> ' ' U+2002 #   -> ' ' U+2003 #   -> ' ' U+2004 #   -> ' ' U+2005 #   -> ' ' U+2006 #   -> ' ' U+2007 #   -> ' ' U+2008 #   -> ' ' U+2009 #   -> ' ' U+200A #   -> U+200B # ​ -> U+200C # ‌ -> U+200D # ‍ -> U+200E # ‎ -> U+200F # ‏ -> U+2010 # ‐ -> '-' U+2011 # ‑ -> '-' U+2012 # ‒ -> '-' U+2013 # – -> '-' U+2014 # — -> '--' U+2015 # ― -> '--' U+2016 # ‖ -> '||' U+2017 # ‗ -> '_' U+2018 # ‘ -> ''' U+2019 # ’ -> ''' U+201A # ‚ -> ''' U+201B # ‛ -> ''' U+201C # “ -> '"' U+201D # ” -> '"' U+201E # „ -> '"' U+201F # ‟ -> '"' U+2020 # † -> '+' U+2021 # ‡ -> '++' U+2022 # • -> 'o' U+2023 # ‣ -> '>' U+2024 # ․ -> '.' U+2025 # ‥ -> '..' U+2026 # … -> '...' U+2027 # ‧ -> '-' U+202A # ‪ -> U+202B # ‫ -> U+202C # ‬ -> U+202D # ‭ -> U+202E # ‮ -> U+202F #   -> ' ' U+2030 # ‰ -> ' 0/00' U+2032 # ′ -> ''' U+2033 # ″ -> '"' U+2034 # ‴ -> ''''' U+2035 # ‵ -> '`' U+2036 # ‶ -> '``' U+2037 # ‷ -> '```' U+2039 # ‹ -> '<' U+203A # › -> '>' U+203C # ‼ -> '!!' U+203E # ‾ -> '-' U+2043 # ⁃ -> '-' U+2044 # ⁄ -> '/' U+2048 # ⁈ -> '?!' U+2049 # ⁉ -> '!?' U+204A # ⁊ -> '7' U+2070 # ⁰ -> '^0', '0' U+2074 # ⁴ -> '^4', '4' U+2075 # ⁵ -> '^5', '5' U+2076 # ⁶ -> '^6', '6' U+2077 # ⁷ -> '^7', '7' U+2078 # ⁸ -> '^8', '8' U+2079 # ⁹ -> '^9', '9' U+207A # ⁺ -> '^+', '+' U+207B # ⁻ -> '^-', '-' U+207C # ⁼ -> '^=', '=' U+207D # ⁽ -> '^(', '(' U+207E # ⁾ -> '^)', ')' U+207F # ⁿ -> '^n', 'n' U+2080 # ₀ -> '_0', '0' U+2081 # ₁ -> '_1', '1' U+2082 # ₂ -> '_2', '2' U+2083 # ₃ -> '_3', '3' U+2084 # ₄ -> '_4', '4' U+2085 # ₅ -> '_5', '5' U+2086 # ₆ -> '_6', '6' U+2087 # ₇ -> '_7', '7' U+2088 # ₈ -> '_8', '8' U+2089 # ₉ -> '_9', '9' U+208A # ₊ -> '_+', '+' U+208B # ₋ -> '_-', '-' U+208C # ₌ -> '_=', '=' U+208D # ₍ -> '_(', '(' U+208E # ₎ -> '_)', ')' U+20AC # € -> 'EUR', 'E' U+2100 # ℀ -> 'a/c' U+2101 # ℁ -> 'a/s' U+2103 # ℃ -> '°C', 'C' U+2105 # ℅ -> 'c/o' U+2106 # ℆ -> 'c/u' U+2109 # ℉ -> '°F', 'F' U+2113 # ℓ -> 'l' U+2116 # № -> 'Nº', 'No' U+2117 # ℗ -> '(P)' U+2120 # ℠ -> '[SM]' U+2121 # ℡ -> 'TEL' U+2122 # ™ -> '[TM]' U+2126 # Ω -> 'Ω', 'ohm', 'O' U+212A # K -> 'K' U+212B # Å -> 'Å' U+212E # ℮ -> 'e' U+2153 # ⅓ -> ' 1/3' U+2154 # ⅔ -> ' 2/3' U+2155 # ⅕ -> ' 1/5' U+2156 # ⅖ -> ' 2/5' U+2157 # ⅗ -> ' 3/5' U+2158 # ⅘ -> ' 4/5' U+2159 # ⅙ -> ' 1/6' U+215A # ⅚ -> ' 5/6' U+215B # ⅛ -> ' 1/8' U+215C # ⅜ -> ' 3/8' U+215D # ⅝ -> ' 5/8' U+215E # ⅞ -> ' 7/8' U+215F # ⅟ -> ' 1/' U+2160 # Ⅰ -> 'I' U+2161 # Ⅱ -> 'II' U+2162 # Ⅲ -> 'III' U+2163 # Ⅳ -> 'IV' U+2164 # Ⅴ -> 'V' U+2165 # Ⅵ -> 'VI' U+2166 # Ⅶ -> 'VII' U+2167 # Ⅷ -> 'VIII' U+2168 # Ⅸ -> 'IX' U+2169 # Ⅹ -> 'X' U+216A # Ⅺ -> 'XI' U+216B # Ⅻ -> 'XII' U+216C # Ⅼ -> 'L' U+216D # Ⅽ -> 'C' U+216E # Ⅾ -> 'D' U+216F # Ⅿ -> 'M' U+2170 # ⅰ -> 'i' U+2171 # ⅱ -> 'ii' U+2172 # ⅲ -> 'iii' U+2173 # ⅳ -> 'iv' U+2174 # ⅴ -> 'v' U+2175 # ⅵ -> 'vi' U+2176 # ⅶ -> 'vii' U+2177 # ⅷ -> 'viii' U+2178 # ⅸ -> 'ix' U+2179 # ⅹ -> 'x' U+217A # ⅺ -> 'xi' U+217B # ⅻ -> 'xii' U+217C # ⅼ -> 'l' U+217D # ⅽ -> 'c' U+217E # ⅾ -> 'd' U+217F # ⅿ -> 'm' U+2190 # ← -> '<-' U+2191 # ↑ -> '^' U+2192 # → -> '->' U+2193 # ↓ -> 'v' U+2194 # ↔ -> '<->' U+21D0 # ⇐ -> '<=' U+21D2 # ⇒ -> '=>' U+21D4 # ⇔ -> '<=>' U+2212 # − -> '–', '-' U+2215 # ∕ -> '/' U+2216 # ∖ -> '\' U+2217 # ∗ -> '*' U+2218 # ∘ -> 'o' U+2219 # ∙ -> '·' U+221E # ∞ -> 'inf' U+2223 # ∣ -> '|' U+2225 # ∥ -> '||' U+2236 # ∶ -> ':' U+223C # ∼ -> '~' U+2260 # ≠ -> '/=' U+2261 # ≡ -> '=' U+2264 # ≤ -> '<=' U+2265 # ≥ -> '>=' U+226A # ≪ -> '<<' U+226B # ≫ -> '>>' U+2295 # ⊕ -> '(+)' U+2296 # ⊖ -> '(-)' U+2297 # ⊗ -> '(x)' U+2298 # ⊘ -> '(/)' U+22A2 # ⊢ -> '|-' U+22A3 # ⊣ -> '-|' U+22A6 # ⊦ -> '|-' U+22A7 # ⊧ -> '|=' U+22A8 # ⊨ -> '|=' U+22A9 # ⊩ -> '||-' U+22C5 # ⋅ -> '·' U+22C6 # ⋆ -> '*' U+22D5 # ⋕ -> '#' U+22D8 # ⋘ -> '<<<' U+22D9 # ⋙ -> '>>>' U+22EF # ⋯ -> '...' U+2329 # 〈 -> '<' U+232A # 〉 -> '>' U+2400 # ␀ -> 'NUL' U+2401 # ␁ -> 'SOH' U+2402 # ␂ -> 'STX' U+2403 # ␃ -> 'ETX' U+2404 # ␄ -> 'EOT' U+2405 # ␅ -> 'ENQ' U+2406 # ␆ -> 'ACK' U+2407 # ␇ -> 'BEL' U+2408 # ␈ -> 'BS' U+2409 # ␉ -> 'HT' U+240A # ␊ -> 'LF' U+240B # ␋ -> 'VT' U+240C # ␌ -> 'FF' U+240D # ␍ -> 'CR' U+240E # ␎ -> 'SO' U+240F # ␏ -> 'SI' U+2410 # ␐ -> 'DLE' U+2411 # ␑ -> 'DC1' U+2412 # ␒ -> 'DC2' U+2413 # ␓ -> 'DC3' U+2414 # ␔ -> 'DC4' U+2415 # ␕ -> 'NAK' U+2416 # ␖ -> 'SYN' U+2417 # ␗ -> 'ETB' U+2418 # ␘ -> 'CAN' U+2419 # ␙ -> 'EM' U+241A # ␚ -> 'SUB' U+241B # ␛ -> 'ESC' U+241C # ␜ -> 'FS' U+241D # ␝ -> 'GS' U+241E # ␞ -> 'RS' U+241F # ␟ -> 'US' U+2420 # ␠ -> 'SP' U+2421 # ␡ -> 'DEL' U+2423 # ␣ -> '_' U+2424 # ␤ -> 'NL' U+2425 # ␥ -> '///' U+2426 # ␦ -> '?' U+2460 # ① -> '(1)', '1' U+2461 # ② -> '(2)', '2' U+2462 # ③ -> '(3)', '3' U+2463 # ④ -> '(4)', '4' U+2464 # ⑤ -> '(5)', '5' U+2465 # ⑥ -> '(6)', '6' U+2466 # ⑦ -> '(7)', '7' U+2467 # ⑧ -> '(8)', '8' U+2468 # ⑨ -> '(9)', '9' U+2469 # ⑩ -> '(10)' U+246A # ⑪ -> '(11)' U+246B # ⑫ -> '(12)' U+246C # ⑬ -> '(13)' U+246D # ⑭ -> '(14)' U+246E # ⑮ -> '(15)' U+246F # ⑯ -> '(16)' U+2470 # ⑰ -> '(17)' U+2471 # ⑱ -> '(18)' U+2472 # ⑲ -> '(19)' U+2473 # ⑳ -> '(20)' U+2474 # ⑴ -> '(1)', '1' U+2475 # ⑵ -> '(2)', '2' U+2476 # ⑶ -> '(3)', '3' U+2477 # ⑷ -> '(4)', '4' U+2478 # ⑸ -> '(5)', '5' U+2479 # ⑹ -> '(6)', '6' U+247A # ⑺ -> '(7)', '7' U+247B # ⑻ -> '(8)', '8' U+247C # ⑼ -> '(9)', '9' U+247D # ⑽ -> '(10)' U+247E # ⑾ -> '(11)' U+247F # ⑿ -> '(12)' U+2480 # ⒀ -> '(13)' U+2481 # ⒁ -> '(14)' U+2482 # ⒂ -> '(15)' U+2483 # ⒃ -> '(16)' U+2484 # ⒄ -> '(17)' U+2485 # ⒅ -> '(18)' U+2486 # ⒆ -> '(19)' U+2487 # ⒇ -> '(20)' U+2488 # ⒈ -> '1.', '1' U+2489 # ⒉ -> '2.', '2' U+248A # ⒊ -> '3.', '3' U+248B # ⒋ -> '4.', '4' U+248C # ⒌ -> '5.', '5' U+248D # ⒍ -> '6.', '6' U+248E # ⒎ -> '7.', '7' U+248F # ⒏ -> '8.', '8' U+2490 # ⒐ -> '9.', '9' U+2491 # ⒑ -> '10.' U+2492 # ⒒ -> '11.' U+2493 # ⒓ -> '12.' U+2494 # ⒔ -> '13.' U+2495 # ⒕ -> '14.' U+2496 # ⒖ -> '15.' U+2497 # ⒗ -> '16.' U+2498 # ⒘ -> '17.' U+2499 # ⒙ -> '18.' U+249A # ⒚ -> '19.' U+249B # ⒛ -> '20.' U+249C # ⒜ -> '(a)', 'a' U+249D # ⒝ -> '(b)', 'b' U+249E # ⒞ -> '(c)', 'c' U+249F # ⒟ -> '(d)', 'd' U+24A0 # ⒠ -> '(e)', 'e' U+24A1 # ⒡ -> '(f)', 'f' U+24A2 # ⒢ -> '(g)', 'g' U+24A3 # ⒣ -> '(h)', 'h' U+24A4 # ⒤ -> '(i)', 'i' U+24A5 # ⒥ -> '(j)', 'j' U+24A6 # ⒦ -> '(k)', 'k' U+24A7 # ⒧ -> '(l)', 'l' U+24A8 # ⒨ -> '(m)', 'm' U+24A9 # ⒩ -> '(n)', 'n' U+24AA # ⒪ -> '(o)', 'o' U+24AB # ⒫ -> '(p)', 'p' U+24AC # ⒬ -> '(q)', 'q' U+24AD # ⒭ -> '(r)', 'r' U+24AE # ⒮ -> '(s)', 's' U+24AF # ⒯ -> '(t)', 't' U+24B0 # ⒰ -> '(u)', 'u' U+24B1 # ⒱ -> '(v)', 'v' U+24B2 # ⒲ -> '(w)', 'w' U+24B3 # ⒳ -> '(x)', 'x' U+24B4 # ⒴ -> '(y)', 'y' U+24B5 # ⒵ -> '(z)', 'z' U+24B6 # Ⓐ -> '(A)', 'A' U+24B7 # Ⓑ -> '(B)', 'B' U+24B8 # Ⓒ -> '(C)', 'C' U+24B9 # Ⓓ -> '(D)', 'D' U+24BA # Ⓔ -> '(E)', 'E' U+24BB # Ⓕ -> '(F)', 'F' U+24BC # Ⓖ -> '(G)', 'G' U+24BD # Ⓗ -> '(H)', 'H' U+24BE # Ⓘ -> '(I)', 'I' U+24BF # Ⓙ -> '(J)', 'J' U+24C0 # Ⓚ -> '(K)', 'K' U+24C1 # Ⓛ -> '(L)', 'L' U+24C2 # Ⓜ -> '(M)', 'M' U+24C3 # Ⓝ -> '(N)', 'N' U+24C4 # Ⓞ -> '(O)', 'O' U+24C5 # Ⓟ -> '(P)', 'P' U+24C6 # Ⓠ -> '(Q)', 'Q' U+24C7 # Ⓡ -> '(R)', 'R' U+24C8 # Ⓢ -> '(S)', 'S' U+24C9 # Ⓣ -> '(T)', 'T' U+24CA # Ⓤ -> '(U)', 'U' U+24CB # Ⓥ -> '(V)', 'V' U+24CC # Ⓦ -> '(W)', 'W' U+24CD # Ⓧ -> '(X)', 'X' U+24CE # Ⓨ -> '(Y)', 'Y' U+24CF # Ⓩ -> '(Z)', 'Z' U+24D0 # ⓐ -> '(a)', 'a' U+24D1 # ⓑ -> '(b)', 'b' U+24D2 # ⓒ -> '(c)', 'c' U+24D3 # ⓓ -> '(d)', 'd' U+24D4 # ⓔ -> '(e)', 'e' U+24D5 # ⓕ -> '(f)', 'f' U+24D6 # ⓖ -> '(g)', 'g' U+24D7 # ⓗ -> '(h)', 'h' U+24D8 # ⓘ -> '(i)', 'i' U+24D9 # ⓙ -> '(j)', 'j' U+24DA # ⓚ -> '(k)', 'k' U+24DB # ⓛ -> '(l)', 'l' U+24DC # ⓜ -> '(m)', 'm' U+24DD # ⓝ -> '(n)', 'n' U+24DE # ⓞ -> '(o)', 'o' U+24DF # ⓟ -> '(p)', 'p' U+24E0 # ⓠ -> '(q)', 'q' U+24E1 # ⓡ -> '(r)', 'r' U+24E2 # ⓢ -> '(s)', 's' U+24E3 # ⓣ -> '(t)', 't' U+24E4 # ⓤ -> '(u)', 'u' U+24E5 # ⓥ -> '(v)', 'v' U+24E6 # ⓦ -> '(w)', 'w' U+24E7 # ⓧ -> '(x)', 'x' U+24E8 # ⓨ -> '(y)', 'y' U+24E9 # ⓩ -> '(z)', 'z' U+24EA # ⓪ -> '(0)', '0' U+2500 # ─ -> '-' U+2501 # ━ -> '=' U+2502 # │ -> '|' U+2503 # ┃ -> '|' U+2504 # ┄ -> '-' U+2505 # ┅ -> '=' U+2506 # ┆ -> '|' U+2507 # ┇ -> '|' U+2508 # ┈ -> '-' U+2509 # ┉ -> '=' U+250A # ┊ -> '|' U+250B # ┋ -> '|' U+250C # ┌ -> '+' U+250D # ┍ -> '+' U+250E # ┎ -> '+' U+250F # ┏ -> '+' U+2510 # ┐ -> '+' U+2511 # ┑ -> '+' U+2512 # ┒ -> '+' U+2513 # ┓ -> '+' U+2514 # └ -> '+' U+2515 # ┕ -> '+' U+2516 # ┖ -> '+' U+2517 # ┗ -> '+' U+2518 # ┘ -> '+' U+2519 # ┙ -> '+' U+251A # ┚ -> '+' U+251B # ┛ -> '+' U+251C # ├ -> '+' U+251D # ┝ -> '+' U+251E # ┞ -> '+' U+251F # ┟ -> '+' U+2520 # ┠ -> '+' U+2521 # ┡ -> '+' U+2522 # ┢ -> '+' U+2523 # ┣ -> '+' U+2524 # ┤ -> '+' U+2525 # ┥ -> '+' U+2526 # ┦ -> '+' U+2527 # ┧ -> '+' U+2528 # ┨ -> '+' U+2529 # ┩ -> '+' U+252A # ┪ -> '+' U+252B # ┫ -> '+' U+252C # ┬ -> '+' U+252D # ┭ -> '+' U+252E # ┮ -> '+' U+252F # ┯ -> '+' U+2530 # ┰ -> '+' U+2531 # ┱ -> '+' U+2532 # ┲ -> '+' U+2533 # ┳ -> '+' U+2534 # ┴ -> '+' U+2535 # ┵ -> '+' U+2536 # ┶ -> '+' U+2537 # ┷ -> '+' U+2538 # ┸ -> '+' U+2539 # ┹ -> '+' U+253A # ┺ -> '+' U+253B # ┻ -> '+' U+253C # ┼ -> '+' U+253D # ┽ -> '+' U+253E # ┾ -> '+' U+253F # ┿ -> '+' U+2540 # ╀ -> '+' U+2541 # ╁ -> '+' U+2542 # ╂ -> '+' U+2543 # ╃ -> '+' U+2544 # ╄ -> '+' U+2545 # ╅ -> '+' U+2546 # ╆ -> '+' U+2547 # ╇ -> '+' U+2548 # ╈ -> '+' U+2549 # ╉ -> '+' U+254A # ╊ -> '+' U+254B # ╋ -> '+' U+254C # ╌ -> '-' U+254D # ╍ -> '=' U+254E # ╎ -> '|' U+254F # ╏ -> '|' U+2550 # ═ -> '=' U+2551 # ║ -> '|' U+2552 # ╒ -> '+' U+2553 # ╓ -> '+' U+2554 # ╔ -> '+' U+2555 # ╕ -> '+' U+2556 # ╖ -> '+' U+2557 # ╗ -> '+' U+2558 # ╘ -> '+' U+2559 # ╙ -> '+' U+255A # ╚ -> '+' U+255B # ╛ -> '+' U+255C # ╜ -> '+' U+255D # ╝ -> '+' U+255E # ╞ -> '+' U+255F # ╟ -> '+' U+2560 # ╠ -> '+' U+2561 # ╡ -> '+' U+2562 # ╢ -> '+' U+2563 # ╣ -> '+' U+2564 # ╤ -> '+' U+2565 # ╥ -> '+' U+2566 # ╦ -> '+' U+2567 # ╧ -> '+' U+2568 # ╨ -> '+' U+2569 # ╩ -> '+' U+256A # ╪ -> '+' U+256B # ╫ -> '+' U+256C # ╬ -> '+' U+256D # ╭ -> '+' U+256E # ╮ -> '+' U+256F # ╯ -> '+' U+2570 # ╰ -> '+' U+2571 # ╱ -> '/' U+2572 # ╲ -> '\' U+2573 # ╳ -> 'X' U+257C # ╼ -> '-' U+257D # ╽ -> '|' U+257E # ╾ -> '-' U+257F # ╿ -> '|' U+25CB # ○ -> 'o' U+25E6 # ◦ -> 'o' U+2605 # ★ -> '*' U+2606 # ☆ -> '*' U+2612 # ☒ -> 'X' U+2613 # ☓ -> 'X' U+2639 # ☹ -> ':-(' U+263A # ☺ -> ':-)' U+263B # ☻ -> '(-:' U+266D # ♭ -> 'b' U+266F # ♯ -> '#' U+2701 # ✁ -> '%<' U+2702 # ✂ -> '%<' U+2703 # ✃ -> '%<' U+2704 # ✄ -> '%<' U+270C # ✌ -> 'V' U+2713 # ✓ -> '√' U+2714 # ✔ -> '√' U+2715 # ✕ -> 'x' U+2716 # ✖ -> 'x' U+2717 # ✗ -> 'X' U+2718 # ✘ -> 'X' U+2719 # ✙ -> '+' U+271A # ✚ -> '+' U+271B # ✛ -> '+' U+271C # ✜ -> '+' U+271D # ✝ -> '+' U+271E # ✞ -> '+' U+271F # ✟ -> '+' U+2720 # ✠ -> '+' U+2721 # ✡ -> '*' U+2722 # ✢ -> '+' U+2723 # ✣ -> '+' U+2724 # ✤ -> '+' U+2725 # ✥ -> '+' U+2726 # ✦ -> '+' U+2727 # ✧ -> '+' U+2729 # ✩ -> '*' U+272A # ✪ -> '*' U+272B # ✫ -> '*' U+272C # ✬ -> '*' U+272D # ✭ -> '*' U+272E # ✮ -> '*' U+272F # ✯ -> '*' U+2730 # ✰ -> '*' U+2731 # ✱ -> '*' U+2732 # ✲ -> '*' U+2733 # ✳ -> '*' U+2734 # ✴ -> '*' U+2735 # ✵ -> '*' U+2736 # ✶ -> '*' U+2737 # ✷ -> '*' U+2738 # ✸ -> '*' U+2739 # ✹ -> '*' U+273A # ✺ -> '*' U+273B # ✻ -> '*' U+273C # ✼ -> '*' U+273D # ✽ -> '*' U+273E # ✾ -> '*' U+273F # ✿ -> '*' U+2740 # ❀ -> '*' U+2741 # ❁ -> '*' U+2742 # ❂ -> '*' U+2743 # ❃ -> '*' U+2744 # ❄ -> '*' U+2745 # ❅ -> '*' U+2746 # ❆ -> '*' U+2747 # ❇ -> '*' U+2748 # ❈ -> '*' U+2749 # ❉ -> '*' U+274A # ❊ -> '*' U+274B # ❋ -> '*' U+FB00 # ff -> 'ff' U+FB01 # fi -> 'fi' U+FB02 # fl -> 'fl' U+FB03 # ffi -> 'ffi' U+FB04 # ffl -> 'ffl' U+FB05 # ſt -> 'ſt', 'st' U+FB06 # st -> 'st' U+FEFF #  -> U+FFFD # � -> '?' translitcodec-version-0.7.0/transtab/transtab.utf000066400000000000000000001607341404555523200222330ustar00rootroot00000000000000% $Id: transtab.utf,v 1.8.1 2020-01-18 19:00:12+01 cpa Exp $ % APOSTROPHE % ' -> '’' % GRAVE ACCENT % ` -> '‛', '‘' ; % NO-BREAK SPACE %   -> ' ' % INVERTED EXCLAMATION MARK % ¡ -> '!' % CENT SIGN % ¢ -> 'c' % POUND SIGN % £ -> 'GBP' "" % YEN SIGN % ¥ -> 'Y' % BROKEN BAR % ¦ -> '|' % SECTION SIGN % § -> 'S' % DIAERESIS % ¨ -> '"' % COPYRIGHT SIGN % © -> '(c)', 'c' ""; % FEMININE ORDINAL INDICATOR % ª -> 'a' % LEFT-POINTING DOUBLE ANGLE QUOTATION MARK % « -> '<<' "" % NOT SIGN % ¬ -> '-' % SOFT HYPHEN % ­ -> '-' % REGISTERED SIGN % ® -> '(R)' "" % MACRON % ¯ -> '-' % DEGREE SIGN % ° -> ' ' % PLUS-MINUS SIGN % ± -> '+/-' "" % SUPERSCRIPT TWO % ² -> '^2', '2' ""; % SUPERSCRIPT THREE % ³ -> '^3', '3' ""; % ACUTE ACCENT % ´ -> ''' % MICRO SIGN % µ -> 'μ', 'u' ; % PILCROW SIGN % ¶ -> 'P' % MIDDLE DOT % · -> '.' % CEDILLA % ¸ -> ',' % SUPERSCRIPT ONE % ¹ -> '^1', '1' ""; % MASCULINE ORDINAL INDICATOR % º -> 'o' % RIGHT-POINTING DOUBLE ANGLE QUOTATION MARK % » -> '>>' "" % VULGAR FRACTION ONE QUARTER % ¼ -> ' 1/4' "" % VULGAR FRACTION ONE HALF % ½ -> ' 1/2' "" % VULGAR FRACTION THREE QUARTERS % ¾ -> ' 3/4' "" % INVERTED QUESTION MARK % ¿ -> '?' % LATIN CAPITAL LETTER A WITH GRAVE % À -> 'A' % LATIN CAPITAL LETTER A WITH ACUTE % Á -> 'A' % LATIN CAPITAL LETTER A WITH CIRCUMFLEX %  -> 'A' % LATIN CAPITAL LETTER A WITH TILDE % à -> 'A' % LATIN CAPITAL LETTER A WITH DIAERESIS % Ä -> 'Ae', 'A' ""; % LATIN CAPITAL LETTER A WITH RING ABOVE % Å -> 'Aa', 'A' ""; % LATIN CAPITAL LETTER AE % Æ -> 'AE', 'A' ""; % LATIN CAPITAL LETTER C WITH CEDILLA % Ç -> 'C' % LATIN CAPITAL LETTER E WITH GRAVE % È -> 'E' % LATIN CAPITAL LETTER E WITH ACUTE % É -> 'E' % LATIN CAPITAL LETTER E WITH CIRCUMFLEX % Ê -> 'E' % LATIN CAPITAL LETTER E WITH DIAERESIS % Ë -> 'E' % LATIN CAPITAL LETTER I WITH GRAVE % Ì -> 'I' % LATIN CAPITAL LETTER I WITH ACUTE % Í -> 'I' % LATIN CAPITAL LETTER I WITH CIRCUMFLEX % Î -> 'I' % LATIN CAPITAL LETTER I WITH DIAERESIS % Ï -> 'I' % LATIN CAPITAL LETTER ETH % Ð -> 'D' % LATIN CAPITAL LETTER N WITH TILDE % Ñ -> 'N' % LATIN CAPITAL LETTER O WITH GRAVE % Ò -> 'O' % LATIN CAPITAL LETTER O WITH ACUTE % Ó -> 'O' % LATIN CAPITAL LETTER O WITH CIRCUMFLEX % Ô -> 'O' % LATIN CAPITAL LETTER O WITH TILDE % Õ -> 'O' % LATIN CAPITAL LETTER O WITH DIAERESIS % Ö -> 'Oe', 'O' ""; % MULTIPLICATION SIGN % × -> 'x' % LATIN CAPITAL LETTER O WITH STROKE % Ø -> 'O' % LATIN CAPITAL LETTER U WITH GRAVE % Ù -> 'U' % LATIN CAPITAL LETTER U WITH ACUTE % Ú -> 'U' % LATIN CAPITAL LETTER U WITH CIRCUMFLEX % Û -> 'U' % LATIN CAPITAL LETTER U WITH DIAERESIS % Ü -> 'Ue', 'U' ""; % LATIN CAPITAL LETTER Y WITH ACUTE % Ý -> 'Y' % LATIN CAPITAL LETTER THORN % Þ -> 'Th' "" % LATIN SMALL LETTER SHARP S % ß -> 'ss', 'β' ""; % LATIN SMALL LETTER A WITH GRAVE % à -> 'a' % LATIN SMALL LETTER A WITH ACUTE % á -> 'a' % LATIN SMALL LETTER A WITH CIRCUMFLEX % â -> 'a' % LATIN SMALL LETTER A WITH TILDE % ã -> 'a' % LATIN SMALL LETTER A WITH DIAERESIS % ä -> 'ae', 'a' ""; % LATIN SMALL LETTER A WITH RING ABOVE % å -> 'aa', 'a' ""; % LATIN SMALL LETTER AE % æ -> 'ae', 'a' ""; % LATIN SMALL LETTER C WITH CEDILLA % ç -> 'c' % LATIN SMALL LETTER E WITH GRAVE % è -> 'e' % LATIN SMALL LETTER E WITH ACUTE % é -> 'e' % LATIN SMALL LETTER E WITH CIRCUMFLEX % ê -> 'e' % LATIN SMALL LETTER E WITH DIAERESIS % ë -> 'e' % LATIN SMALL LETTER I WITH GRAVE % ì -> 'i' % LATIN SMALL LETTER I WITH ACUTE % í -> 'i' % LATIN SMALL LETTER I WITH CIRCUMFLEX % î -> 'i' % LATIN SMALL LETTER I WITH DIAERESIS % ï -> 'i' % LATIN SMALL LETTER ETH % ð -> 'd' % LATIN SMALL LETTER N WITH TILDE % ñ -> 'n' % LATIN SMALL LETTER O WITH GRAVE % ò -> 'o' % LATIN SMALL LETTER O WITH ACUTE % ó -> 'o' % LATIN SMALL LETTER O WITH CIRCUMFLEX % ô -> 'o' % LATIN SMALL LETTER O WITH TILDE % õ -> 'o' % LATIN SMALL LETTER O WITH DIAERESIS % ö -> 'oe', 'o' ""; % DIVISION SIGN % ÷ -> ':' % LATIN SMALL LETTER O WITH STROKE % ø -> 'o' % LATIN SMALL LETTER U WITH GRAVE % ù -> 'u' % LATIN SMALL LETTER U WITH ACUTE % ú -> 'u' % LATIN SMALL LETTER U WITH CIRCUMFLEX % û -> 'u' % LATIN SMALL LETTER U WITH DIAERESIS % ü -> 'ue', 'u' ""; % LATIN SMALL LETTER Y WITH ACUTE % ý -> 'y' % LATIN SMALL LETTER THORN % þ -> 'th' "" % LATIN SMALL LETTER Y WITH DIAERESIS % ÿ -> 'y' % LATIN CAPITAL LETTER A WITH MACRON % Ā -> 'A' % LATIN SMALL LETTER A WITH MACRON % ā -> 'a' % LATIN CAPITAL LETTER A WITH BREVE % Ă -> 'A' % LATIN SMALL LETTER A WITH BREVE % ă -> 'a' % LATIN CAPITAL LETTER A WITH OGONEK % Ą -> 'A' % LATIN SMALL LETTER A WITH OGONEK % ą -> 'a' % LATIN CAPITAL LETTER C WITH ACUTE % Ć -> 'C' % LATIN SMALL LETTER C WITH ACUTE % ć -> 'c' % LATIN CAPITAL LETTER C WITH CIRCUMFLEX % Ĉ -> 'Ch', 'C' ""; % LATIN SMALL LETTER C WITH CIRCUMFLEX % ĉ -> 'ch', 'c' ""; % LATIN CAPITAL LETTER C WITH DOT ABOVE % Ċ -> 'C' % LATIN SMALL LETTER C WITH DOT ABOVE % ċ -> 'c' % LATIN CAPITAL LETTER C WITH CARON % Č -> 'C' % LATIN SMALL LETTER C WITH CARON % č -> 'c' % LATIN CAPITAL LETTER D WITH CARON % Ď -> 'D' % LATIN SMALL LETTER D WITH CARON % ď -> 'd' % LATIN CAPITAL LETTER D WITH STROKE % Đ -> 'D' % LATIN SMALL LETTER D WITH STROKE % đ -> 'd' % LATIN CAPITAL LETTER E WITH MACRON % Ē -> 'E' % LATIN SMALL LETTER E WITH MACRON % ē -> 'e' % LATIN CAPITAL LETTER E WITH BREVE % Ĕ -> 'E' % LATIN SMALL LETTER E WITH BREVE % ĕ -> 'e' % LATIN CAPITAL LETTER E WITH DOT ABOVE % Ė -> 'E' % LATIN SMALL LETTER E WITH DOT ABOVE % ė -> 'e' % LATIN CAPITAL LETTER E WITH OGONEK % Ę -> 'E' % LATIN SMALL LETTER E WITH OGONEK % ę -> 'e' % LATIN CAPITAL LETTER E WITH CARON % Ě -> 'E' % LATIN SMALL LETTER E WITH CARON % ě -> 'e' % LATIN CAPITAL LETTER G WITH CIRCUMFLEX % Ĝ -> 'Gh', 'G' ""; % LATIN SMALL LETTER G WITH CIRCUMFLEX % ĝ -> 'gh', 'g' ""; % LATIN CAPITAL LETTER G WITH BREVE % Ğ -> 'G' % LATIN SMALL LETTER G WITH BREVE % ğ -> 'g' % LATIN CAPITAL LETTER G WITH DOT ABOVE % Ġ -> 'G' % LATIN SMALL LETTER G WITH DOT ABOVE % ġ -> 'g' % LATIN CAPITAL LETTER G WITH CEDILLA % Ģ -> 'G' % LATIN SMALL LETTER G WITH CEDILLA % ģ -> 'g' % LATIN CAPITAL LETTER H WITH CIRCUMFLEX % Ĥ -> 'Hh', 'H' ""; % LATIN SMALL LETTER H WITH CIRCUMFLEX % ĥ -> 'hh', 'h' ""; % LATIN CAPITAL LETTER H WITH STROKE % Ħ -> 'H' % LATIN SMALL LETTER H WITH STROKE % ħ -> 'h' % LATIN CAPITAL LETTER I WITH TILDE % Ĩ -> 'I' % LATIN SMALL LETTER I WITH TILDE % ĩ -> 'i' % LATIN CAPITAL LETTER I WITH MACRON % Ī -> 'I' % LATIN SMALL LETTER I WITH MACRON % ī -> 'i' % LATIN CAPITAL LETTER I WITH BREVE % Ĭ -> 'I' % LATIN SMALL LETTER I WITH BREVE % ĭ -> 'i' % LATIN CAPITAL LETTER I WITH OGONEK % Į -> 'I' % LATIN SMALL LETTER I WITH OGONEK % į -> 'i' % LATIN CAPITAL LETTER I WITH DOT ABOVE % İ -> 'I' % LATIN SMALL LETTER DOTLESS I % ı -> 'i' % LATIN CAPITAL LIGATURE IJ % IJ -> 'IJ' "" % LATIN SMALL LIGATURE IJ % ij -> 'ij' "" % LATIN CAPITAL LETTER J WITH CIRCUMFLEX % Ĵ -> 'Jh', 'J' ""; % LATIN SMALL LETTER J WITH CIRCUMFLEX % ĵ -> 'jh', 'j' ""; % LATIN CAPITAL LETTER K WITH CEDILLA % Ķ -> 'K' % LATIN SMALL LETTER K WITH CEDILLA % ķ -> 'k' % LATIN SMALL LETTER KRA % ĸ -> 'k' % LATIN CAPITAL LETTER L WITH ACUTE % Ĺ -> 'L' % LATIN SMALL LETTER L WITH ACUTE % ĺ -> 'l' % LATIN CAPITAL LETTER L WITH CEDILLA % Ļ -> 'L' % LATIN SMALL LETTER L WITH CEDILLA % ļ -> 'l' % LATIN CAPITAL LETTER L WITH CARON % Ľ -> 'L' % LATIN SMALL LETTER L WITH CARON % ľ -> 'l' % LATIN CAPITAL LETTER L WITH MIDDLE DOT % Ŀ -> 'L·', 'L.', 'L' "";""; % LATIN SMALL LETTER L WITH MIDDLE DOT % ŀ -> 'l·', 'l.', 'l' "";""; % LATIN CAPITAL LETTER L WITH STROKE % Ł -> 'L' % LATIN SMALL LETTER L WITH STROKE % ł -> 'l' % LATIN CAPITAL LETTER N WITH ACUTE % Ń -> 'N' % LATIN SMALL LETTER N WITH ACUTE % ń -> 'n' % LATIN CAPITAL LETTER N WITH CEDILLA % Ņ -> 'N' % LATIN SMALL LETTER N WITH CEDILLA % ņ -> 'n' % LATIN CAPITAL LETTER N WITH CARON % Ň -> 'N' % LATIN SMALL LETTER N WITH CARON % ň -> 'n' % LATIN SMALL LETTER N PRECEDED BY APOSTROPHE % ʼn -> ''n' "" % LATIN CAPITAL LETTER ENG % Ŋ -> 'NG', 'N' ""; % LATIN SMALL LETTER ENG % ŋ -> 'ng', 'n' ""; % LATIN CAPITAL LETTER O WITH MACRON % Ō -> 'O' % LATIN SMALL LETTER O WITH MACRON % ō -> 'o' % LATIN CAPITAL LETTER O WITH BREVE % Ŏ -> 'O' % LATIN SMALL LETTER O WITH BREVE % ŏ -> 'o' % LATIN CAPITAL LETTER O WITH DOUBLE ACUTE % Ő -> 'O' % LATIN SMALL LETTER O WITH DOUBLE ACUTE % ő -> 'o' % LATIN CAPITAL LIGATURE OE % Œ -> 'OE' "" % LATIN SMALL LIGATURE OE % œ -> 'oe' "" % LATIN CAPITAL LETTER R WITH ACUTE % Ŕ -> 'R' % LATIN SMALL LETTER R WITH ACUTE % ŕ -> 'r' % LATIN CAPITAL LETTER R WITH CEDILLA % Ŗ -> 'R' % LATIN SMALL LETTER R WITH CEDILLA % ŗ -> 'r' % LATIN CAPITAL LETTER R WITH CARON % Ř -> 'R' % LATIN SMALL LETTER R WITH CARON % ř -> 'r' % LATIN CAPITAL LETTER S WITH ACUTE % Ś -> 'S' % LATIN SMALL LETTER S WITH ACUTE % ś -> 's' % LATIN CAPITAL LETTER S WITH CIRCUMFLEX % Ŝ -> 'Sh', 'S' ""; % LATIN SMALL LETTER S WITH CIRCUMFLEX % ŝ -> 'sh', 's' ""; % LATIN CAPITAL LETTER S WITH CEDILLA % Ş -> 'S' % LATIN SMALL LETTER S WITH CEDILLA % ş -> 's' % LATIN CAPITAL LETTER S WITH CARON % Š -> 'S' % LATIN SMALL LETTER S WITH CARON % š -> 's' % LATIN CAPITAL LETTER T WITH CEDILLA % Ţ -> 'T' % LATIN SMALL LETTER T WITH CEDILLA % ţ -> 't' % LATIN CAPITAL LETTER T WITH CARON % Ť -> 'T' % LATIN SMALL LETTER T WITH CARON % ť -> 't' % LATIN CAPITAL LETTER T WITH STROKE % Ŧ -> 'T' % LATIN SMALL LETTER T WITH STROKE % ŧ -> 't' % LATIN CAPITAL LETTER U WITH TILDE % Ũ -> 'U' % LATIN SMALL LETTER U WITH TILDE % ũ -> 'u' % LATIN CAPITAL LETTER U WITH MACRON % Ū -> 'U' % LATIN SMALL LETTER U WITH MACRON % ū -> 'u' % LATIN CAPITAL LETTER U WITH BREVE % Ŭ -> 'U' % LATIN SMALL LETTER U WITH BREVE % ŭ -> 'u' % LATIN CAPITAL LETTER U WITH RING ABOVE % Ů -> 'U' % LATIN SMALL LETTER U WITH RING ABOVE % ů -> 'u' % LATIN CAPITAL LETTER U WITH DOUBLE ACUTE % Ű -> 'U' % LATIN SMALL LETTER U WITH DOUBLE ACUTE % ű -> 'u' % LATIN CAPITAL LETTER U WITH OGONEK % Ų -> 'U' % LATIN SMALL LETTER U WITH OGONEK % ų -> 'u' % LATIN CAPITAL LETTER W WITH CIRCUMFLEX % Ŵ -> 'W' % LATIN SMALL LETTER W WITH CIRCUMFLEX % ŵ -> 'w' % LATIN CAPITAL LETTER Y WITH CIRCUMFLEX % Ŷ -> 'Y' % LATIN SMALL LETTER Y WITH CIRCUMFLEX % ŷ -> 'y' % LATIN CAPITAL LETTER Y WITH DIAERESIS % Ÿ -> 'Y' % LATIN CAPITAL LETTER Z WITH ACUTE % Ź -> 'Z' % LATIN SMALL LETTER Z WITH ACUTE % ź -> 'z' % LATIN CAPITAL LETTER Z WITH DOT ABOVE % Ż -> 'Z' % LATIN SMALL LETTER Z WITH DOT ABOVE % ż -> 'z' % LATIN CAPITAL LETTER Z WITH CARON % Ž -> 'Z' % LATIN SMALL LETTER Z WITH CARON % ž -> 'z' % LATIN SMALL LETTER LONG S % ſ -> 's' % LATIN SMALL LETTER F WITH HOOK % ƒ -> 'f' % LATIN CAPITAL LETTER O WITH HORN % Ơ -> 'O' % LATIN SMALL LETTER O WITH HORN % ơ -> 'o' % LATIN CAPITAL LETTER U WITH HORN % Ư -> 'U' % LATIN SMALL LETTER U WITH HORN % ư -> 'u' % LATIN CAPITAL LETTER S WITH COMMA BELOW % Ș -> 'Ş', 'S' ; % LATIN SMALL LETTER S WITH COMMA BELOW % ș -> 'ş', 's' ; % LATIN CAPITAL LETTER T WITH COMMA BELOW % Ț -> 'Ţ', 'T' ; % LATIN SMALL LETTER T WITH COMMA BELOW % ț -> 'ţ', 't' ; % MODIFIER LETTER PRIME % ʹ -> '′', ''' ; % MODIFIER LETTER TURNED COMMA % ʻ -> '‘' % MODIFIER LETTER APOSTROPHE % ʼ -> '’', ''' ; % MODIFIER LETTER REVERSED COMMA % ʽ -> '‛' % MODIFIER LETTER CIRCUMFLEX ACCENT % ˆ -> '^' % MODIFIER LETTER VERTICAL LINE % ˈ -> ''' % MODIFIER LETTER MACRON % ˉ -> '¯' % MODIFIER LETTER LOW VERTICAL LINE % ˌ -> ',' % MODIFIER LETTER TRIANGULAR COLON % ː -> ':' % RING ABOVE % ˚ -> '°' % SMALL TILDE % ˜ -> '~' % DOUBLE ACUTE ACCENT % ˝ -> '"' % GREEK NUMERAL SIGN % ʹ -> ''' % GREEK LOWER NUMERAL SIGN % ͵ -> ',' % GREEK QUESTION MARK % ; -> ';' % LATIN CAPITAL LETTER B WITH DOT ABOVE % Ḃ -> 'B' % LATIN SMALL LETTER B WITH DOT ABOVE % ḃ -> 'b' % LATIN CAPITAL LETTER D WITH DOT ABOVE % Ḋ -> 'D' % LATIN SMALL LETTER D WITH DOT ABOVE % ḋ -> 'd' % LATIN CAPITAL LETTER F WITH DOT ABOVE % Ḟ -> 'F' % LATIN SMALL LETTER F WITH DOT ABOVE % ḟ -> 'f' % LATIN CAPITAL LETTER M WITH DOT ABOVE % Ṁ -> 'M' % LATIN SMALL LETTER M WITH DOT ABOVE % ṁ -> 'm' % LATIN CAPITAL LETTER P WITH DOT ABOVE % Ṗ -> 'P' % LATIN SMALL LETTER P WITH DOT ABOVE % ṗ -> 'p' % LATIN CAPITAL LETTER S WITH DOT ABOVE % Ṡ -> 'S' % LATIN SMALL LETTER S WITH DOT ABOVE % ṡ -> 's' % LATIN CAPITAL LETTER T WITH DOT ABOVE % Ṫ -> 'T' % LATIN SMALL LETTER T WITH DOT ABOVE % ṫ -> 't' % LATIN CAPITAL LETTER W WITH GRAVE % Ẁ -> 'W' % LATIN SMALL LETTER W WITH GRAVE % ẁ -> 'w' % LATIN CAPITAL LETTER W WITH ACUTE % Ẃ -> 'W' % LATIN SMALL LETTER W WITH ACUTE % ẃ -> 'w' % LATIN CAPITAL LETTER W WITH DIAERESIS % Ẅ -> 'W' % LATIN SMALL LETTER W WITH DIAERESIS % ẅ -> 'w' % LATIN CAPITAL LETTER U WITH HORN AND TILDE % Ữ -> 'U' % LATIN SMALL LETTER U WITH HORN AND TILDE % ữ -> 'u' % LATIN CAPITAL LETTER Y WITH GRAVE % Ỳ -> 'Y' % LATIN SMALL LETTER Y WITH GRAVE % ỳ -> 'y' % EN QUAD %   -> ' ' % EM QUAD %   -> ' ' "" % EN SPACE %   -> ' ' % EM SPACE %   -> ' ' "" % THREE-PER-EM SPACE %   -> ' ' % FOUR-PER-EM SPACE %   -> ' ' % SIX-PER-EM SPACE %   -> ' ' % FIGURE SPACE %   -> ' ' % PUNCTUATION SPACE %   -> ' ' % THIN SPACE %   -> ' ' % HAIR SPACE %   -> "" % ZERO WIDTH SPACE % ​ -> "" % ZERO WIDTH NON-JOINER % ‌ -> "" % ZERO WIDTH JOINER % ‍ -> "" % LEFT-TO-RIGHT MARK % ‎ -> "" % RIGHT-TO-LEFT MARK % ‏ -> "" % HYPHEN % ‐ -> '-' % NON-BREAKING HYPHEN % ‑ -> '-' % FIGURE DASH % ‒ -> '-' % EN DASH % – -> '-' % EM DASH % — -> '--' "" % HORIZONTAL BAR % ― -> '--' "" % DOUBLE VERTICAL LINE % ‖ -> '||' "" % DOUBLE LOW LINE % ‗ -> '_' % LEFT SINGLE QUOTATION MARK % ‘ -> ''' % RIGHT SINGLE QUOTATION MARK % ’ -> ''' % SINGLE LOW-9 QUOTATION MARK % ‚ -> ''' % SINGLE HIGH-REVERSED-9 QUOTATION MARK % ‛ -> ''' % LEFT DOUBLE QUOTATION MARK % “ -> '"' % RIGHT DOUBLE QUOTATION MARK % ” -> '"' % DOUBLE LOW-9 QUOTATION MARK % „ -> '"' % DOUBLE HIGH-REVERSED-9 QUOTATION MARK % ‟ -> '"' % DAGGER % † -> '+' % DOUBLE DAGGER % ‡ -> '++' "" % BULLET % • -> 'o' % TRIANGULAR BULLET % ‣ -> '>' % ONE DOT LEADER % ․ -> '.' % TWO DOT LEADER % ‥ -> '..' "" % HORIZONTAL ELLIPSIS % … -> '...' "" % HYPHENATION POINT % ‧ -> '-' % LEFT-TO-RIGHT EMBEDDING % ‪ -> "" % RIGHT-TO-LEFT EMBEDDING % ‫ -> "" % POP DIRECTIONAL FORMATTING % ‬ -> "" % LEFT-TO-RIGHT OVERRIDE % ‭ -> "" % RIGHT-TO-LEFT OVERRIDE % ‮ -> "" % NARROW NO-BREAK SPACE %   -> ' ' % PER MILLE SIGN % ‰ -> ' 0/00' "" % PRIME % ′ -> ''' % DOUBLE PRIME % ″ -> '"' % TRIPLE PRIME % ‴ -> ''''' "" % REVERSED PRIME % ‵ -> '`' % REVERSED DOUBLE PRIME % ‶ -> '``' "" % REVERSED TRIPLE PRIME % ‷ -> '```' "" % SINGLE LEFT-POINTING ANGLE QUOTATION MARK % ‹ -> '<' % SINGLE RIGHT-POINTING ANGLE QUOTATION MARK % › -> '>' % DOUBLE EXCLAMATION MARK % ‼ -> '!!' "" % OVERLINE % ‾ -> '-' % HYPHEN BULLET % ⁃ -> '-' % FRACTION SLASH % ⁄ -> '/' % QUESTION EXCLAMATION MARK % ⁈ -> '?!' "" % EXCLAMATION QUESTION MARK % ⁉ -> '!?' "" % TIRONIAN SIGN ET % ⁊ -> '7' % SUPERSCRIPT ZERO % ⁰ -> '^0', '0' ""; % SUPERSCRIPT FOUR % ⁴ -> '^4', '4' ""; % SUPERSCRIPT FIVE % ⁵ -> '^5', '5' ""; % SUPERSCRIPT SIX % ⁶ -> '^6', '6' ""; % SUPERSCRIPT SEVEN % ⁷ -> '^7', '7' ""; % SUPERSCRIPT EIGHT % ⁸ -> '^8', '8' ""; % SUPERSCRIPT NINE % ⁹ -> '^9', '9' ""; % SUPERSCRIPT PLUS SIGN % ⁺ -> '^+', '+' ""; % SUPERSCRIPT MINUS % ⁻ -> '^-', '-' ""; % SUPERSCRIPT EQUALS SIGN % ⁼ -> '^=', '=' ""; % SUPERSCRIPT LEFT PARENTHESIS % ⁽ -> '^(', '(' ""; % SUPERSCRIPT RIGHT PARENTHESIS % ⁾ -> '^)', ')' ""; % SUPERSCRIPT LATIN SMALL LETTER N % ⁿ -> '^n', 'n' ""; % SUBSCRIPT ZERO % ₀ -> '_0', '0' ""; % SUBSCRIPT ONE % ₁ -> '_1', '1' ""; % SUBSCRIPT TWO % ₂ -> '_2', '2' ""; % SUBSCRIPT THREE % ₃ -> '_3', '3' ""; % SUBSCRIPT FOUR % ₄ -> '_4', '4' ""; % SUBSCRIPT FIVE % ₅ -> '_5', '5' ""; % SUBSCRIPT SIX % ₆ -> '_6', '6' ""; % SUBSCRIPT SEVEN % ₇ -> '_7', '7' ""; % SUBSCRIPT EIGHT % ₈ -> '_8', '8' ""; % SUBSCRIPT NINE % ₉ -> '_9', '9' ""; % SUBSCRIPT PLUS SIGN % ₊ -> '_+', '+' ""; % SUBSCRIPT MINUS % ₋ -> '_-', '-' ""; % SUBSCRIPT EQUALS SIGN % ₌ -> '_=', '=' ""; % SUBSCRIPT LEFT PARENTHESIS % ₍ -> '_(', '(' ""; % SUBSCRIPT RIGHT PARENTHESIS % ₎ -> '_)', ')' ""; % EURO SIGN % € -> 'EUR', 'E' ""; % ACCOUNT OF % ℀ -> 'a/c' "" % ADDRESSED TO THE SUBJECT % ℁ -> 'a/s' "" % DEGREE CELSIUS % ℃ -> '°C', 'C' ""; % CARE OF % ℅ -> 'c/o' "" % CADA UNA % ℆ -> 'c/u' "" % DEGREE FAHRENHEIT % ℉ -> '°F', 'F' ""; % SCRIPT SMALL L % ℓ -> 'l' % NUMERO SIGN % № -> 'Nº', 'No' "";"" % SOUND RECORDING COPYRIGHT % ℗ -> '(P)' "" % SERVICE MARK % ℠ -> '[SM]' "" % TELEPHONE SIGN % ℡ -> 'TEL' "" % TRADE MARK SIGN % ™ -> '[TM]' "" % OHM SIGN % Ω -> 'Ω', 'ohm', 'O' ;""; % KELVIN SIGN % K -> 'K' % ANGSTROM SIGN % Å -> 'Å' % ESTIMATED SYMBOL % ℮ -> 'e' % VULGAR FRACTION ONE THIRD % ⅓ -> ' 1/3' "" % VULGAR FRACTION TWO THIRDS % ⅔ -> ' 2/3' "" % VULGAR FRACTION ONE FIFTH % ⅕ -> ' 1/5' "" % VULGAR FRACTION TWO FIFTHS % ⅖ -> ' 2/5' "" % VULGAR FRACTION THREE FIFTHS % ⅗ -> ' 3/5' "" % VULGAR FRACTION FOUR FIFTHS % ⅘ -> ' 4/5' "" % VULGAR FRACTION ONE SIXTH % ⅙ -> ' 1/6' "" % VULGAR FRACTION FIVE SIXTHS % ⅚ -> ' 5/6' "" % VULGAR FRACTION ONE EIGHTH % ⅛ -> ' 1/8' "" % VULGAR FRACTION THREE EIGHTHS % ⅜ -> ' 3/8' "" % VULGAR FRACTION FIVE EIGHTHS % ⅝ -> ' 5/8' "" % VULGAR FRACTION SEVEN EIGHTHS % ⅞ -> ' 7/8' "" % FRACTION NUMERATOR ONE % ⅟ -> ' 1/' "" % ROMAN NUMERAL ONE % Ⅰ -> 'I' % ROMAN NUMERAL TWO % Ⅱ -> 'II' "" % ROMAN NUMERAL THREE % Ⅲ -> 'III' "" % ROMAN NUMERAL FOUR % Ⅳ -> 'IV' "" % ROMAN NUMERAL FIVE % Ⅴ -> 'V' % ROMAN NUMERAL SIX % Ⅵ -> 'VI' "" % ROMAN NUMERAL SEVEN % Ⅶ -> 'VII' "" % ROMAN NUMERAL EIGHT % Ⅷ -> 'VIII' "" % ROMAN NUMERAL NINE % Ⅸ -> 'IX' "" % ROMAN NUMERAL TEN % Ⅹ -> 'X' % ROMAN NUMERAL ELEVEN % Ⅺ -> 'XI' "" % ROMAN NUMERAL TWELVE % Ⅻ -> 'XII' "" % ROMAN NUMERAL FIFTY % Ⅼ -> 'L' % ROMAN NUMERAL ONE HUNDRED % Ⅽ -> 'C' % ROMAN NUMERAL FIVE HUNDRED % Ⅾ -> 'D' % ROMAN NUMERAL ONE THOUSAND % Ⅿ -> 'M' % SMALL ROMAN NUMERAL ONE % ⅰ -> 'i' % SMALL ROMAN NUMERAL TWO % ⅱ -> 'ii' "" % SMALL ROMAN NUMERAL THREE % ⅲ -> 'iii' "" % SMALL ROMAN NUMERAL FOUR % ⅳ -> 'iv' "" % SMALL ROMAN NUMERAL FIVE % ⅴ -> 'v' % SMALL ROMAN NUMERAL SIX % ⅵ -> 'vi' "" % SMALL ROMAN NUMERAL SEVEN % ⅶ -> 'vii' "" % SMALL ROMAN NUMERAL EIGHT % ⅷ -> 'viii' "" % SMALL ROMAN NUMERAL NINE % ⅸ -> 'ix' "" % SMALL ROMAN NUMERAL TEN % ⅹ -> 'x' % SMALL ROMAN NUMERAL ELEVEN % ⅺ -> 'xi' "" % SMALL ROMAN NUMERAL TWELVE % ⅻ -> 'xii' "" % SMALL ROMAN NUMERAL FIFTY % ⅼ -> 'l' % SMALL ROMAN NUMERAL ONE HUNDRED % ⅽ -> 'c' % SMALL ROMAN NUMERAL FIVE HUNDRED % ⅾ -> 'd' % SMALL ROMAN NUMERAL ONE THOUSAND % ⅿ -> 'm' % LEFTWARDS ARROW % ← -> '<-' "" % UPWARDS ARROW % ↑ -> '^' % RIGHTWARDS ARROW % → -> '->' "" % DOWNWARDS ARROW % ↓ -> 'v' % LEFT RIGHT ARROW % ↔ -> '<->' "" % LEFTWARDS DOUBLE ARROW % ⇐ -> '<=' "" % RIGHTWARDS DOUBLE ARROW % ⇒ -> '=>' "" % LEFT RIGHT DOUBLE ARROW % ⇔ -> '<=>' "" % MINUS SIGN % − -> '–', '-' ; % DIVISION SLASH % ∕ -> '/' % SET MINUS % ∖ -> '\' % ASTERISK OPERATOR % ∗ -> '*' % RING OPERATOR % ∘ -> 'o' % BULLET OPERATOR % ∙ -> '·' % INFINITY % ∞ -> 'inf' "" % DIVIDES % ∣ -> '|' % PARALLEL TO % ∥ -> '||' "" % RATIO % ∶ -> ':' % TILDE OPERATOR % ∼ -> '~' % NOT EQUAL TO % ≠ -> '/=' "" % IDENTICAL TO % ≡ -> '=' % LESS-THAN OR EQUAL TO % ≤ -> '<=' "" % GREATER-THAN OR EQUAL TO % ≥ -> '>=' "" % MUCH LESS-THAN % ≪ -> '<<' "" % MUCH GREATER-THAN % ≫ -> '>>' "" % CIRCLED PLUS % ⊕ -> '(+)' "" % CIRCLED MINUS % ⊖ -> '(-)' "" % CIRCLED TIMES % ⊗ -> '(x)' "" % CIRCLED DIVISION SLASH % ⊘ -> '(/)' "" % RIGHT TACK % ⊢ -> '|-' "" % LEFT TACK % ⊣ -> '-|' "" % ASSERTION % ⊦ -> '|-' "" % MODELS % ⊧ -> '|=' "" % TRUE % ⊨ -> '|=' "" % FORCES % ⊩ -> '||-' "" % DOT OPERATOR % ⋅ -> '·' % STAR OPERATOR % ⋆ -> '*' % EQUAL AND PARALLEL TO % ⋕ -> '#' % VERY MUCH LESS-THAN % ⋘ -> '<<<' "" % VERY MUCH GREATER-THAN % ⋙ -> '>>>' "" % MIDLINE HORIZONTAL ELLIPSIS % ⋯ -> '...' "" % LEFT-POINTING ANGLE BRACKET % 〈 -> '<' % RIGHT-POINTING ANGLE BRACKET % 〉 -> '>' % SYMBOL FOR NULL % ␀ -> 'NUL' "" % SYMBOL FOR START OF HEADING % ␁ -> 'SOH' "" % SYMBOL FOR START OF TEXT % ␂ -> 'STX' "" % SYMBOL FOR END OF TEXT % ␃ -> 'ETX' "" % SYMBOL FOR END OF TRANSMISSION % ␄ -> 'EOT' "" % SYMBOL FOR ENQUIRY % ␅ -> 'ENQ' "" % SYMBOL FOR ACKNOWLEDGE % ␆ -> 'ACK' "" % SYMBOL FOR BELL % ␇ -> 'BEL' "" % SYMBOL FOR BACKSPACE % ␈ -> 'BS' "" % SYMBOL FOR HORIZONTAL TABULATION % ␉ -> 'HT' "" % SYMBOL FOR LINE FEED % ␊ -> 'LF' "" % SYMBOL FOR VERTICAL TABULATION % ␋ -> 'VT' "" % SYMBOL FOR FORM FEED % ␌ -> 'FF' "" % SYMBOL FOR CARRIAGE RETURN % ␍ -> 'CR' "" % SYMBOL FOR SHIFT OUT % ␎ -> 'SO' "" % SYMBOL FOR SHIFT IN % ␏ -> 'SI' "" % SYMBOL FOR DATA LINK ESCAPE % ␐ -> 'DLE' "" % SYMBOL FOR DEVICE CONTROL ONE % ␑ -> 'DC1' "" % SYMBOL FOR DEVICE CONTROL TWO % ␒ -> 'DC2' "" % SYMBOL FOR DEVICE CONTROL THREE % ␓ -> 'DC3' "" % SYMBOL FOR DEVICE CONTROL FOUR % ␔ -> 'DC4' "" % SYMBOL FOR NEGATIVE ACKNOWLEDGE % ␕ -> 'NAK' "" % SYMBOL FOR SYNCHRONOUS IDLE % ␖ -> 'SYN' "" % SYMBOL FOR END OF TRANSMISSION BLOCK % ␗ -> 'ETB' "" % SYMBOL FOR CANCEL % ␘ -> 'CAN' "" % SYMBOL FOR END OF MEDIUM % ␙ -> 'EM' "" % SYMBOL FOR SUBSTITUTE % ␚ -> 'SUB' "" % SYMBOL FOR ESCAPE % ␛ -> 'ESC' "" % SYMBOL FOR FILE SEPARATOR % ␜ -> 'FS' "" % SYMBOL FOR GROUP SEPARATOR % ␝ -> 'GS' "" % SYMBOL FOR RECORD SEPARATOR % ␞ -> 'RS' "" % SYMBOL FOR UNIT SEPARATOR % ␟ -> 'US' "" % SYMBOL FOR SPACE % ␠ -> 'SP' "" % SYMBOL FOR DELETE % ␡ -> 'DEL' "" % OPEN BOX % ␣ -> '_' % SYMBOL FOR NEWLINE % ␤ -> 'NL' "" % SYMBOL FOR DELETE FORM TWO % ␥ -> '///' "" % SYMBOL FOR SUBSTITUTE FORM TWO % ␦ -> '?' % CIRCLED DIGIT ONE % ① -> '(1)', '1' ""; % CIRCLED DIGIT TWO % ② -> '(2)', '2' ""; % CIRCLED DIGIT THREE % ③ -> '(3)', '3' ""; % CIRCLED DIGIT FOUR % ④ -> '(4)', '4' ""; % CIRCLED DIGIT FIVE % ⑤ -> '(5)', '5' ""; % CIRCLED DIGIT SIX % ⑥ -> '(6)', '6' ""; % CIRCLED DIGIT SEVEN % ⑦ -> '(7)', '7' ""; % CIRCLED DIGIT EIGHT % ⑧ -> '(8)', '8' ""; % CIRCLED DIGIT NINE % ⑨ -> '(9)', '9' ""; % CIRCLED NUMBER TEN % ⑩ -> '(10)' "" % CIRCLED NUMBER ELEVEN % ⑪ -> '(11)' "" % CIRCLED NUMBER TWELVE % ⑫ -> '(12)' "" % CIRCLED NUMBER THIRTEEN % ⑬ -> '(13)' "" % CIRCLED NUMBER FOURTEEN % ⑭ -> '(14)' "" % CIRCLED NUMBER FIFTEEN % ⑮ -> '(15)' "" % CIRCLED NUMBER SIXTEEN % ⑯ -> '(16)' "" % CIRCLED NUMBER SEVENTEEN % ⑰ -> '(17)' "" % CIRCLED NUMBER EIGHTEEN % ⑱ -> '(18)' "" % CIRCLED NUMBER NINETEEN % ⑲ -> '(19)' "" % CIRCLED NUMBER TWENTY % ⑳ -> '(20)' "" % PARENTHESIZED DIGIT ONE % ⑴ -> '(1)', '1' ""; % PARENTHESIZED DIGIT TWO % ⑵ -> '(2)', '2' ""; % PARENTHESIZED DIGIT THREE % ⑶ -> '(3)', '3' ""; % PARENTHESIZED DIGIT FOUR % ⑷ -> '(4)', '4' ""; % PARENTHESIZED DIGIT FIVE % ⑸ -> '(5)', '5' ""; % PARENTHESIZED DIGIT SIX % ⑹ -> '(6)', '6' ""; % PARENTHESIZED DIGIT SEVEN % ⑺ -> '(7)', '7' ""; % PARENTHESIZED DIGIT EIGHT % ⑻ -> '(8)', '8' ""; % PARENTHESIZED DIGIT NINE % ⑼ -> '(9)', '9' ""; % PARENTHESIZED NUMBER TEN % ⑽ -> '(10)' "" % PARENTHESIZED NUMBER ELEVEN % ⑾ -> '(11)' "" % PARENTHESIZED NUMBER TWELVE % ⑿ -> '(12)' "" % PARENTHESIZED NUMBER THIRTEEN % ⒀ -> '(13)' "" % PARENTHESIZED NUMBER FOURTEEN % ⒁ -> '(14)' "" % PARENTHESIZED NUMBER FIFTEEN % ⒂ -> '(15)' "" % PARENTHESIZED NUMBER SIXTEEN % ⒃ -> '(16)' "" % PARENTHESIZED NUMBER SEVENTEEN % ⒄ -> '(17)' "" % PARENTHESIZED NUMBER EIGHTEEN % ⒅ -> '(18)' "" % PARENTHESIZED NUMBER NINETEEN % ⒆ -> '(19)' "" % PARENTHESIZED NUMBER TWENTY % ⒇ -> '(20)' "" % DIGIT ONE FULL STOP % ⒈ -> '1.', '1' ""; % DIGIT TWO FULL STOP % ⒉ -> '2.', '2' ""; % DIGIT THREE FULL STOP % ⒊ -> '3.', '3' ""; % DIGIT FOUR FULL STOP % ⒋ -> '4.', '4' ""; % DIGIT FIVE FULL STOP % ⒌ -> '5.', '5' ""; % DIGIT SIX FULL STOP % ⒍ -> '6.', '6' ""; % DIGIT SEVEN FULL STOP % ⒎ -> '7.', '7' ""; % DIGIT EIGHT FULL STOP % ⒏ -> '8.', '8' ""; % DIGIT NINE FULL STOP % ⒐ -> '9.', '9' ""; % NUMBER TEN FULL STOP % ⒑ -> '10.' "" % NUMBER ELEVEN FULL STOP % ⒒ -> '11.' "" % NUMBER TWELVE FULL STOP % ⒓ -> '12.' "" % NUMBER THIRTEEN FULL STOP % ⒔ -> '13.' "" % NUMBER FOURTEEN FULL STOP % ⒕ -> '14.' "" % NUMBER FIFTEEN FULL STOP % ⒖ -> '15.' "" % NUMBER SIXTEEN FULL STOP % ⒗ -> '16.' "" % NUMBER SEVENTEEN FULL STOP % ⒘ -> '17.' "" % NUMBER EIGHTEEN FULL STOP % ⒙ -> '18.' "" % NUMBER NINETEEN FULL STOP % ⒚ -> '19.' "" % NUMBER TWENTY FULL STOP % ⒛ -> '20.' "" % PARENTHESIZED LATIN SMALL LETTER A % ⒜ -> '(a)', 'a' ""; % PARENTHESIZED LATIN SMALL LETTER B % ⒝ -> '(b)', 'b' ""; % PARENTHESIZED LATIN SMALL LETTER C % ⒞ -> '(c)', 'c' ""; % PARENTHESIZED LATIN SMALL LETTER D % ⒟ -> '(d)', 'd' ""; % PARENTHESIZED LATIN SMALL LETTER E % ⒠ -> '(e)', 'e' ""; % PARENTHESIZED LATIN SMALL LETTER F % ⒡ -> '(f)', 'f' ""; % PARENTHESIZED LATIN SMALL LETTER G % ⒢ -> '(g)', 'g' ""; % PARENTHESIZED LATIN SMALL LETTER H % ⒣ -> '(h)', 'h' ""; % PARENTHESIZED LATIN SMALL LETTER I % ⒤ -> '(i)', 'i' ""; % PARENTHESIZED LATIN SMALL LETTER J % ⒥ -> '(j)', 'j' ""; % PARENTHESIZED LATIN SMALL LETTER K % ⒦ -> '(k)', 'k' ""; % PARENTHESIZED LATIN SMALL LETTER L % ⒧ -> '(l)', 'l' ""; % PARENTHESIZED LATIN SMALL LETTER M % ⒨ -> '(m)', 'm' ""; % PARENTHESIZED LATIN SMALL LETTER N % ⒩ -> '(n)', 'n' ""; % PARENTHESIZED LATIN SMALL LETTER O % ⒪ -> '(o)', 'o' ""; % PARENTHESIZED LATIN SMALL LETTER P % ⒫ -> '(p)', 'p' ""; % PARENTHESIZED LATIN SMALL LETTER Q % ⒬ -> '(q)', 'q' ""; % PARENTHESIZED LATIN SMALL LETTER R % ⒭ -> '(r)', 'r' ""; % PARENTHESIZED LATIN SMALL LETTER S % ⒮ -> '(s)', 's' ""; % PARENTHESIZED LATIN SMALL LETTER T % ⒯ -> '(t)', 't' ""; % PARENTHESIZED LATIN SMALL LETTER U % ⒰ -> '(u)', 'u' ""; % PARENTHESIZED LATIN SMALL LETTER V % ⒱ -> '(v)', 'v' ""; % PARENTHESIZED LATIN SMALL LETTER W % ⒲ -> '(w)', 'w' ""; % PARENTHESIZED LATIN SMALL LETTER X % ⒳ -> '(x)', 'x' ""; % PARENTHESIZED LATIN SMALL LETTER Y % ⒴ -> '(y)', 'y' ""; % PARENTHESIZED LATIN SMALL LETTER Z % ⒵ -> '(z)', 'z' ""; % CIRCLED LATIN CAPITAL LETTER A % Ⓐ -> '(A)', 'A' ""; % CIRCLED LATIN CAPITAL LETTER B % Ⓑ -> '(B)', 'B' ""; % CIRCLED LATIN CAPITAL LETTER C % Ⓒ -> '(C)', 'C' ""; % CIRCLED LATIN CAPITAL LETTER D % Ⓓ -> '(D)', 'D' ""; % CIRCLED LATIN CAPITAL LETTER E % Ⓔ -> '(E)', 'E' ""; % CIRCLED LATIN CAPITAL LETTER F % Ⓕ -> '(F)', 'F' ""; % CIRCLED LATIN CAPITAL LETTER G % Ⓖ -> '(G)', 'G' ""; % CIRCLED LATIN CAPITAL LETTER H % Ⓗ -> '(H)', 'H' ""; % CIRCLED LATIN CAPITAL LETTER I % Ⓘ -> '(I)', 'I' ""; % CIRCLED LATIN CAPITAL LETTER J % Ⓙ -> '(J)', 'J' ""; % CIRCLED LATIN CAPITAL LETTER K % Ⓚ -> '(K)', 'K' ""; % CIRCLED LATIN CAPITAL LETTER L % Ⓛ -> '(L)', 'L' ""; % CIRCLED LATIN CAPITAL LETTER M % Ⓜ -> '(M)', 'M' ""; % CIRCLED LATIN CAPITAL LETTER N % Ⓝ -> '(N)', 'N' ""; % CIRCLED LATIN CAPITAL LETTER O % Ⓞ -> '(O)', 'O' ""; % CIRCLED LATIN CAPITAL LETTER P % Ⓟ -> '(P)', 'P' ""; % CIRCLED LATIN CAPITAL LETTER Q % Ⓠ -> '(Q)', 'Q' ""; % CIRCLED LATIN CAPITAL LETTER R % Ⓡ -> '(R)', 'R' ""; % CIRCLED LATIN CAPITAL LETTER S % Ⓢ -> '(S)', 'S' ""; % CIRCLED LATIN CAPITAL LETTER T % Ⓣ -> '(T)', 'T' ""; % CIRCLED LATIN CAPITAL LETTER U % Ⓤ -> '(U)', 'U' ""; % CIRCLED LATIN CAPITAL LETTER V % Ⓥ -> '(V)', 'V' ""; % CIRCLED LATIN CAPITAL LETTER W % Ⓦ -> '(W)', 'W' ""; % CIRCLED LATIN CAPITAL LETTER X % Ⓧ -> '(X)', 'X' ""; % CIRCLED LATIN CAPITAL LETTER Y % Ⓨ -> '(Y)', 'Y' ""; % CIRCLED LATIN CAPITAL LETTER Z % Ⓩ -> '(Z)', 'Z' ""; % CIRCLED LATIN SMALL LETTER A % ⓐ -> '(a)', 'a' ""; % CIRCLED LATIN SMALL LETTER B % ⓑ -> '(b)', 'b' ""; % CIRCLED LATIN SMALL LETTER C % ⓒ -> '(c)', 'c' ""; % CIRCLED LATIN SMALL LETTER D % ⓓ -> '(d)', 'd' ""; % CIRCLED LATIN SMALL LETTER E % ⓔ -> '(e)', 'e' ""; % CIRCLED LATIN SMALL LETTER F % ⓕ -> '(f)', 'f' ""; % CIRCLED LATIN SMALL LETTER G % ⓖ -> '(g)', 'g' ""; % CIRCLED LATIN SMALL LETTER H % ⓗ -> '(h)', 'h' ""; % CIRCLED LATIN SMALL LETTER I % ⓘ -> '(i)', 'i' ""; % CIRCLED LATIN SMALL LETTER J % ⓙ -> '(j)', 'j' ""; % CIRCLED LATIN SMALL LETTER K % ⓚ -> '(k)', 'k' ""; % CIRCLED LATIN SMALL LETTER L % ⓛ -> '(l)', 'l' ""; % CIRCLED LATIN SMALL LETTER M % ⓜ -> '(m)', 'm' ""; % CIRCLED LATIN SMALL LETTER N % ⓝ -> '(n)', 'n' ""; % CIRCLED LATIN SMALL LETTER O % ⓞ -> '(o)', 'o' ""; % CIRCLED LATIN SMALL LETTER P % ⓟ -> '(p)', 'p' ""; % CIRCLED LATIN SMALL LETTER Q % ⓠ -> '(q)', 'q' ""; % CIRCLED LATIN SMALL LETTER R % ⓡ -> '(r)', 'r' ""; % CIRCLED LATIN SMALL LETTER S % ⓢ -> '(s)', 's' ""; % CIRCLED LATIN SMALL LETTER T % ⓣ -> '(t)', 't' ""; % CIRCLED LATIN SMALL LETTER U % ⓤ -> '(u)', 'u' ""; % CIRCLED LATIN SMALL LETTER V % ⓥ -> '(v)', 'v' ""; % CIRCLED LATIN SMALL LETTER W % ⓦ -> '(w)', 'w' ""; % CIRCLED LATIN SMALL LETTER X % ⓧ -> '(x)', 'x' ""; % CIRCLED LATIN SMALL LETTER Y % ⓨ -> '(y)', 'y' ""; % CIRCLED LATIN SMALL LETTER Z % ⓩ -> '(z)', 'z' ""; % CIRCLED DIGIT ZERO % ⓪ -> '(0)', '0' ""; % BOX DRAWINGS LIGHT HORIZONTAL % ─ -> '-' % BOX DRAWINGS HEAVY HORIZONTAL % ━ -> '=' % BOX DRAWINGS LIGHT VERTICAL % │ -> '|' % BOX DRAWINGS HEAVY VERTICAL % ┃ -> '|' % BOX DRAWINGS LIGHT TRIPLE DASH HORIZONTAL % ┄ -> '-' % BOX DRAWINGS HEAVY TRIPLE DASH HORIZONTAL % ┅ -> '=' % BOX DRAWINGS LIGHT TRIPLE DASH VERTICAL % ┆ -> '|' % BOX DRAWINGS HEAVY TRIPLE DASH VERTICAL % ┇ -> '|' % BOX DRAWINGS LIGHT QUADRUPLE DASH HORIZONTAL % ┈ -> '-' % BOX DRAWINGS HEAVY QUADRUPLE DASH HORIZONTAL % ┉ -> '=' % BOX DRAWINGS LIGHT QUADRUPLE DASH VERTICAL % ┊ -> '|' % BOX DRAWINGS HEAVY QUADRUPLE DASH VERTICAL % ┋ -> '|' % BOX DRAWINGS LIGHT DOWN AND RIGHT % ┌ -> '+' % BOX DRAWINGS DOWN LIGHT AND RIGHT HEAVY % ┍ -> '+' % BOX DRAWINGS DOWN HEAVY AND RIGHT LIGHT % ┎ -> '+' % BOX DRAWINGS HEAVY DOWN AND RIGHT % ┏ -> '+' % BOX DRAWINGS LIGHT DOWN AND LEFT % ┐ -> '+' % BOX DRAWINGS DOWN LIGHT AND LEFT HEAVY % ┑ -> '+' % BOX DRAWINGS DOWN HEAVY AND LEFT LIGHT % ┒ -> '+' % BOX DRAWINGS HEAVY DOWN AND LEFT % ┓ -> '+' % BOX DRAWINGS LIGHT UP AND RIGHT % └ -> '+' % BOX DRAWINGS UP LIGHT AND RIGHT HEAVY % ┕ -> '+' % BOX DRAWINGS UP HEAVY AND RIGHT LIGHT % ┖ -> '+' % BOX DRAWINGS HEAVY UP AND RIGHT % ┗ -> '+' % BOX DRAWINGS LIGHT UP AND LEFT % ┘ -> '+' % BOX DRAWINGS UP LIGHT AND LEFT HEAVY % ┙ -> '+' % BOX DRAWINGS UP HEAVY AND LEFT LIGHT % ┚ -> '+' % BOX DRAWINGS HEAVY UP AND LEFT % ┛ -> '+' % BOX DRAWINGS LIGHT VERTICAL AND RIGHT % ├ -> '+' % BOX DRAWINGS VERTICAL LIGHT AND RIGHT HEAVY % ┝ -> '+' % BOX DRAWINGS UP HEAVY AND RIGHT DOWN LIGHT % ┞ -> '+' % BOX DRAWINGS DOWN HEAVY AND RIGHT UP LIGHT % ┟ -> '+' % BOX DRAWINGS VERTICAL HEAVY AND RIGHT LIGHT % ┠ -> '+' % BOX DRAWINGS DOWN LIGHT AND RIGHT UP HEAVY % ┡ -> '+' % BOX DRAWINGS UP LIGHT AND RIGHT DOWN HEAVY % ┢ -> '+' % BOX DRAWINGS HEAVY VERTICAL AND RIGHT % ┣ -> '+' % BOX DRAWINGS LIGHT VERTICAL AND LEFT % ┤ -> '+' % BOX DRAWINGS VERTICAL LIGHT AND LEFT HEAVY % ┥ -> '+' % BOX DRAWINGS UP HEAVY AND LEFT DOWN LIGHT % ┦ -> '+' % BOX DRAWINGS DOWN HEAVY AND LEFT UP LIGHT % ┧ -> '+' % BOX DRAWINGS VERTICAL HEAVY AND LEFT LIGHT % ┨ -> '+' % BOX DRAWINGS DOWN LIGHT AND LEFT UP HEAVY % ┩ -> '+' % BOX DRAWINGS UP LIGHT AND LEFT DOWN HEAVY % ┪ -> '+' % BOX DRAWINGS HEAVY VERTICAL AND LEFT % ┫ -> '+' % BOX DRAWINGS LIGHT DOWN AND HORIZONTAL % ┬ -> '+' % BOX DRAWINGS LEFT HEAVY AND RIGHT DOWN LIGHT % ┭ -> '+' % BOX DRAWINGS RIGHT HEAVY AND LEFT DOWN LIGHT % ┮ -> '+' % BOX DRAWINGS DOWN LIGHT AND HORIZONTAL HEAVY % ┯ -> '+' % BOX DRAWINGS DOWN HEAVY AND HORIZONTAL LIGHT % ┰ -> '+' % BOX DRAWINGS RIGHT LIGHT AND LEFT DOWN HEAVY % ┱ -> '+' % BOX DRAWINGS LEFT LIGHT AND RIGHT DOWN HEAVY % ┲ -> '+' % BOX DRAWINGS HEAVY DOWN AND HORIZONTAL % ┳ -> '+' % BOX DRAWINGS LIGHT UP AND HORIZONTAL % ┴ -> '+' % BOX DRAWINGS LEFT HEAVY AND RIGHT UP LIGHT % ┵ -> '+' % BOX DRAWINGS RIGHT HEAVY AND LEFT UP LIGHT % ┶ -> '+' % BOX DRAWINGS UP LIGHT AND HORIZONTAL HEAVY % ┷ -> '+' % BOX DRAWINGS UP HEAVY AND HORIZONTAL LIGHT % ┸ -> '+' % BOX DRAWINGS RIGHT LIGHT AND LEFT UP HEAVY % ┹ -> '+' % BOX DRAWINGS LEFT LIGHT AND RIGHT UP HEAVY % ┺ -> '+' % BOX DRAWINGS HEAVY UP AND HORIZONTAL % ┻ -> '+' % BOX DRAWINGS LIGHT VERTICAL AND HORIZONTAL % ┼ -> '+' % BOX DRAWINGS LEFT HEAVY AND RIGHT VERTICAL LIGHT % ┽ -> '+' % BOX DRAWINGS RIGHT HEAVY AND LEFT VERTICAL LIGHT % ┾ -> '+' % BOX DRAWINGS VERTICAL LIGHT AND HORIZONTAL HEAVY % ┿ -> '+' % BOX DRAWINGS UP HEAVY AND DOWN HORIZONTAL LIGHT % ╀ -> '+' % BOX DRAWINGS DOWN HEAVY AND UP HORIZONTAL LIGHT % ╁ -> '+' % BOX DRAWINGS VERTICAL HEAVY AND HORIZONTAL LIGHT % ╂ -> '+' % BOX DRAWINGS LEFT UP HEAVY AND RIGHT DOWN LIGHT % ╃ -> '+' % BOX DRAWINGS RIGHT UP HEAVY AND LEFT DOWN LIGHT % ╄ -> '+' % BOX DRAWINGS LEFT DOWN HEAVY AND RIGHT UP LIGHT % ╅ -> '+' % BOX DRAWINGS RIGHT DOWN HEAVY AND LEFT UP LIGHT % ╆ -> '+' % BOX DRAWINGS DOWN LIGHT AND UP HORIZONTAL HEAVY % ╇ -> '+' % BOX DRAWINGS UP LIGHT AND DOWN HORIZONTAL HEAVY % ╈ -> '+' % BOX DRAWINGS RIGHT LIGHT AND LEFT VERTICAL HEAVY % ╉ -> '+' % BOX DRAWINGS LEFT LIGHT AND RIGHT VERTICAL HEAVY % ╊ -> '+' % BOX DRAWINGS HEAVY VERTICAL AND HORIZONTAL % ╋ -> '+' % BOX DRAWINGS LIGHT DOUBLE DASH HORIZONTAL % ╌ -> '-' % BOX DRAWINGS HEAVY DOUBLE DASH HORIZONTAL % ╍ -> '=' % BOX DRAWINGS LIGHT DOUBLE DASH VERTICAL % ╎ -> '|' % BOX DRAWINGS HEAVY DOUBLE DASH VERTICAL % ╏ -> '|' % BOX DRAWINGS DOUBLE HORIZONTAL % ═ -> '=' % BOX DRAWINGS DOUBLE VERTICAL % ║ -> '|' % BOX DRAWINGS DOWN SINGLE AND RIGHT DOUBLE % ╒ -> '+' % BOX DRAWINGS DOWN DOUBLE AND RIGHT SINGLE % ╓ -> '+' % BOX DRAWINGS DOUBLE DOWN AND RIGHT % ╔ -> '+' % BOX DRAWINGS DOWN SINGLE AND LEFT DOUBLE % ╕ -> '+' % BOX DRAWINGS DOWN DOUBLE AND LEFT SINGLE % ╖ -> '+' % BOX DRAWINGS DOUBLE DOWN AND LEFT % ╗ -> '+' % BOX DRAWINGS UP SINGLE AND RIGHT DOUBLE % ╘ -> '+' % BOX DRAWINGS UP DOUBLE AND RIGHT SINGLE % ╙ -> '+' % BOX DRAWINGS DOUBLE UP AND RIGHT % ╚ -> '+' % BOX DRAWINGS UP SINGLE AND LEFT DOUBLE % ╛ -> '+' % BOX DRAWINGS UP DOUBLE AND LEFT SINGLE % ╜ -> '+' % BOX DRAWINGS DOUBLE UP AND LEFT % ╝ -> '+' % BOX DRAWINGS VERTICAL SINGLE AND RIGHT DOUBLE % ╞ -> '+' % BOX DRAWINGS VERTICAL DOUBLE AND RIGHT SINGLE % ╟ -> '+' % BOX DRAWINGS DOUBLE VERTICAL AND RIGHT % ╠ -> '+' % BOX DRAWINGS VERTICAL SINGLE AND LEFT DOUBLE % ╡ -> '+' % BOX DRAWINGS VERTICAL DOUBLE AND LEFT SINGLE % ╢ -> '+' % BOX DRAWINGS DOUBLE VERTICAL AND LEFT % ╣ -> '+' % BOX DRAWINGS DOWN SINGLE AND HORIZONTAL DOUBLE % ╤ -> '+' % BOX DRAWINGS DOWN DOUBLE AND HORIZONTAL SINGLE % ╥ -> '+' % BOX DRAWINGS DOUBLE DOWN AND HORIZONTAL % ╦ -> '+' % BOX DRAWINGS UP SINGLE AND HORIZONTAL DOUBLE % ╧ -> '+' % BOX DRAWINGS UP DOUBLE AND HORIZONTAL SINGLE % ╨ -> '+' % BOX DRAWINGS DOUBLE UP AND HORIZONTAL % ╩ -> '+' % BOX DRAWINGS VERTICAL SINGLE AND HORIZONTAL DOUBLE % ╪ -> '+' % BOX DRAWINGS VERTICAL DOUBLE AND HORIZONTAL SINGLE % ╫ -> '+' % BOX DRAWINGS DOUBLE VERTICAL AND HORIZONTAL % ╬ -> '+' % BOX DRAWINGS LIGHT ARC DOWN AND RIGHT % ╭ -> '+' % BOX DRAWINGS LIGHT ARC DOWN AND LEFT % ╮ -> '+' % BOX DRAWINGS LIGHT ARC UP AND LEFT % ╯ -> '+' % BOX DRAWINGS LIGHT ARC UP AND RIGHT % ╰ -> '+' % BOX DRAWINGS LIGHT DIAGONAL UPPER RIGHT TO LOWER LEFT % ╱ -> '/' % BOX DRAWINGS LIGHT DIAGONAL UPPER LEFT TO LOWER RIGHT % ╲ -> '\' % BOX DRAWINGS LIGHT DIAGONAL CROSS % ╳ -> 'X' % BOX DRAWINGS LIGHT LEFT AND HEAVY RIGHT % ╼ -> '-' % BOX DRAWINGS LIGHT UP AND HEAVY DOWN % ╽ -> '|' % BOX DRAWINGS HEAVY LEFT AND LIGHT RIGHT % ╾ -> '-' % BOX DRAWINGS HEAVY UP AND LIGHT DOWN % ╿ -> '|' % WHITE CIRCLE % ○ -> 'o' % WHITE BULLET % ◦ -> 'o' % BLACK STAR % ★ -> '*' % WHITE STAR % ☆ -> '*' % BALLOT BOX WITH X % ☒ -> 'X' % SALTIRE % ☓ -> 'X' % WHITE FROWNING FACE % ☹ -> ':-(' "" % WHITE SMILING FACE % ☺ -> ':-)' "" % BLACK SMILING FACE % ☻ -> '(-:' "" % MUSIC FLAT SIGN % ♭ -> 'b' % MUSIC SHARP SIGN % ♯ -> '#' % UPPER BLADE SCISSORS % ✁ -> '%<' "" % BLACK SCISSORS % ✂ -> '%<' "" % LOWER BLADE SCISSORS % ✃ -> '%<' "" % WHITE SCISSORS % ✄ -> '%<' "" % VICTORY HAND % ✌ -> 'V' % CHECK MARK % ✓ -> '√' % HEAVY CHECK MARK % ✔ -> '√' % MULTIPLICATION X % ✕ -> 'x' % HEAVY MULTIPLICATION X % ✖ -> 'x' % BALLOT X % ✗ -> 'X' % HEAVY BALLOT X % ✘ -> 'X' % OUTLINED GREEK CROSS % ✙ -> '+' % HEAVY GREEK CROSS % ✚ -> '+' % OPEN CENTRE CROSS % ✛ -> '+' % HEAVY OPEN CENTRE CROSS % ✜ -> '+' % LATIN CROSS % ✝ -> '+' % SHADOWED WHITE LATIN CROSS % ✞ -> '+' % OUTLINED LATIN CROSS % ✟ -> '+' % MALTESE CROSS % ✠ -> '+' % STAR OF DAVID % ✡ -> '*' % FOUR TEARDROP-SPOKED ASTERISK % ✢ -> '+' % FOUR BALLOON-SPOKED ASTERISK % ✣ -> '+' % HEAVY FOUR BALLOON-SPOKED ASTERISK % ✤ -> '+' % FOUR CLUB-SPOKED ASTERISK % ✥ -> '+' % BLACK FOUR POINTED STAR % ✦ -> '+' % WHITE FOUR POINTED STAR % ✧ -> '+' % STRESS OUTLINED WHITE STAR % ✩ -> '*' % CIRCLED WHITE STAR % ✪ -> '*' % OPEN CENTRE BLACK STAR % ✫ -> '*' % BLACK CENTRE WHITE STAR % ✬ -> '*' % OUTLINED BLACK STAR % ✭ -> '*' % HEAVY OUTLINED BLACK STAR % ✮ -> '*' % PINWHEEL STAR % ✯ -> '*' % SHADOWED WHITE STAR % ✰ -> '*' % HEAVY ASTERISK % ✱ -> '*' % OPEN CENTRE ASTERISK % ✲ -> '*' % EIGHT SPOKED ASTERISK % ✳ -> '*' % EIGHT POINTED BLACK STAR % ✴ -> '*' % EIGHT POINTED PINWHEEL STAR % ✵ -> '*' % SIX POINTED BLACK STAR % ✶ -> '*' % EIGHT POINTED RECTILINEAR BLACK STAR % ✷ -> '*' % HEAVY EIGHT POINTED RECTILINEAR BLACK STAR % ✸ -> '*' % TWELVE POINTED BLACK STAR % ✹ -> '*' % SIXTEEN POINTED ASTERISK % ✺ -> '*' % TEARDROP-SPOKED ASTERISK % ✻ -> '*' % OPEN CENTRE TEARDROP-SPOKED ASTERISK % ✼ -> '*' % HEAVY TEARDROP-SPOKED ASTERISK % ✽ -> '*' % SIX PETALLED BLACK AND WHITE FLORETTE % ✾ -> '*' % BLACK FLORETTE % ✿ -> '*' % WHITE FLORETTE % ❀ -> '*' % EIGHT PETALLED OUTLINED BLACK FLORETTE % ❁ -> '*' % CIRCLED OPEN CENTRE EIGHT POINTED STAR % ❂ -> '*' % HEAVY TEARDROP-SPOKED PINWHEEL ASTERISK % ❃ -> '*' % SNOWFLAKE % ❄ -> '*' % TIGHT TRIFOLIATE SNOWFLAKE % ❅ -> '*' % HEAVY CHEVRON SNOWFLAKE % ❆ -> '*' % SPARKLE % ❇ -> '*' % HEAVY SPARKLE % ❈ -> '*' % BALLOON-SPOKED ASTERISK % ❉ -> '*' % EIGHT TEARDROP-SPOKED PROPELLER ASTERISK % ❊ -> '*' % HEAVY EIGHT TEARDROP-SPOKED PROPELLER ASTERISK % ❋ -> '*' % LATIN SMALL LIGATURE FF % ff -> 'ff' "" % LATIN SMALL LIGATURE FI % fi -> 'fi' "" % LATIN SMALL LIGATURE FL % fl -> 'fl' "" % LATIN SMALL LIGATURE FFI % ffi -> 'ffi' "" % LATIN SMALL LIGATURE FFL % ffl -> 'ffl' "" % LATIN SMALL LIGATURE LONG S T % ſt -> 'ſt', 'st' "";"" % LATIN SMALL LIGATURE ST % st -> 'st' "" % ZERO WIDTH NO-BREAK SPACE %  -> "" % REPLACEMENT CHARACTER % � -> '?'