Character code conversion

Character Code Conversion is a site that provides tools that can convert (decode/encode) text characters to another character code. Simply paste the text to convert it to another character code (UTF-8, UTF-16, Unicode, Shift_JIS, JIS, EUC). It also supports URL encoding/decoding, HTML encoding/decoding. It is free to use.

URL encode

URL decode

HTML encode
(Char entity references)

HTML decode
(Char entity references)

HTML encode
(Numeric Char Reference)

HTML decode
(Numeric Char Reference)

Unicode encode
(UTF-8)

Unicode decode
(UTF-8)

Unicode encode
(UTF-16BE)

Unicode decode
(UTF-16BE)

Shift_JIS encode

Shift_JIS decode

JIS encode

JIS decode

EUC encode

EUC decode

Character Code

It refers to the numbers/numbers assigned to characters. By replacing characters with numbers, such as A=1, B=2, C=3, ..., Z=26, computers that can only handle numbers can recognize characters that are easy for humans to understand.

Well-known character codes include "ASCII", "Shift_JIS", "UTF-8", and "UTF-16".

What is encoding and decoding?

  • Encoding: Converting data into another format
  • Decoding: Returning converted data to the original data
  • URL encoding and URL decoding: Converting characters that cannot be used in URLs into characters that can be used in URLs
  • HTML encoding and HTML decoding: Converting characters that cannot be used in HTML into characters that can be used in URLs
  • Number entity reference: Representing a specific character using a character code represented by a number
  • Character entity reference: Representing a specific character using a name called a character entity reference name

Types of character codes

  • ASCII: Represents alphabets, numbers, symbols, etc. with 7 bits per character. American Standard Code for Information Interchange, which only supports English characters, some symbols, and numbers.
  • Shift_JIS: A character code for Japanese. It was jointly developed by Microsoft, ASCII, Mitsubishi Electric, and others, and is also known as "SJIS."
  • Unicode: Developed to be used as a universal character code. It is an international standard, and all major languages ​​can be expressed in Unicode. It also supports symbols, emojis, currency symbols, and more.
  • UTF-8: One of the encoding methods for Unicode. It was developed with the aim of using ASCII as is in Unicode.
  • UTF-16: One of the encoding methods for Unicode. In UTF-16, most characters are expressed in 2 bytes (16 bits).
  • EUC: Extended Unix Code, a character code established by AT&T. It supports the languages ​​of various countries, and the definition that assigns the Japanese character set is called Japanese EUC (EUC-JP).

How to use

1) Paste the text you want to convert the character code.
2) It will be automatically converted (encoded/decoded) into each type of character code.
3) The converted (encoded/decoded) character code can be "copied" or "downloaded" by clicking the icon in the upper right.

Notes

This tool is available for free.

※This program is created and confirm the operation in PHP8.1.22.
※If you have any inquiries, opinions, or requests that you would like to make, please fill out the following form