Free Online Unicode Converter

Convert text to Unicode escape sequences (\\uXXXX format) and back. Supports emoji, surrogate pairs, and multiple output formats including \\u{XXXXX} (ES6) and HTML entities.

Supported formats: \uXXXX (e.g., \u0048), \u{XXXXX} (e.g., \u{1F600}), &#xXXXX; (HTML entity). Emojis are automatically handled via surrogate pairs.

text_fields Input

code Unicode Output

Characters: 0 Code Points: 0 Bytes (UTF-8): 0

Character	Code Point (Hex)	Escape Sequence

Surrogate Pairs: Unicode code points above U+FFFF (like emoji) cannot be represented with a single \\uXXXX. UTF-16 uses two 16-bit code units (high surrogate + low surrogate) to encode them, with ranges U+D800–U+DBFF and U+DC00–U+DFFF respectively. This tool automatically handles surrogate pair combination and splitting.

Example: 😀 (U+1F600) is encoded as \\uD83D\\uDE00 (two \\uXXXX sequences forming a surrogate pair).

Character	Unicode Code Point	Escape Sequence
Space	U+0020	\u0020
A	U+0041	\u0041
a	U+0061	\u0061
0	U+0030	\u0030
Euro: €	U+20AC	\u20AC
Greek: α	U+03B1	\u03B1
Emoji: 😀	U+1F600	\uD83D\uDE00 or \u{1F600}

Key Features

bolt

Instant Processing

All operations run locally in your browser for instant, zero-latency results.

lock

Privacy Protected

Your data never leaves your browser. No server storage or tracking.

Frequently Asked Questions

Basics

Why does the emoji appear as two \\uXXXX sequences instead of one?expand_more

Code points above U+FFFF (such as emoji) cannot be represented by a single 4-digit \\uXXXX escape. UTF-16 encoding represents these characters using surrogate pairs. Use the \\u{XXXXX} (ES6) format for a single-escape representation.

How do I decode \\uXXXX sequences from a JSON string?expand_more

Paste your JSON string into this tool's Decode tab. The converter recognizes \\uXXXX, \\u{XXXXX}, and &#xXXXX; formats and converts them back to the original characters. JavaScript's JSON.parse() also handles \\uXXXX natively.

Technical Details

What is the difference between \\uXXXX and U+XXXX notation?expand_more

U+XXXX is a documentation convention for naming Unicode code points. It is not a valid escape sequence in any programming language. The \\uXXXX format is the actual escape syntax used in JavaScript, Java, C#, and JSON.

Can I use this to find the Unicode code point of any character?expand_more

Yes. Paste any character into the Encode tab and select Pure Hex format. The output is the character's Unicode code point in hexadecimal. You can also expand the "View Encoding Mapping" panel for a per-code-point breakdown.

Free Online Unicode Converter

Key Features

Instant Processing

Privacy Protected

Frequently Asked Questions

Related Tools

Base64 Encoder

JWT Decoder

URL Encoder

HTML Entity Encoder

Hex Encoder

ASCII Encoder