Free Online Unicode Converter

Convert text to Unicode escape sequences (\\uXXXX format) and back. Supports emoji, surrogate pairs, and multiple output formats including \\u{XXXXX} (ES6) and HTML entities.

Supported formats: \uXXXX (e.g., \u0048), \u{XXXXX} (e.g., \u{1F600}), &#xXXXX; (HTML entity). Emojis are automatically handled via surrogate pairs.
text_fields Input
code Unicode Output
Characters: 0 Code Points: 0 Bytes (UTF-8): 0
Character Code Point (Hex) Escape Sequence

Surrogate Pairs: Unicode code points above U+FFFF (like emoji) cannot be represented with a single \\uXXXX. UTF-16 uses two 16-bit code units (high surrogate + low surrogate) to encode them, with ranges U+D800–U+DBFF and U+DC00–U+DFFF respectively. This tool automatically handles surrogate pair combination and splitting.

Example: 😀 (U+1F600) is encoded as \\uD83D\\uDE00 (two \\uXXXX sequences forming a surrogate pair).

Character Unicode Code Point Escape Sequence
SpaceU+0020\u0020
AU+0041\u0041
aU+0061\u0061
0U+0030\u0030
Euro: €U+20AC\u20AC
Greek: αU+03B1\u03B1
Emoji: 😀U+1F600\uD83D\uDE00 or \u{1F600}

Key Features

bolt

Instant Processing

All operations run locally in your browser for instant, zero-latency results.

lock

Privacy Protected

Your data never leaves your browser. No server storage or tracking.

Frequently Asked Questions

Basics
Why does the emoji appear as two \\uXXXX sequences instead of one?expand_more
Code points above U+FFFF (such as emoji) cannot be represented by a single 4-digit \\uXXXX escape. UTF-16 encoding represents these characters using surrogate pairs. Use the \\u{XXXXX} (ES6) format for a single-escape representation.
How do I decode \\uXXXX sequences from a JSON string?expand_more
Paste your JSON string into this tool's Decode tab. The converter recognizes \\uXXXX, \\u{XXXXX}, and &#xXXXX; formats and converts them back to the original characters. JavaScript's JSON.parse() also handles \\uXXXX natively.
Technical Details
What is the difference between \\uXXXX and U+XXXX notation?expand_more
U+XXXX is a documentation convention for naming Unicode code points. It is not a valid escape sequence in any programming language. The \\uXXXX format is the actual escape syntax used in JavaScript, Java, C#, and JSON.
Can I use this to find the Unicode code point of any character?expand_more
Yes. Paste any character into the Encode tab and select Pure Hex format. The output is the character's Unicode code point in hexadecimal. You can also expand the "View Encoding Mapping" panel for a per-code-point breakdown.