Skip to content
World Cup Kickoff Time Converter is liveSee kickoff times in your time zone and add calendar reminders.

Unicode Converter

Convert text and emoji to Unicode code points, decode U+, 0x, simple \u, or decimal tokens, and inspect UTF-8 bytes, UTF-16 units, and surrogate pairs locally.

Input Text

Characters

0 / 50,000

Unicode Converter

Quick Start

1
Enter text or emoji, then click Encode.
2
Enter U+, 0x, simple \u, or decimal tokens, then click Decode.
3
Separate multiple tokens with spaces, commas, semicolons, or new lines.
4
Review the same editor result, then copy it.

Common Scenarios

Programming

handle encodings and debug string issues

Data analysis

inspect special characters/encodings in text

Protocol text

inspect pasted payload text as code points and UTF-8/UTF-16 representations without decoding packet bytes.

Education

learn fundamentals of character encodings

Security review

spot-check zero-width, variation selector, ZWJ, or control code points without a confusable or bidi report.

Format conversion

switch between readable text and code point tokens

Conversion Parameters & Formats

Code point: U+4E2D, U+0041
Simple \u token: \u4E2D, \u0041. This parses tokens, not JavaScript string literals.
Decimal: 20013, 65
Hexadecimal: 0x4E2D, 0x0041
Separators: spaces, commas, semicolons, or new lines. Use U+, 0x, or \u for hex; unprefixed numeric tokens are decimal.

Code Point Boundaries

Encode always outputs a U+ code point list. Decimal values, UTF-8 bytes, UTF-16 units, and surrogate pairs appear in details, not as alternate output formats.
Decode accepts U+, 0x, simple \u, and decimal tokens. Use prefixes for hexadecimal input; bare hex like 1F600 is not a supported input style.
Simple \u token parsing is not a JavaScript string parser: \u{1F600}, \U0001F600, \xNN, CSS escapes, HTML entities, and URL percent-encoding are outside this tool.
Character details show the first 100 unique code points with occurrence counts. Long emoji sequences and combining marks are inspected as code points, not grapheme clusters.
Block, category, and emoji labels are lightweight local hints. The page is not an official Unicode name, version, script, bidi, or normalization database.
You can spot-check zero-width, variation, control, or unusual code points, but this is not a confusable, bidi, Trojan Source, IDN, or font coverage scanner.
Unicode code point conversion runs locally in your browser. Source text may stay as a browser draft. If a saved workspace or WebDAV sync is enabled, that text may be saved through that sync. Encode, Decode, and copied text all use the same editor content; character details are recalculated from that content and are not saved as a separate result state. This tool does not upload files, create attachments, or export files.

Usage Advice

Text input: type or paste the content
Emoji specifics: some emoji are composed of multiple code points (base + modifiers + variation selector + ZWJ)
Surrogate awareness: characters beyond the BMP (> U+FFFF) appear as UTF‑16 surrogate pairs in JavaScript strings

Limitations & Compatibility

Invalid range: values <0 or >0x10FFFF are rejected. Lone surrogate code points may display inconsistently across browsers.
Rendering: display varies by platform and font.
Block and category labels are lightweight local hints, not a full Unicode database with official names, scripts, or bidi data.

Privacy & Security

Unicode code point conversion runs locally in your browser. Source text may stay as a browser draft. If a saved workspace or WebDAV sync is enabled, that text may be saved through that sync. Encode, Decode, and copied text all use the same editor content; character details are recalculated from that content and are not saved as a separate result state. This tool does not upload files, create attachments, or export files.

FAQ

6

Continue with these related tools for the next step.

All tool processing happens locally in your browser.