Text to binary (UTF-8)

Encode text to 0/1 per UTF-8 byte or decode back.

{{ textBinary.message }}

Overview

If you have ever spent hours debugging a character encoding issue — that classic situation where `ã` becomes `ã` or where a JSON that looks correct in the editor mysteriously breaks at the API — you have probably wondered how computers actually store letters. The answer is straightforward: they do not store letters. They store numbers. Every key you press is mapped to an integer, which in turn is stored in memory as a sequence of bits — zeros and ones. The concept goes back to the first electrical computers: the ENIAC, completed in 1945, operated with decimal numbers in hardware, but the IBM 701 of 1952 and its successors popularized binary representation as the universal standard of computing. Since then, everything in a computer — text, images, sounds, executable code — is ultimately a sequence of bits.

The mapping between letters and numbers was first standardized in 1963 with ASCII (American Standard Code for Information Interchange), which assigned values from 0 to 127 to Latin alphabet letters, digits, punctuation, and control characters. `A` became 65, `a` became 97, space became 32. It worked well for English, but the world has over 7,000 active languages. In the years that followed, countless regional extensions emerged — ISO 8859-1 for Western Europe, Microsoft's Windows-1252, Japan's Shift-JIS — and the incompatibilities among them created the chaos that Unicode was designed to fix. In 1992, Ken Thompson and Rob Pike created UTF-8: a variable-length encoding where ASCII characters occupy exactly 1 byte and characters outside ASCII use 2 to 4 bytes. Today, over 98% of web pages use UTF-8.

This tool converts each UTF-8 byte of the text into its 8-bit binary representation and also decodes back. In practice, for English-only text with ASCII characters, each character produces exactly one 8-bit group. For text with accented characters — like `ã` (U+00E3), which in UTF-8 is encoded as two bytes: 0xC3 and 0xA3 — each character can produce two or more groups. This is informative and sometimes surprising: a 5-letter string with accents can turn into 10 or 15 bytes in binary. Real use cases include visualizing exactly what a given text represents in bytes to debug serialization problems, creating educational examples to explain encoding, solving CTF (Capture The Flag) challenges where data is transmitted in binary, and simply satisfying the curiosity of seeing text the way the machine sees it.

Technical deep dive

Common questions summarized

  • What is this tool for?: It runs fully in your browser: useful to validate, format, or convert data in everyday development.
  • Are my inputs sent to a server?: Processing happens locally with JavaScript. We do not store what you paste into the text areas.
  • Can I use this for real production data?: Use at your own risk. For secrets (passwords, tokens), prefer controlled environments and your company policies. And always review the generated contents. Never trust blindly things you see on the internet.

Sample payload to try

  • See also the larger "Code Snippets" sample; paste this excerpt to try locally: Sample — A → 01000001 (byte único em ASCII)

Tool guide

  • What “binary” means here The text’s bytes as zeros and ones (8 bits per UTF-8 byte).

  • What the tool does Encodes text to 0/1 strings per byte; decodes while ignoring free spaces and line breaks between digits.

  • Why use it Teaching, low-level debugging, character-encoding exercises.

Code Snippets

Code example
A → 01000001 (byte único em ASCII)

Sample

A → 01000001 (byte único em ASCII)

FAQ

What is this tool for?

It runs fully in your browser: useful to validate, format, or convert data in everyday development.

Are my inputs sent to a server?

Processing happens locally with JavaScript. We do not store what you paste into the text areas.

Can I use this for real production data?

Use at your own risk. For secrets (passwords, tokens), prefer controlled environments and your company policies. And always review the generated contents. Never trust blindly things you see on the internet.