What is Base32? Characters, Tables, Regular Expressions, Z-Base32, Base32Hex and Crockford’s Base32

In this article, we will go over Base32 encoding in detail, focusing on its unique characteristics in data manipulation. We will look at its character set, provide a reference table, and look at various character groups. In addition, we’ll look at the differences introduced by Z-Base32, talk about case sensitivity, and show how regular expressions can be used to detect Base32. Join us on this educational journey as we explore the complexities of Base32 encoding and its applications.

The Base32 Character Set

In contrast to its sibling Base64, which uses 64 characters, Base32 uses a smaller set of 32 characters. This decision is motivated by the desire to create encoding that avoids similar-looking characters in order to avoid transcription or display errors.

The Base32 character set includes the following symbols: A–Z and 2–7. These characters have been carefully chosen to avoid confusion between visually similar characters such as ‘0’ and ‘O’ or ‘1’ and ‘I’. As a result, the character set ensures that data is reliably represented and transmitted.

Online Base32 Translator

Base32 Character Table and Reference

A reference table is required for a clear understanding of Base32 encoding. A comprehensive table demonstrating the correspondence between binary, decimal, and Base32 values is provided below, providing a quick and convenient way to look up character representations for various data.

Binary	Decimal	Base32
00000	0	A
00001	1	B
00010	2	C
00011	3	D
00100	4	E
00101	5	F
00110	6	G
00111	7	H
01000	8	I
01001	9	J
01010	10	K
01011	11	L
01100	12	M
01101	13	N
01110	14	O
01111	15	P
10000	16	Q
10001	17	R
10010	18	S
10011	19	T
10100	20	U
10101	21	V
10110	22	W
10111	23	X
11000	24	Y
11001	25	Z
11010	26	2
11011	27	3
11100	28	4
11101	29	5
11110	30	6
11111	31	7

This reference table is a handy tool for quickly reading Base32-encoded characters and their related values, which will aid you in a variety of encoding and decoding activities.

Base32 Characters Group

Characters in Base32 encoding are classified into distinct categories based on their indices within the character set. These are some of the categories:

Uppercase Letters (Indices 0-25): A, B, C, D, E, F, G, H, I, J, K, L, M, N, O, P, Q, R, S, T, U, V, W, X, Y, Z

Numbers (Indices 26-31): 2, 3, 4, 5, 6, 7

Z-Base32 Differences

While the traditional Base32 encoding uses uppercase letters and numbers, the Z-Base32 variant takes a different approach. The character set in Z-Base32 is designed to reduce potential reading errors caused by visual similarity between characters.

Decimal	Z-Base32	Binary
0	y	00000
1	b	00001
2	n	00010
3	d	00011
4	r	00100
5	f	00101
6	g	00110
7	8	00111
8	e	01000
9	j	01001
10	k	01010
11	m	01011
12	c	01100
13	p	01101
14	q	01110
15	x	01111
16	o	10000
17	t	10001
18	1	10010
19	u	10011
20	w	10100
21	i	10101
22	s	10110
23	z	10111
24	a	11000
25	3	11001
26	4	11010
27	5	11011
28	h	11100
29	7	11101
30	6	11110
31	9	11111

Online Z-Base32 Translator

Case Sensitivity in Base32 Encoding

In Base32 encoding, characters are case-sensitive, which means that lowercase and uppercase letters represent different values. Here are some examples:

B64ENCODER = II3DIRKOINHUIRKS
b64encoder = MI3DIZLOMNXWIZLS
B64encoder = II3DIZLOMNXWIZLS
B64Encoder = II3DIRLOMNXWIZLS

Using Regular Expressions for Base32 Detection

Regular expressions can be an extremely effective tool for pattern matching and detecting specific formats within a string. Regular expressions can help identify and extract Base32 encoded strings within a larger body of text when it comes to detecting Base32 encoded strings within a larger body of text.

Here’s a simple regular expression pattern that can be used to detect Base32 encoded strings:

\b[A-Z2-7]+=*\b

Let’s dissect this pattern’s components:

\b: This is a word boundary anchor that ensures the pattern matches whole words.
[A-Z2-7]+: This character class matches uppercase letters from A to Z, and the digits 2 to 7. The + indicates that one or more of these characters must be present.

=*: This matches any number of equal signs (padding characters) that may be present at the end of the Base32 encoded string.

Base32Hex and Crockford’s Base32

Base32Hex is an alternative to the standard Base32. It is a notation for expressing numbers that can be conveniently and accurately transmitted between humans and computer systems. This textual 32-symbol notation is human readable, machine readable, compact, error resistant, and pronounceable.

On the other hand, Crockford’s Base32 is another alternative design for Base32 created by Douglas Crockford. It has been designed with certain modifications to avoid confusion and accidental obscenity. It excludes the letters I, L, O to avoid confusion with digits and U to prevent accidental obscenity. When decoding, upper and lower case letters are accepted, and i and l will be treated as 1 and o will be treated as 0. When encoding, only upper case letters are used. Crockford also proposes using additional characters for a mod-37 checksum.

Base32Hex Encoder

Base32Hex Decoder