How to read C declarations

The C programming language is notorious for its type declarations. The programming language was designed more than 50 years ago. The designers of the language, apparently, didn’t pay much attention to making it easier to understand declarations. Consider the following declaration.

int *p[4];

How should we read it? Is the above statement declaring p to be an array of four elements with each element pointing to an integer, or is it a pointer to an array of four elements each of which is an integer?

In this blog, we will learn how to read C declarations and apply that knowledge to convert the above declarations into simple English. We will first define some terminology and then outline the rules which will enable us to convert any declaration into a simple English sentence.

Declarator#

A declarator is a simple identifier (also called variable name), an array identifier (also called array variable name), a function name, or a pointer to any of the above, optionally followed by an equal sign and initial value or values. For example, first = 4, second[4] = {1, 1, 2, 3}, third(), *fourth, *fifth[4] and *sixth() are all valid declarators in the following declarations.

There may be any number of pointers, such as ***seventh, any number of array dimensions, such as eighth[4][5][6]; but only one pair of function parentheses. The declarator ninth()() is invalid. The declarators (*p)()[] , and(*p)[]() are also invalid.

An identifier, an identifier with array square brackets, or an identifier with function parentheses is also called a direct declarator. In the above examples, first, second[4], third(), fourth, fifth[4], and sixth() are direct declarators.

Type Specifier#

Type specifiers are char, double, float, int, long, signed, unsigned, enum, struct, and union. The keywords enum, struct and union are usually followed by what is called a tag. The keywords struct and union declare complex types.

Storage Class#

The storage class of a variable tells a compiler how to allocate memory for that variable. There are five storage classes, auto, extern, register, static, and typedef. The typedef storage class doesn't tell a compiler about memory allocation. It only defines a new name for a data type.

Declarations vs. definitions and linkage#

Decoding is easier when you also know what the line does.

Declaration#

Introduces a name and type to a scope (may or may not allocate storage).

Definition#

Allocates storage (for objects) or provides a body (for functions).

Rules of thumb#

extern int x; → declaration only (no storage).
int x; at file scope → definition (storage is allocated).
int x = 42; → definition with initialization.
typedef unsigned long u64; → not a variable; creates an alias for a type.

Linkage#

static at file scope → internal linkage (name not visible outside the translation unit).
extern → refers to an entity defined elsewhere (external linkage).
register is largely obsolete in modern compilers.

Knowing this helps you decode and understand why the declaration exists.

Type Qualifier#

As of this writing, there are four type-qualifiers; const, restrict, volatile, and _Atomic. The type qualifiers restrict and _Atomic were introduced in C99 and C11 standards. _Atomic is not only a type qualifier, but it is also a type specifier when used with standard type specifiers. For example, _Atomic(int) is a type specifier and not a type qualifier. We will discuss _Atomic in detail in another other blog.

Type Qualifier Rule#

If a type qualifier or qualifiers appear next to a type specifier ( int, char, float, double, etc.) it applies to that type-specifier. Otherwise, it applies to the asterisk pointer to its immediate left. The type qualifier restrict only applies to pointers.

We will apply this rule several times in decoding C declarations so that it becomes clear.

Explanation of Type Qualifier Rule#

Consider the following declaration.

Const-correctness quick reference#

A few high-leverage patterns to remember when working with const in C and C++:

const int *p ≡ int const *p → pointer to const int (the pointee is const; p can change to point elsewhere).
int * const p → const pointer to int (the pointer is fixed; the int can change).
const int * const p → const pointer to const int.

Tip:
Qualifiers bind to the thing they’re next to.

If adjacent to *, they qualify the pointer.
If adjacent to the base type, they qualify the pointee.

This matches the logic of Rule 3 from the declarator reading order.

For function pointers, qualifiers apply the same way

Rules for Understanding C Declarations#

We locate the first identifier reading from the left and then follow the precedence rules.

Precedence Rules#

Rule 1. Read the postfix operators (square brackets indicating an array and parentheses indicating a function) from left to right, till the semicolon or the closing unmatched parenthesis is reached.
Rule 2. Read the prefix asterisk operators indicating a pointer, till the beginning of the declaration or the opening parenthesis, corresponding to the closing parenthesis of Rule 1, is reached.
Rule 3. If a type qualifier or qualifiers appear next to a type specifier ( int, char, float, double, etc.) it applies to that type-specifier. Otherwise, it applies to the asterisk pointer to its immediate left. The type qualifier restrict only applies to pointers.

The clockwise/spiral rule (a second, visual algorithm)#

Many engineers like the clockwise/spiral rule as a tactile way to read a declarator:

Start at the identifier (e.g., p in int *p[4];).
Move right as far as you can, consuming any postfix (() function, [] array).
Spiral left to consume a single prefix * (pointer).
Repeat steps 2–3, jumping across parentheses when you hit them, until you run out of tokens.
Finally, prepend the base type you see to the left (int, struct T, etc.).

Examples#

int *p[4];
→ p right: [] ⇒ “array[4]”; spiral left: * ⇒ “pointer to”; prepend int ⇒
“p is array[4] of pointer to int.”
int (*p)[4];
→ p right: ) stops; left: * ⇒ “pointer to”; right outside parens: [] ⇒ “array[4]”; prepend int ⇒
“pointer to array[4] of int.”

The spiral rule is equivalent to standard precedence rules but is often faster to do in your head.

Arrays, pointers, and function parameters (decay, VLAs)#

In parameter lists, arrays decay to pointers:

This distinction — p[N] vs (*p)[N] — is one of the most common decoding pitfalls.

Application of Rules#

Let's apply the above rules to understand the very first declaration we talked about, i.e. int *p[4];.

In the following illustrations, the red arrow indicates starting position and the green arrow indicates the ending position. The rule under consideration is applicable to the text between the two arrows. The purple color is used to indicate the text that has already been processed.

The first identifier in the above declaration is p.

We apply Rule 1 and read the postfix operator (in this case square brackets indicating an array) till we reach the semicolon, " p is an array of 4 . . .".

On 64-bit machines, all pointers ( char *, char **, char ***, and so on) are 8-byte long. Compiling and executing the above program produces output like shown below.

Size of pointer on this machine: 8 bytes q: 0x600001e90d20 q+1: 0x600001e90d70

We observe that even though q is an 8-byte pointer, advancing it by 1 changes the address by 0x50 or 80 bytes. This confirms that q is indeed a pointer to an array of 10 pointers to characters, exactly as we found by decoding it.

Let us decode one more complex C declaration, which will require applying Rule 3If a type qualifier or qualifiers appear next to a type specifier ( int, char, float, double, etc.) it applies to that type-specifier. Otherwise, it applies to the asterisk pointer to its immediate left. The type qualifier restrict only applies to pointers.. Here is the declaration we want to convert to simple English.
char ** const * volatile x;

We find the first identifier in the declaration, which is x.

There is a semicolon to the immediate right of x hence we cannot apply Rule 1Read the postfix operators (square brackets indicating an array and parentheses indicating a function) from left to right, till the semicolon or the closing unmatched parenthesis is reached. to this declaration. To the immediate left of x is the type qualifier volatile which means we have to apply Rule 3If a type qualifier or qualifiers appear next to a type specifier ( int, char, float, double, etc.) it applies to that type-specifier. Otherwise, it applies to the asterisk pointer to its immediate left. The type qualifier restrict only applies to pointers.. In this case, according to Rule 3If a type qualifier or qualifiers appear next to a type specifier ( int, char, float, double, etc.) it applies to that type-specifier. Otherwise, it applies to the asterisk pointer to its immediate left. The type qualifier restrict only applies to pointers., the type qualifier volatile applies to the asterisk (pointer) to its immediate left.

Since the type qualifier applies to the asterisk to its immediate left, we stop here temporarily and read till this point. " x is a volatile pointer to . . .".
We find const to the left of the constant pointer. According to Rule 3If a type qualifier or qualifiers appear next to a type specifier ( int, char, float, double, etc.) it applies to that type-specifier. Otherwise, it applies to the asterisk pointer to its immediate left. The type qualifier restrict only applies to pointers., it applies to the asterisk (pointer) to its immediate left. We read, " x is a volatile pointer to a constant pointer . . .".

We still haven't reached a semicolon or an unmatched parenthesis, so we continue applying Rule 1Read the postfix operators (square brackets indicating an array and parentheses indicating a function) from left to right, till the semicolon or the closing unmatched parenthesis is reached.. We find an equal sign ( = ) indicating an initializer. Let's handle it at the end.
We apply Rule 2Read the prefix asterisk operators indicating a pointer, till the beginning of the declaration or the opening parenthesis, corresponding to the closing parenthesis of Rule 1, is reached. looking for the prefix operators. We find int to the immediate left of (*cmp) which is a type specifier. We read, " cmp is a pointer to a function (which has two parameters, both are pointers to constant void) and returns an integer."

The initialization part of the declaration stores the value of the variable ascending (which must be a function of the appropriate type, as mentioned in the declaration) in the identifier cmp.

Finally, let's look at the most complicated declaration in the list given at the beginning.

struct IMAGE *(*(*(*fp)[5]))(const char *, int);

On applying the rules, we obtain the following simple English representation:

" fp is a pointer to an array of 5 pointers to pointer to functions (whose first parameter is a pointer to a constant character and the second parameter is an integer) and returns a pointer to struct IMAGE."

The figure below shows the sequence in which this complex declaration is handled, by numbering its various parts.

Every C declaration begins with a type specifier, such as char, int, double, etc, or a type qualifier const or volatile. The type qualifier restrict cannot begin a declaration as it applies to pointers only. The type specifier could be one keyword, such as int, or multiple keywords, such as unsigned long int, or long double. Type specifier may have the struct, union, and enum keywords.

We start with the first identifier from left, applying Rule 1 (postfix operators) till we encounter an unmatched closing parenthesis or a semicolon indicating the end of the declaration. Then we apply Rule 2 (prefix operators) till we encounter an opening parenthesis or reach the beginning of the declaration.

We alternate between Rule 1Read the postfix operators (square brackets indicating an array and parentheses indicating a function) from left to right, till the semicolon or the closing unmatched parenthesis is reached. and Rule 2Read the prefix asterisk operators indicating a pointer, till the beginning of the declaration or the opening parenthesis, corresponding to the closing parenthesis of Rule 1, is reached. (alternating from right to left and back to right, starting with the first identifier from left) till the entire declaration has been read. We apply Rule 3If a type qualifier or qualifiers appear next to a type specifier ( int, char, float, double, etc.) it applies to that type-specifier. Otherwise, it applies to the asterisk pointer to its immediate left. The type qualifier restrict only applies to pointers. when we encounter any type qualifier along the way.

With this knowledge, we can decode any valid complex C declaration into simple English.

Mini cheat sheet of frequent patterns#

T *p → pointer to T
T **p → pointer to pointer to T
T p[N] → array [N] of T
T (*p)[N] → pointer to array [N] of T
T *p[N] → array [N] of pointer to T
T (*p)(args) → pointer to function taking args returning T
T (*p[])(args) → array of pointer to function taking args returning T
T (*(*p)[N])(args) → pointer to array [N] of pointer to function taking args returning T
const T *p / T const *p → pointer to const T
T * const p → const pointer to T

Keep this near your editor until the shapes become second nature.

Tooling: check your reading with cdecl and the compiler#

Two practical ways to validate a tricky line:

cdecl (online or CLI)#

English → C:
declare a as pointer to function (void) returning pointer to array 10 of pointer to char
C → English:
explain char *(*(*a)())[10]

Your compiler#

Put the declaration in a tiny .c file and compile with -Wall -Wextra -Wpedantic (GCC/Clang).
Try taking sizeof of sub-expressions via helper code to confirm “pointer to array” vs “array of pointers” behavior.

This feedback loop cements the decoding skill fast.

Exercises#

Please decode the following declarations for more practice. Answers are provided to verify your work.

Answers#

b is array 8 of pointers to const pointers to function which takes no parameters and returns a pointer to int
c is a pointer to a function with no parameters returning a pointer to const pointer to char
p is an array 4 of pointer to functions that has a pointer to char parameter returning a pointer to an array of pointers to char
s is a function that takes two parameters, the first one is an int and the second one is a pointer to a function that takes an int and returns void, returning a pointer to a function that has an int parameter and returns void
f is a function that takes an int parameter and returns a pointer to a function that takes an int parameter and returns a pointer to void
x is volatile pointer to const pointer to pointer to char
f is a two-dimensional array (second dimension is 4) of pointer to pointer to function, that takes on parameters, returning pointer to array of pointer to char

The Next Steps#

Browse the following courses to learn more about C programming language.

References#

https://www.iso-9899.info/wiki/The_Standard
C Programming Language, 2nd Edition, Brian W. Kernighan, Dennis M. Ritchie
C: A Reference Manual, 5th Edition, Samuel Harbison, Guy Steele Jr.
Expert C Programming: Deep C Secrets, Peter van der Linden

Table of Contents