In C, we have something called a struct
, which is a user-definable type that holds multiple pieces of data, potentially of different types.
It’s a convenient way to bundle multiple variables into a single one. This can be beneficial for passing variables to functions (so you just have to pass one instead of many), and useful for organizing data and making code more readable.
If you’ve come from another language, you might be familiar with the idea of classes and objects. These don’t exist in C, natively67. You can think of a struct
as a class with only data members, and no methods.
You can declare a struct
in your code like so:
struct car {
char *name;
float price;
int speed;
};
This is often done at the global scope outside any functions so that the struct
is globally available.
When you do this, you’re making a new type. The full type name is struct car
. (Not just car
—that won’t work.)
There aren’t any variables of that type yet, but we can declare some:
struct car saturn; // Variable "saturn" of type "struct car"
And now we have an uninitialized variable saturn
68 of type struct car
.
We should initialize it! But how do we set the values of those individual fields?
Like in many other languages that stole it from C, we’re going to use the dot operator (.
) to access the individual fields.
.name = "Saturn SL/2";
saturn.price = 15999.99;
saturn.speed = 175;
saturn
("Name: %s\n", saturn.name);
printf("Price (USD): %f\n", saturn.price);
printf("Top Speed (km): %d\n", saturn.speed); printf
There on the first lines, we set the values in the struct car
, and then in the next bit, we print those values out.
That example in the previous section was a little unwieldy. There must be a better way to initialize that struct
variable!
You can do it with an initializer by putting values in for the fields in the order they appear in the struct
when you define the variable. (This won’t work after the variable has been defined—it has to happen in the definition).
struct car {
char *name;
float price;
int speed;
};
// Now with an initializer! Same field order as in the struct declaration:
struct car saturn = {"Saturn SL/2", 16000.99, 175};
("Name: %s\n", saturn.name);
printf("Price: %f\n", saturn.price);
printf("Top Speed: %d km\n", saturn.speed); printf
The fact that the fields in the initializer need to be in the same order is a little freaky. If someone changes the order in struct car
, it could break all the other code!
We can be more specific with our initializers:
struct car saturn = {.speed=175, .name="Saturn SL/2"};
Now it’s independent of the order in the struct
declaration. Which is safer code, for sure.
Similar to array initializers, any missing field designators are initialized to zero (in this case, that would be .price
, which I’ve omitted).
You can do a couple things to pass a struct
to a function.
struct
.struct
.Recall that when you pass something to a function, a copy of that thing gets made for the function to operate on, whether it’s a copy of a pointer, an int
, a struct
, or anything.
There are basically two cases when you’d want to pass a pointer to the struct
:
struct
that was passed in, and have those changes show in the caller.struct
is somewhat large and it’s more expensive to copy that onto the stack than it is to just copy a pointer69.For those two reasons, it’s far more common to pass a pointer to a struct
to a function, though its by no means illegal to pass the struct
itself.
Let’s try passing in a pointer, making a function that will allow you to set the .price
field of the struct car
:
#include <stdio.h>
struct car {
char *name;
float price;
int speed;
};
int main(void)
{
struct car saturn = {.speed=175, .name="Saturn SL/2"};
// Pass a pointer to this struct car, along with a new,
// more realistic, price:
set_price(&saturn, 799.99);
printf("Price: %f\n", saturn.price);
}
You should be able to come up with the function signature for set_price()
just by looking at the types of the arguments we have there.
saturn
is a struct car
, so &saturn
must be the address of the struct car
, AKA a pointer to a struct car
, namely a struct car*
.
And 799.99
is a float
.
So the function declaration must look like this:
void set_price(struct car *c, float new_price)
We just need to write the body. One attempt might be:
void set_price(struct car *c, float new_price) {
.price = new_price; // ERROR!!
c}
That won’t work because the dot operator only works on struct
s… it doesn’t work on pointers to struct
s.
Ok, so we can dereference the variable c
to de-pointer it to get to the struct
itself. Dereferencing a struct car*
results in the struct car
that the pointer points to, which we should be able to use the dot operator on:
void set_price(struct car *c, float new_price) {
(*c).price = new_price; // Works, but is ugly and non-idiomatic :(
}
And that works! But it’s a little clunky to type all those parens and the asterisk. C has some syntactic sugar called the arrow operator that helps with that.
The arrow operator helps refer to fields in pointers to struct
s.
void set_price(struct car *c, float new_price) {
// (*c).price = new_price; // Works, but non-idiomatic :(
//
// The line above is 100% equivalent to the one below:
->price = new_price; // That's the one!
c}
So when accessing fields, when do we use dot and when do we use arrow?
struct
, use dot (.
).struct
, use arrow (->
). struct
sHere’s an easy one for you!
Just assign from one to the other!
struct car a, b;
= a; // Copy the struct b
And returning a struct
(as opposed to a pointer to one) from a function also makes a similar copy to the receiving variable.
This is not a “deep copy”70. All fields are copied as-is, including pointers to things.
struct
sThere’s only one safe way to do it: compare each field one at a time.
You might think you could use memcmp()
71, but that doesn’t handle the case of the possible padding bytes that might be in there.
If you clear the struct
to zero first with memset()
72, then it might work, though there could be weird elements that might not compare as you expect73.