6.2 Move Semantics, Cloning, and Copying
Rust primarily uses move semantics for data stored on the heap, while also providing cloning for explicit deep copies and a light copy trait for small, stack-only types. Let’s clarify a few terms first:
- Move: Transferring ownership of a resource from one variable to another without duplicating the underlying data.
- Shallow copy: Copying only the “outer” parts of a value (for example, a pointer) while leaving the heap-allocated data it points to untouched.
- Deep copy: Copying both the outer data (such as a pointer) and the resource(s) on the heap to which it refers.
6.2.1 Move Semantics
In Rust, many types that manage heap-allocated resources (like String
) employ move semantics. When you assign one variable to another or pass it to a function, ownership is moved rather than copied. Rust doesn’t create a deep copy—or even a shallow copy—of heap data by default; it simply transfers control of that data to the new variable. This ensures that only one variable is responsible for freeing the memory.
Rust Example
fn main() { let s1 = String::from("hello"); let s2 = s1; // Ownership moves from s1 to s2 // println!("{}", s1); // Error: s1 is no longer valid println!("{}", s2); // Prints: hello }
Once ownership moves to s2
, s1
becomes invalid and cannot be used. Rust disallows accidental uses of s1
, avoiding a class of memory errors upfront.
Comparison with C++ and C
In C++, assigning one std::string
to another typically does a deep copy, creating a distinct instance with its own buffer. You must explicitly use std::move
to achieve something akin to Rust’s move semantics:
#include <iostream>
#include <string>
int main() {
std::string s1 = "hello";
std::string s2 = std::move(s1); // Conceptually moves ownership to s2
// std::cout << s1 << std::endl; // UB if accessed
std::cout << s2 << std::endl; // Prints: hello
return 0;
}
In Rust, assigning s1
to s2
automatically moves ownership. By contrast, in C++, you must call std::move(s1)
explicitly, and s1
is left in an unspecified state.
Meanwhile, C has no built-in ownership model. When two pointers reference the same block of heap memory, the compiler does not enforce which pointer frees it:
#include <stdlib.h>
#include <string.h>
int main() {
char *s1 = malloc(6);
strcpy(s1, "hello");
char *s2 = s1; // Both pointers refer to the same memory
// free(s1);
// Using either s1 or s2 now leads to undefined behavior
return 0;
}
This can easily cause double frees, dangling pointers, or memory leaks. Rust prevents such problems via strict ownership transfer.
6.2.2 Shallow vs. Deep Copy and the clone()
Method
A shallow copy duplicates only metadata—pointers, sizes, or capacities—without cloning the underlying data. Rust’s design discourages shallow copies by enforcing ownership transfer and encouraging an explicit .clone()
method for a full deep copy. Nonetheless, in unsafe contexts, programmers can bypass these safeguards and create shallow copies manually, risking double frees if two entities both believe they own the same resource.
To create a true duplicate, call .clone()
, which performs a deep copy. This allocates new memory on the heap and copies the original data:
Example: Difference Between Move and Clone
fn main() { let s1 = String::from("hello"); let s2 = s1; // Move // println!("{}", s1); // Error: s1 has been moved let s3 = String::from("world"); let s4 = s3.clone(); // Clone println!("s3: {}, s4: {}", s3, s4); // Both valid }
Here, s3
and s4
each contain their own heap-allocated buffer with the content "world"
. Because .clone()
can be expensive for large data, use it sparingly.
- Move: Transfers ownership; the original variable is invalidated.
- Clone: Both variables own distinct copies of the data.
6.2.3 Copying Scalar Types
Some types in Rust (e.g., integers, floats, and other fixed-size, stack-only data) are so simple that a bitwise copy suffices. These types implement the Copy
trait. When you assign them, they are simply copied, and the original remains valid:
fn main() { let x = 5; let y = x; // Copy println!("x: {}, y: {}", x, y); }
This mirrors copying basic values in C:
int x = 5;
int y = x; // Copy
Since these types do not manage heap data, there is no risk of double frees or dangling pointers.