Interface Values Representation
Interface values are represented as a two-word pair, defined by a struct runtime.iface
:
The first word tab
is a pointer to an itable, which contains information about the type of the interface, the type of the data it points to and a virtual method table
The second word data
is a pointer to the value (copy of the original) held by the interface. Values stored in interfaces might be arbitrarily large, but only one word is dedicated to holding the value in the interface structure, so the assignment allocates a chunk of memory on the heap and records the pointer in the data
field
Value Boxing
When value of type T
is assigned to an interface value:
- If type
T
is a non-interface type, then thedata
field points to the copy of theT
value. A copy is used because if the variable later changes, the pointer should have the old value, not the new one - If type
T
is also an interface type, then thedata
field points directly to the same value stored inT
sdata
field
Code example: Go Playground - Value Boxing
Example
Given:
On a 32-bit word size machine the interface Stringer
storing the Binary
is:
Itable
The itable is represented by a struct runtime.itab
:
inter
describes type information of the interface itself_type
describes the type information of the value an interface containsfun
is a variable-sized array of function pointers - the dispatch table of the interface
Type information about the interface data type is represented by runtime.interfacetype
which is an alias for abi.InterfaceType
struct:
Note that, the itab
corresponds to the interface type, not the dynamic type. In our example, it contains pointer only to String
method, not Get
To check a type of the actual value an interface points to, compiler generates the code equivalent to s.tab->_type
To call s.String()
, compiler generates the code equivalent to s.tab->fun[0](s.data)
Note that, the function in itable is being passed the pointer, not the actual value. Thus the function pointer in our example is (*Binary).String
not Binary.String
Computing the itab
The itab
gets computed during the assignment (conversion) of the value to the interface variable: s := any.(Stringer)
There is a compile-time generated type description struct for each concrete type like Binary
. Among other metadata, the type description struct contains a list of methods implemented by that type
Similarly, there is compile-time generated abi.InterfaceType
struct for each interface type like Stringer
It too contains a list of methods
At run-time the itab
is computed by looking for each method listed in the interface type’s method table in the concrete type’s method table. It then caches it, so the itab
is computed only once. Note that, both tables are sorted, so the mapping is found in time
Optimizations
If an interface has no methods, itab
is dropped and the first word points at the type directly. That is, runtime.eface
is used instead of runtime.iface
:
In Russ Cox blog post it is stated that if the actual value fits in a single word, it is stored in the second word directly without indirection or heap allocation. However, this is no longer true, as it caused race conditions in concurrent GC and ambiguity about whether the data word holds a pointer or scalar (todo elaborate further). So interfaces always store the pointer in the data
field
Heap Allocations and Escape Analysis
When a method is called via an interface value instead of directly through a struct, the compiler generally lacks knowledge of the method’s implementation at compile time. Consequently, escape analysis cannot confirm that the value doesn’t escape, leading to heap allocations (there are optimizations) even for scalar types like int
s, float
s, string
s
In some cases, the compiler can prove the concrete type of the value stored in the interface and devirtualize a method call, avoiding heap allocations
The Go runtime has a special static array of the first 256 integers (0 to 255), and when it would normally have to allocate memory to store an integer on the heap as part of converting it to an interface, it first checks to see if it can just return a pointer to the appropriate element in the array instead
Putting a zero-width type, e.g. struct{}
, in an interface value doesn’t allocate
Addressability
The concrete value stored in an interface is not addressable (it’s a copy), in the same way that a map element is not addressable
Therefore, when you call a method on an interface, it must either have an identical receiver type or it must be directly discernible from the concrete type:
- Pointer and value receiver methods can be called with pointers and values respectively
- Value receiver methods can be called with pointer values because they can be dereferenced first
- Pointer receiver methods cannot be called with values
References
- research!rsc: Go Data Structures: Interfaces
- Source code: runtime2.go
- Source code: iface.go
- How are interfaces implemented in Go? : r/golang
- Chapter II: Interfaces - Go Internals
- Internals of Interfaces in Golang | Intermediate level - YouTube
- Source code: type.go
- Lec08 Allocation Strategies - YouTube
- Lec09 Implicit Allocators Indirection And References - YouTube
- GopherCon Europe 2023: Jonathan Amsterdam - A Fast Structured Logging Package - YouTube
- Generics can make your Go code slower
- Go Wiki: Compiler And Runtime Optimizations
- Go Wiki: MethodSets - The Go Programming Language
- Interface method calls with the Go register ABI - Eli Bendersky’s website
- Why is a value stored in an interface not addressable?: r/golang
- Frequently Asked Questions (FAQ) - The Go Programming Language
- Interfaces in Go: Go 101
- Ice cream makers and data races | Dave Cheney
- Go Optimizations 101. Tapir Liu
- GitHub - akutz/go-interface-values: When storing a value in a Go interface allocates memory on the heap.TODO
- Interface Internals - Keith Randall - YouTube