Package java.lang.foreign
Provides low-level access to memory and functions outside the Java runtime.
Foreign memory access
The main abstraction introduced to support foreign memory access is
MemorySegment
, that models a contiguous region of memory,
residing either inside or outside the Java heap. Memory segments are typically
allocated using an Arena
, which controls the lifetime of
the regions of memory backing the segments it allocates. The contents of a
memory segment can be described using a memory layout
,
which provides basic operations to query sizes, offsets, and alignment constraints.
Memory layouts also provide an alternate, more abstract way, to
access memory segments using
var handles,
which can be computed using layout paths.
For example, to allocate an off-heap region of memory big enough to hold 10 values of
the primitive type int
, and fill it with values ranging from 0
to
9
, we can use the following code:
try (Arena arena = Arena.ofConfined()) {
MemorySegment segment = arena.allocate(10 * 4);
for (int i = 0 ; i < 10 ; i++) {
segment.setAtIndex(ValueLayout.JAVA_INT, i, i);
}
}
int
. The native segment is allocated using a
confined arena. As such, access to
the native segment is restricted to the current thread (the thread that created the
arena). Moreover, when the arena is closed, the native segment is invalidated, and
its backing region of memory is deallocated. Note the use of the try-with-resources
construct: this idiom ensures that the off-heap region of memory backing the native
segment will be released at the end of the block, according to the semantics described
in Section 14.20.3 of The Java Language Specification.
Memory segments provide strong safety guarantees when it comes to memory access. First, when accessing a memory segment, the access coordinates are validated (upon access), to make sure that access does not occur at any address that resides outside the boundaries of the memory segment used by the access operation. We call this guarantee spatial safety; in other words, access to memory segments is bounds-checked, in the same way as array access is, as described in Section 15.10.4 of The Java Language Specification.
Additionally, to prevent a region of memory from being accessed after it has been deallocated (i.e. use-after-free), a segment is also validated (upon access) to make sure that the arena from which it has been obtained has not been closed. We call this guarantee temporal safety.
Together, spatial and temporal safety ensure that each memory access operation either succeeds - and accesses a valid location within the region of memory backing the memory segment - or fails.
Foreign function access
The key abstractions introduced to support foreign function access areSymbolLookup
, FunctionDescriptor
and
Linker
. The first is used to look up symbols inside
libraries; the second is used to model the signature of foreign functions, while the
third is used to link foreign functions as MethodHandle
instances, so that clients can perform foreign function calls directly in Java,
without the need for intermediate layers of C/C++ code (as is the case with the
Java Native Interface (JNI)).
For example, to compute the length of a string using the C standard library function
strlen
on a Linux/x64 platform, we can use the following code:
Linker linker = Linker.nativeLinker();
SymbolLookup stdlib = linker.defaultLookup();
MethodHandle strlen = linker.downcallHandle(
stdlib.find("strlen").orElseThrow(),
FunctionDescriptor.of(ValueLayout.JAVA_LONG, ValueLayout.ADDRESS)
);
try (Arena arena = Arena.ofConfined()) {
MemorySegment cString = arena.allocateFrom("Hello");
long len = (long)strlen.invokeExact(cString); // 5
}
strlen
function in the standard C library; a downcall method handle
targeting said function is subsequently
obtainedRESTRICTED.
To complete the linking successfully, we must provide a
FunctionDescriptor
instance, describing the signature of the
strlen
function. From this information, the linker will uniquely determine
the sequence of steps which will turn the method handle invocation (here performed
using MethodHandle.invokeExact(java.lang.Object...)
)
into a foreign function call, according to the rules specified by the ABI of the
underlying platform.
The Arena
class also provides many useful methods for
interacting with foreign code, such as
converting
Java strings into zero-terminated, UTF-8 strings, as demonstrated in the above example.
Restricted methods
Some methods in this package are considered restricted. Restricted methods are typically used to bind native foreign data and/or functions to first-class Java API elements which can then be used directly by clients. For instance the restricted methodMemorySegment.reinterpret(long)
RESTRICTED can be
used to create a fresh segment with the same address and temporal bounds, but with
the provided size. This can be useful to resize memory segments obtained when
interacting with native functions.
Binding foreign data and/or functions is generally unsafe and, if done incorrectly,
can result in VM crashes, or memory corruption when the bound Java API element
is accessed. For instance, incorrectly resizing a native memory segment using
MemorySegment.reinterpret(long)
RESTRICTED can lead to a JVM crash, or,
worse, lead to silent memory corruption when attempting to access the resized segment.
For these reasons, it is crucial for code that calls a restricted method to never pass
arguments that might cause incorrect binding of foreign data and/or functions to
a Java API.
Given the potential danger of restricted methods, the Java runtime issues a warning on
the standard error stream every time a restricted method is invoked. Such warnings can
be disabled by granting access to restricted methods to selected modules. This can be
done either via implementation-specific command line options or programmatically, e.g.
by calling ModuleLayer.Controller.enableNativeAccess(java.lang.Module)
RESTRICTED.
For every class in this package, unless specified otherwise, any method arguments of
reference type must not be null
, and any null argument will elicit a
NullPointerException
. This fact is not individually documented for methods of
this API.
- API Note:
- Usual memory model guarantees (see 17.4) do not apply when accessing native memory segments as these segments are backed by off-heap regions of memory.
- Implementation Note:
- In the reference implementation, access to restricted methods can be granted to
specific modules using the command line option
--enable-native-access=M1,M2, ... Mn
, whereM1
,M2
,... Mn
are module names (for the unnamed module, the special valueALL-UNNAMED
can be used). If this option is specified, access to restricted methods are only granted to the modules listed by that option. If this option is not specified, access to restricted methods is enabled for all modules, but access to restricted methods will result in runtime warnings. - Since:
- 22
- External Specifications
-
ClassDescriptionA value layout used to model the address of some region of memory.An arena controls the lifecycle of native memory segments, providing both flexible allocation and timely deallocation.A function descriptor models the signature of a foreign function.A compound layout that is an aggregation of multiple, heterogeneous member layouts.A linker provides access to foreign functions from Java code, and access to Java code from foreign functions.A linker option is used to provide additional parameters to a linkage request.A memory layout describes the contents of a memory segment.An element in a layout path.A memory segment provides access to a contiguous region of memory.A scope models the lifetime of all the memory segments associated with it.A padding layout.An object that may be used to allocate memory segments.A compound layout that denotes a homogeneous repetition of a given element layout.A group layout whose member layouts are laid out one after the other.A symbol lookup retrieves the address of a symbol in one or more libraries.A group layout whose member layouts are laid out at the same starting offset.A layout that models values of basic data types.A value layout whose carrier is
boolean.class
.A value layout whose carrier isbyte.class
.A value layout whose carrier ischar.class
.A value layout whose carrier isdouble.class
.A value layout whose carrier isfloat.class
.A value layout whose carrier isint.class
.A value layout whose carrier islong.class
.A value layout whose carrier isshort.class
.