Nov 17, 2025 Current Resources

Linux mode setting, from the comfort of OCaml

Linux provides the KMS (Kernel Mode Setting) API to let applications query and configure display settings. It's used by Wayland compositors and other programs that need to configure the hardware directly. I found the C API a little verbose and hard to follow so I made libdrm-ocaml, which lets us run commands interactively in a REPL.

We'll start by discovering what hardware is available and how it's currently configured, then configure a monitor to display a simple bitmap, and then finally render a 3D animation. The post should be a useful introduction to KMS even if you don't know OCaml.

( this post also appeared on Hacker News )

Table of Contents

Running it yourself
Querying the current state
Making changes
3D rendering
Linux VTs
Debugging
Conclusions

Running it yourself

If you want to follow along, you'll need to install libdrm-ocaml and an interactive REPL like utop. With Nix, you can set everything up like this:

git clone https://github.com/talex5/libdrm-ocaml
cd libdrm-ocaml
nix develop
dune utop

You should see a utop # prompt, where you can enter OCaml expressions. Use ;; to tell the REPL you've finished typing and it's time to evaluate, e.g.

1 2	`utop # 1+1;; - : int = 2`

Alternatively, you can install things using opam (OCaml's package manager):

opam install libdrm utop
utop

Then, at the utop prompt enter #require "libdrm";; (including the leading #).

Querying the current state

Before changing anything, we'll start by discovering what hardware is available.

I'll introduce the API as we go along, but you can check the API reference docs if you want more information.

Finding devices

To list available graphics devices:

utop # Drm.Device.list ();;
- : Drm.Device.Info.t list =
[{primary_node = Some "/dev/dri/card0";
  render_node = Some "/dev/dri/renderD128";
  info = PCI {bus = {domain = 0; bus = 1; dev = 0; func = 0};
              dev = {vendor_id = 0x1002;
                     device_id = 0x67ff;
                     subvendor_id = 0x1458;
                     subdevice_id = 0x230b;
                     revision_id = 0xff}}}]

libdrm scans the /dev/dri/ directory looking for devices. It uses stat to find the device major and minor numbers and uses the virtual /sys filesystem to get information about each one. This is a PCI device, and the information corresponds to the values from lspci, e.g.

$ lspci -nns 0:1:0.0
01:00.0 VGA compatible controller [0300]: Advanced Micro Devices, Inc. [AMD/ATI]
  Baffin [Radeon RX 550 640SP / RX 560/560X] [1002:67ff] (rev ff)

Each graphics device can have a primary and a render node. The primary node gives full access to the device, including configuring monitors, while the render node just allows applications to render scenes to memory. In the last post I was using the render to node to create a 3D image, and then sending it to the Wayland compositor for display. This time we'll be doing the display ourselves, so we need to open the primary node:

1 2	`utop # let dev = Unix.openfile "/dev/dri/card0" [O_CLOEXEC; O_RDWR] 0;; val dev : Unix.file_descr = <abstr>`

To check the driver version:

1
2
3

utop # Drm.Device.Version.get dev;;
- : Drm.Device.Version.t =
{version = 3.61.0; name = "amdgpu"; date = "0"; desc = "AMD GPU"}

If you're familiar with the C API, this corresponds to the drmGetVersion function, and Drm.Device.list corresponds to drmGetDevices2; I reorganised things a bit to make better use of OCaml's modules.

Listing resources

Let's see what resources we've got to play with:

utop # let resources = Drm.Kms.Resources.get dev;;
val resources : K.Resources.t =
  {fbs = [];
   crtcs = [57; 60; 63; 66; 69];
   connectors = [71; 78; 84];
   encoders = [70; 76; 83; 86; 87; 88; 89; 90];
   min_width,max_width = 0,16384;
   min_height,max_height = 0,16384}

Note: The Kernel Mode Setting functions are in the Drm.Kms module. The C API calls these functions drmMode*, but I found that confusing as e.g. drmModeGetResources sounds like you're asking for the resources of a mode.

A CRTC is a CRT Controller, and typically controls a single monitor (known as a Cathode Ray Tube for historical reasons). Framebuffers provide image data to a CRTC (we create framebuffers as needed). Connectors correspond to physical connectors (e.g. where you plug in a monitor cable). An Encoder encodes data from the CRTC for a particular connector.

Resources diagram (simplified)

Connectors

To save a bit of typing, I'll create an alias for the Drm.Kms module:

1	`utop # module K = Drm.Kms;;`

You could also open Drm.Kms to avoid needing any prefix, but I'll keep using K for clarity.

To get details for the first connector (the head of the list):

utop # K.Connector.get dev (List.hd resources.connectors);;
- : K.Connector.t =
{connector_id = 71; (* DP-1 *)
 connector_type = DisplayPort;
 connector_type_id = 1;
 connection = Connected;
 mm_width,mm_height = 700,390;
 subpixel = Unknown;
 modes = [3840x2160 60.00Hz;
          3840x2160 30.00Hz;
          3840x2160 29.97Hz;
          2560x1440 59.95Hz;
          ...];
 props = [1:77; 2:0; 5:0; 6:0; 4:0; 34:0; 35:0; 36:0; 37:0; 72:8; 73:0; 
          7:0; 74:0; 75:15];
 encoder_id = Some 70;
 encoders = [70]}

This is DisplayPort connector 1 (usually called DP-1) and it's currently Connected. The connector also says which modes are available on the connected monitor.

I was lucky in that the first connector was the one I'm using, but really we should get all the connectors and filter them to find the connected ones. List.map can be used to run get on each of them:

utop # let connectors = List.map (K.Connector.get dev) resources.connectors;;
val connectors : K.Connector.t list =
  [{connector_id = 71; (* DP-1 *) ...};
   {connector_id = 78; (* HDMI-A-1 *) ...};
   {connector_id = 84; (* DVI-D-1 *) ...}]

Then to filter:

utop # let is_connected (c : K.Connector.t) = (c.connection = Connected);;
val is_connected : K.Connector.t -> bool = <fun>

utop # let connected = List.filter is_connected connectors;;
val connected : K.Connector.t list =
  [{connector_id = 71; (* DP-1 *) ...}]

We'll investigate c, the first connected one:

1
2
3

utop # let c = List.hd connected;;
val c : K.Connector.t =
  {connector_id = 71; (* DP-1 *) ...}

A note on IDs

In the libdrm C API, IDs are just integers. To avoid mix-ups, I made them distinct types in the OCaml API. For example, if you try to use an encoder ID as a connector ID:

utop # K.Connector.get dev (List.hd resources.encoders);;
                           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^
Error: This expression has type Drm.Kms.Encoder.id = [ `Encoder ] Drm.Id.t
       but an expression was expected of type
         K.Connector.id = [ `Connector ] Drm.Id.t
       These two variant types have no intersection

Normally this is what you want, but for interactive use it's annoying that you can't just pass a plain integer. e.g.

utop # K.Connector.get dev 71;;
                           ^^
Error: The constant 71 has type int but an expression was expected of type
         K.Connector.id = [ `Connector ] Drm.Id.t

You can get any kind of ID with Drm.Id.of_int (e.g. K.Connector.get dev (Drm.Id.of_int 71)), but that's still a bit verbose, so you might prefer to (re)define a prefix operator for it, e.g.

1
2
3

utop # let ( ! ) = Drm.Id.of_int;;
utop # K.Connector.get dev !71;;
- : K.Connector.t = {connector_id = 71; ...}

(note: ! is the only single-character prefix operator available in OCaml)

Modes

Modes are shown in abbreviated form in the connector output. To see the full list:

utop # c.modes;;
- : K.Mode_info.t list =
[3840x2160 60.00Hz; 3840x2160 30.00Hz; 3840x2160 29.97Hz; 2560x1440 59.95Hz;
 1920x1200 60.00Hz; 1920x1080 60.00Hz; 1920x1080 59.94Hz; 1600x1200 60.00Hz;
 1680x1050 59.95Hz; 1600x900 60.00Hz; 1280x1024 75.02Hz; 1280x1024 60.02Hz;
 1440x900 59.89Hz; 1280x800 59.81Hz; 1152x864 75.00Hz; 1280x720 60.00Hz;
 1280x720 59.94Hz; 1024x768 75.03Hz; 1024x768 70.07Hz; 1024x768 60.00Hz;
 832x624 74.55Hz; 800x600 75.00Hz; 800x600 72.19Hz; 800x600 60.32Hz;
 800x600 56.25Hz; 640x480 75.00Hz; 640x480 72.81Hz; 640x480 66.67Hz;
 640x480 60.00Hz; 640x480 59.94Hz; 720x400 70.08Hz]

Note: I annotated various pretty-printer functions with [@@ocaml.toplevel_printer], which causes utop to use them by default to display values of the corresponding type. For example, showing a list of modes uses this short summary form. Displaying an individual mode shows all the information. Here's the first mode:

# List.hd c.modes;;
- : K.Mode_info.t =
{name = "3840x2160";
 typ = preferred+driver;
 flags = phsync+nvsync;
 stereo_mode = None;
 aspect_ratio = None;
 clock = 533250;
 hdisplay,vdisplay = 3840,2160;
 hsync_start = 3888;
 hsync_end = 3920;
 htotal = 4000;
 hskew = 0;
 vsync_start = 2163;
 vsync_end = 2168;
 vtotal = 2222;
 vscan = 0;
 vrefresh = 60}

Properties

Some resources can also have extra properties. Use get_properties to fetch them:

utop # K.Connector.get_properties dev c.connector_id;;
- : [ `Connector ] K.Properties.t =
{EDID = 92; DPMS = On; TILE = None; link-status = Good; non-desktop = 0;
 HDR_OUTPUT_METADATA = None; scaling mode = None; underscan = off;
 underscan hborder = 0; underscan vborder = 0; max bpc = 8;
 Colorspace = Default; vrr_capable = 0; subconnector = Native}

Linux only returns a subset of the properties until you enable the atomic feature. Let's turn that on now:

1 2	`utop # Drm.Client_cap.(set atomic) dev true;; - : (unit, Unix.error) result = Ok ()`

(Module.(expr) is a short-hand that brings all of Module's symbols into scope for expr, so we don't have to repeat the module name for both set and atomic)

And getting the properties again, we now have an extra CRTC_ID, telling us which controller this connector is currently using:

utop # let c_props = K.Connector.get_properties dev c.connector_id;;
val c_props : [ `Connector ] K.Properties.t =
{EDID = 92; DPMS = On; TILE = None; link-status = Good; non-desktop = 0;
 HDR_OUTPUT_METADATA = None; CRTC_ID = 57; scaling mode = None;
 underscan = off; underscan hborder = 0; underscan vborder = 0; max bpc = 8;
 Colorspace = Default; vrr_capable = 0; subconnector = Native}

Encoders

The Linux documentation says:

Those are really just internal artifacts of the helper libraries used to implement KMS drivers. Besides that they make it unnecessarily more complicated for userspace to figure out which connections between a CRTC and a connector are possible, and what kind of cloning is supported, they serve no purpose in the userspace API. Unfortunately encoders have been exposed to userspace, hence can’t remove them at this point. Furthermore the exposed restrictions are often wrongly set by drivers, and in many cases not powerful enough to express the real restrictions.

OK. Well, let's take a look anyway:

utop # let e = K.Encoder.get dev (Option.get c.encoder_id);;
val e : K.Encoder.t =
  {encoder_id = 70;
   encoder_type = TMDS;
   crtc_id = Some 57;
   possible_crtcs = 0x1f;
   possible_clones = 0x1}

Note: We need Option.get here because a connector might not have an encoder set yet. Where the C API uses 0 to indicate no resource, the OCaml API uses None to force us to think about that case.

As the documentation says, the encoder is mainly useful to get the CRTC ID:

1 2	`utop # let crtc_id = Option.get e.crtc_id;; val crtc_id : Drm.Kms.Crtc.id = 57`

We could instead have got that directly from the connector using its properties:

1 2	utop # K.Properties.Values.get_value_exn c_props K.Connector.crtc_id;; - : [ `Crtc ] Drm.Id.t option = Some 57

CRT Controllers

utop # let crtc = K.Crtc.get dev crtc_id;;
val crtc : K.Crtc.t =
  {crtc_id = 57;
   fb_id = Some 93;
   x,y = 0,0;
   width,height = 3840,2160;
   mode = Some 3840x2160 60.00Hz}

An active CRTC has a mode set (presumably from the connector's list of supported modes), and a framebuffer with the image to be displayed.

If I keep calling Crtc.get, I see that it is sometimes showing framebuffer 93 and sometimes 94. My Wayland compositor (Sway) updates one framebuffer while the other is being shown, then switches which one is displayed.

Framebuffers

My CRTC is currently displaying the contents of framebuffer 93:

1 2	`utop # let fb_id = Option.get crtc.fb_id;; val fb_id : Drm.Kms.Fb.id = 93`

utop # let fb = K.Fb.get dev fb_id;;
val fb : K.Fb.t =
  {fb_id = 93;
   width,height = 3840,2160;
   pixel_format, modifier = XR24, None;
   interlaced = false;
   planes = [{handle = None; pitch = 15360; offset = 0}]}

A framebuffer has up to 4 framebuffer planes (not to be confused with CRTC planes; see later), each of which references a buffer object (also known as a BO and referenced with a GEM handle).

This framebuffer is using the XR24 format, where there is a single BO with 32 bits for each pixel (8 for red, 8 green, 8 blue and 8 unused). Some formats use e.g. a separate buffer for each component (or a different part of the same buffer, using offset).

Modern graphics cards also support format modifiers, but my card is too old so I just get None. Linux's fourcc.h header file describes the various formats and modifiers. Modifiers seem to be mainly used to specify the tiling.

I don't have permission to see the buffer object, so it appears as (handle = None). The pitch is the number of bytes from one row to the next (also known as the stride). Here, the 15360 is simply the width (3840) multiplied by the 4 bytes per pixel.

CRTC planes

In fact, Crtc.get is an old API that only covers the basic case of a single framebuffer. In reality, a CRTC can combine multiple CRTC planes, which for some reason aren't returned with the other resources and must be requested separately:

1 2	`utop # let plane_ids = K.Plane.list dev;; val plane_ids : K.Plane.id list = [40; 43; 46; 49; 52; 55; 58; 61; 64; 67]`

(note: you need to enable "atomic" mode before requesting planes; we already did that above)

utop # let planes = List.map (K.Plane.get dev) plane_ids;;
val planes : K.Plane.t list =
  [{formats = [XR24; AR24; RA24; XR30; XB30; AR30; AB30; XR48; XB48; 
               AR48; AB48; XB24; AB24; RG16; XR4H; AR4H; XB4H; AB4H];
    plane_id = 40;
    crtc_id = None;
    fb_id = None;
    crtc_x,crtc_y = 0,0;
    x,y = 0,0;
    possible_crtcs = 0x10};
   ...
  ]

A lot of these planes aren't being used (don't have a CRTC), which we can check for with a helper function:

1 2	`utop # let has_crtc (x : K.Plane.t) = (x.crtc_id <> None);; val has_crtc : K.Plane.t -> bool = <fun>`

Looks like Sway is using two planes at the moment:

utop # let active_planes = List.filter has_crtc planes;;
val active_planes : K.Plane.t list =
  [{formats = [XR24; AR24; RA24; XR30; XB30; AR30; AB30; XR48; XB48; 
               AR48; AB48; XB24; AB24; RG16; XR4H; AR4H; XB4H; AB4H];
    plane_id = 52;
    crtc_id = Some 57;
    fb_id = Some 94;
    crtc_x,crtc_y = 0,0;
    x,y = 0,0;
    possible_crtcs = 0x1};
   {formats = [AR24];
    plane_id = 55;
    crtc_id = Some 57;
    fb_id = Some 98;
    crtc_x,crtc_y = 0,0;
    x,y = 0,0;
    possible_crtcs = 0x1}]

More information is available as properties:

utop # let active_plane_ids = List.map K.Plane.id active_planes;;
val active_plane_ids : K.Plane.id list = [52; 55]

utop # List.map (K.Plane.get_properties dev) active_plane_ids;;
- : [ `Plane ] K.Properties.t list =
[{CRTC_H = 2160; CRTC_ID = 57; CRTC_W = 3840; CRTC_X = 0; CRTC_Y = 0;
  FB_ID = 93; IN_FENCE_FD = -1; SRC_H = 141557760; SRC_W = 251658240;
  SRC_X = 0; SRC_Y = 0; rotation = [rotate-0]; type = Primary; zpos = 0};
 {CRTC_H = 128; CRTC_ID = 57; CRTC_W = 128; CRTC_X = 3105; CRTC_Y = 1518;
  FB_ID = 98; IN_FENCE_FD = -1; SRC_H = 8388608; SRC_W = 8388608; SRC_X = 0;
  SRC_Y = 0; type = Cursor; zpos = 255}]

Plane 52 is a Primary plane and is using framebuffer 93 (as we saw before).
Plane 55 is a Cursor plane, using framebuffer 98 (and the AR24 format, with alpha/transparency).

A plane chooses which part of the frame buffer to show (SRC_X, SRC_Y, SRC_W and SRC_H) and where it should appear on the screen (CRTC_X, CRTC_Y, CRTC_W and CRTC_H). The source values are in 16.16 format (i.e. shifted left 16 bits).

Oddly, Plane.get returned crtc_x,crtc_y = 0,0 for both planes, but the properties show the correct cursor location (CRTC_X = 3105; CRTC_Y = 1518;).

Having the cursor on a separate plane avoids having to modify the main screen image whenever the mouse pointer moves, which is good for low latency (especially if the GPU is busy rendering something else at the time), power consumption (the GPU can stay powered down), and allows showing an application's buffer full screen without the compositor needing to modify the application's buffer.

You might also have some Overlay planes, which can be useful for displaying video. My graphics card seems to be too old for that.

Expanded resources diagram

Here's an expanded diagram showing some more possibilities:

Expanded resources diagram

Some framebuffer formats take the input data from multiple buffers.
A framebuffer can be shared by multiple CRTCs (perhaps with each plane showing a different part of it).
A CRTC can have multiple planes (e.g. primary and cursor).
A single CRTC can show the same image on multiple monitors.

Making changes

If I try turning off the CRTC (by setting the mode to None) from my desktop environment it fails:

1 2	`utop # K.Crtc.set dev crtc_id ~pos:(0,0) ~connectors:[] None;; Exception: Unix.Unix_error(Unix.EACCES, "drmModeSetCrtc", "")`

The reason is that I'm currently running a graphical desktop and Sway owns the device (so my dev is not the DRM "master"):

1 2	`utop # Drm.Device.is_master dev;; - : bool = false`

That can be fixed by switching to a different VT (e.g. with Ctrl-Alt-F2) and running it there. However, this will result in a second problem: I won't be able to see what I'm doing!

If you have a second computer then you can SSH in and test things out from there, but for simplicity we'll leave the utop REPL at this point and write some programs instead.

For example, query.ml shows the information we discovered above:

dune exec -- ./examples/query.exe

devices:                              
  [{primary_node = Some "/dev/dri/card0";
    render_node = Some "/dev/dri/renderD128";
...

Non-atomic mode setting

Linux provides two ways to configure modes: the old non-atomic API and the newer atomic one.

examples/nonatomic.ml contains a simple example of the older (but simpler) API. It starts by finding a device (the first one with a primary node supporting KMS), then finds all connected connectors (as we did above), and calls show_test_page on each one:

let () =
  Utils.with_device @@ fun t ->
  let connected = List.filter Utils.is_connected t.connectors in
  Utils.restoring_afterwards t @@ fun () ->
  List.iter (show_test_page t) connected;
  Unix.sleep 2

restoring_afterwards stores the current configuration, runs the callback, and then puts things back to normal when that finishes (or you press Ctrl-C).

The program waits for 2 seconds after showing the test page before exiting.

show_test_page finds the CRTC (as we did above), takes the first supported mode, creates a test framebuffer of that size, and configures the CRTC to display it:

let show_test_page (t : Resources.t) (c : K.Connector.t) =
  match c.encoder_id with
  | None -> println "%a has no encoder (skipping)" K.Connector.pp_name c
  | Some encoder_id ->
    match K.Encoder.get t.dev encoder_id with
    | { crtc_id = None; _ } ->
      println "%a's encoder has no CRTC (skipping)" K.Connector.pp_name c
    | { crtc_id = Some crtc_id; _ } ->
      println "Showing test page on %a" K.Connector.pp_name c;
      let mode = List.hd c.modes in
      let size = (mode.hdisplay, mode.vdisplay) in
      let fb = Test_image.create t.dev size in
      K.Crtc.set t.dev crtc_id (Some mode) ~fb ~pos:(0,0)
        ~connectors:[c.connector_id]

If the connector doesn't have a CRTC, we could find a suitable one and use that, but for simplicity the example just skips such connectors.

To run the example (switch away from any graphical desktop first or it won't work):

dune exec -- ./examples/nonatomic.exe

Dumb buffers

Typically the pixel data to be displayed comes from some complex rendering pipeline, but Linux also provides dumb buffers for simple cases such as testing. The Test_image.create function used above creates a dumb buffer with a test pattern: