Freedom is the essential principle around which Linux (a.k.a. GNU/Linux) as an operating system revolves. You're free to choose your distribution. You're free to change your office suite. You're free to change your kernel! It is only natural this extends to the visual appearance of your system. Virtually anything can be swapped with something else and configured to the nth degree to achieve any imaginable style. Customising the appearance of your system is known as ricing.

Before we attempt any of these customisations, it is important to understand what are the major elements in this and what role they play in yielding the final rendering of your desktop. I'm going to give a bottom-up overview of these elements and their most popular implementations. Please do bear in mind this is not intended by any means to be an exhaustive or authoritative explanation of the Linux display stack. If you wish to obtain more details about a specific topic covered here, refer to the references section at the end of this article, or consult the appropriate documentation.

Even though most of the ideas covered here apply equally to all distros, I'm assuming an installation specific to an Arch Linux system. You will find that the equivalent steps in your preferred distro are nearly the same (or even easier), but that much I'll leave you to figure out.

The Display Server

Text-based consoles can be very useful and fun. But, fortunately, staring at a flickering monochromatic screen stopped being the only way to use a computer long ago. The invention of the Graphical User Interface -or GUI- in the Palo Alto-based company Xerox was a big turning point, transforming how computers were regarded by the mainstream population and helped make computers useful, not only to academics that make this possible, but the very first one (just above raw graphics rendering) is the display server, the first protagonist of our whirlwind tour of the stack.

A display server, or windowing system, is a (Userland applications run in user mode as opposed to privileged or kernel mode, so they have a much more restricted control of the hardware resources) userland software layer responsible for capturing interactions from the user's mouse and keyboard as well as communicating with the kernel to render graphical artifacts on the screen, namely WIMP elements and fonts. As its name implies, it follows a client-server approach whereby all applications must send requests to it to draw to the screen, as only the display server is allowed to talk directly to the kernel to utilise the system's graphics resources. In the next section we'll explain in more detail how this is actually done but, let us simply say at this point that, what this means is that when an application wants to render its container window on the screen, it must do so via the display server. The server will provide it with the screen real estate it needs, a fixed position on the screen, alongside graphic primitives necessary to draw the window borders and other usual elements like the close, minimise or expand buttons at the top. The display server doesn't have any real knowledge about what it's drawing, it doesn't understand what a pointer or a window is, conceptually speaking. It doesn't need to, as that is someone else's job: all it cares about is shapes, boundaries and the intersections between these (a technique known as compositing, which we'll also explain later).

=> /assets/images/article/2016/3_1.png

The X Window System which, at its 11th revision -commonly referred to as X11-, is by far the most ubiquitous type of display server protocol, and X.Org Server is its de facto implementation.

X.Org Server can be installed with the xorg-server, xorg-server-utils and xorg-server-apps packages in Arch Linux, which include functionality for debugging and further configuration. Even though xorg-server is the main one, the other two will provide you with some indispensable tools like the debugging tool xev and xinit, which is what you use to start a new instance of X in your system, that is, transform that flickering one-dimensional terminal interface into a colourful gateway to your computer. I recommend you install all of these packages to save yourself some trouble.

With X, installing it is just half the battle. You'll have to go through a configuration process which involves generating and/or editing X.Org's settings file, xorg.conf. This file contains all the information X needs to know about your hardware: screens, keyboard, mouse... This task may be more or less painstaking, depending on how many tools are available to you in your particular environment. There are two very straightforward ways to configure X:

Writing X config files is no easy task and I'm not going to explain how to write one in this guide either as I'm far from being an expert on that myself but just to give you an idea of what a working X config file looks like, here's mine

{{< highlight conf >}}

Section "ServerLayout"

    Identifier "layout"

    Screen 0 "nvidia"

    Inactive "intel"

EndSection

Section "Device"

    Identifier "nvidia"

    Driver "nvidia"

    BusID "PCI:1:0:0"

EndSection

Section "Screen"

    Identifier "nvidia"

    Device "nvidia"

    Option "AllowEmptyInitialConfiguration" "Yes"

EndSection

Section "Device"

    Identifier "intel"

    Driver "modesetting"

    Option "AccelMethod" "none"

EndSection

Section "Screen"

    Identifier "intel"

    Device "intel"

EndSection

{{< / highlight >}}

There are several other additional config files for X.Org you'll see mentioned at some point or another, like .Xdefaults, the newer .Xauthority, .xprofile, xinitrc, etc. These files control how X is initialised when launched, system fonts, console appearance and much more. I'm not getting into any more detail with these, just wanted to create some awareness on them because I'm sure you'll come across them sooner or later if you're setting up X from scratch.

Architecture of the X Window System

The X Window System exposes a protocol to communicate asynchronously with its clients (there can be any number of them) either via a network or locally. You can go and have a read at its specification

=> here

if you want (not that you have to). In order to illustrate X's functioning, let's assume a common scenario where a user is interacting with an application and has just clicked a radio button. The actions that will ensue are as follows:

As stated before, X is not aware of what it's drawing to the screen nor does it remember what it's sent to the framebuffer, so every time something needs to be redrawn on the screen, X sends a request to the appropriate client application to take care of that. This is known as running in legacy mode. Nowadays, it is preferred to use an external composite manager or compositor. A composite manager's main job is to receive the next window frame to render on the screen for every individual X client (i.e. every windowed application) and then combine all of these individual windows, calculating their intersections, z-index (visibility or depth), etc. to obtain the final scene to display on the screen. Composite managers can add many more features like menu and window animations, transparency and shadows. Notable examples of compositors are compiz, compton, xcompmgr or Cairo Compmgr.

=> /assets/images/article/2016/3_2.png

X is capable of drawing 2D and 3D elements onto the screen via the Cairo and GLX (OpenGL) libraries, respectively. If you're installing a system from scratch it might be worth checking if you already have the necessary graphics driver installed. The mesa package is fairly standard and will work just fine on most architectures and with most graphics cards. It will not be very fast though. For that, you'll need drivers specific to your graphics adapter. For example, in the case of Nvidia GPUs, you'd need the nouveau package or Nvidia's proprietary drivers (nvidia) to squeeze the maximum performance out of your card.

Unsurprisingly, X.Org is written in C. Even though it is possible to directly code against it with Xlib and the newer XCB interface, you'll rarely need to do such a thing to develop GUI applications in Unix systems. Instead, you code using a GUI toolkit. The three most popular GUI toolkits are GTK+, Qt and Tk. GTK+ is the most widespread one whereas Qt is the toolkit for KDE applications. Tk is the oldest of the three as well as the easiest one to program with, although it looks very arcane compared to its modern counterparts.

=> /assets/images/article/2016/3_3.png

You can read more about X's architecture

=> on its wiki

.

Apart from X11, there are other well-known display server protocols in very active development, such as Ubuntu's Mir and Wayland. Wayland and Weston, its reference implementation, look quite promising as they're being rolled out to the latest versions of Red Hat's cutting-edge distribution starting from Fedora 21. Both Wayland and Mir have many reasons for existing but they both spawned off the same desire to modernise GNU/Linux by removing the clutter and quirky artifacts inherited from its Unix roots. One of these jewels from the past is the topic in discussion, the X Window System, which has been around since the mid 80's.

Now, how does Wayland work and in what areas does it bring improvement over X? These are manifold but can be boiled down to one: Wayland removes the middle-man.

Enter Wayland

If you paid attention to the example event process outlined a moment ago, you'd have noticed X plays an unavoidable central role in the chain. Nowadays, most components of the architecture (udev, evdev, the compositor) have become self-sufficient in accomplishing their individual tasks and therefore do not need X for nothing else than as a dumb data relay. The proponents of Wayland realised this and decided it would make more sense to replace X altogether with the compositor, thus making the display server and the composite manager become the same thing.

=> /assets/images/article/2016/3_4.png

In Wayland, the compositor talks directly with the clients and the kernel to manipulate the appearance of the screen. The implications of this approach are substantially positive. Performance is increased as the overhead of having every component communicating via a piece of middleware is eliminated. Also, the overall structure and workflows of the system are greatly simplified as well as the codebase which gets an extreme makeover, going from X's ancient source to Wayland's, or properly speaking, Weston's. The net result of this is that GUI applications should render quicker and cleaner, especially when animations and interactions between different windows are involved.

Unfortunately, the fact that Wayland exists doesn't mean it has been adopted all across the board. Migrating from X, which is deeply intertwined with all the existing GUI participants, to Wayland is no easy task, as the change cannot occur only on a single layer of the chain: it's not as simple as uninstalling X.Org and then installing Weston. X is a dependency in one way or another for most tiers of the desktop rendering hierarchy: toolkits, WMs and naturally DEs, all have been built on top of X. Thus, the switch was not going to happen quick and painlessly. GNOME and KDE maintainers have done a great job at incorporating Wayland support in the latest revisions of their software (as I said before, Fedora can work on Wayland out of the box now) but there's much work to be done still, especially for standalone WMs like awesome, which doesn't even have plans to support it. But wait... what was a WM again?

Window Managers

Window managers are key backend components of a GUI environment. They control the location, appearance and animations (moving, resizing...) of the windows shown on the screen and collaborate closely with the display server to achieve this purpose. WMs are devoted to the task of manipulating the size and position of the windows on your desktop as well as rendering their borders, title bars and icons. They also control other elements such as desktop menus, dialogs and popups (e.g. notify-send popups).

Managing and rendering the visual appearance of a window frame, that is, everything surrounding the application window, is known as decoration. Decoration can be performed by two different actors in the display model: the client (client-side decoration or CSD) or the server (server-side decoration or SSD).

Both CSD and SSD have their pros and cons and have been embraced by one DE or another.

CSD's strengths are that it offers much more freedom to the application developer, as he can tailor the look of his application all around, including an element which is usually off-bounds, as is the window frame itself. The problem with CSD is that, should the application crash at any point, the whole window will become stuck and will be a pain to remove from the screen. This is a common scenario on Windows and its infamous Application Not Responding problem, which is often provoked due to a bad design in which the UI is not given its own thread separate from the backend logic.

On the other hand, SSD is safer and performs better, as the display server or WM are in full control of the window frame of any application. This means that, even if the application crashes or becomes unresponsive for a while, at least you will be able to move, close or minimise the window, and shutting down the application becomes less of a fuss.

There was once mention of a novel approach to rendering window frames, namely Dynamic Window Decorations, which looked promising, attempting to bring the best of both worlds, but I guess it died out over time as not much has been said recently on the matter. The basic idea behind it was that client applications published a specification of the widgets and custom elements they wanted drawn and then between the display server and another participant called the DWD console, would render native controls on the screen according to this.

Since WMs play such a vital role in how your desktop appearance is composed, they are capable of transforming your experience in many different ways. There are three major types of WM:

Installing a window manager should be as easy as grabbing the package through your package manager and then appending this line to your ~/.xinitrc file once it's installed:

Proxy Information
Original URL
gemini://dvejmz.srht.site/2016-08-22-display-server-primer.gmi
Status Code
Success (20)
Meta
text/gemini
Capsule Response Time
188.138412 milliseconds
Gemini-to-HTML Time
1.425421 milliseconds

This content has been proxied by September (ba2dc).