Rickard Lindberg

May 2, 2025 ∞

Newsletter April 2025: projects2

This month I’ve done a lot of programming. I ended up working more on my own code hosting platform. I call it projects2. Why two? Because it’s my second attempt at implementing this idea. Second attempt in recent times at least.

In my 2017 blog post, A new home for Timeline, I wrote

My suggested way forward is therefore to develop a new platform whose core features are registration free discussions and pull requests. In addition, it would need features common to many platforms like hosting of releases and a project web page.

In my previous attempt I focused on registration free discussions. This time, I decided to instead focus on creating the minimal possible software that allowed us to move away from SourceForge for Timeline and also get rid of the Jenkins instance that I run. That way, the infrastructure for running Timeline would not depend on proprietary systems or “complicated” third party software (Jenkins) which is overkill for our needs.

What follows is a demo of the current state of projects2.

Requirements

To use projects2 we need the following:

A machine running Fedora Linux that we have root access to
A domain that resolves to that machine
An SSL certificate for that domain

I use DNSimple to purchase domains and SSL certificates and Linode to provision Fedora servers.

Initial setup

projects2 is implemented as a single Python script, projects2.py, which is used to configure a single Fedora Linux machine to act as a code hosting platform.

We configure our server in a config.ini file. Let’s use this for the demo:

[Global]
InstanceName = projectsdemo
Domain = projectsdemo.rickardlindberg.me
Title = A demo site for projects2.
Description = This site showcases the project2 code hosting platform.

[User:admin]
DisplayName = Rickard
SshKeys = <my public ssh key>
Projects = *

[WildcardCertificate]
pem = <my ssl certificate>
key = <my ssl private key>

Next we run the bootstrap command, which should only be run once on a fresh Fedora install:

$ path/to/projects2.py bootstrap
ssh root login prompt
...
Ensuring user scm...
Ensuring passwordless sudo for scm...
Ensuring folder /home/scm/.ssh...
Ensuring authorized keys...
Ensuring folder /opt/projectsdemo...
Ensuring folder /opt/projectsdemo/web/artifacts...
Ensuring myself...
Ensuring myself api...
Ensuring config.ini...
Ensuring folder /home/scm/.ssh...
Ensuring authorized keys...
Ensuring SSH configured...
Ensuring sshd is restarted...
Ensuring hostname is projectsdemo.rickardlindberg.me...
Ensuring folder /opt/projectsdemo/web...
Ensuring folder /opt/projectsdemo/web/artifacts...
Ensuring folder /opt/projectsdemo/web/scm...
Ensuring folder /opt/projectsdemo/events...
Ensuring pem...
Ensuring key...
Setting up tools...
Setting up CI...
Setting up timezone...
Building /opt/projectsdemo/web/index.html...

We now have our Fedora server configured as a code hosting platform! But it looks a little empty at the moment:

The initial projectsdemo website which shows only
the instance title, descriptions, and empty placeholders for projects and
events.

Adding a project

Let’s add a project to our config.ini and also fix two typos that I made in the title and description:

[Global]
...
Title = A demo site for projects2
Description = This site showcases the projects2 code hosting platform.

[Project:demo]
Scm = git
Description = A demo project.

To apply these changes, we run the update command:

$ path/to/projects2.py update
Ensuring myself...
Ensuring myself api...
Ensuring config.ini...
Ensuring folder /home/scm/.ssh...
Ensuring authorized keys...
Ensuring SSH configured...
Ensuring hostname is projectsdemo.rickardlindberg.me...
Ensuring folder /opt/projectsdemo/web...
Ensuring folder /opt/projectsdemo/web/artifacts...
Ensuring folder /opt/projectsdemo/web/scm...
Ensuring folder /opt/projectsdemo/events...
Ensuring pem...
Ensuring key...
Setting up tools...
Setting up CI...
Setting up timezone...
Configuring project demo...
Ensuring pre-receive hook...
Ensuring post-update hook...
Building /opt/projectsdemo/web/demo.html...
Building /opt/projectsdemo/web/index.html...

And now the demo project appears on the website along with the fixed texts:

The projectsdemo website which now also shows the
projects placeholder filled in with the demo project.

Pushing code to the project

We have an empty git project setup up. Let’s push some code to it:

$ git init demo
$ cd demo
$ git branch -m main
$ vim README.md
$ git add README.md
$ git commit -m 'Add readme.'
$ git remote add origin scm@projectsdemo.rickardlindberg.me:demo.git
$ git push -u origin main
Enumerating objects: 3, done.
Counting objects: 100% (3/3), done.
Writing objects: 100% (3/3), 239 bytes | 239.00 KiB/s, done.
Total 3 (delta 0), reused 0 (delta 0), pack-reused 0 (from 0)
remote: Hello from projects2 pre-receive hook!
remote: Ensuring /opt/projectsdemo/web/demo gone...
remote: Building /opt/projectsdemo/web/index.html...
remote: Building /opt/projectsdemo/web/demo.html...
To projectsdemo.rickardlindberg.me:demo.git
 * [new branch]      main -> main
branch 'main' set up to track 'origin/main'.

The website updates to show that we pushed some code to the demo project:

The projectsdemo website which now alos shows the
recent events placeholder filled in with one entry.

We can make more changes as usual and push them:

$ ...make changes...
$ git push
Enumerating objects: 5, done.
Counting objects: 100% (5/5), done.
Writing objects: 100% (3/3), 281 bytes | 281.00 KiB/s, done.
Total 3 (delta 0), reused 0 (delta 0), pack-reused 0 (from 0)
remote: Hello from projects2 pre-receive hook!
remote: Ensuring /opt/projectsdemo/web/demo gone...
remote: Building /opt/projectsdemo/web/index.html...
remote: Building /opt/projectsdemo/web/demo.html...
To projectsdemo.rickardlindberg.me:demo.git
   b81aad4..6f7b10a  main -> main

And the diff will appear in the event log:

The projectsdemo website which now shows two
events for the demo project where the most recent event also includes a diff.

CI

You might have noticed that the push log includes lines like these:

remote: Building /opt/projectsdemo/web/index.html...
remote: Building /opt/projectsdemo/web/demo.html...

When we push code, projects2, intercepts that push to update the website. This mechanism is also used for Continuous Integration (CI).

In our repo, we can add files called Dockerfile*.ci. Those define Docker images in which the CI scripts are run. Let’s add Dockerfile.py312.ci to our demo project:

FROM python:3.12

CMD ["python3.12", "build.py"]

It says that the CI command to run is python3.12 build.py. Here is what build.py looks like:

#!/usr/bin/env python3

import json
import os
import sys

binary_path = "binary"
site_root = "html"

with open(binary_path, "w") as f:
    f.write("the compiled binary")

os.makedirs(site_root)
with open(os.path.join(site_root, "index.html"), "w") as f:
    f.write("hello from demo site")

with open("Dockerfile.py312.ci.files", "w") as f:
    f.write(json.dumps({
        "artifacts": [
            {
                "source": binary_path,
                "destination": "binary",
            },
        ],
        "site": site_root,
    }))

It simulates building a binary which looks like this:

$ cat binary
the compiled binary

And it builds a project website that looks like this:

$ cat html/index.html
hello from demo site

It tells projects2 about these artifacts via the file Dockerfile.py312.ci.files that looks like this:

{
  "artifacts": [
    {
      "source": "binary",
      "destination": "binary"
    }
  ],
  "site": "html"
}

When we push this change, we can see the following addition in the log:

remote: Building Dockerfile.py312.ci...
remote: Running Dockerfile.py312.ci...

The Dockerfile.py312.ci.files is parsed by projects2 and the binary file has been saved as an artifact along with the project website. The link to the binary is shown in the event log:

The projectsdemo website which shows a link to an
artifact in the most recent event.

And we can verify that it is correct like this:

$ curl https://projectsdemo.rickardlindberg.me/artifacts/demo/binary
the compiled binary

The project website is also published at <domain>/demo:

The demo project website which shows the
placeholder text.

This CI workflow is so nice. In my opinion, it is also much better than Jenkins'. It implements real CI. We just push code as we normally do. If the build brakes, the code will not get pushed and we can try again.

Summary

projects2 is now complete enough that I can start using it for my projects. For Timeline, we can replace all infrastructure from SourceForge and my Jenkins instance with projects2. Almost. We still use the mailing list from SourceForge. And maybe we will continue doing that. I’m not sure that registration free discussions and pull requests are as important to me as I thought. Mostly because there are not many contributors to my projects. But I might incorporate some kind of communication mechanism into projects2.

I couldn’t have implemented projects2 say 5 years ago. I wasn’t as good at Agile development and couldn’t have implemented the simplest thing that could possible work. I have many prior projects to thank for that. In particular I did the simplest thing that could possibly work. Here’s what happened. and Agile Game Development with Python and Pygame. I also couldn’t have come up with this solution for CI without prior reading, thinking, and prototyping solutions. I think the takeaway here is that you need to do many projects. From each project you will learn something that you can incorporate into your next project. Most projects will fail, but you will learn something. And eventually you will have a success. I have a feeling that projects2 might be a success. It is successful now in the sense that I actually use it. Time will tell for how long.

Apr 2, 2025 ∞

Newsletter March 2025: Snowboarding

This month I have done nothing related to programming in my spare time. Partly because it is snowboard season.

Me standing on a snowboard in the mountains of
Åre.

I’m most interested in continuing work on my own code hosting platform that will host my projects. We’ll see if I have the time and motivation next month. Or if something completely different catches my interest.

Mar 19, 2025 ∞

Newsletter February 2025: A New Code Hosting Platform?

When I came across XXIIVV in last month, I immediately got interested in the idea that in order to be able to run our software many, many years from now, we need to target a small virtual machine that we can re-implement in a weekend. Their virtual machine is called Uxn:

Uxn is the virtual machine powering the Hundred Rabbits software.

I spent some time reading about Uxn and trying it out. I also started documenting my research in a blog post. Then I got distracted by other things.

One thing that distracted me was starting working on my own code hosting platform. It will be tailored to my specific needs and my projects. I wrote about some of my needs back in 2017 in A new home for Timeline and I also have new ideas for what I want today.

I made pretty good progress, and I think I can soon start using it for some of my projects.

However, as with most of my hobby projects, I got distracted. This time by a snowboarding vacation. We’ll see what my interest decides to continue working on when I have the time and motivation.

Feb 1, 2025 ∞

Newsletter January 2025: Inspired and Motivated by New Laptop and Reading

New Laptop

I got a new laptop this month. It was almost 10 years since I bought my previous one. I mainly needed a new one to be able to smoothly browse certain websites and for better performance when editing videos.

When installing the latest version of Fedora on it, I took the time to clean up my dotfiles. Since I jumped quite many Fedora versions, my often used tool rlselect had stopped working. I figured out the problem and documented the fix in Replacing Ctrl-R in Bash without TIOCSTI.

That blog post came naturally to me. I was trying to find a solution to a problem. I found other people having the same problem. When I found a solution, I felt the need to share it to contribute to the discussion and hopefully help someone else. I even wrote a custom version of the blog post tailored to an issue on GitHub.

Bootstrapping

I came across Bootstrappable Builds. Bootstrapping is an interesting problem that I’ve mainly come across in my work on RLMeta. They write that

To gain trust in our computing platforms, we need to be able to tell how each part was produced from source.

I started thinking how this would apply to RLMeta. The RLMeta “binary” is a Python file. So it needs Python to bootstrap itself. I’m not sure if that qualifies as a problem according to Bootstrappable.

The “binary” is not really human readable, so it is not feasible to inspect it. On the other hand, the source code says exactly how it is produced, and we can verify that it produces itself.

One way to figure out if this is a problem or not is to see if it is vulnerable to the “Trusting Trust” attack. The article Reflections on Rusting Trust talks about how to make this attack in the Rust compiler. I don’t fully understand it, but it could be interesting to try with RLMeta.

XXIIVV

I came across XXIIVV. There are so many things in there that interest me.

One of those things is the idea that in order to be able to run our software many, many years from now, we need to target a small virtual machine that we can re-implement in a weekend. You can find more on this in devlog and the transcript of the talk Weathering Software Winter.

One thing that cause our software to break is when the things that it depend on change or go away. For that reason, I’m reluctant to pull in third party dependencies when building software. But what if the language our software is written in disappears? That is less likely than third party dependencies changing, but there is still a risk. Especially in the long run.

But if we can not depend on third party software or languages, we have to implement the whole software stack ourselves. That is a lot of work. There is probably a balance where the trade-off of depending on something is worth it. And that balance differs depending on our goals with the software. However, my feeling is that many things that we pull in third party dependencies for, we can quite easily implement ourselves. And get rid of the bloat of the 80% of the third party dependency that we don’t use. In addition to getting rid of bloat, it also increases understandability. We don’t need to figure out how a third party dependency work, we just need to figure out how a much smaller set of our code works.

Another area where preserving software is of interest to me is my website. About half a year ago, I moved to Micro.blog. Some of my posts now only exists there. The platform gives me some things that I like such as ease of posting and interaction with others. But what if Micro.blog goes away? What happens to my words? I think I need to go back to having the source code for my website in a git repo. Then I should be able to compile my website for different targets. One for publishing online. It might be an export to Micro.blog so that I can continue to use some of its features. But it might also be a pdf export. That way I can print the pdf and have my whole website preserved as a physical book in my bookshelf. That will most likely live for much longer than any technology. And of course, compiling my website should depend on as few dependencies as possible. Perhaps even target a small VM as XXIIVV does it?

A Note on Reading

I was able to read and write about the topics above partly because of a realization that I had earlier this month:

Today’s realization is that you can get important things done by consistently working on them for 15 minutes at the start of every day.

By doing it at the start of the day, you ensure that it gets done. And the rest of the day you don’t need to be stressed about not working on your important thing, because you already have.

I want to read more. But it is easier to just scroll through my feeds and read headlines. What I’ve tried now is to bookmark things that look interesting. Then I spend some time in the mornings to carefully read those pieces. It’s been a quite positive experience for me. I’ve also used that trick to get more boring tasks done. It might not work if you are a night person though.

Jan 19, 2025 ∞

Replacing Ctrl-R in Bash without TIOCSTI

I have previously written about how I use rlselect as a replacement for Ctrl+R in Bash.

It works by creating a key binding in Bash for Ctrl+R that invokes rlselect instead of the default Bash interactive history search command. rlselect looks something like this:

Screenshot of rlselect showing two entries, hello
and world, with hello selected.

If you press tab, the current selection is inserted at the prompt. If you press enter, the current selection is executed. This is the same behavior as the default Ctrl+R.

The mechanism for this stopped working in recent Linux kernel versions. I figured out how to solve it and in this blog post I explain how.

Old Mechanism

When rlselect is invoked from Ctrl+R, it is invoked with the --tab and --action flags. The first flag allows the tab key to be used to select a line and the second makes rlselect print the action taken on the first line before to the selection.

Here is an example where enter is pressed when “hello” is selected:

$ (echo hello; echo world) | rlselect --tab --action
enter
hello

Here is an example where tab is pressed when “world” is selected:

$ (echo hello; echo world) | rlselect --tab --action
tab
world

Here is an example where Ctrl+G is pressed:

$ (echo hello; echo world) | rlselect --tab --action
ctrl-g

Ctrl+G aborts, so no selection is printed on the second line.

To feed this output to the prompt, TIOCSTI is used. It simulates that you type characters in the terminal. The full script that Ctrl+R invokes looks like this:

set -e

result=$(tac ~/.bash_history | rlselect --tab --action -- "$@")

python - "$result" << EOF
import fcntl
import sys
import termios

action, selection = sys.argv[1].split("\n", 1)

if action != "tab":
    selection += "\n"

for ch in selection:
    fcntl.ioctl(sys.stdout.fileno(), termios.TIOCSTI, ch)
EOF

The last part is where TIOCSTI is used to simulate that you press the keys of the selection. Unless tab is pressed, it appends a newline to the selection to simulate that Enter is pressed.

The Bash configuration looks like this:

if [[ $- =~ .*i.* ]]; then bind '"\C-r": "\C-a rlselect-history \C-j"'; fi

Here is how it works:

Ctrl+R is bound to a series of keystrokes.
First Ctrl+A is simulated which takes the cursor to the beginning of the line.
Then <space>rlselect-history<space> is typed.
Then Ctrl+J is simulated which means accept the current line. Or execute it. The initial space entered in the previous step ensures that the rlselect-history command does not end up in the history. The moving of the cursor to the beginning of the line ensures that anything typed at the prompt is passed as an argument to rlselect-history.

(This configuration also makes the text rlselect-history ... appear in the terminal. The new mechanism makes that go away.)

This mechanism stopped working in recent Linux kernel versions because TIOCSTI can not be used like this. There is apparently security issues with TIOCSTI and it is now only allowed as root.

New Mechanism

The new Bash configuration for Ctrl+R behavior that I came up with looks like this:

rlselect-history() {
    local action
    local selection
    {
        read action
        read selection
    } < <(tac ~/.bash_history | rlselect --tab --action -- "${READLINE_LINE}")
    if [ "$action" = "tab" ]; then
        READLINE_LINE="${selection}"
        READLINE_POINT=${#READLINE_LINE}
        bind '"\C-x2":' # Bind Ctrl+x+2 to do nothing
    elif [ "$action" = "enter" ]; then
        READLINE_LINE="${selection}"
        READLINE_POINT=${#READLINE_LINE}
        bind '"\C-x2": accept-line' # Bind Ctrl+x+2 to accept line
    else
        bind '"\C-x2":' # Bind Ctrl+x+2 to do nothing
    fi
}

if [[ $- =~ .*i.* ]]; then
    # Bind history command to Ctrl+x+1 followed by Ctrl+x+2:
    bind '"\C-r": "\C-x1\C-x2"'
    # Bind Ctrl+x+1 to execute rlselect-history which does two things:
    # 1. Sets READLINE_*
    # 2. Binds Ctrl+x+2 to either accept line or do nothing.
    bind -x '"\C-x1": rlselect-history'
fi

Let’s break this down.

Ctrl+R is bound to a series of keystrokes.
First Ctrl+X+1 is simulated.
Then Ctrl+X+2 is simulated.
Ctrl+X+1 is bound to execute the command rlselect-history. The -x to bind ensures that the variables READLINE_* can be set. From man bash on set -x:

Cause shell-command to be executed whenever keyseq is entered. When shell-command is executed, the shell sets the READLINE_LINE variable to the contents of the readline line buffer and the READLINE_POINT and READLINE_MARK variables […] If the executed command changes the value of any of READLINE_LINE, READLINE_POINT, or READLINE_MARK, those new values will be reflected in the editing state.
rlselect-history is defined as a Bash function which allows it to reconfigure the key binding for Ctrl+X+2. Depending on if the current selection should be executed or not, it binds Ctrl+X+2 to either accept-line or nothing.

So the new mechanism relies on using two extra key bindings: Ctrl+X+1 and Ctrl+X+2. I chose them because I don’t use them otherwise. But they can be any two key bindings.

The trick to finding this solution for me was understanding Bash key bindings. This answer on StackOverflow writes the following:

With bind, you can bind keys to do one of three things, but no combination of them:

Execute a readline command: bind '"key": command'

Execute a series of keystrokes: bind '"key":"keystrokes"'

Execute a shell command: bind -x '"key": shell-command'

That made me understand that you can not call accept-line from within rlselect-history because it is executed in the context of bind -x, and readline commands can only be executed in the context of bind '"key": command'.

Resources

Here are some resources that talks about the problem with TIOCSTI that helped me:

hstr (the program that initially inspired me to write rlselect) had a similar problem and I found clues to my solution there.
The fzf-plugins repo and this dicussion provides a similar solution for fzf.
The article Readline and Fuzzy Finder helped me understand how to work with READLINE_* in Bash.

Jan 13, 2025 ∞

Today’s realization is that you can get important things done by consistently working on them for 15 minutes at the start of every day.

By doing it at the start of the day, you ensure that it gets done. And the rest of the day you don’t need to be stressed about not working on your important thing, because you already have.

Jan 12, 2025 ∞

Newsletter December 2024: Advent of Code

December is the month of Advent of Code. I had told myself not to participate this year because I know I get completely consumed by the problems and it has a negative impact on the rest of my life. It worked. Until December 15th. More on that later.

Code Editor Update

Last month I started working on a new code editor. It is a mix of a text editor and a structured editor. It is all text, but parsers and pretty printers allow you to work with a tree structure and not think too much about syntax.

I continued working on it this month. The big achievement was that I added support for another language in addition to JSON. The other language is rlmeta. Here is a screenshot showing the parser opened in the editor:

A screenshot of releditor editing the parser of
rlmeta.

This is a big achievement because it ties everything together. You define a parser and a pretty printer for your language. That gives you all editing capabilities. However, you can also write a code generator, and now you have a full blown programming language with editing support “for free”. This potentially provides an environment to quickly experiment with new programming languages.

Conceptually, I’m quite happy with this achievement. However, there are many things to work on before this is “production ready”. First of all, the performance is pretty horrible because of the constant parsing and pretty printing. Second of all, I need to see if a tree based editor can actually become better than a regular text editor.

Advent of Code

I couldn’t help myself but to participate this year as well. The experience was not as stressful as last year. I still got consumed by the problems, but the feeling was mostly positive. I managed to complete all but 3 problems. Right now, the interest to complete them is pretty low. I might take a look at other solutions to see if I can learn something from that.

My approach to solving the problems is that I try to solve them in order, and I don’t look at others' solutions until I have solved both parts. However, I’m out of ideas to try on the last problems, and I think the competition part is over by now. I might learn something for next year if I look at solutions now.

This year I also practiced object oriented design. So my solutions involve many small objects interacting with each other to produce a solution. It was mostly a success I think. One of my favorite solutions is for day 11.

This year I also think that I got the hang of Dijkstra and A*. (I found Introduction to the A* Algorithm from Red Blob Games really helpful.)

You can find all my solutions on GitHub.

Dec 8, 2024 ∞

Newsletter November 2024: A New Project

Compared to last month, this month I did some programming in my spare time. I had fewer commitments, and my mind started thinking about various programming projects. We also got the first snowfall of the season and I got to enjoy a run in it:

Me running in a snow-covered landscape with the
sun setting in the background.

A New Code Editor

The programming project that I started working on is code editor that is a mix of a text editor and a structured editor. It is all text, but parsers and pretty printers allow you to work with a tree structure and not think too much about syntax. The code is available on GitHub, and here is what it looks like when editing a JSON document:

Screenshot of rledit editing a JSON document with a
selection.

I got the idea for this project when trying out Black. Black automatically formats Python code for you so that you don’t have to think about it. I’ve been interested in structured editors for some time, but my feeling is that they are not user friendly because they limit what you can type. Then I came up with this idea of an editor that constantly parses what you type. If the parse is successful, it pretty prints it for you and provides you with edit operations on the AST. But it is all still just text, so you can type whatever. In the worst case, the parse fails and you have to fix it manually.

So far, it looks quite promising. And most importantly, I’m having fun experimenting. The most likely scenario is that the project will not be a success, but I will learn something and have fun doing so. But you never know. One day, one of these projects just might turn into something that is invaluable.

TDD

This month I also watched a presentation by Kent Beck called TDD: Theme & Variations. For me, it was a nice refresher on the origins of TDD.

One thing that I appreciate with TDD, that I partially had forgotten, is how it reduces anxiety. Kent reminded me of it in the presentation. When all tests are passing and you can’t think of any more tests to write, you are done, and you know that it works. That reduces anxiety a lot.

Dec 4, 2024 ∞

Today I ran part of the way to work. It was a cold, beautiful winter morning in Stockholm.

Me running with water and Stockholm City Hall in the background.

Nov 28, 2024 ∞

Sometimes, I solve programming problems by coding on paper. A few days ago, it looked like this:

A piece of paper with source code written on it with annotations.

Nov 28, 2024 ∞

I’ve started working on a code editor that is a mix of a text editor and a structured editor. It is all text, but parsers and pretty printers allow you to work with a tree structure and not think too much about syntax. It is a work in progress. Code is here.

Screenshot of rledit editing a JSON document with a selection.

Nov 23, 2024 ∞

We got some more snow. I like running in the winter. Especially when there is snow and the sun is shining.

Me running in a snow-covered landscape with the sun setting in the background.

Nov 20, 2024 ∞

I needed to submit some heic photos to a service that only accepted jpg. I didn’t know about the heic format, but a little searching gave me a solution:

$ heif-convert
bash: heif-convert: command not found...
Install package 'libheif' to provide command 'heif-convert'? [N/y] y
...
$ find . -iname '*.heic' -exec heif-convert -q 100 {} {}.jpg \;

Nov 20, 2024 ∞

Today was the first day of snow this season. Not much. I’m looking forward to many more runs on a white trail.

Me running on a trail with a little snow.

Nov 16, 2024 ∞

I was researching how to run Black (and possibly other formatters) from Vim and found Ergonomic mappings for code formatting in Vim. It was very helpful.

Nov 3, 2024 ∞

How would you improve this code?

def update_r_users(service)
    r_users = []
    for user in service.get_all_users():
        if "r" in user:
            r_users.append(user)
    service.set_users_in_group("users_with_r_in_name", r_users)

Find out what I did it in my latest newsletter.

Nov 3, 2024 ∞

Newsletter October 2024: Primitive Obsession?

Normally I do something related to programming in my spare time every month. I read something that I find interesting and want to share. Or I have some thought related to programming that I want to share. This is the first month since I started these monthly updates in June 2019 that I’ve got nothing of that. I’ve been occupied with other things, and I’ve also done quite a bit of programming at work. Perhaps that has satisfied my interest for programming.

One thing that I’ve done a lot at work this month is wrapping simple data structures in classes. Instead of passing around lists and dictionaries, I’ve created classes holding that data and only serialized it at the edges of the application. Every time I have done this, I wish I had done it sooner. It’s so good. And in nine times out of ten, those classes have attracted some functionality that fits perfectly. They have provided one place to put functionality instead of scattering it throughout the codebase.

What am I talking about? Let me give an example.

Imagine that we talk to a user service that has an API something like this:

service.get_all_users() -> ["user1", "user2"]
service.set_users_in_group("group1", ["user1", "user2"]) -> OK

The API works with users represented as list of strings. Now we want to write a function that applies some domain logic to assign users to groups. It is so easy and tempting to write something like this:

def update_r_users(service)
    r_users = []
    for user in service.get_all_users():
        if "r" in user:
            r_users.append(user)
    service.set_users_in_group("users_with_r_in_name", r_users)

One problem with this code is that it mixes calls to the user service with domain logic, so is is difficult to test domain logic without invoking the service. It’s also more difficult to reason about.

What if we instead did this:

def update_r_users(service)
    service.set_users_in_group(
        "users_with_r_in_name",
        Users.from_service(service.get_all_users()).filter_name("r").serialize()
    )

class Users:

    @classmethod
    def from_service(cls, users):
        return cls(users)

    def __init__(self, users):
        self.users = users

    def filter_name(self, text):
        return Users([
            user
            for user in self.users
            if text in user
        ])

    def serialize(self):
        return self.users

This is what I mean by wrapping simple data structures in classes. Why is this better?

First of all, I think update_r_users now reads a lot better. The filtering logic is no longer mixed with the calls to the service.

This comes at the cost of writing the Users class which is quite long for the relative functionality that it provides. In the beginning, I often find it hard to motivate myself to write these classes. It feels like a lots of boilerplate code for not much benefit. However, I often find that these sort of classes attract functionality, at which point they start to become more useful.

Another thing that they do is encapsulate the data format from the user service. Say that the API of the service changes. There is now more information about users:

service.get_all_users() -> [{"name": "user1", "age": 21}, {"name": "user2", "age": 43}]
service.set_users_in_group("group1", ["user1", "user2"]) -> OK

We can update the User class accordingly:

class Users:

    @classmethod
    def from_service(cls, users):
        return cls(users)

    def __init__(self, users):
        self.users = users

    def filter_name(self, text):
        return Users([
            user
            for user in self.users
            if text in user["name"]
        ])

    def serialize(self):
        return [user["name"] for user in self.users]

The update_r_users stays the same. We control the API of Users. Our application can safely depend on it. And we can change the internals.

I couldn’t find a name for this sort of pattern. My first though was that it was a way to avoid primitive obsession. And it is. But it feels like more than that. Bill suggested that it might be just “good old encapsulation”. Dan said that it depends on the context and that it could be an anti-corruption layer in DDD terms. What would you call it?

Anyway, I’ve been doing this sort of thing a lot this month, and I thought it was worth sharing.

Nov 2, 2024 ∞

Today I learned about the Rison data serialization format. I wrote a function to convert a Python value to Rison format. It was an elegant recursive function with partial support for the format.

Nov 2, 2024 ∞

I’ve used testing without mocks quite extensively now. I’ve also used it in a work project for more than a year. My experience is that it’s the best testing strategy that I’ve ever used. I’ve never felt more confident that my code works. I refactor code without fear of it breaking. It’s so good.

Oct 28, 2024 ∞

It’s getting dark. It gives variation to the running.