Luis Orduz

A genetic algorithm implemented in Python

2023-03-01T00:00:00-05:00

Natural selection is, roughly, the likelihood of a given individual to survive long enough to reproduce, and thus continue its species. Factor in mutations—random changes in the genes—and the probability a given mutation has to help an individual survive (or not) in its environment and the result is that some individuals are more likely to reproduce than others. Those fitter individuals are more likely to pass on their mutations to the next generation, which will add mutations of its own, ultimately causing the population to slowly change as these mutations accumulate. Repeat this process over multiple generations across millions of years and we get evolution.

Turns out that implementing these ideas, or at least analogies, in software can be useful to solve certain problems, so let's write a simple program that exemplifies the process.

Seed

There are multiple types of genetic algorithms with multiple different uses, but usually they start with a data sample.

from collections.abc import Collection
from typing import Protocol


class Population[Individual](Collection, Protocol):
  def select_random(self) -> Individual: ...


def algorithm[Individual](population: Population[Individual]):
  parent_a = population.select_random()
  parent_b = population.select_random()

To keep things simple, we start with two parents that are selected randomly from the existing population, and we'll go from there.¹

Crossover

With our first pair in place, we can now produce the next "generation".

class Offspring[Individual](Protocol):
  def mutate(self) -> Individual: ...


class Population[Individual](Collection, Protocol):
  # ...
  def crossover(self, first: Individual, second: Individual)\
      -> Offspring[Individual]: ...

  def add(self, individual: Individual): ...


def algorithm[Individual](population: Population[Individual]):
  # ...
  base_offspring = population.crossover(parent_a, parent_b)
  real_offspring = base_offspring.mutate()
  population.add(real_offspring)

The crossover in genetic algorithms is the operation used to combine the data of the parents to produce offspring. But we can't just stop there, we need genetic variance to ensure the population actually evolves over time. One form of variance is of course that the parents contribute different characteristics selected at random from each parent, but even that isn't enough as it could leave us stuck².

Actual variance comes from the key element of mutation, the random chance that any given offspring individual will have genes not present in the parents.³

Finally, the new individual is, of course, a new member of the population so we add it⁴.

Natural Selection

At this point, we have parents and their offspring, what now? It's time to determine the goal. Genetic algorithms are commonly used to find a good enough solution to certain types of, often trial and error, problems that don't translate well to common normal algorithms. Fortunately, the only thing resembling a "goal" in nature is simply thriving, surviving long enough to reproduce... So let's do that, by introducing a "niche" and determining how well the individuals fit that niche.

# ...
class Population[Individual](Collection, Protocol):
  # ...
  def remove(self, individual: Individual): ...


class Niche[Individual](Protocol):
  def tournament(self, pop: Collection[Individual])\
    -> tuple[Individual, Individual]: ...


def algorithm[Individual](
  population: Population[Individual], niche: Niche[Individual]
):
  # ...
  fittest, unfit = niche.tournament(population)
  population.remove(unfit)

Nature is ruthless, and so is our algorithm. In nature, only the fittest perpetuate their genes, and in our algorithm, only that individual in a group⁵ that best fits the niche is the one to continue. This is usually called "tournament selection" in genetic algorithm jargon.

Finally, to maintain our analogy (and really to prevent our population from growing without bound) we remove the least fit individual from the population.

Generations

We have almost completed the algorithm, but the mere fact that we've found an individual that fits the niche better than others doesn't mean we've actually found one that thrives in the niche; the likelihood of achieving that in just the first generation is nil. We'll need many generations, so we need to repeat the process until we find such individual.

# ...
class Population[Individual](Collection, Protocol):
  # ...
  def find_mate(self, individual: Individual) -> Individual: ...


class Niche[Individual](Protocol):
  # ...
  def can_thrive(self, individual: Individual) -> bool: ...


def algorithm[Individual](
  population: Population[Individual], niche: Niche[Individual]
) -> int:
  parent_a = population.select_random()
  parent_b = population.select_random()

  generations = 0

  while not niche.can_thrive(parent_a):
    base_offspring = population.crossover(parent_a, parent_b)
    real_offspring = base_offspring.mutate()
    population.add(real_offspring)

    fittest, unfit = niche.tournament(population)
    population.remove(unfit)
    parent_a, parent_b = fittest, population.find_mate(fittest)

    generations += 1

  return generations

There are several options here, one commonly used in real genetic algorithms is to pick the two fittest instead of just one and make those "reproduce", producing an entirely new population and continue iterating from there. But to keep our natural analogy going, let's instead assume that our fittest finds a "suitable mate"⁶ in another member of the population, which also adds another source of variance.

Ultimately, the point here is iteration: continually doing the crossover and tournament selection until we meet our goal.

And there we have it, that algorithm function represents our full genetic algorithm, in a way I hope is self-explanatory enough. That function should work without change as long as it receives arguments that actually implement the protocols properly.

Here's a file with the complete definition, and here's a file with a string-based implementation of the algorithm, along with the function being run.

Now, before finishing, you'll notice that I talked very little about the actual problems that could be solved with this type of algorithm... Well that's true, because the point of this post was the algorithm itself. That said, I might write a follow up with a practical example.

Appendix (On implementation and testing)

I mentioned above that implementation doesn't matter and it indeed doesn't but for the sake of completeness—to fully explain the genetic algorithm—I wanted to go over what happens during crossover and tournament selection. However, we can write tests to do that instead of explaining the implementations line by line!⁷

from hypothesis import given, strategies as st

from implementation import Individual


st.register_type_strategy(Individual, st.builds(Individual, st.text(
  alphabet=Individual.POOL,
  min_size=Individual.LENGTH,
  max_size=Individual.LENGTH
)))

Before anything is done, we have to tell hypothesis how to create an Individual that actually fits our implementation. Only then we move onto the tests:

# ...
from implementation import Population

@given(...)
def test_crossover(parent_a: Individual, parent_b: Individual):
  offspring = Population().crossover(parent_a, parent_b)

  is_parent_a = is_parent_b = False

  for gene, a, b in zip(offspring, parent_a, parent_b):
    assert gene in (a, b)
    is_parent_a |= gene == a
    is_parent_b |= gene == b

  assert is_parent_a and is_parent_b

This test tells us everything we need to know about what happens in crossover without actually having to check the implementation: We don't care how it's done, but we do care that every gene in the offspring comes from one of the parents, and that both parents had an input.

# ...
from implementation import Niche


@given(...)
def test_tournament_selection(niche: Niche, population: Population):
  pop = set(population)
  winner, loser = niche.tournament(pop)

  while len(pop) > 1:
    assert winner in pop

    pop.remove(loser)
    _, loser = niche.tournament(pop)
  else:
    assert winner == loser

Tournament selection is a bit trickier to test because the calculation for each fittest is an implementation detail, but one the whole idea depends upon. We could simply repeat the implementation here and assert that the winner and loser were calculated correctly but then the test would no longer work if we changed the metric (or changed what an individual is entirely).

In these cases, we have to step back and think of invariants: what is always true about the tournament selection? As long as the winner remains in the population and no other individuals are added, it will always be the winner. And that's what we test: We systematically remove each loser until only one individual is left in the population; that individual must still be the original winner!

I use protocols because Python's structural subtyping is pretty good at properly representing a domain. In simpler terms: we only care about what our objects can do. On that note, the code I'll be showing shouldn't throw errors in mypy or pyright. ↩
If we stick to just the parents' genomes, then the target will never be reached if it requires a gene that neither of the parents has. ↩
The intermediate class Offspring fulfills two purposes here: to explicitly show the mutation step (instead of leaving it as an implementation detail of crossover) and to rely on the type system. We'll know we have a real individual only if it was selected from an existing population or if it's the result of mutation from the crossover of two parents. ↩
You might have noticed that an "Individual" is represented only by generic type arguments. This is on purpose: the algorithm doesn't need to care what an individual is. ↩
In the type annotation, I used Collection instead of Population (which is itself a collection) because the tournament could be done over any group of individuals; it doesn't have to be a population specifically, and we gotta be "liberal in what we accept". ↩
How it finds it is an implementation detail, hopefully one that excludes its parents. ↩
Since, as we all know, "code is for what, tests are for why, and comments are for jokes". ↩

Domains of engineers and users

2022-04-30T00:00:00-05:00

I finished my last post by mentioning how refactoring can help us achieve a code base where discussing the code and discussing the domain model can be analogous or even entail similar discussions. That's a topic that deserves a deeper looking into.

Experts on different domains

Imagine this conversation:

End user/Product owner: "We'd like for the URL of the Foo to have the provided identifier instead of these random characters."

Engineer: "Oh, by default the framework adds a UUID as the primary key, which is also used for URLs; we'll update it."

End user/Product owner: "Is that a front-end change or a back-end change?"

Now, now, that's an exaggeration as I've worked with plenty of users and product people who were well versed enough on technical details—often because they had to pick up the terminology—to understand what the developer was talking about in that exchange, but I hope this gets the idea across: Too often, there's a gap in shared terminology that hinders the communication between engineers and users.

And here's an example in the other direction:

End user/Product owner: "We need the aggregated results of the 'Foo' to be calculated from the monthly 'bar' instead of the weekly 'bar' from now on."

Engineer: "Understood, I'll add it to the backlog and change it as soon as possible."

Engineer (later, to another engineer or the tech lead): "Where the hell do the aggregated results value come from? Is that in the database? How is it called there? And what even is bar?"

Now that one is no exaggeration; I've had that exact conversation over the years, multiple times. And I've been on both sides of that last question too. Of course, I've also been in projects where there was enough rapport between the engineering and product teams for the engineer to simply ask the product owner right away... But the problem still persists, only in the form of periodic repetitive conversations instead of the latency caused by the developer looking for someone who understands.

The point is that engineers are experts on the domain of software engineering, while users are experts on their own domain, which they're hoping it gets easier to do with the application/platform/system they bought/subscribed to, or hired the engineers to create/maintain.

Code and stories

Let's imagine that the user/product owner comes at us with the following requirement:

As a user, I want to store my "Foo" with the "jon", "doe", "bar" and "baz" values. With the understanding that "baz" is seven times "bar".

After the appropriate back and forth, we come to the following user story:

Store Foo: Create a new model in cool_framework used to store the periodic Foo of the user. The fields are jon (unique string), doe (integer, 100 by default), and bar (float, cannot be empty); baz will be dynamically generated from bar. A UUID field will be included as primary key by the framework.

This user story is of course lacking a way for users to actually submit the data they want to store¹, but it already has more than enough to show how the mismatch in communication starts: The user/product owner could very well ask "what the hell is a UUID?"²

I should point out that I am in no way condemning this approach. It's perfectly fine and how around ninety percent of the projects I've worked on have looked.

But now let's look at the code that fulfills the user story:

from cool_framework import models


class Foo(models.Model):
  jon = models.StringField(unique=True)
  doe = models.IntegerField(default=100)
  bar = models.FloatField(nullable=False)

  @models.computed_property
  def baz(self):
    return self.bar * 7

Again, this is perfectly fine; I currently work in projects whose code looks like this (with a billion more fields and methods, of course), and there aren't problems, usually.

The thing is while this code can trivially be discused among engineers, it can't be discussed by or with users. By this point, both groups are essentially speaking different languages: The users talk about whatever Foo is, how the "jon" group performed this week, when the "baz" could change, etc.; the engineers, conversely, are talking about fields, properties, tables, migrations, etc.

And this works! Most of my experience has had this kind of separation and the teams work and the product is delivered. But what if the language gap could be narrowed?

A shared language

Let's rephrase that story:

Store Foo: Create a class/type/struct³ Foo where the user can store multiple instances of the following values: jon (type ProvidedIdentifier), doe (type UserEstimation), and bar (type PeriodicResults); baz (type AggregatedResults) is calculated from bar.

I just made up those types, but what's important here is that we're to assume that those types mean something to the users. Whatever is in "jon", the users usually call it the "provided identifier" of whatever Foo is; they know that "bar" is the "periodic results" of whatever it is that the users do. Ditto for the other fields.

With that understanding, let's rewrite the code to start illustrating why representing the domain more explicitly is important:

from decimal import Decimal


class ProvidedIdentifier(string):
  class Meta:
    unique = True

class UserEstimation(int):
  class Meta:
    default = 100

class PeriodicResults(Decimal):
  class Meta:
    nullable = False

class AggregatedResults(Decimal):
  FACTOR: int = 7

  def __new__(self, pr: PeriodicResults):
    return super().__new__(self, pr * self.FACTOR)


class Foo:
  jon: ProvidedIdentifier
  doe: UserEstimation
  bar: PeriodicResults

  @property
  def baz(self) -> AggregatedResults:
    return AggregatedResults(self.bar)

Is the code more verbose? In total, absolutely⁴ but it too became clearer about the what we're dealing with: We can assume that the signatures of those classes came from discussions with the users, where they described what that data means in their business and how it's supposed to behave. Sure, doe is still a string, bar a floating point value, etcetera, but now the developers can talk about the code in terms similar to what the users talk about, which also means they'll talk to the users in those terms too.

Let's reimage the conversations from the start, but now with the assumption that all the code is like this:

Well, for starters the first conversation no longer makes sense as another win from these changes is that we decoupled our business logic from whatever framework we're using⁵... As long as the framework treats the business code as source of truth anyway.

As for the second,

End user/Product owner: "We need the aggregated results of the 'Foo' to be calculated from the monthly 'bar' instead of the weekly 'bar' from now on."

Engineer: "Just to make sure, baz is the aggregated results and bar is the periodic results, correct?"

End user/Product owner: "Correct!"

Engineer: "Cool, I'll get it done."

Let's be real, it's likely that the engineer still doesn't know what the hell these "results" are or why the users care about them, but now at least it might be easier to pinpoint what needs to change without having to find someone who might be more familiar with the domain.

Finally, it bears mentioning that this approach doesn't solve all communication problems: The engineers will never be domain experts on the users' domain, so there will always be questions, specially when creating new features as the developers will need to ask what the new classes/types will need for fields and metadata⁶; as for the users/product owners, the decoupling with the framework and the increasing reliance on native language constructs means that the number of technical details they need to have an idea about becomes smaller, but not zero.

Regardless, giving the team (both involved users and developers) a way to more fluidly discuss the product is a huge step forward in my book.

A caveat

All of that is pretty nice; however, the boilerplate the users don't care about still needs to exist somewhere. Maybe abstracted and hidden away in specific modules or even internal libraries that have their own repos, but finding and correcting leaky abstractions is a neverending battle, so those discussions that ideally should be among engineers only might find their way in conversations with the users.

Imagine a world where we didn't need to abstract the boilerplate for every project; that it simply didn't exist. In that ideal world, we could just drop some native classes/structs/etc that contain all the business logic and only the business logic to some tool(s), and the tool would automagically take care of all the wiring needed for the application to reach the end user.

I've given a lot of thought to such an idea, and maybe it's a pipe dream, but stranger things happen in this industry all the time.

Let's say a form in some web site/application. ↩
Or, for that matter "what are strings and floats?" You never know! ↩
Or whatever construct your favorite language uses to group data, if any! ↩
But hey, the code of the Foo class itself got simpler. ↩
Yay for clean architecture! ↩
In other words, some conversations for the developers to make the business rules clearer, which they then can express in the code! ↩

Practical refactoring: Abstractions

2022-03-31T00:00:00-05:00

In my last post, we did a basic rundown of a very convoluted short algorithm to make more explicit what was actually happening in it. That by itself goes a long way in improving how readable the code is, and thus makes it easier to maintain. I've seen small improvements like that be welcome enthusiastically among different teams, but we can go further.

I remember a project I worked on where there was basically no separation of concerns between request handling boilerplate, database connection boilerplate and actual business logic; everything was handled within the same functions. It was a nightmare. I was hired to create some new APIs, but it took me just a week of trying to create new handlers like that to decide that such was no way to live. I took it upon myself to refactor that code. It bears reasserting that when refactoring, it's good—and often enough—to first reach for the lowest hanging fruit and in that code the easiest improvement was, of course, separating code that dealt with different things into different functions, and calling those new functions from the old ones.

Now, the small script we're going through does really only one thing, but that doesn't mean we can't divide some responsibilities by way of abstracting away some code not directly related to that one thing. If we do this, we improve the code in at least three ways:

The algorithm itself becomes more immediately obvious; it's easier to understand what the code does.
By abstracting the supporting/boilerplate code, we make it possible to reuse those same structures somewhere else.
This gives us a opportunity to bring the code in step with the domain: using names specific to what we're doing instead of talking exclusively about basic data types.

The abstractions

Something I didn't mention in my last post is that the small method we're refactoring is actually part of a basic genetic algorith I implemented for fun. Well, genetic algorithms deal with populations, so what if instead dealing with "lists" of "strings", we create an actual Population class/data type that does what we need our populations to do:

class Population(list):
    def __iter__(self) -> Iterable[Self]:
        return iter(self[x::2] for x in range(2))

Not complicated, now we know that a population is a list, but one that when iterated just produces two sublists: the two halves of the original. We can improve it further, but this is enough for what we need¹.

We have our population, but our genetic algorithm is not of random things; it specifically looks for individuals that match a given root individual, with the purpose of each generation to be closer to that individual.

Let's give it a try:

class Individual(str):
    def __and__(self, other: Self):
        return sum(ch_s == ch_o for ch_s, ch_o in zip(self, other))

For our "model"² we know that we need a specific type of string that when compared with another will return the number of identical characters. So we do just that: create a str subclass that can "intersect" with other strings of the same type, and give a numeric value representing how well they match.

With these two abstractions alone, our method improves considerably:

def get_best_matches(self) -> WinnerPair:
    # self.population: Population[Individual]
    # self.root: Individual
    winners = WinnerPair()
    for half in self.population:
        winner = max(half, key=lambda ind: ind & self.root)
        winners.append(winner)
    return winners

I like it. We get each member of the winner pair from half of the population, and notice how the code doesn't show things we don't need to know to understand the steps:

How do we get each half of the population? That doesn't matter for understanding the algorithm; we just need to know that we're getting a half. How that half is gathered is up to the implementtion in Population which we can check if we need to, or we could change it if we have to: Like instead of appending the odd-positioned items to a half and the even-positioned to the other half, we could just literally split it at the central index.
To get the winner we're comparing how well is the intersection between each individual and the root (self.root, renamed from self.word³, in this case). How is that intersection calculated/found? That's entirely an implementation detail, which again we can check and/or change in Individual if we have to. We could even make individuals a different type instead of strings, make a comparison appropriate for that type, and we wouldn't need to change anything in this method.

Notice that, at the end of my previous post, I said that we could use abstractions instead of relying on comments to tell us what the data types are supposed to represent. There are still comments in this piece of code... but that comment is just to note what abstractions we're using, and is there just for description purposes. In the full code it would be completely redundant, since the types of self.population and self.root would already be defined somewhere else. Likely at the top of the class this method belongs to.

Nonetheless, we could still make those comments explicit in the code⁴ by way of, say, making them arguments to the method. However, that would no longer be a refactoring⁵... But we can cheat a little bit:

def get_winners(
  population: Population[Individual], root: Individual
) -> WinnerPair:
    winners = WinnerPair()
    for half in population:
        winner = max(half, key=lambda ind: ind & root)
        winners.append(winner)
    return winners

def get_best_matches(self) -> WinnerPair:
    return get_winners(self.population, self.root)

Basically, we create a new function that isn't necessarily tied to a class (note the lack of self) and we call that function from our method. This gives us the potential advantage of reusing that very same winner finding logic somewhere else if the opportunity arises.

Hindsight

So there you have it, we improved the code, considerably, just by moving a few lines around and making what we're doing and using more explicit. It goes without saying that this would help us communicate between the developers and with the product team (if any) much more easily.

Ideally, in a good codebase, discussing the code and discussing the business model would entail very similar expressions, as the code would just be a literal (with caveats of course) representation of that business model. Hopefully this post showed how such a thing could be achieved.

This is a subject I really, really like and I'm hoping to keep writing about it.

When refactoring, it's easy to get lost making increasingly minute improvements. We always gotta remember that premature optimization is the root of all evil and that abstractions can easily become unnecessary indirections. This is a fine line, and sometimes I don't spot it, so it's important to remember that the goal is to make the code more maintainable instead of perfect from the get go (which is impossible anyway). In an actual product, it also helps to remember what I think as the rule zero of Software Engineering: "fight for the users". If something we're doing won't improve the users' experience in any meaningful way, it might be best left alone. ↩
This genetic algorithm in particular just iterates over populations that increasingly resemble the root word. ↩
In my opinion, the naming of the root individual as self.word was a code smell born out of the usage of basic data types instead of proper abstractions. It was a way to hint at the developer that the comparison was between strings. Using proper types/classes, we no longer need to do that. ↩
After all, "code is for what, tests are for why and comments are for jokes" (which is also a joke... or is it?) ↩
If "improve the easiest thing first" is rule one of refactoring, then "do not, under any circumstance, change the contract" is rule zero of refactoring. To put it another way: if we have a good test suite, a proper refactoring shouldn't change the result of any test. Anything other than that is an actual change in the behavior of the application/system, and it better have been agreed upon. ↩

Practical refactoring: 'clever' code

2022-02-28T00:00:00-05:00

Look at this code

def get_best_matches(self) -> WinnerPair:
    """
    Divide population in half.
    Pick the word closest to the matching word in each half.
    """
    return WinnerPair(*map(
        lambda population_half: max(
            population_half,
            default="",
            key=lambda word: sum(
                a == b for a, b in zip(word, self.word)
            )
        ),
        (self.population[::2], self.population[1::2])
    ))

I'm not gonna deny it, I liked writing it, I like that it is technically a single function call¹, the usage of lambdas and the built-in Python functions used for handling, well, functions and iterables. What can I say? It makes me feel "clever" because technically it's code that requires certain level of familiarity with the language.

It's also a total mess. I literally spent an entire afternoon explaining this "short" piece of code to an experienced engineer who had already invested a few months getting familiar with Python.

This code is me at my most self-indulgent and I'm well aware I would never have written this outside of a prototype meant only for me to play around. Code like this is not meant to live in a system worked at by more than one developer. It'd be a nightmare to maintain, as only the one who wrote it could possibly understand it. Hell, I wrote this and I had to struggle a bit puzzling what it actually did.

In short, this code is ripe for improvement, which is exactly what I'm gonna do.

Unclear iterations

The first thing that jumps at me upon seeing this code is that there are three nested iterations in it, but it's very difficult to tell which one is which or where each one ends. An easy first fix is then relying less on built-in functions and making the iterations more explicit via for statements.

def get_best_matches(self) -> WinnerPair:
    # Get the words closest to the target in each half of the population
    winners = WinnerPair()
    for population in (self.population[x::2] for x in range(2)):
        scores = []
        for word in population:
            similarity = 0
            for char_word, char_target in zip(word, self.word):
                similarity += chard_word == char_target
            scores.append((word, score))
        winner = max(scores, key=lambda score: score[1])
        winners.append(winner[0])
    return winners

Looks quite different, doesn't it? It would seem to someone completely unfamiliar with Python that I changed more than replacing the function calls with fors, but that's truly all I did:

The first function was map which is doing something to both halves of the population.
The second function was max which is picking the highest according to something in each word in the population.
The third function was sum which is actually calculating that previous "something": In this case, how similar is the current word with the target word.

I then reused max, but it's now clearer what maximum value of what it's being picked. I will not lie: I hesitated with leaving the sum as it was as I felt that with the other replacements it was clear enough, but then I saw the opportunity to further clarify that we were comparing the current word with the target word. On the other hand, I did leave zip as it was, as that one is clear enough to me.²

Aside: Someone with some familiarity with algorithm analysis might see three nested for loops and pale at the "cubic" complexity, but this function isn't iterating over the population input (let's call it "n") multiple times. It's instead iterating only once over the total characters input (let's say "m"). In short: This iteration only visits each character in the population once.

The word list (but not each word) is visited twice because of the max function, but since two is a constant, it remains of linear complexity.

Anyhow, those "straightfoward" changes are enough to at least being able to tell what the function is doing line by line, but it can be better.

As it is, we're doing a bunch of operations over basic data types with a comment explaining what those data types are supposed to represent. We could instead explicitly define our own abstractions over those data types and let those abstractions tell us what they can or can't do, or how they should be used.

But I feel like that is interesting enough for its own post, so see you in the next part!

Well, a function call wrapped in a class instantiation, but who's nitpicking? ↩
I firmly think that Software Engineering is engineering, and I have no problem calling myself "engineer" over, say, "craftsman", but there is a subjective factor to some decisions. ↩

21 & 22

2022-01-01T00:00:00-05:00

2021

I wish I could say 2021 was an eventful year, but frankly I spent most of it dealing with ennui, boredom and general disenchantment. One could even say I might have faced some burnout, but I don't think that's it; I still very much like writing code.

I also got covid right in the middle of the year so that might also have put a damper on things, but that doesn't mean that absolutely nothing happened last year.

Life

Back in June 2020, and rather suddenly, an opportunity to accomplish one of my personal goals presented itself, and I decided to go for it with the caveat that I had to complete it within ten months. Indeed, I finished it by April, and now that's one fewer thing for me to worry about, forever.

I moved back to Bucaramanga, my home city, after spending three years in Bogotá. I figured I wasn't doing much of anything there (even less with covid around) so I decided I wanted to spend more time with my folks, siblings and my sister's children.

I could finally go back to the movies after over a year of lockdown, but more on that later.

Work

I started working at a new company, with which I'm extremely happy. It fits all I want from a place to work nigh perfectly (remote, flexible, well organized, interest in new tech, etc) and the team is great. Some of the projects are better than others but regardless, it would take a lot, a lot, for me to leave.

On the personal projects front, I created a prototype to compare single page applications with backend generated fragments displayed by htmx.¹ Also I learned a little about genetic algorithms while on it.

Did some updates to this website, mainly related to indie web things.

Reading

Technical

I cut back on my HN surfing. Not because the signal to noise ratio has gotten lower (although I've gotten the feeling sometimes) but because I was simply spending too much time in, often repetitive, comment sections. And I didn't even comment, I mostly read.

Instead, I've been subscribing to more and more blogs (both company tech blogs and engineers' personal blogs), to the point that my subscription list in the old reader tripled over the year. I'd say the change is good in general.

Likewise, I've followed more people on social media (primarily the same engineers whose blogs I subscribed to), mostly for discovery value: Often those people share stuff by other people, which means potentially new blogs to follow.²

Fiction

I finally got an e-reader last year, and it has translated on my reading quite a few more books than in recent years. Mostly fantasy, and I'd say my favorite was probably "Prince of Thorns". I'll be reading its sequels and more this year.

Movies

According to my Criticker profile, I watched sixty six films last year. I love that almost half of them were watched in theaters, although I do stream movies; had to pick up the habit during 2020.

Apparently, the movie I hated the most overall was some cliched horror movie from 2019, but the one I hated the most that I watched in theaters was of course the new "matrix" movie.

Conversely, I gave the exact same rating, the highest of the 66, to two different movies: "Savig private Ryan", which I watched at home, and "Spider-Man: No way home", which I watched in theaters. Yup, I stand by it, definitely the two movies I liked the most that I saw last year.

Unfortunately, nothing over 85 (which would translate to a nine or higher in imdb), and indeed, no movie I watched last year really floored me. I'm hopeful for "The Northman" to pull it off this 2022.

Music

I don't stream music; I'd rather buy the songs I like and listen only to my collection... But that didn't happen often last year. Though I did remember adding "Courtesy Call" as well as some instrumental tracks to my "music for programming/writing" playlist.

2022

With 2021 out of the way, I do have plans for this year.

Life

I'm pretty happy with where my life is right now, materially speaking, so I'm focusing this year on finally losing weight. I have a plan this time.

Work

Will keep working in my current job. As I said, I have zero interest in leaving my current company.

Will do far more experimentation, both to practice stuff I might have gotten rusty with and to get to know new things... such as Rust.

I also have an idea for a potential tool/service that could become my first officially released open source project. Gotta work on it and iterate to see if it actually is viable or not.

Reading

Nothing much, will continue the trend of reading the blogs I already follow and subscribe to more.

Likewise, more usage of my e-reader for fiction.

Writing

I'll do my best to be more active on this blog. Can't promise a regular schedule to myself, but I will write at least a note of anything that comes to mind or that I find interesting.

Will finish some stories I am writing privately and will try to get started on something I feel I can actually publish. To get feedback if nothing else.

Movies

As I said, I'm really looking forward to "The Northman", but also there's a handful of other movies that I'm hoping they're as great as they look (looking at you "Doctor Strange") or that end up surprising me (looking at you "Avatar 2" or "The secrets of Dumbledore", among others).

Long story short: HTMX is absolutely magical and everyone should be using it. ↩
I still actively avoid the awful, outrage-obsessed side of social media, of course. ↩

Creating stories from requirements

2020-02-01T00:00:00-05:00

A few weeks ago, one of my best friends came to me with an idea for an application. Unlike most (if not all) of my ideas, this one I actually believe to have potential so I asked him to prepare a list of requirements we could use to at least have a rough goal to aim for with this application. He delivered, and I was presented a good enough list... And I did nothing with it, I've been letting it sit there, all unfulfilled potential. But that changes now!

I've been working as a software engineer for a while and that experience, the manifesto, what I remember from my college studies and what I've read regarding system design all point to the same conclusion: Having a fixed list of tasks and dedicating too much time to it is pointless because requirements change like waves in the sea.

However, I still feel it's important to have a set of stories to work on, both to have a general idea of what the system is and how it's supposed to work, as well as for documenting progress; nothing kills motivation like the feeling you're not actually advancing towards your goal. Besides, we control the requirements in this case and, even though waves are fickle, the global conveyor belt is still a thing. Much like with that overly elaborate metaphor, my goal is not to describe how every single detail in the application is supposed to work, but more a general description of each functionality, and general details on how to implement it.

With that in mind, I'll show how I approached turning that requirements list written in the "as a user" format into stories that developers can actually work on. I'll forgo things like "story points" or "acceptance criteria" because not only are those restrictive, I'm doing this as a developer for developers—all two of us—and what I care about is what to do and how to do it. And that's also the reason I won't be using stuff like certain project management tool that most software development teams are familiar with whose name I won't mention, at the risk of summoning giant lizards.¹

In fact, this is a good point to mention sourcehut, Drew DeVault's cool set of tools for creating software that's very reminiscent of how the biggest open source projects are maintained. I'm going to be creating these stories using sourcehut's todo, which is to say: simple issues.

Without further ado, let's take one of the requirements as example, one that is universal enough that I can use without risking IP: user accounts. This is an abbreviation of my friend's requirement:

As a user I can create an account in the application, with an email or using social networks (an email should be sent introducing the platform upon registration).

As mentioned, my goal with the stories is knowing what to do and how to do it, from a developer's perspective. Part by part, this is the architecture² that this requirement defines:

The application has a domain, and this domain includes the entity User because that's what we want to create.
The application has some sort of data layer that the application interfaces with to store this User data.
The application has adapters that interface with third parties to retrieve the user data, social networks in this case.
The application has the use case "create user" that is called in two different ways: using email or using the adapters mentioned above.
The application has an interface that the user employs to send the email and other data, or to trigger the retrieval from third parties.
The application has a second use case, which is to send an automated email to the, well, email provided by the user upon successful creation.

With that rough outline, we have an idea for two or three stories, because at this stage is better to restrict stories to the number of use cases, or to the number of times all use cases are instantiated across the application.³ Of course, stories ultimately can involve editing use cases too; the point is that we should make the stories about the application business rules whenever possible.

The stories mentioned here are deliberately vague on the tech stack because I want this to be applicable for as many developers as possible.

Create User from email: Create an Use Case that accepts raw data as well as a data repository⁴ and creates an instance of the User domain model using the raw data. It then passes this data to the repository for creation.; Create an interface adapter⁵ that receives data (including email) submitted by an user and passes it to the aforementioned Use Case, alongside the database repository.; Create a view⁶ that allows the user to submit this data to the adapter.⁷
Create User from third party: Create an interface adapter that gathers data (including email) about an user sent from third parties⁸. It passes this data to the create user Use Case alongside the database repository.⁹; Create a view that allows the user to go through the third party communication cycle.
Send email on user creation: Edit Create User Use case so that, upon successful storage, calls a new Use Case.; This new Use Case is in charge of sending the body and addressee of an email to an email sender interface adapter.¹⁰; Create an interface adapter that uses the data produced by the use case and sends it to an external queue¹¹, in the form of a message, that should send the email.

These stories are somewhat vague, and that is deliberate. I don't want to restrict—and not only because I would be restricting myself—and details of implementation are what code reviews and tests are for. The one exception I make on not specifying implementation is regarding the tech stack itself: it's such a big decision that all developers are benefitted if the stack is clear¹².

The stories should only define a rough end goal, and I believe the ones I wrote here achieve that. Anyhow, I feel that this is a good first step and a general description of how we'll be working on this project.

This is going to be part of a series, an idea born from a great article on blogging that I read the other day, which inspired me to write more and gave me the idea of how to actually write: I'm going to push myself into working on this application (and refactoring an older one, with another friend) so that I have material to write for this blog, and I can use the will to write for the blog as incentive to work on those projects, killing two birds with one stone.

IMPORTANT: Everything I'm writing in these series is my interpretation and general idea of a good process. Anything (or everything) I write might be entirely wrong and, in such event, I encourage you to correct me in the comments.

Just FYI, I don't hate it, it can be a great tool, as long as it isn't drowned in the bastardization of scrum, which I don't hate either... as long as it's used as a guideline instead of a forced two-week waterfall grind. ↩
I'm trying to describe this within the terms of the clean architecture. Results may vary. ↩
And debatably at any stage. I've seen projects where stories are created for everything, even changing the color of a button to a slightly differen shade of blue. YMMV on advantages and disadvantages of that practice. ↩
I'm a postgreSQL user. ↩
Probably a function that would retrieve the data in a given format, create a domain entity object from it that would then pass this object into the database adapter, probably a SQLAlchemy model. This function itself would be called from a flask endpoint. ↩
I've worked as a full stack web developer almost exclusively, so in here I'm thinking of a form, created either from a JS framework or just a simple web form. ↩
Of course, nobody is forced to use a client-server system, but it's what I'll use. ↩
Probably something like flask-dance. ↩
Remember not to pollute the business rules or the entities with data or steps specific to any third party. ↩
Remember not to break flow of control, these business rules don't care how the database signals success or how the sender adapter sends the message. But that's an implementation detail that should be discussed in review. ↩
Probably rabbitmq through Celery. ↩
Everything beyond that (meaning how the stack is used) is outside the scope of the story. That's for tests and reviews. ↩

Oracle in Docker

2017-11-13T00:00:00-05:00

A while ago, I had to work in a project that used oracle as its data layer (yeah, I know...). When we started, there was no such thing as an Oracle docker image so the development environment was either set-up manually or using bash scripts. I tried to create images but, first, it was hell and, second, I didn't want to bother with any license breach. I love bash and I'm often scripting away repetitive stuff but I am way too used to docker for my development environments (and also for deploying and in production); as such, it can be said that, whenever I had to rebuild the environment from scratch (and since a migration was being made towards data warehouses, that was more often than usual), I cursed my days.

Thankfully, by the time we were finishing and regressions were becoming more and more expected, Oracle released official images to the docker store. I didn't waste time and, with some effort as the documentation was quite sparse, I managed to set them up locally and turned the bash scripts (and some plain text instructions) and other requirements into a docker-compose file. This short guide is about duplicating the process (well, the Oracle part).

Initial steps

First of all, create an account in the docker store if you don't have one already.

Next, login with your account in the docker console, using the command docker login.

Getting Oracle

With that set up, head over to the oracle enterprise page in the docker store and click in the button that says "Proceed to Checkout".

At this point, fill the information requested and accept the terms, the process is similar to the one Oracle has for downloading the client and databases from their website. They require it here too because this is Oracle.

Now you can pull the docker image: docker pull store/oracle/database-enterprise:12.2.0.1. It'll take a while.

Using Oracle

At this point you're probably in the instructions page, which is now far more detailed than it was when the images were released, lucky you. They are relatively easy to follow but I'll write the last few commands required to use the image here anyway.

To start the image, run the command:

docker run -d --name <db-container-name> \
  store/oracle/database-enterprise:12.2.0.1

To connect to the database using Oracle's sqlplus client, use the following command:

docker exec -it <db-container-name> bash -c \
  "source /home/oracle/.bashrc; sqlplus /nolog"

Some options

Setting the DB_SID environment variable changes the name of the database. Default is ORCLCDB.
The port 1521 can be mapped so that the container can be accessed from the host. It can also, of course, be linked or set up in a network with other containers.
The data can be separated in a volume, the directory to be mapped is /ORCL.
Remember to change the password of the sys user (default is Oradoc_db1). This probably should be done in a Dockerfile that uses this image as base.
There's a smaller image (store/oracle/database-enterprise:12.2.0.1-slim) whose Oracle installation has fewer options and tools. This is what I'd probably use if I have to work with Oracle again.

And that's it for now. If you have any problems or corrections, let me know in the comments!

Continuous delivery with Gitlab

2016-10-20T00:00:00-05:00

When I worked full-time in unique projects, writing the code and tests before running everything manually and then deploying to staging or production, also manually, was good enough. Now, however, with the potential to work in several and vastly different projects and environments, this "process" has become increasingly tedious; as such, as I've become more interested in operations since I started working with Docker, I tasked myself with automating this.

After a while of trying different tools, and the impossibility to work with others due to the limited income of a recent freelancer, I've settled in a process that I believe will suit me just fine, thanks to GitLab.com's all-around awesomeness. It beats the other offerings I considered by a margin:

Over GitHub, GitLab has private repositories, something clients will want, in its free tier and their built-in CI.
Over Google Cloud Repositories, GitLab has their integrated CI.¹
Over Heroku (which I was reticent about anyway for different reasons), GitLab CI is far less opinionated and offers more freedom in setting up the delivery process while allowing itself to deploy different types of applications more easily.
Over dedicated automation tools such as Jenkins or Buildbot, GitLab CI has the advantage of being simpler and straightforward. It might not be as maneuverable, but I believe what it offers is more than I need.

Finally, GitLab has the advantage that its CI service is fully integrated and out-of-the-box with the git repositories, along with other useful or potentially useful features. One of these is that it works extremely well with docker, which I already use for local development.

My development process is now, roughly:

Running docker containers locally.
Using git hooks (set in place using bash scripts) to trigger tests.
Pushing to GitLab, where their CI will take charge of running tests again and, on success, pushing to the defined destination.

Right now, I'm pushing to Google App Engine, whose free tier, despite their lacking repositories², is still the best option for me.

But enough of introduction, let's get on with the guide:

Preparation

We need:

An account in GitLab.com or one's own GitLab server. I believe a private GitLab server would work too but I've only tested this on the website.
An account in Google Cloud. Or adjust the GAE-specific steps to your vendor of choice.
Docker, docker-compose, and git installed locally. Some familiarity with git might be required.

GAE Setup

(If you're already familiar with creating a project in Google App Engine or use a different vendor, skip ahead to GitLab setup).

In the cloud console, create a project:

Fill in the name you want (I called mine "gitlab-test").

For all of the following steps, remember the project ID that was returned upon creation.

Afterwards, go to IAM & Admin:

Once there, click on "Service Accounts":

Then click on "CREATE SERVICE ACCOUNT" and fill in the form that pops up like so:

After clicking "CREATE", this will download a json file that contains the key that GitLab CI needs to connect to GAE.

Now we need to enable the two APIs required to deploy to app engine remotely. There's a straightforward way that is merely clicking a link but, since I wouldn't trust it myself if I didn't see it firsthand, I won't expect you to trust it either. So the slightly longer way it is:

In the google cloud console sidebar, click on API manager:

Once in the API manager, click in "ENABLE API".

In the library that opens there's a search box, type "app engine admin api" there and click in the first result.

Once there, click in "ENABLE".

Now just repeat this process (API Manager > Enable API > Search > Enable) for "Google Cloud Storage".

GitLab setup

(If you're already familiar with creating a GitLab project and setting up project variables, go straight to Code).

First of all, create a project by clicking on the "New Project" button:

Fill up the "new project" form using the settings and name you want:

After sending the form, click on the project settings menu and select "Variables":

Set the GAE_PROJECT variable with the id of your Google Cloud Project:

Afterwards, set a new variable named GAE_KEY, whose value must be the contents of the json file we downloaded earlier from GAE. Delete the json file as it could be dangerous to have it lying around.

These variables might not be needed if the process to deploy to your vendor doesn't require authentication or there are other ways of authenticating.

Code

The app we're deploying will be a simple "Hello World" in Flask with the following structure:

app
| - app.yaml
| - docker-compose.yml
| - .gitlab-ci.yml
| - app
|   | - __init__.py
|   | - app.py
|   | - test.py
|   | - Dockerfile
|   | - requirements.txt

app.py is within a module and not in the root folder (which would be simpler) for ease of deployment to Google App Engine. This is its code:

from flask import Flask

app = Flask(__name__)

@app.route('/')
def hello():
    return 'Hello World!'

if __name__ == "__main__":
    app.run(debug=True)

test.py, as its name indicates, it's just a very simple unit test for app.py, as an example:

import unittest

from app import app

class Test(unittest.TestCase):
  def test(self):
    result = app.test_client().get('/')

    self.assertEqual(
      result.data.decode('utf-8'),
      'Hello World!'
    )

__init__.py has the GAE path setup:

import os, sys

lib_path = os.path.join(
  os.path.abspath(os.path.dirname(__file__)),
  'lib'
)
sys.path.insert(0, lib_path)

from .app import app

if __name__ == "__main__":
    app.run()

The app only has one dependency, flask, and that single word³ is the content of requirements.txt.

Now the stuff this guide is meant to be about. First the Dockerfile:

FROM python:latest

ADD requirements.txt /

RUN pip install -r requirements.txt

ADD . /code

WORKDIR /code

CMD ["python","-m", "unittest", "discover"]

Simple enough: from the python image install the requirements and run the test.

docker-compose.yml is very simple too:

app:
    build: app

Build and run what's in the app folder

Now what allows GitLab to perform its magic, the .gitlab-ci.yml file:

back:
  image: python
  stage: build
  script:
    - >-
        pip install -t app/lib
        -r app/requirements.txt
    - export PYTHONPATH=$PWD/app/lib:$PYTHONPATH
    - python -m unittest discover
  artifacts:
    paths:
      - app/lib/

deploy_production:
  image: google/cloud-sdk
  stage: deploy
  environment: production
  script:
    - echo $GAE_KEY > /tmp/key.json
    - gcloud config set project $GAE_PROJECT
    - >-
        gcloud
        auth activate-service-account
        --key-file /tmp/key.json
    - gcloud --quiet app deploy
  after_script:
    - rm /tmp/key.json

There are a couple things happening here, but nothing overly complicated:

In the build stage, run the python docker image, install the requirements locally in a folder called lib, run the tests and then make the lib folder available for next stages.
In the deployment stage... deploy the app to GAE (Adjust the commands for your vendor of choice).⁴

As you can see, to deploy we are using the variables (GAE_KEY and GAE_PROJECT) we set in the previous section.

Finally, app.yaml, which is specific to GAE:

runtime: python27
threadsafe: true

handlers:
  - url: /
    script: app.app

This uses the module structure so it can use the external libraries (flask) in the project.

Deployment

There's not much to this, just run docker-compose up and wait for the OK or possible errors. If there's nothing wrong, then we're ready to deploy. The Dockerfile and the docker-compose.yml file can be tweaked to actually run the server or perform any other task one might need.

In the root folder of our app, initialize git and add the repository URL of the gitlab project as remote. Then you only have to push the code and, after a few minutes, check the url [your-project-id].appspot.com and the "Hello World!" should be staring right back at you.

Conclusion

So that's it! We've deployed our app to GAE using GitLab. From then on, you can just dedicate yourself to writing the code and its tests. To deploy (to production or staging or any environment you choose), you just need to push and this process will take care testing and delivering the code if there aren't any errors.

Things we could do now is setting up automatic local testing on each commit, multiple stages and notifications for failed and successful builds, etc.

If you have any questions, let me know in the comments.

It amazes me how cloud repositories is almost completely isolated from all other Google Cloud services. They used to have a Push-to-Deploy feature but that's gone (if it isn't, it must be really well-hidden now because I spent days reading documentation, forums and question threads about this) and now they suggest setting up one's own continuous integration service. I can't imagine why they did that and, again, I'd rather not risk being charged for running their recommended Jenkins setup. ↩
Indeed, if their cloud repositories were integrated with their cloud platform, I might have never bothered to look into GitLab. A good thing in hindsight, all things considered. ↩
Not versioning your dependencies is, of course, not recommended. ↩
The script for the deploy_production stage in .gitlab-ci.yml is partly based on the one in this cool post by Dennis Alund. ↩

The Not-Invented-Here syndrome

2016-06-25T00:00:00-05:00

I read a while ago about the NIH syndrome and how it's generally not recommended because it unnecessarily increases the workload and the amount of code that needs to be maintained. Not to mention that using existing libraries or frameworks, specially open source ones, can also eventually involve helping the community and, thus, improving the code for everyone.

So, the recommendation is generally reusing as much code as possible, hopefully keeping the amount of original code reduced to the actual business logic of the project at hand. I think this is sound, but there's something to be said about producing in-house code and reinventing the wheel a little.

In my last official project, where I worked as a backend developer for a startup, we did use frameworks for all of backend, frontend and presentation, as well as several plugins for the frameworks to avoid increasing the workload too much, but we also wrote a lot of code that we might have found in existing libraries if we looked.

For a lot of the REST API, for example, I wrote all of the entrypoints and callback logic. I know now that proper usage of Flask-Restful (flask is, of course, my favorite framework) could have saved me a lot of work in that area... But I don't really regret it, I can say a learned a whole lot because I've always been a bit of a hands-on learner.

Of course, I've also seen first-hand that doing everything in-house can and does get out of control and, after a while, it becomes almost impossible for the handful of developers of a startup to maintain all that code.

In the project I'm working now, thanks to what I learned from writing a lot of my own code, it's been easier for me to research libraries and decide what would be a better fit as well as recognizing where I really do need to write; part of the reason I didn't reuse as much as I could in that other project was overestimating what was actual business logic and what were mere building blocks.

Given the chance, I'll probably refactor all that code and use Flask-Restful or similar to simplify it and make it more easily maintainable; but every learning opportunity is a good opportunity so I'm glad I went more zealous in the first go in that project.

What I'm trying to say is yes, one should avoid the NIH syndrome, surf the community, reuse stuff that hundreds if not thousands of people have polished (the more eyes, the better) and prevent getting the codebase from getting out of control due to reinventing the wheel. But one should also tackle at least one project where one writes as much code as possible, it highlights the importance of reusing the code in later projects, one gets first-hand experience on what leads people to write such libraries in the first place and, in general, one learns how the kind of projects one is working on generally work.

It goes without saying, of course, that doing such a thing is really only beneficial early in one's career. I see no reason for reinventing the wheel once one is already an experimented developer. Of course, someone more experienced than me could probably tell me otherwise.

In a different but related matter, in this website I've tried to avoid using frameworks of any kind; opting for more hands-on code. Just like doing it once helps to learn, I think that keeping a side, personal, project for practice keeps one from forgetting the basics. This site is, thus, my sandbox in a way, helping me practice HTML and Jinja templating (through the Pelican blog), and LESS and CSS for the themes.

Hello World!

2016-03-14T00:00:00-05:00

I'm a Software Developer and, as such, the first entry of my blog is to be a Hello World page, which I now proceed to write in Python...

print("Ahoy!")

... and browser ECMAScript (or JavaScript if you prefer, which I don't).

console.log("Good Day.")

So... that's it for today.

UPDATE: I decided to dedicate my site mostly to Software after all.