Comparing algorithms for extracting content from web pages
This study pits 14 open-source main content extractors against each other and arrives at a somewhat surprising conclusion.
This study pits 14 open-source main content extractors against each other and arrives at a somewhat surprising conclusion.
I went to summer school at Two Point Campus, the spiritual successor to Theme Hospital’s spiritual successor Two Point Hospital.
Sturgeon’s law states that ninety percent of everything is crap. It’s mostly true, except in this post where absolutely everything is crap.
Many people learn Python as their first or second programming language, but only few people bother to master it.
Three “fun” ways to throw off fellow PHP developers (and the “fractal of bad design” article isn’t one of them).
Flaky tests can cause CI builds to fail unexpectedly, and should be fixed as quickly as possible. This study shows why.
It’s easy to write Dockerfiles that work, but also to write Dockerfiles that suck. Here are some tips and tricks for writing better Dockerfiles.
Helm charts are re-usable packages for Kubernetes resources. They are easy to share and use, but this comes at a price.
I’m in the market for a tool that can help me analyse logs, traces, and metrics, and I was hoping that this paper could help me pick one.
Fonts in macOS look different from text that’s been rendered in Windows, mostly due to different philosophies about font rendering.
Spamming trains on Slack is likely to make some co-workers very happy and others very annoyed.
I built a completely useless Chrome extension that encourages people to complete their JIRA tickets (so I don’t have to).
End-to-end tests can help you discover problems in web applications, but sadly are not free of problems themselves.
Supersharers managed to reach 5.2% of registered voters. In contrast, Russia’s 2016 campaign only reached 3.4% of voters.
Although predatory journals may sound scary, they are actually quite friendly (and after your money).
Dat Utrecht Centraal druk is weten we allemaal wel. Gelukkig zijn er genoeg andere domme feitjes waarmee je mensen kunt lastigvallen.
I built a tool for the handful of people who listen to NPO radio stations, use Last.fm and know how to use the command line.
Hoe vind je je droombaan? In dit artikel vertel ik je hoe ik zelf (als softwareontwikkelaar) meestal te werk ga.
China is one of the world’s most populous countries, which means it also has some of the largest cities on the planet.
Depending on where you’re from, public transport in Hong Kong either offers a glimpse of the future or is stuck in the past.
Japan is good at inventing things, but in these five cases it was actually just good at adopting things.
Red Alert 2 has been available on Steam since March this year, but sadly the 24-year old RTS game no longer works out of the box.
Final Fantasy VIII is a classic JRPG in which everyone (including its developer) makes plenty of questionable decisions.
LEGO City Undercover is an underrated action-adventure game where you get to play an undercover cop in a major city.
Alright is a Google Chrome extension that automatically turns JIRA (and other) references in GitHub pull request titles into hyperlinks.
This blog provides a concise summary of the Scrum framework for anyone who needs a refresher or is just getting started.
Why an ACM membership is totally worth the money, even if you’re not an academic.
This page keeps track of my humble LEGO collection that I (passively) built up over the past decades.
I’ve published 250 articles on this website. All that work and what did it get me? Why did I do it? Let’s see if the page views justify it!
Another year, another retrospective, and also another excuse to discuss the past, the present, and the future.
What’s better than a crappy train travel planner that runs on my machine? A crappy train travel planner that runs on YOUR machine!
If all you have is CSS, everything looks like boxes and pseudo-elements.
Yet another blog post where I create an unmaintainable mess using our favourite yet very inadequate programming language, SQL.