The DeepSeek-R1 Training Pipeline

April 17, 2025

DeepSeek’s R1 had it’s time in the spotlight as a strong reasoning model that came ‘out of nowhere’. One of the highlights of the model was that it was released publicly, including both the training process and weights. However, one thing lacking from the paper was an overview of the pipeline. Unsurprisingly, there are a few steps involved to produce such great results.

Why we can't talk to dogs, yet

April 5, 2025

Wouldn’t it be great if you could communicate directly with your dog? If you could ask him why he bit your furniture, or just understand what he’s barking about? While research has tried address this in the past, the problem is still far from solved, and potentially unsolvable. Let’s see why.

LLM Readability as a Tool

March 30, 2025

When training multilingual models a common problem is language mixing or code switching, in which models may respond in multiple languages when we would expect them to use just one. This can also happen in reasoning models, such as DeepSeek-R1. In their paper, they found that

Matching Robots

March 19, 2025

Sometimes when reading books it can be hard to get a grasp for a problem in practice. This video popped up on IG showcasing one of the important principles of multi-agent systems.

Learning the Sine Function with Polynomials

March 6, 2025

Why Autonomous Software?

March 2, 2025

With “agents” the hot topic of 2025, we should take a step back and ask ourselves, why do we want autonomous software? Let’s explore some reasons we’d want agents making decisions themselves.