Why Autonomous Software?

With “agents” the hot topic of 2025, we should take a step back and ask ourselves, why do we want autonomous software? Let’s explore some reasons we’d want agents making decisions themselves.

Read More

AGI as the Best Gamer

Games are fun. Humans love watching a variety of them from NFL to League of Legends to Mahjong. The competitive nature of games is enticing and the infinite strategies help keep them fresh. These factors also make them great testing grounds for AI. For example, recent work from the same group behind the ARC-AGI challenge pits LLMs against each other in games of snake to find out which is the best. In this article, let’s discuss their findings along with other work towards using competitive games to evaluate AI.

Read More

How to Evaluate Reasoning Capabilities

Reasoning is a trait that has long been lauded among humanity. Among many other capabilities, it allows us to take seemingly disparate ideas and combine them together in interesting ways to form new ideas. It’s something that is often sort out for in the workplace, ideal among friends, and now a trait that we wish to see in our programs. But how can you evaluate reasoning?

Read More