AGI as the Best Gamer

Games are fun. Humans love watching a variety of them from NFL to League of Legends to Mahjong. The competitive nature of games is enticing and the infinite strategies help keep them fresh. These factors also make them great testing grounds for AI. For example, recent work from the same group behind the ARC-AGI challenge pits LLMs against each other in games of snake to find out which is the best. In this article, let’s discuss their findings along with other work towards using competitive games to evaluate AI.

Read More

How to Evaluate Reasoning Capabilities

Reasoning is a trait that has long been lauded among humanity. Among many other capabilities, it allows us to take seemingly disparate ideas and combine them together in interesting ways to form new ideas. It’s something that is often sort out for in the workplace, ideal among friends, and now a trait that we wish to see in our programs. But how can you evaluate reasoning?

Read More

What is Reasoning?

There’s a lot of talk about whether LLM’s can reason or not. With the release of OpenAI’s o1 and now the upcoming release of o3 which are touted to be strong reasoners, it seems we’re getting close to ‘reasoning capabilities’, but what does that mean? Let’s try to debug what reasoning is.

Read More

Learning Efficiently

Learning has been the centre of the field of AI since its inception, however its meaning and focus has continued to evolve over time. An effective AGI must be able to learn effectively from its environment. In this post, let’s explore the different ways that systems can learn.

Read More