SOAR architecture image

Self-Improving Language Models for Evolutionary Program Synthesis: A Case Study on ARC-AGI

Self-Improving Language Models for Evolutionary Program Synthesis: A Case Study on ARC-AGI 馃 Hugging Face (data and model)聽聽 | 聽聽 馃搼 Paper 聽聽 | 聽聽 馃搼 Blog Large Language Models (LLMs) have become incredibly powerful, but they often hit a wall when faced with truly complex reasoning tasks that require discovering a solution from scratch. Simply throwing more computing power or using a bigger model often yields diminishing returns. But what if a model could learn from its own experience, getting smarter with every attempt? ...

May 14, 2025 路 5 min 路 Julien Pourcel
ACES architecture image

Generating a Diversity of Challenging Programming Puzzles with Autotelic Generative Models

Generating a Diversity of Challenging Programming Puzzles with Autotelic Generative Models (ACES) Introduction Human intelligence is marked not just by the ability to solve problems, but by the creative act of inventing them. Automating the generation of novel, diverse, and challenging problems has wide-ranging applications-from personalized education to robust benchmarking of AI systems. The ACES (Autotelic CodE Search) framework, accepted as a Spotlight Poster 馃挮 at NeurIPS 2024, introduces a principled method for generating Python programming puzzles that are both difficult and semantically varied, pushing the boundaries of what current generative models can achieve alone. ...

May 14, 2025 路 7 min 路 Julien Pourcel