Show HN: CellARC Measuring Intelligence with Cellular Automata

November 12, 2025

Share This Post

CellARC, a synthetic benchmark for abstraction and reasoning is built from multicolor 1D cellular automata (CA). Each episode has five support pairs and one query serialized in 256 tokens, enabling rapid iteration with small models while exposing a controllable task space with explicit knobs for alphabet size k, radius r, rule family, Langton’s lambda, query coverage, and cell entropy. We release 95k training episodes plus two 1k test splits (interpolation/extrapolation) and evaluate symbolic, recurrent, convolutional, transformer, recursive, and LLM baselines. CellARC decouples generalization from anthropomorphic priors, supports unlimited difficulty-controlled sampling, and enables reproducible studies of how quickly models infer new rules under tight budgets.

Paper: https://arxiv.org/abs/2511.07908
Code: https://github.com/mireklzicar/cellarc
Baselines: https://github.com/mireklzicar/cellarc_baselines
Dataset: https://huggingface.co/datasets/mireklzicar/cellarc_100k
Web & Leaderboard: https://cellarc.mireklzicar.com/

Comments URL: https://news.ycombinator.com/item?id=45897072

Points: 1

# Comments: 0

Source: arxiv.org

Subscribe To Our Newsletter

Get updates and learn from the best

More To Explore

Yt-dlp: External JavaScript runtime now required for full YouTube support

Article URL: https://github.com/yt-dlp/yt-dlp/issues/15012 Comments URL: https://news.ycombinator.com/item?id=45898407 Points: 1 # Comments: 0 Source: github.com

November 12, 2025

Windows Securitym Hackers Feeds

Security issues discovered in sudo-rs

Article URL: https://lists.debian.org/debian-security-announce/2025/msg00218.html Comments URL: https://news.ycombinator.com/item?id=45898377 Points: 2 # Comments: 0 Source: lists.debian.org

November 12, 2025

IT Support

Hosting & Email

Cloud Solutions

Cyber Security

Telephone & Internet