Show HN: CellARC Measuring Intelligence with Cellular Automata

Share This Post

CellARC, a synthetic benchmark for abstraction and reasoning is built from multicolor 1D cellular automata (CA). Each episode has five support pairs and one query serialized in 256 tokens, enabling rapid iteration with small models while exposing a controllable task space with explicit knobs for alphabet size k, radius r, rule family, Langton’s lambda, query coverage, and cell entropy. We release 95k training episodes plus two 1k test splits (interpolation/extrapolation) and evaluate symbolic, recurrent, convolutional, transformer, recursive, and LLM baselines. CellARC decouples generalization from anthropomorphic priors, supports unlimited difficulty-controlled sampling, and enables reproducible studies of how quickly models infer new rules under tight budgets.

Paper: https://arxiv.org/abs/2511.07908
Code: https://github.com/mireklzicar/cellarc
Baselines: https://github.com/mireklzicar/cellarc_baselines
Dataset: https://huggingface.co/datasets/mireklzicar/cellarc_100k
Web & Leaderboard: https://cellarc.mireklzicar.com/


Comments URL: https://news.ycombinator.com/item?id=45897072

Points: 1

# Comments: 0

Source: arxiv.org

Subscribe To Our Newsletter

Get updates and learn from the best

More To Explore

Windows Securitym Hackers Feeds

Security issues discovered in sudo-rs

Article URL: https://lists.debian.org/debian-security-announce/2025/msg00218.html Comments URL: https://news.ycombinator.com/item?id=45898377 Points: 2 # Comments: 0 Source: lists.debian.org

Do You Want To Boost Your Business?

drop us a line and keep in touch

We are here to help

One of our technicians will be with you shortly.