Anyone used Reducto for parsing? How good is their embedding-aware chunking?

October 25, 2025

Share This Post

Curious if anyone here has used Reducto for document parsing or retrieval pipelines.

They seem to focus on generating LLM-ready chunks using a mix of vision-language models and something they call “embedding-optimized” or intelligent chunking. The idea is that it preserves document layout and meaning (tables, figures, etc.) before generating embeddings for RAG or vector search systems.

I’m mostly wondering how this works in practice

– Does their “embedding-aware” chunking noticeably improve retrieval or reduce hallucinations?

– Did you still need to run additional preprocessing or custom chunking on top of it?

Would appreciate hearing from anyone who’s tried it in production or at scale.

Comments URL: https://news.ycombinator.com/item?id=45703569

Points: 1

# Comments: 0

Source: news.ycombinator.com

Subscribe To Our Newsletter

Get updates and learn from the best

More To Explore

Windows Securitym Hackers Feeds

The Case Against LLMs as Rerankers

Article URL: https://blog.voyageai.com/2025/10/22/the-case-against-llms-as-rerankers/ Comments URL: https://news.ycombinator.com/item?id=45729947 Points: 1 # Comments: 0 Source: blog.voyageai.com

October 28, 2025

Windows Securitym Hackers Feeds

BogoMips Mini-How To

Article URL: https://tldp.org/HOWTO/BogoMips/ Comments URL: https://news.ycombinator.com/item?id=45729905 Points: 1 # Comments: 0 Source: tldp.org

October 28, 2025

IT Support

Hosting & Email

Cloud Solutions

Cyber Security

Telephone & Internet