RAG Playground
Explore each step of the RAG pipeline through interactive visualizations
Text Splitting
Visualize how documents are split into meaningful chunks while preserving semantic coherence
- Fixed Character strategy: Simple uniform segmentation based on predetermined character length. Best for prototyping and resource-constrained environments with minimal computational overhead.
- Recursive character strategy: Multi-tier algorithm that preserves natural language boundaries and semantic coherence. Recommended for applications requiring semantic integrity.
- Parent-Child character strategy: Dual-tier architecture using fine-grained chunks for precise matching while maintaining parent documents for rich contextual information. Optimal for complex applications requiring balanced accuracy and completeness.
When merging and splitting segments, some segments themselves exceed the chunk size in length, or the splitting and reassembly logic causes the combined length to exceed the set value, resulting in over-limit chunks.
¶¶
¶
␣
Sample text from AWS Documentation
Hover over chunks to highlight their position in the source document
Chunks:0
Avg. Size:0 chars
With Overlap:0
Avg. Overlap:0 chars