PDF Splitter with Regex-Based Text Pattern Matching
The Split PDF converter: https://www.convertapi.com/pdf-to-split allows a single PDF document to be divided into several smaller ones. While converter already supported splitting by SplitByPattern
, SplitByCustomRange
, and ExtractPages
, a new parameter has been introduced: SplitByTextPattern
.
SplitByTextPattern
enables PDF splitting by identifying text segments that match a specific regex pattern. This feature divides the document into separate files at each occurrence of the specified pattern, making it ideal for segmenting content based on recurring headings, section labels, or custom text markers.
For example, setting the pattern Chapter \d+:
splits the PDF at each chapter title, such as Chapter 1:, Chapter 2. and avoid splitting by chapters that do not have number.
Example PDF:
Result of this split will be 3 PDF documents: