Summary of the Structure of PDF files
Summary of the Structure of PDF files PDF can be looked upon as a combination of different file types presented in a single container. The reason for this is that a PDF file contains Text, vector art, images, fonts and other file format can be embedded - even the native files that were used to create the PDF in the first place. An object orientated file format with were items can be connected directly or indirectly to each other. The objects within a PDF file can be divided into the following types: Dictionaries A group containing direct or references to indirect objects. Dictionaries can be seen as the glue holding together the elements in a PDF files. The example below shows the structure of a typical page dictionary: The Contents stream has an attributes dictionary that contains a filter name and the length of the stream The CropBox array contains the coordinates of the rectangle that defines the area that is visible on the page. The MediaBox array contains the coordinates of the r