The Apache FOP Project

The Apache™ FOP Project

Apache™ FOP: Graphics Formats

Introduction

After the Apache™ FOP 0.94 release, the image handling subsystem has been rewritten in order to improve the range of supported images and image subtypes, to lower the overall memory consumption when handling images, to produce smaller output files and to increase the performance in certain areas. Of course, this causes a few changes most of which the user will probably not notice. The most important changes are:

The actual image loading framework no longer resides in Apache FOP, but was instead placed in XML Graphics Commons.

Overview of Graphics Support

The table below summarizes the theoretical support for graphical formats within FOP. In other words, within the constraints of the limitations listed here, these formats should work. However, many of them have not been tested, and there may be limitations that have not yet been discovered or documented. The packages needed to support some formats are not included in the FOP distribution and must be installed separately. Follow the links in the "Support Through" columns for more details.

Format Type Support Through Apache FOP (native) Support Through Apache Batik Support Through Image I/O
BMP (Microsoft Windows Bitmap) bitmap X [1]
EMF (Windows Enhanced Metafile) vector (with embedded bitmaps) (X)
EPS (Encapsulated PostScript) metafile (both bitmap and vector), most frequently used for vector drawings (X)
GIF (Graphics Interchange Format) bitmap X
JPEG (Joint Photographic Experts Group) bitmap (X) X
PNG (Portable Network Graphic) bitmap X
SVG (Scalable Vector Graphics) vector (with embedded bitmaps) X
TIFF (Tag Image Format File) bitmap (X) X [1]
WMF (Windows Metafile) vector (with embedded bitmaps) (X)

Legend:

Map of supported image formats by output format

Not all image formats are supported for all output formats! For example, while you can use EPS (Encapsulated PostScript) files when you generate PostScript output, this format will not be supported by any other output format. Here's an overview of which image formats are supported by which output format:

Image Format PDF PostScript Java2D, PNG, TIFF, AWT PCL AFP RTF
BMP (Microsoft Windows Bitmap) X X X X X X
EMF (Windows Enhanced Metafile) X [1]
EPS (Encapsulated PostScript) X [1]
GIF (Graphics Interchange Format) X X X X X X
JPEG (Joint Photographic Experts Group) X [1] X [1] X X X [1] X
PNG (Portable Network Graphic) X X X X X X
SVG (Scalable Vector Graphics) X X X X X X
TIFF (Tag Image Format File) X [2] X [2] X X X [2] X
WMF (Windows Metafile) X X X X X X

Legend:

Graphics Packages

XML Graphics Commons Native

XML Graphics Commons supports a number of graphic file formats natively as basic functionality: all bitmap formats for which there are Image I/O codecs available (JPEG, PNG, GIF, TIFF, etc.), EPS and EMF.

FOP Native

FOP has no native image plug-ins for the image loading framework of its own but currently hosts the Batik-dependent SVG and WMF plug-ins until they can be moved to Apache Batik.

Apache Batik

Apache Batik will later receive the SVG and WMF plug-ins for the image loading framework that are currently hosted inside FOP.

Current FOP distributions include a distribution of the Apache Batik. Because Batik's API changes frequently, it is highly recommended that you use the version that ships with FOP, at least when running FOP. Batik must be run in a graphical environment. Batik must be run in a graphical environment. It uses AWT classes for rendering SVG, which in turn require an X server on Unixish systems. If you run a server without X, or if you can't connect to the X server due to security restrictions or policies (a so-called "headless" environment), SVG rendering will fail.

Here are some workarounds:

Image I/O

The image loading framework in XML Graphics Commons provides a wrapper to load images through the JDK's Image I/O API (JSR 015). Image I/O allows to dynamically add additional image codecs. An example of such an add-on library are the JAI Image I/O Tools available from Sun.

Details on image formats

BMP

BMP images are supported through an Image I/O codec. There may be limitations of the codec which are outside the control of Apache FOP.

EMF

Windows Enhanced Metafiles (EMF) are only supported in RTF output where they are embedded without decoding.

EPS

Apache FOP allows to use EPS files when generating PostScript output only.

Other output targets can't be supported at the moment because FOP lacks a PostScript interpreter. Furthermore, FOP is currently not able to parse the preview bitmaps sometimes contained in EPS files.

GIF

GIF images are supported through an Image I/O codec. Transparency is supported but not guaranteed to work with every output format.

JPEG

FOP native support (i.e. the handling of undecoded images) of JPEG does not include all variants, especially those containing unusual color lookup tables and color profiles. If you have trouble with a JPEG image in FOP, try opening it with an image processing program (such as Photoshop or Gimp) and then saving it. Specifying 24-bit color output may also help. For the PDF and PostScript renderers most JPEG images can be passed through without decompression. User reports indicate that grayscale, RGB, and CMYK color spaces are all rendered properly. However, for other output formats, the JPEG images have to be decompressed. Tests have shown that there are some limitation in some Image I/O codecs concerning images in the CMYK color space. Work-arounds are in place but may not always work as expected.

PNG

PNG images are supported through an Image I/O codec. Transparency is supported but not guaranteed to work with every output format.

SVG

Introduction

FOP uses Apache Batik for SVG support. This format can be handled as an fo:instream-foreign-object or in a separate file referenced with fo:external-graphic. Batik's SVG Rasterizer utility may also be used to convert standalone SVG documents into PDF. For more information please see the SVG Rasterizer documentation on the Batik site.

Placing SVG Graphics into PDF

The SVG is rendered into PDF by using PDF commands to draw and fill lines and curves. This means that the graphical objects created with this remain as vector graphics. The same applies to PostScript output. For other output formats the SVG graphic may be converted to a bitmap image.

There are a number of SVG things that cannot be converted directly into PDF. Parts of the graphic such as effects, patterns and images are inserted into the PDF as a raster graphic. The resolution of these raster images can be controlled through the "target resolution" setting in the configuration.

Currently transparency is limited in PDF so many SVG images that contain effects or graphics with transparent areas may not be displayed correctly.

Placing SVG Text into PDF and PostScript

If possible, Batik will use normal PDF or PostScript text when inserting text. It does this by checking if the text can be drawn normally and the font is supported. This example svg text.svg /text.pdfshows how various types and effects with text are handled. Note that tspan and outlined text are not yet implemented.

Otherwise, text is converted and drawn as a set of shapes by Batik, using the stroking text painter. This means that a typical character will have about 10 curves (each curve consists of at least 20 characters). This can make the output files large and when it is viewed the viewer may not normally draw those fine curves very well (In Adobe Acrobat, turning on "Smooth Line Art" in the preferences will fix this). Copy/paste functionality will not be supported in this case. If the text is inserted into the output file using the inbuilt text commands it will use a single character.

Note that because SVG text can be rendered as either text or a vector graphic, you may need to consider settings in your viewer for both. The Acrobat viewer has both "smooth line art" and "smooth text" settings that may need to be set for SVG images to be displayed nicely on your screen (see Edit / Preferences / Display). This setting will not affect the printing of your document, which should be OK in any case, but will only affect the quality of the screen display.

Scaling

Currently, SVG images are rendered with the dimensions specified in the SVG file, within the viewport specified in the fo:external-graphic element. For everything to work properly, the two should be equal. The SVG standard leaves this issue as an implementation detail. Additional scaling options are available through XSL-FO means.

If you use pixels to specify the size of an SVG graphic the "source resolution" setting in the configuration will be used to determine the size of a pixel. The use of pixels to specify sizes is discouraged as they may be interpreted differently in different environments.

Known Problems

TIFF

FOP can embed TIFF images without decompression into PDF, PostScript and AFP if they have either CCITT T.4, CCITT T.6, or JPEG compression. Otherwise, a TIFF-capable Image I/O codec is necessary for decoding the image.

There may be some limitation concerning images in the CMYK color space.

WMF

Windows Metafiles (WMF) are supported through classes in Apache Batik. At the moment, support for this format is experimental and may not always work as expected.

Graphics Resolution

Some bitmapped image file formats store a dots-per-inch (dpi) or other resolution values. FOP tries to use this resolution information whenever possible to determine the image's intrinsic size. This size is used during the layout process when it is not superseded by an explicit size on fo:external-graphic (content-width and content-height properties).

Please note that not all images contain resolution information. If it's not available the source resolution set on the FopFactory (or through the user configuration XML) is used. The default here is 72 dpi.

Bitmap images are generally embedded into the output format at their original resolution (as is). No resampling of the image is performed. Explicit resampling is on our wishlist, but hasn't been implemented, yet. Bitmaps included in SVG graphics may be resampled to the resolution specified in the "target resolution" setting in the configuration if SVG filters are applied. This can be used as a work-around to resample images in FO documents.

Page selection for multi-page formats

Some image formats such as TIFF support multiple pages/sub-images per file. You can select a particular page using a special URI fragment in the form: #page= (for example: http://localhost/images/myimage.tiff#page=3)

Image caching

FOP caches images between runs. There is one cache per FopFactory instance. The URI is used as a key to identify images which means that when a particular URI appears again, the image is taken from the cache. If you have a servlet that generates a different image each time it is called with the same URI you need to use a constantly changing dummy parameter on the URI to avoid caching.

The image cache has been improved considerably in the redesigned code. Therefore, resetting the image cache should be a thing of the past. If you still experience OutOfMemoryErrors, please notify us.

If all else fails, the image cache can be cleared like this: fopFactory.getImageManager().getCache().clearCache();