Add to
del.icio.us
Digg this
July 4, 2008
Sun Microsystems' PDF Import extension (SPI) is only available in beta for now, and only works with recent
developer builds of OpenOffice.org 3.0, which is scheduled for September release, meaning that the Linux
community will have to wait until then if it is to be used in a production environment.
Of course, the quality of the final release is anybody's guess, but the beta's capabilities fall in
the middle of the available PDF import tools.
SPI opens up PDF files in the Draw application, although you could also use Impress, which shares much of
the same code. This might seem strange at first, especially if your PDF file is text. But using Draw is logical,
given the limitations of the PDF format.
No application -- not even Acrobat, the proprietary PDF editor made by Adobe, is able to edit more than
a single line while preserving format. Given this limitation, importing to Draw makes sense, because it can
treat each line as a separate text object for editing.
Although rearrangement of a paragraph requires line by line editing with SPI, and can be tricky if you need
to add an extra line, the extension leaves you no worse off than any other PDF editor.
However, at least you are in a relatively friendly user interface. Aside from the limitation of editing one
line at a time, the worst problem you are likely to have is the automatic capitalization of the first character
of each line if you have Autocorrect turned on while you edit.
To investigate SPI, you need to download and install a snapshot build of OpenOffice.org 3.0.
Then you can install SPI from Tools -> Extension Manager. The next time you start OpenOffice.org 3.0, you'll
be able to open PDF files from any of the options for opening an existing document in the File menu.
In testing, SPI's success at importing text depended largely on the fonts being used in the document. For
best results, you need to have the fonts in the imported PDF file installed on your system. Otherwise, SPI
will use a substitute font that may not correspond to the letter spacing of the original.
Additionally, while common fonts such as Helvetica or Times Roman create few problems, SPI seems to have
trouble reading the metrics of some PostScript fonts and displaying them correctly. Usually, the display
problem takes the form of a line of text that, converted to a text object, extends far beyond the page margins,
and makes reformatting tedious if not impossible.
In its beta version, SPI cannot handle PDF forms, and text alignment is not always preserved, with fully
justified text showing a strong tendency to import as left-aligned. Nor are hyperlinks supported, although
they are a basic necessity for many online documents.
Otherwise, the list of what SPI can handle is much larger than the list of problems: Text frames, sections,
multilevel lists, and table formatting including border and background color, are all imported without any
problems.
Graphics in imported PDF files had similar mixed success during testing. Many import into Draw without any
trouble, with text wrapping around them in the same style as the original document. However, some PNG images -- but
not all -- were imported vertically inverted, and, in another case, a graphic became an uneditable object.
In some complex layouts, the positioning of some graphics was off by a few pixels though.
Source: eBusiness News.
Add to
del.icio.us
Digg this
ADVERTISERS:
Linux News Today.org is read by over 450,000 people involved in the field of Linux application development,
professional Web hosting services, Linux
security, Linux Web development, etc.
Inquire about our reasonable advertising rates
on our news website. One of our advertising representatives will be in touch with you. Simply email us to learn
about our ad rates and how we can help drive relevant traffic to your website. Advertising space is limited.