Поиск по блогу

вторник, 3 февраля 2015 г.

Ссылки на конвертеры pdf файлов в текст и html

Как парсить pdf файлы? Здесь ссылки на десктопные конвертеры и библиотеки. Есть даже список List of PDF software, но главные надежды я возлагаю на Xpdf, только что скачал и буду пробовать под Windows. Здесь же и справочники pdftotext.txt pdftohtml.txt

Здесь и первые эксперименты с pdftotext и pdftohtml. И чудный html файл после pdftohtml.

Xpdf 3.04 was released 2014-may-28
Python module for converting PDF to text
List of PDF software
slate 0.4Extract text from PDF documents easily
pdfreflow is a command line utility that operates on the output of the poppler utility called pdftohtml. pdfreflow reflows the texts into paragraphs, while at the same time removing hyphenation and page numbers, headers and footers.
pdftohtmlPdftohtml is a tool based on the Xpdf package which translates pdf documents into html format.
pdf2xml convertor based on Xpdf library (http://www.foolabs.com/xpdf/home.html). It converts information contained in a PDF file into XML. First, you need to install xpdf and libxml2 (see documentation). Hervé Déjean Xerox Research Centre Europe
How can I convert PDF to HTML? old post 2009 year
Reading data from PDF files into R
Poppler is a PDF rendering library based on the xpdf-3.0 code base
Converting PDF to HTML with Python [duplicate]
scrapy for table content in pdf file
Saving PDF files through callback function in a spider in scrapy
Portable Document Format (PDF) — межплатформенный формат электронных документов, разработанный фирмой Adobe Systems с использованием ряда возможностей языка PostScript.
PDFTK4ALLPDFtk is a simple tool for doing everyday things with PDF documents. It comes in three flavors: PDFtk Free, PDFtk Pro, and our original command-line tool PDFtk Server.

Распарсить PDF в TXT - здесь бедолаги решали ту же задачуЮ что и я, пока их читал установил на w8 утилиту pdftk4all - она слеивает и разделяет файлы и страницы. Они ее использовали для ремонта "плохих" pdf файлов.
Scraping large pdf tables which span accross multiple pages
PDF reference manual from Adobe
may be helpful to PDF users
Here are some other tools based on the Xpdf code
Pdf-parser

In []:
pdftk4all

Как конвертируется pdf файл (начнем с конца)

In [8]:
from IPython.display import Image
Image ("C:\\Users\\kiss\\Pictures\\pythonR\\pdftohtml.png")
Out[8]:

Под windows документации маловато, но есть

In []:
C:\Program Files\Xpdf\bin64>pdftohtml help
pdftohtml version 3.04
Copyright 1996-2014 Glyph & Cog, LLC
Usage: pdftohtml [options] <PDF-file> <html-dir>
  -f <int>               : first page to convert
  -l <int>               : last page to convert
  -r <int>               : resolution, in DPI (default is 150)
  -skipinvisible         : do not draw invisible text
  -opw <string>          : owner password (for encrypted files)
  -upw <string>          : user password (for encrypted files)
  -q                     : don't print any messages or errors
  -cfg <string>          : configuration file to use in place of .xpdfrc
  -v                     : print copyright and version info
  -h                     : print usage information
  -help                  : print usage information
  --help                 : print usage information
  -?                     : print usage information

C:\Program Files\Xpdf\bin64>
In [2]:
%load "C:\\Program Files\\Xpdf\\doc\\pdftotext.txt"
In []:
pdftotext(1)                                                      pdftotext(1)



NAME
       pdftotext  -  Portable Document Format (PDF) to text converter (version
       3.04)

SYNOPSIS
       pdftotext [options] [PDF-file [text-file]]

DESCRIPTION
       Pdftotext converts Portable Document Format (PDF) files to plain text.

       Pdftotext reads the PDF file, PDF-file, and writes a text  file,  text-
       file.   If  text-file  is not specified, pdftotext converts file.pdf to
       file.txt.  If text-file is '-', the text is sent to stdout.

CONFIGURATION FILE
       Pdftotext reads a configuration file at startup.   It  first  tries  to
       find the user's private config file, ~/.xpdfrc.  If that doesn't exist,
       it looks for a system-wide config file, typically /usr/local/etc/xpdfrc
       (but  this  location  can be changed when pdftotext is built).  See the
       xpdfrc(5) man page for details.

OPTIONS
       Many of the following options can be set with configuration  file  com-
       mands.  These are listed in square brackets with the description of the
       corresponding command line option.

       -f number
              Specifies the first page to convert.

       -l number
              Specifies the last page to convert.

       -layout
              Maintain (as best as possible) the original physical  layout  of
              the  text.   The  default is to 'undo' physical layout (columns,
              hyphenation, etc.) and output the text in reading order.  If the
              -fixed  option is given, character spacing within each line will
              be determined by the specified character pitch.

       -table Table mode is similar to physical layout mode, but optimized for
              tabular  data, with the goal of keeping rows and columns aligned
              (at the expense of inserting extra whitespace).  If  the  -fixed
              option  is  given,  character  spacing  within each line will be
              determined by the specified character pitch.

       -lineprinter
              Line  printer  mode  uses  a  strict  fixed-character-pitch  and
              -height  layout.   That  is, the page is broken into a grid, and
              characters are placed into that grid.  If the  grid  spacing  is
              too  small for the actual characters, the result is extra white-
              space.  If the grid spacing is too large, the result is  missing
              whitespace.   The grid spacing can be specified using the -fixed
              and -linespacing options.  If one or both are not given  on  the
              command  line,  pdftotext  will  attempt  to compute appropriate
              value(s).

       -raw   Keep the text in content stream order.  Depending on how the PDF
              file was generated, this may or may not be useful.

       -fixed number
              Specify  the  character  pitch (character width), in points, for
              physical layout, table, or line printer mode.  This  is  ignored
              in all other modes.

       -linespacing number
              Specify  the  line  spacing,  in  points, for line printer mode.
              This is ignored in all other modes.

       -clip  Text which is hidden because of clipping is removed before doing
              layout,  and then added back in.  This can be helpful for tables
              where clipped (invisible) text would overlap the next column.

       -enc encoding-name
              Sets the encoding to use for  text  output.   The  encoding-name
              must  be  defined  with  the unicodeMap command (see xpdfrc(5)).
              The encoding name is case-sensitive.  This defaults to  "Latin1"
              (which is a built-in encoding).  [config file: textEncoding]

       -eol unix | dos | mac
              Sets the end-of-line convention to use for text output.  [config
              file: textEOL]

       -nopgbrk
              Don't insert page breaks (form feed characters)  between  pages.
              [config file: textPageBreaks]

       -opw password
              Specify  the  owner  password  for the PDF file.  Providing this
              will bypass all security restrictions.

       -upw password
              Specify the user password for the PDF file.

       -q     Don't print any messages or errors.  [config file: errQuiet]

       -cfg config-file
              Read config-file in place of ~/.xpdfrc or the system-wide config
              file.

       -v     Print copyright and version information.

       -h     Print usage information.  (-help and --help are equivalent.)

BUGS
       Some  PDF  files contain fonts whose encodings have been mangled beyond
       recognition.  There is no way (short of OCR) to extract text from these
       files.

EXIT CODES
       The Xpdf tools use the following exit codes:

       0      No error.

       1      Error opening a PDF file.

       2      Error opening an output file.

       3      Error related to PDF permissions.

       99     Other error.

AUTHOR
       The  pdftotext software and documentation are copyright 1996-2014 Glyph
       & Cog, LLC.

SEE ALSO
       xpdf(1),  pdftops(1),  pdftohtml(1),  pdfinfo(1),  pdffonts(1),  pdfde-
       tach(1), pdftoppm(1), pdftopng(1), pdfimages(1), xpdfrc(5)
       http://www.foolabs.com/xpdf/



                                  28 May 2014                     pdftotext(1)
In [3]:
%load "C:\\Program Files\\Xpdf\\doc\\pdftohtml.txt"
In []:
pdftohtml(1)                                                      pdftohtml(1)



NAME
       pdftohtml  -  Portable Document Format (PDF) to HTML converter (version
       3.04)

SYNOPSIS
       pdftohtml [options] PDF-file HTML-dir

DESCRIPTION
       Pdftohtml converts Portable Document Format (PDF) files to HTML.

       Pdftohtml reads the PDF file, PDF-file, and places  an  HTML  file  for
       each page, along with auxiliary images in the directory, HTML-dir.  The
       HTML directory will be created; if it already  exists,  pdftohtml  will
       report an error.

CONFIGURATION FILE
       Pdftohtml  reads  a  configuration  file at startup.  It first tries to
       find the user's private config file, ~/.xpdfrc.  If that doesn't exist,
       it looks for a system-wide config file, typically /usr/local/etc/xpdfrc
       (but this location can be changed when pdftohtml is  built).   See  the
       xpdfrc(5) man page for details.

OPTIONS
       Many  of  the following options can be set with configuration file com-
       mands.  These are listed in square brackets with the description of the
       corresponding command line option.

       -f number
              Specifies the first page to convert.

       -l number
              Specifies the last page to convert.

       -r     Specifies  the  resolution,  in DPI, for background images.  The
              default is 150 DPI.

       -opw password
              Specify the owner password for the  PDF  file.   Providing  this
              will bypass all security restrictions.

       -upw password
              Specify the user password for the PDF file.

       -q     Don't print any messages or errors.  [config file: errQuiet]

       -cfg config-file
              Read config-file in place of ~/.xpdfrc or the system-wide config
              file.

       -v     Print copyright and version information.

       -h     Print usage information.  (-help and --help are equivalent.)

BUGS
       Some PDF files contain fonts whose encodings have been  mangled  beyond
       recognition.  There is no way (short of OCR) to extract text from these
       files.

EXIT CODES
       The Xpdf tools use the following exit codes:

       0      No error.

       1      Error opening a PDF file.

       2      Error opening an output file.

       3      Error related to PDF permissions.

       99     Other error.

AUTHOR
       The pdftohtml software and documentation are copyright 1996-2014  Glyph
       & Cog, LLC.

SEE ALSO
       xpdf(1),  pdftops(1),  pdftotext(1),  pdfinfo(1),  pdffonts(1),  pdfde-
       tach(1), pdftoppm(1), pdftopng(1), pdfimages(1), xpdfrc(5)
       http://www.foolabs.com/xpdf/



                                  28 May 2014                     pdftohtml(1)
In []:
Об установке 

Установка под Windows оказалось необычной. В файле install.txt было рекомендовано просто создать папку Xpdf в Program Files и скопировать все туда. Потом я создал целых два конфигурационных файла в bin6

In []:
xpdfrc
xpdfrc.txt
In []:
C:...>tree "C:\Program Files\Xpdf" /F
Структура папок
Серийный номер тома: 000000D7 6017:2A0B
C:\PROGRAM FILES\XPDF
   ANNOUNCE
   CHANGES
   COPYING
   COPYING3
   INSTALL
   README

├───bin32
       pdfdetach.exe
       pdffonts.exe
       pdfimages.exe
       pdfinfo.exe
       pdftohtml.exe
       pdftopng.exe
       pdftoppm.exe
       pdftops.exe
       pdftotext.exe

├───bin64
       demo1.pdf
       pdfdetach.exe
       pdffonts.exe
       pdfimages.exe
       pdfinfo.exe
       pdftohtml.exe
       pdftopng.exe
       pdftoppm.exe
       pdftops.exe
       pdftotext.exe
       xpdfrc
       xpdfrc.txt

└───doc
        pdfdetach.txt
        pdffonts.txt
        pdfimages.txt
        pdfinfo.txt
        pdftohtml.txt
        pdftopng.txt
        pdftoppm.txt
        pdftops.txt
        pdftotext.txt
        sample-xpdfrc
        xpdf.txt
        xpdfrc.txt

Файл конфигурации был полность закомментирован, я не нашел и не нагуглил никаких инструкций, кроме форума вот этих бедолаг Распарсить PDF в TXT

и раскоментировал следующее

In [3]:
%load "C:\\Program Files\\Xpdf\\bin64\\xpdfrc"
In []:
#========================================================================
#
# Sample xpdfrc file
#
# The Xpdf tools look for a config file in two places:
# 1. ~/.xpdfrc
# 2. in a system-wide directory, typically /usr/local/etc/xpdfrc
#
# This sample config file demonstrates some of the more common
# configuration options.  Everything here is commented out.  You
# should edit things (especially the file/directory paths, since
# they'll likely be different on your system), and uncomment whichever
# options you want to use.  For complete details on config file syntax
# and available options, please see the xpdfrc(5) man page.
#
# Also, the Xpdf language support packages each include a set of
# options to be added to the xpdfrc file.
#
# http://www.foolabs.com/xpdf/
#
#========================================================================

#----- display fonts

# These map the Base-14 fonts to the Type 1 fonts that ship with
# ghostscript.  You'll almost certainly want to use something like
# this, but you'll need to adjust this to point to wherever
# ghostscript is installed on your system.  (But if the fonts are
# installed in a "standard" location, xpdf will find them
# automatically.)

#fontFile Times-Roman  /usr/local/share/ghostscript/fonts/n021003l.pfb
#fontFile Times-Italic  /usr/local/share/ghostscript/fonts/n021023l.pfb
#fontFile Times-Bold  /usr/local/share/ghostscript/fonts/n021004l.pfb
#fontFile Times-BoldItalic /usr/local/share/ghostscript/fonts/n021024l.pfb
#fontFile Helvetica  /usr/local/share/ghostscript/fonts/n019003l.pfb
#fontFile Helvetica-Oblique /usr/local/share/ghostscript/fonts/n019023l.pfb
#fontFile Helvetica-Bold  /usr/local/share/ghostscript/fonts/n019004l.pfb
#fontFile Helvetica-BoldOblique /usr/local/share/ghostscript/fonts/n019024l.pfb
#fontFile Courier  /usr/local/share/ghostscript/fonts/n022003l.pfb
#fontFile Courier-Oblique /usr/local/share/ghostscript/fonts/n022023l.pfb
#fontFile Courier-Bold  /usr/local/share/ghostscript/fonts/n022004l.pfb
#fontFile Courier-BoldOblique /usr/local/share/ghostscript/fonts/n022024l.pfb
#fontFile Symbol   /usr/local/share/ghostscript/fonts/s050000l.pfb
#fontFile ZapfDingbats  /usr/local/share/ghostscript/fonts/d050000l.pfb

# If you need to display PDF files that refer to non-embedded fonts,
# you should add one or more fontDir options to point to the
# directories containing the font files.  Xpdf will only look at .pfa,
# .pfb, .ttf, and .ttc files in those directories (other files will
# simply be ignored).

fontDir  C:\Windows\Fonts

#----- PostScript output control

# Set the default PostScript file or command.

#psFile   "|lpr -Pmyprinter"

# Set the default PostScript paper size -- this can be letter, legal,
# A4, or A3.  You can also specify a paper size as width and height
# (in points).

#psPaperSize  letter

#----- text output control

# Choose a text encoding for copy-and-paste and for pdftotext output.
# The Latin1, ASCII7, and UTF-8 encodings are built into Xpdf.  Other
# encodings are available in the language support packages.

textEncoding  UTF-8

# Choose the end-of-line convention for multi-line copy-and-past and
# for pdftotext output.  The available options are unix, mac, and dos.

#textEOL  unix

#----- misc settings

# Enable FreeType, and anti-aliased text.

#enableFreeType  yes
#antialias  yes

# Set the command used to run a web browser when a URL hyperlink is
# clicked.

#launchCommand  viewer-script
#urlCommand "netscape -remote 'openURL(%s)'"

Попробовал запустить pdftohtml, получил ошибки

In []:
C:\Program Files\Xpdf\bin64>pdftohtml demo1.pdf C:\Users\kiss\Documents\Xpdf
Config Error: No display font for 'Symbol'
Config Error: No display font for 'ZapfDingbats'
I/O Error: Couldn't create HTML output directory 'C:\Users\kiss\Documents\Xpdf'
In []:
#Попытался добавить строчку со шрифтом в symbol.txt
# Заодно и создать xpdfrc.txt из xpdfrc
fontFile Symbol   C:\Windows\Fonts\symbol.ttf
In [4]:
%load "C:\\Program Files\\Xpdf\\bin64\\xpdfrc.txt"
In []:
#========================================================================
#
# Sample xpdfrc file
#
# The Xpdf tools look for a config file in two places:
# 1. ~/.xpdfrc
# 2. in a system-wide directory, typically /usr/local/etc/xpdfrc
#
# This sample config file demonstrates some of the more common
# configuration options.  Everything here is commented out.  You
# should edit things (especially the file/directory paths, since
# they'll likely be different on your system), and uncomment whichever
# options you want to use.  For complete details on config file syntax
# and available options, please see the xpdfrc(5) man page.
#
# Also, the Xpdf language support packages each include a set of
# options to be added to the xpdfrc file.
#
# http://www.foolabs.com/xpdf/
#
#========================================================================

#----- display fonts

# These map the Base-14 fonts to the Type 1 fonts that ship with
# ghostscript.  You'll almost certainly want to use something like
# this, but you'll need to adjust this to point to wherever
# ghostscript is installed on your system.  (But if the fonts are
# installed in a "standard" location, xpdf will find them
# automatically.)

#fontFile Times-Roman  /usr/local/share/ghostscript/fonts/n021003l.pfb
#fontFile Times-Italic  /usr/local/share/ghostscript/fonts/n021023l.pfb
#fontFile Times-Bold  /usr/local/share/ghostscript/fonts/n021004l.pfb
#fontFile Times-BoldItalic /usr/local/share/ghostscript/fonts/n021024l.pfb
#fontFile Helvetica  /usr/local/share/ghostscript/fonts/n019003l.pfb
#fontFile Helvetica-Oblique /usr/local/share/ghostscript/fonts/n019023l.pfb
#fontFile Helvetica-Bold  /usr/local/share/ghostscript/fonts/n019004l.pfb
#fontFile Helvetica-BoldOblique /usr/local/share/ghostscript/fonts/n019024l.pfb
#fontFile Courier  /usr/local/share/ghostscript/fonts/n022003l.pfb
#fontFile Courier-Oblique /usr/local/share/ghostscript/fonts/n022023l.pfb
#fontFile Courier-Bold  /usr/local/share/ghostscript/fonts/n022004l.pfb
#fontFile Courier-BoldOblique /usr/local/share/ghostscript/fonts/n022024l.pfb
fontFile Symbol   C:\Windows\Fonts\symbol.ttf
#fontFile ZapfDingbats  /usr/local/share/ghostscript/fonts/d050000l.pfb

# If you need to display PDF files that refer to non-embedded fonts,
# you should add one or more fontDir options to point to the
# directories containing the font files.  Xpdf will only look at .pfa,
# .pfb, .ttf, and .ttc files in those directories (other files will
# simply be ignored).

fontDir  C:\Windows\Fonts

#----- PostScript output control

# Set the default PostScript file or command.

#psFile   "|lpr -Pmyprinter"

# Set the default PostScript paper size -- this can be letter, legal,
# A4, or A3.  You can also specify a paper size as width and height
# (in points).

#psPaperSize  letter

#----- text output control

# Choose a text encoding for copy-and-paste and for pdftotext output.
# The Latin1, ASCII7, and UTF-8 encodings are built into Xpdf.  Other
# encodings are available in the language support packages.

textEncoding  UTF-8

# Choose the end-of-line convention for multi-line copy-and-past and
# for pdftotext output.  The available options are unix, mac, and dos.

#textEOL  unix

#----- misc settings

# Enable FreeType, and anti-aliased text.

#enableFreeType  yes
#antialias  yes

# Set the command used to run a web browser when a URL hyperlink is
# clicked.

#launchCommand  viewer-script
#urlCommand "netscape -remote 'openURL(%s)'"

Не помогло. Решил попробовать вариант pdftotext

In []:
C:\Program Files\Xpdf\bin64>pdftotext demo1.pdf C:\Users\kiss\Documents\Xpdf
I/O Error: Couldn't open text file 'C:\Users\kiss\Documents\Xpdf'

C:\Program Files\Xpdf\bin64>pdftotext demo1.pdf C:\Users\kiss\Documents\Xpdf\text1.txt

И нашел в заданном файле text1.txt вполне приличный текстовый файл. Но захотелось большего. Может быть другие шрифты найдутся?

In []:
C:\Program Files\Xpdf\bin64>pdftohtml demo1.pdf C:\Users\kiss\Documents\Xpdf\text1.html
Config Error: No display font for 'Symbol'
Config Error: No display font for 'ZapfDingbats'
Syntax Warning: Substituting font 'Helvetica' for 'HelveticaNeue-Roman'

Да, действительно, произошла замене шрифта, и была создана "странная" папка text1.html

In []:
C:\Users\kiss\SkyDrive\Docs\mailru\cars_mail_1\carmailPrice>tree C:\Users\kiss\Documents\Xpdf  /F
Структура папок
Серийный номер тома: 00000023 6017:2A0B
C:\USERS\KISS\DOCUMENTS\XPDF
   text1.txt

└───text1.html
        index.html
        page1.html
        page1.png
        page2.html
        page2.png
        page3.html
        page3.png

А в этой странной папке еще более странный html файл.

Казалось бы, такие стили с абсолютным позициированием - это безвредный изыск. Но нет, при последующем экспериментировании с другим файлом, содержащим таблицы, оказалось, что информация считана по столбцам, а позициируется по строкам.

In [6]:
%load C:/Users/kiss/Documents/Xpdf/text1.html/page3.html
In []:
<html>
<head>
<meta http-equiv="Content-Type" content="text/html; charset=UTF-8">
<style type="text/css">
.txt { white-space:nowrap; }
#f0 { font-family:sans-serif; font-weight:normal; font-style:normal; }
#f1 { font-family:serif; font-weight:normal; font-style:normal; }
#f2 { font-family:serif; font-weight:normal; font-style:italic; }
#f3 { font-family:serif; font-weight:bold; font-style:normal; }
#f4 { font-family:sans-serif; font-weight:normal; font-style:normal; }
#f5 { font-family:sans-serif; font-weight:bold; font-style:normal; }
#f6 { font-family:sans-serif; font-weight:normal; font-style:italic; }
#f7 { font-family:sans-serif; font-weight:normal; font-style:normal; }
#f8 { font-family:sans-serif; font-weight:normal; font-style:normal; }
#f9 { font-family:sans-serif; font-weight:normal; font-style:normal; }
#f10 { font-family:serif; font-weight:bold; font-style:italic; }
#f11 { font-family:sans-serif; font-weight:bold; font-style:normal; }
#f12 { font-family:sans-serif; font-weight:normal; font-style:normal; }
#f13 { font-family:sans-serif; font-weight:normal; font-style:normal; }
</style>
</head>
<body onload="start()">
<img id="background" style="position:absolute; left:0px; top:0px;" width="566" height="726" src="page3.png">
<div class="txt" style="position:absolute; left:47px; top:46px;"><span id="f5" style="font-size:28px;vertical-align:baseline;color:#094270;">Was zum Teufel hat ihn getrieben</span></div>
<div class="txt" style="position:absolute; left:138px; top:82px;"><span id="f1" style="font-size:11px;vertical-align:baseline;color:#000000;">Reuter-Erinnerungen: Die negativen Kommentare überwiegen</span></div>
<div class="txt" style="position:absolute; left:40px; top:102px;"><span id="f6" style="font-size:25px;vertical-align:baseline;color:#094270;">S</span><span id="f2" style="font-size:10px;vertical-align:super;color:#000000;">chein und Wirklichkeit  die</span></div>
<div class="txt" style="position:absolute; left:57px; top:113px;"><span id="f2" style="font-size:10px;vertical-align:baseline;color:#000000;">Memoiren von Edzard Reuter,</span></div>
<div class="txt" style="position:absolute; left:40px; top:124px;"><span id="f2" style="font-size:10px;vertical-align:baseline;color:#000000;">dem ehemaligen Vorstandsvorsitzen-</span></div>
<div class="txt" style="position:absolute; left:40px; top:134px;"><span id="f2" style="font-size:10px;vertical-align:baseline;color:#000000;">den der Daimler-Benz AG, sorgten</span></div>
<div class="txt" style="position:absolute; left:40px; top:145px;"><span id="f2" style="font-size:10px;vertical-align:baseline;color:#000000;">für Aufsehen, noch bevor sie erschie-</span></div>
<div class="txt" style="position:absolute; left:40px; top:156px;"><span id="f2" style="font-size:10px;vertical-align:baseline;color:#000000;">nen waren. Was Reuters grimmiger</span></div>
<div class="txt" style="position:absolute; left:40px; top:166px;"><span id="f2" style="font-size:10px;vertical-align:baseline;color:#000000;">Rückblick schwarz auf weiß bietet,</span></div>
<div class="txt" style="position:absolute; left:40px; top:177px;"><span id="f2" style="font-size:10px;vertical-align:baseline;color:#000000;">ist kein schöner Anblick. Aber lehr-</span></div>
<div class="txt" style="position:absolute; left:40px; top:187px;"><span id="f2" style="font-size:10px;vertical-align:baseline;color:#000000;">reich. So urteilte die Frankfurter</span></div>
<div class="txt" style="position:absolute; left:40px; top:198px;"><span id="f2" style="font-size:10px;vertical-align:baseline;color:#000000;">Rundschau, nachdem manager</span></div>
<div class="txt" style="position:absolute; left:40px; top:209px;"><span id="f2" style="font-size:10px;vertical-align:baseline;color:#000000;">magazin in seiner Februar-Ausgabe</span></div>
<div class="txt" style="position:absolute; left:40px; top:219px;"><span id="f2" style="font-size:10px;vertical-align:baseline;color:#000000;">vorab über das Buch berichtet hatte.</span></div>
<div class="txt" style="position:absolute; left:40px; top:230px;"><span id="f2" style="font-size:10px;vertical-align:baseline;color:#000000;">Kollegen und Geschäftspartner,</span></div>
<div class="txt" style="position:absolute; left:40px; top:240px;"><span id="f2" style="font-size:10px;vertical-align:baseline;color:#000000;">deutsche und internationale Medien</span></div>
<div class="txt" style="position:absolute; left:40px; top:251px;"><span id="f2" style="font-size:10px;vertical-align:baseline;color:#000000;">kommentierten Reuters Erinnerun-</span></div>
<div class="txt" style="position:absolute; left:40px; top:262px;"><span id="f2" style="font-size:10px;vertical-align:baseline;color:#000000;">gen. Auszüge:</span></div>
<div class="txt" style="position:absolute; left:40px; top:283px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">Dieser sich selbst überschätzende</span></div>
<div class="txt" style="position:absolute; left:40px; top:293px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">Einfaltspinsel.</span></div>
<div class="txt" style="position:absolute; left:90px; top:304px;"><span id="f2" style="font-size:8px;vertical-align:baseline;color:#000000;">Ex-Daimler-Chef Joachim Zahn</span></div>
<div class="txt" style="position:absolute; left:106px; top:312px;"><span id="f2" style="font-size:8px;vertical-align:baseline;color:#000000;">im Süddeutschen Rundfunk</span></div>
<div class="txt" style="position:absolute; left:40px; top:334px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">Einfalt meint, der versteht vom Ge-</span></div>
<div class="txt" style="position:absolute; left:40px; top:344px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">schäft nix, und Pinsel bedeutet, der</span></div>
<div class="txt" style="position:absolute; left:40px; top:355px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">schwätzt trotzdem scheinbar ge-</span></div>
<div class="txt" style="position:absolute; left:40px; top:366px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">scheit daher.</span></div>
<div class="txt" style="position:absolute; left:92px; top:376px;"><span id="f2" style="font-size:8px;vertical-align:baseline;color:#000000;">Joachim Zahn in Der Spiegel</span></div>
<div class="txt" style="position:absolute; left:40px; top:395px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">Was zum Teufel hat Reuter getrie-</span></div>
<div class="txt" style="position:absolute; left:40px; top:406px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">ben, mir angesichts meiner angeb-</span></div>
<div class="txt" style="position:absolute; left:40px; top:417px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">lich ekelhaften Charaktereigen-</span></div>
<div class="txt" style="position:absolute; left:40px; top:427px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">schaften auch noch eine Stabsstelle</span></div>
<div class="txt" style="position:absolute; left:40px; top:438px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">anzubieten, mit dem Ziel, in den</span></div>
<div class="txt" style="position:absolute; left:40px; top:448px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">Daimler-Vorstand zu gehen?“</span></div>
<div class="txt" style="position:absolute; left:98px; top:459px;"><span id="f2" style="font-size:8px;vertical-align:baseline;color:#000000;">Martine Dornier-Tiefenthaler</span></div>
<div class="txt" style="position:absolute; left:102px; top:467px;"><span id="f2" style="font-size:8px;vertical-align:baseline;color:#000000;">in der Stuttgarter Zeitung</span></div>
<div class="txt" style="position:absolute; left:40px; top:491px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">Reuter (hat sich) nie zugehörig ge-</span></div>
<div class="txt" style="position:absolute; left:40px; top:501px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">fühlt zu dieser Kaste reaktionärer In-</span></div>
<div class="txt" style="position:absolute; left:40px; top:512px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">dustrieller ... Reuter rechtfertigt sei-</span></div>
<div class="txt" style="position:absolute; left:205px; top:103px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">ne Unternehmenspolitik, und es ge-</span></div>
<div class="txt" style="position:absolute; left:205px; top:113px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">lingt ihm auch zum großen Teil.</span></div>
<div class="txt" style="position:absolute; left:240px; top:123px;"><span id="f2" style="font-size:8px;vertical-align:baseline;color:#000000;">Ex-Daimler-Sprecher Winfried Münster</span></div>
<div class="txt" style="position:absolute; left:314px; top:132px;"><span id="f2" style="font-size:8px;vertical-align:baseline;color:#000000;">in Die Woche</span></div>
<div class="txt" style="position:absolute; left:205px; top:160px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">Wir lehnen ein solches Vorgehen von</span></div>
<div class="txt" style="position:absolute; left:205px; top:170px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">jemandem, der eine herausragende</span></div>
<div class="txt" style="position:absolute; left:205px; top:181px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">Stellung in unserem Unternehmen in-</span></div>
<div class="txt" style="position:absolute; left:205px; top:192px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">nehatte, entschieden ab.</span></div>
<div class="txt" style="position:absolute; left:242px; top:202px;"><span id="f2" style="font-size:8px;vertical-align:baseline;color:#000000;">Schreiben des Daimler-Benz-Vorstands</span></div>
<div class="txt" style="position:absolute; left:288px; top:210px;"><span id="f2" style="font-size:8px;vertical-align:baseline;color:#000000;">an seine Führungskräfte</span></div>
<div class="txt" style="position:absolute; left:205px; top:238px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">So offen er Schrempp und den AEG-</span></div>
<div class="txt" style="position:absolute; left:205px; top:248px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">Managern Fehler anlastet, so vage</span></div>
<div class="txt" style="position:absolute; left:205px; top:259px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">bleibt seine Aussage über eigene Irrtü-</span></div>
<div class="txt" style="position:absolute; left:205px; top:270px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">mer und Versagen.</span></div>
<div class="txt" style="position:absolute; left:257px; top:280px;"><span id="f2" style="font-size:8px;vertical-align:baseline;color:#000000;">Reuter-Biograph Hans Otto Eglau</span></div>
<div class="txt" style="position:absolute; left:322px; top:288px;"><span id="f2" style="font-size:8px;vertical-align:baseline;color:#000000;">in Die Zeit</span></div>
<div class="txt" style="position:absolute; left:205px; top:310px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">Woher das Magazin das Manuskript</span></div>
<div class="txt" style="position:absolute; left:205px; top:320px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">wohl hat?“</span></div>
<div class="txt" style="position:absolute; left:257px; top:331px;"><span id="f2" style="font-size:8px;vertical-align:baseline;color:#000000;">Hannoversche Allgemeine Zeitung</span></div>
<div class="txt" style="position:absolute; left:205px; top:358px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">Man mag das Resultat seiner un-</span></div>
<div class="txt" style="position:absolute; left:205px; top:368px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">ternehmerischen Bemühungen als</span></div>
<div class="txt" style="position:absolute; left:205px; top:379px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">ziemlich erfolglos betrachten: Hier</span></div>
<div class="txt" style="position:absolute; left:205px; top:390px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">ist einer, der sich seinen Kritikern</span></div>
<div class="txt" style="position:absolute; left:205px; top:400px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">stellt und auch im nachhinein beim</span></div>
<div class="txt" style="position:absolute; left:205px; top:411px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">Austeilen noch keineswegs erlahmt</span></div>
<div class="txt" style="position:absolute; left:205px; top:421px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">ist.</span></div>
<div class="txt" style="position:absolute; left:315px; top:432px;"><span id="f2" style="font-size:8px;vertical-align:baseline;color:#000000;">Börsen-Zeitung</span></div>
<div class="txt" style="position:absolute; left:378px; top:103px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">Daß Reuter nur hadert, anstatt sei-</span></div>
<div class="txt" style="position:absolute; left:378px; top:113px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">nen Verstand einzusetzen, daß er alte</span></div>
<div class="txt" style="position:absolute; left:378px; top:124px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">Rechnungen begleicht, anstatt sich</span></div>
<div class="txt" style="position:absolute; left:378px; top:134px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">um ein abgewogenes Urteil zu</span></div>
<div class="txt" style="position:absolute; left:378px; top:145px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">bemühen, das verübeln Reuter selbst</span></div>
<div class="txt" style="position:absolute; left:378px; top:156px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">wohlgesonnene Leute.</span></div>
<div class="txt" style="position:absolute; left:454px; top:166px;"><span id="f2" style="font-size:8px;vertical-align:baseline;color:#000000;">Stuttgarter Nachrichten</span></div>
<div class="txt" style="position:absolute; left:378px; top:190px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">Sein Buch wird sicherlich nicht</span></div>
<div class="txt" style="position:absolute; left:378px; top:200px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">dazu beitragen, Reputation und An-</span></div>
<div class="txt" style="position:absolute; left:378px; top:211px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">sehen zurückzuerlangen.</span></div>
<div class="txt" style="position:absolute; left:488px; top:221px;"><span id="f2" style="font-size:8px;vertical-align:baseline;color:#000000;">Handelsblatt</span></div>
<div class="txt" style="position:absolute; left:378px; top:239px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">Es irritiert, wenn Reuter so rechtet</span></div>
<div class="txt" style="position:absolute; left:378px; top:249px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">und sich dabei selbst zum auf kläreri-</span></div>
<div class="txt" style="position:absolute; left:378px; top:260px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">schen Weltgeist hoch zu Pferd stili-</span></div>
<div class="txt" style="position:absolute; left:378px; top:270px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">siert. Haben am Ende doch diejeni-</span></div>
<div class="txt" style="position:absolute; left:378px; top:281px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">gen recht gehabt, die zwar seine bril-</span></div>
<div class="txt" style="position:absolute; left:378px; top:292px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">lante Intelligenz erkannten, ihn aber</span></div>
<div class="txt" style="position:absolute; left:378px; top:302px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">nicht an der Spitze des Konzerns ha-</span></div>
<div class="txt" style="position:absolute; left:378px; top:313px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">ben wollten?“</span></div>
<div class="txt" style="position:absolute; left:430px; top:323px;"><span id="f2" style="font-size:8px;vertical-align:baseline;color:#000000;">Frankfurter Allgemeine Zeitung</span></div>
<div class="txt" style="position:absolute; left:378px; top:347px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">Wie zu seiner Zeit als Vorsitzender</span></div>
<div class="txt" style="position:absolute; left:378px; top:357px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">weist Reuter heute in seinem Buch</span></div>
<div class="txt" style="position:absolute; left:378px; top:368px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">das Argument zurück, daß seine Ex-</span></div>
<div class="txt" style="position:absolute; left:378px; top:379px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">pansionsstrategie und seine Vision</span></div>
<div class="txt" style="position:absolute; left:378px; top:389px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">vom integrierten Technologiekon-</span></div>
<div class="txt" style="position:absolute; left:378px; top:400px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">zern das Unternehmen ins nanzi-</span></div>
<div class="txt" style="position:absolute; left:378px; top:410px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">elle Desaster führten.</span></div>
<div class="txt" style="position:absolute; left:437px; top:421px;"><span id="f2" style="font-size:8px;vertical-align:baseline;color:#000000;">International Herald Tribune</span></div>
<div class="txt" style="position:absolute; left:378px; top:440px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">Selbst die Hölle kennt keinen ge-</span></div>
<div class="txt" style="position:absolute; left:378px; top:451px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">waltigeren Zorn als den von abge-</span></div>
<div class="txt" style="position:absolute; left:378px; top:461px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">halfterten Executives. Doch Edzard</span></div>
<div class="txt" style="position:absolute; left:378px; top:472px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">Reuter ... wählte den falschen Mo-</span></div>
<div class="txt" style="position:absolute; left:378px; top:482px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">ment, um zurückzuschlagen. Seine</span></div>
<div class="txt" style="position:absolute; left:378px; top:493px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">Attacken dürften das Management</span></div>
<div class="txt" style="position:absolute; left:378px; top:504px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">ziemlich kaltlassen.</span></div>
<div class="txt" style="position:absolute; left:478px; top:514px;"><span id="f2" style="font-size:8px;vertical-align:baseline;color:#000000;">Financial Times</span></div>
<div class="txt" style="position:absolute; left:31px; top:551px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#231f1f;">die Führungsmannschaft um Ent-</span></div>
<div class="txt" style="position:absolute; left:31px; top:563px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#231f1f;">wicklungschef </span><span id="f3" style="font-size:10px;vertical-align:baseline;color:#231f1f;">Johann Tomforde</span></div>
<div class="txt" style="position:absolute; left:31px; top:575px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#231f1f;">und Finanzchef </span><span id="f3" style="font-size:10px;vertical-align:baseline;color:#231f1f;">Christoph Baubin</span></div>
<div class="txt" style="position:absolute; left:31px; top:587px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#231f1f;">feuern.</span></div>
<div class="txt" style="position:absolute; left:51px; top:599px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#231f1f;">Beim Führungskräfte-Forum leg-</span></div>
<div class="txt" style="position:absolute; left:31px; top:611px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#231f1f;">te der Vorsitzende nach: Es war offen-</span></div>
<div class="txt" style="position:absolute; left:31px; top:623px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#231f1f;">sichtlich, daß die Probleme bekannt</span></div>
<div class="txt" style="position:absolute; left:31px; top:635px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#231f1f;">waren, aber verschwiegen wurden.</span></div>
<div class="txt" style="position:absolute; left:31px; top:647px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#231f1f;">Wir haben daher umgehend personelle</span></div>
<div class="txt" style="position:absolute; left:31px; top:659px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#231f1f;">Konsequenzen gezogen.</span></div>
<div class="txt" style="position:absolute; left:51px; top:671px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#231f1f;">Das war eine Warnung, die</span></div>
<div class="txt" style="position:absolute; left:31px; top:683px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#231f1f;">Schrempp vor allem an die Fahrzeug-</span></div>
<div class="txt" style="position:absolute; left:31px; top:695px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#231f1f;">bauer in Untertürkheim richtete. Die</span></div>
<div class="txt" style="position:absolute; left:210px; top:671px;"><span id="f10" style="font-size:10px;vertical-align:baseline;color:#000000;">Buchautor Edzard Reuter:</span></div>
<div class="txt" style="position:absolute; left:210px; top:683px;"><span id="f10" style="font-size:10px;vertical-align:baseline;color:#000000;">Weltgeist hoch zu Pferd</span></div>
<div class="txt" style="position:absolute; left:378px; top:551px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">blieben, anders als die Kollegen des el-</span></div>
<div class="txt" style="position:absolute; left:378px; top:563px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">sässischen Ablegers Smart, bisher von</span></div>
<div class="txt" style="position:absolute; left:378px; top:575px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">Sanktionen verschont.</span></div>
<div class="txt" style="position:absolute; left:398px; top:587px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">Noch sind wir nicht überall da,</span></div>
<div class="txt" style="position:absolute; left:378px; top:599px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">wo wir hin wollen, trieb Schrempp</span></div>
<div class="txt" style="position:absolute; left:378px; top:611px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">seine OFKler an, wir müssen Großes</span></div>
<div class="txt" style="position:absolute; left:378px; top:623px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">vollbringen wollen und es einfach</span></div>
<div class="txt" style="position:absolute; left:378px; top:635px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">tun.</span></div>
<div class="txt" style="position:absolute; left:398px; top:647px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">Auch unter neuer Führung, so</span></div>
<div class="txt" style="position:absolute; left:378px; top:659px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">scheint es, klafft bei Daimler-Benz</span></div>
<div class="txt" style="position:absolute; left:378px; top:671px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">noch eine erhebliche Lücke zwischen</span></div>
<div class="txt" style="position:absolute; left:378px; top:683px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">Anspruch und Alltag. Zwischen</span></div>
<div class="txt" style="position:absolute; left:378px; top:695px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">Schein und Wirklichkeit.</span></div>
<div class="txt" style="position:absolute; left:526px; top:695px;"><span id="f2" style="font-size:10px;vertical-align:baseline;color:#231f1f;">fal</span></div>
</body>
</html>

Этот код показывается в браузере почти без ошибок (скринкаст в начале поста)



Посты чуть ниже также могут вас заинтересовать

Комментариев нет:

Отправить комментарий