Как парсить pdf файлы? Здесь ссылки на десктопные конвертеры и библиотеки. Есть даже список List of PDF software, но главные надежды я возлагаю на Xpdf, только что скачал и буду пробовать под Windows. Здесь же и справочники pdftotext.txt pdftohtml.txt
Здесь и первые эксперименты с pdftotext и pdftohtml. И чудный html файл после pdftohtml.
Xpdf 3.04 was released 2014-may-28
Python module for converting PDF to text
List of PDF software
slate 0.4Extract text from PDF documents easily
pdfreflow is a command line utility that operates on the output of the poppler utility called pdftohtml. pdfreflow reflows the texts into paragraphs, while at the same time removing hyphenation and page numbers, headers and footers.
pdftohtmlPdftohtml is a tool based on the Xpdf package which translates
pdf documents into html format.
pdf2xml convertor based on Xpdf library (http://www.foolabs.com/xpdf/home.html). It converts information contained in a PDF file into XML. First, you need to install xpdf and libxml2 (see documentation). Hervé Déjean Xerox Research Centre Europe
How can I convert PDF to HTML? old post 2009 year
Reading data from PDF files into R
Poppler is a PDF rendering library based on the xpdf-3.0 code base
Converting PDF to HTML with Python [duplicate]
scrapy for table content in pdf file
Saving PDF files through callback function in a spider in scrapy
Portable Document Format (PDF) — межплатформенный формат электронных документов, разработанный фирмой Adobe Systems с использованием ряда возможностей языка PostScript.
PDFTK4ALLPDFtk is a simple tool for doing everyday things with PDF documents. It comes in three flavors: PDFtk Free, PDFtk Pro, and our original command-line tool PDFtk Server.
Распарсить PDF в TXT - здесь бедолаги решали ту же задачуЮ что и я, пока их читал установил на w8 утилиту pdftk4all - она слеивает и разделяет файлы и страницы. Они ее использовали для ремонта "плохих" pdf файлов.
Scraping large pdf tables which span accross multiple pages
PDF reference manual from Adobe
may be helpful to PDF users
Here are some other tools based on the Xpdf code
Pdf-parser
pdftk4all
Как конвертируется pdf файл (начнем с конца)¶
from IPython.display import Image
Image ("C:\\Users\\kiss\\Pictures\\pythonR\\pdftohtml.png")
Под windows документации маловато, но есть
C:\Program Files\Xpdf\bin64>pdftohtml help
pdftohtml version 3.04
Copyright 1996-2014 Glyph & Cog, LLC
Usage: pdftohtml [options] <PDF-file> <html-dir>
-f <int> : first page to convert
-l <int> : last page to convert
-r <int> : resolution, in DPI (default is 150)
-skipinvisible : do not draw invisible text
-opw <string> : owner password (for encrypted files)
-upw <string> : user password (for encrypted files)
-q : don't print any messages or errors
-cfg <string> : configuration file to use in place of .xpdfrc
-v : print copyright and version info
-h : print usage information
-help : print usage information
--help : print usage information
-? : print usage information
C:\Program Files\Xpdf\bin64>
%load "C:\\Program Files\\Xpdf\\doc\\pdftotext.txt"
pdftotext(1) pdftotext(1)
NAME
pdftotext - Portable Document Format (PDF) to text converter (version
3.04)
SYNOPSIS
pdftotext [options] [PDF-file [text-file]]
DESCRIPTION
Pdftotext converts Portable Document Format (PDF) files to plain text.
Pdftotext reads the PDF file, PDF-file, and writes a text file, text-
file. If text-file is not specified, pdftotext converts file.pdf to
file.txt. If text-file is '-', the text is sent to stdout.
CONFIGURATION FILE
Pdftotext reads a configuration file at startup. It first tries to
find the user's private config file, ~/.xpdfrc. If that doesn't exist,
it looks for a system-wide config file, typically /usr/local/etc/xpdfrc
(but this location can be changed when pdftotext is built). See the
xpdfrc(5) man page for details.
OPTIONS
Many of the following options can be set with configuration file com-
mands. These are listed in square brackets with the description of the
corresponding command line option.
-f number
Specifies the first page to convert.
-l number
Specifies the last page to convert.
-layout
Maintain (as best as possible) the original physical layout of
the text. The default is to 'undo' physical layout (columns,
hyphenation, etc.) and output the text in reading order. If the
-fixed option is given, character spacing within each line will
be determined by the specified character pitch.
-table Table mode is similar to physical layout mode, but optimized for
tabular data, with the goal of keeping rows and columns aligned
(at the expense of inserting extra whitespace). If the -fixed
option is given, character spacing within each line will be
determined by the specified character pitch.
-lineprinter
Line printer mode uses a strict fixed-character-pitch and
-height layout. That is, the page is broken into a grid, and
characters are placed into that grid. If the grid spacing is
too small for the actual characters, the result is extra white-
space. If the grid spacing is too large, the result is missing
whitespace. The grid spacing can be specified using the -fixed
and -linespacing options. If one or both are not given on the
command line, pdftotext will attempt to compute appropriate
value(s).
-raw Keep the text in content stream order. Depending on how the PDF
file was generated, this may or may not be useful.
-fixed number
Specify the character pitch (character width), in points, for
physical layout, table, or line printer mode. This is ignored
in all other modes.
-linespacing number
Specify the line spacing, in points, for line printer mode.
This is ignored in all other modes.
-clip Text which is hidden because of clipping is removed before doing
layout, and then added back in. This can be helpful for tables
where clipped (invisible) text would overlap the next column.
-enc encoding-name
Sets the encoding to use for text output. The encoding-name
must be defined with the unicodeMap command (see xpdfrc(5)).
The encoding name is case-sensitive. This defaults to "Latin1"
(which is a built-in encoding). [config file: textEncoding]
-eol unix | dos | mac
Sets the end-of-line convention to use for text output. [config
file: textEOL]
-nopgbrk
Don't insert page breaks (form feed characters) between pages.
[config file: textPageBreaks]
-opw password
Specify the owner password for the PDF file. Providing this
will bypass all security restrictions.
-upw password
Specify the user password for the PDF file.
-q Don't print any messages or errors. [config file: errQuiet]
-cfg config-file
Read config-file in place of ~/.xpdfrc or the system-wide config
file.
-v Print copyright and version information.
-h Print usage information. (-help and --help are equivalent.)
BUGS
Some PDF files contain fonts whose encodings have been mangled beyond
recognition. There is no way (short of OCR) to extract text from these
files.
EXIT CODES
The Xpdf tools use the following exit codes:
0 No error.
1 Error opening a PDF file.
2 Error opening an output file.
3 Error related to PDF permissions.
99 Other error.
AUTHOR
The pdftotext software and documentation are copyright 1996-2014 Glyph
& Cog, LLC.
SEE ALSO
xpdf(1), pdftops(1), pdftohtml(1), pdfinfo(1), pdffonts(1), pdfde-
tach(1), pdftoppm(1), pdftopng(1), pdfimages(1), xpdfrc(5)
http://www.foolabs.com/xpdf/
28 May 2014 pdftotext(1)
%load "C:\\Program Files\\Xpdf\\doc\\pdftohtml.txt"
pdftohtml(1) pdftohtml(1)
NAME
pdftohtml - Portable Document Format (PDF) to HTML converter (version
3.04)
SYNOPSIS
pdftohtml [options] PDF-file HTML-dir
DESCRIPTION
Pdftohtml converts Portable Document Format (PDF) files to HTML.
Pdftohtml reads the PDF file, PDF-file, and places an HTML file for
each page, along with auxiliary images in the directory, HTML-dir. The
HTML directory will be created; if it already exists, pdftohtml will
report an error.
CONFIGURATION FILE
Pdftohtml reads a configuration file at startup. It first tries to
find the user's private config file, ~/.xpdfrc. If that doesn't exist,
it looks for a system-wide config file, typically /usr/local/etc/xpdfrc
(but this location can be changed when pdftohtml is built). See the
xpdfrc(5) man page for details.
OPTIONS
Many of the following options can be set with configuration file com-
mands. These are listed in square brackets with the description of the
corresponding command line option.
-f number
Specifies the first page to convert.
-l number
Specifies the last page to convert.
-r Specifies the resolution, in DPI, for background images. The
default is 150 DPI.
-opw password
Specify the owner password for the PDF file. Providing this
will bypass all security restrictions.
-upw password
Specify the user password for the PDF file.
-q Don't print any messages or errors. [config file: errQuiet]
-cfg config-file
Read config-file in place of ~/.xpdfrc or the system-wide config
file.
-v Print copyright and version information.
-h Print usage information. (-help and --help are equivalent.)
BUGS
Some PDF files contain fonts whose encodings have been mangled beyond
recognition. There is no way (short of OCR) to extract text from these
files.
EXIT CODES
The Xpdf tools use the following exit codes:
0 No error.
1 Error opening a PDF file.
2 Error opening an output file.
3 Error related to PDF permissions.
99 Other error.
AUTHOR
The pdftohtml software and documentation are copyright 1996-2014 Glyph
& Cog, LLC.
SEE ALSO
xpdf(1), pdftops(1), pdftotext(1), pdfinfo(1), pdffonts(1), pdfde-
tach(1), pdftoppm(1), pdftopng(1), pdfimages(1), xpdfrc(5)
http://www.foolabs.com/xpdf/
28 May 2014 pdftohtml(1)
Об установке
Установка под Windows оказалось необычной. В файле install.txt было рекомендовано просто создать папку Xpdf в Program Files и скопировать все туда. Потом я создал целых два конфигурационных файла в bin6
xpdfrc
xpdfrc.txt
C:...>tree "C:\Program Files\Xpdf" /F
Структура папок
Серийный номер тома: 000000D7 6017:2A0B
C:\PROGRAM FILES\XPDF
│ ANNOUNCE
│ CHANGES
│ COPYING
│ COPYING3
│ INSTALL
│ README
│
├───bin32
│ pdfdetach.exe
│ pdffonts.exe
│ pdfimages.exe
│ pdfinfo.exe
│ pdftohtml.exe
│ pdftopng.exe
│ pdftoppm.exe
│ pdftops.exe
│ pdftotext.exe
│
├───bin64
│ demo1.pdf
│ pdfdetach.exe
│ pdffonts.exe
│ pdfimages.exe
│ pdfinfo.exe
│ pdftohtml.exe
│ pdftopng.exe
│ pdftoppm.exe
│ pdftops.exe
│ pdftotext.exe
│ xpdfrc
│ xpdfrc.txt
│
└───doc
pdfdetach.txt
pdffonts.txt
pdfimages.txt
pdfinfo.txt
pdftohtml.txt
pdftopng.txt
pdftoppm.txt
pdftops.txt
pdftotext.txt
sample-xpdfrc
xpdf.txt
xpdfrc.txt
Файл конфигурации был полность закомментирован, я не нашел и не нагуглил никаких инструкций, кроме форума вот этих бедолаг Распарсить PDF в TXT
и раскоментировал следующее
%load "C:\\Program Files\\Xpdf\\bin64\\xpdfrc"
#========================================================================
#
# Sample xpdfrc file
#
# The Xpdf tools look for a config file in two places:
# 1. ~/.xpdfrc
# 2. in a system-wide directory, typically /usr/local/etc/xpdfrc
#
# This sample config file demonstrates some of the more common
# configuration options. Everything here is commented out. You
# should edit things (especially the file/directory paths, since
# they'll likely be different on your system), and uncomment whichever
# options you want to use. For complete details on config file syntax
# and available options, please see the xpdfrc(5) man page.
#
# Also, the Xpdf language support packages each include a set of
# options to be added to the xpdfrc file.
#
# http://www.foolabs.com/xpdf/
#
#========================================================================
#----- display fonts
# These map the Base-14 fonts to the Type 1 fonts that ship with
# ghostscript. You'll almost certainly want to use something like
# this, but you'll need to adjust this to point to wherever
# ghostscript is installed on your system. (But if the fonts are
# installed in a "standard" location, xpdf will find them
# automatically.)
#fontFile Times-Roman /usr/local/share/ghostscript/fonts/n021003l.pfb
#fontFile Times-Italic /usr/local/share/ghostscript/fonts/n021023l.pfb
#fontFile Times-Bold /usr/local/share/ghostscript/fonts/n021004l.pfb
#fontFile Times-BoldItalic /usr/local/share/ghostscript/fonts/n021024l.pfb
#fontFile Helvetica /usr/local/share/ghostscript/fonts/n019003l.pfb
#fontFile Helvetica-Oblique /usr/local/share/ghostscript/fonts/n019023l.pfb
#fontFile Helvetica-Bold /usr/local/share/ghostscript/fonts/n019004l.pfb
#fontFile Helvetica-BoldOblique /usr/local/share/ghostscript/fonts/n019024l.pfb
#fontFile Courier /usr/local/share/ghostscript/fonts/n022003l.pfb
#fontFile Courier-Oblique /usr/local/share/ghostscript/fonts/n022023l.pfb
#fontFile Courier-Bold /usr/local/share/ghostscript/fonts/n022004l.pfb
#fontFile Courier-BoldOblique /usr/local/share/ghostscript/fonts/n022024l.pfb
#fontFile Symbol /usr/local/share/ghostscript/fonts/s050000l.pfb
#fontFile ZapfDingbats /usr/local/share/ghostscript/fonts/d050000l.pfb
# If you need to display PDF files that refer to non-embedded fonts,
# you should add one or more fontDir options to point to the
# directories containing the font files. Xpdf will only look at .pfa,
# .pfb, .ttf, and .ttc files in those directories (other files will
# simply be ignored).
fontDir C:\Windows\Fonts
#----- PostScript output control
# Set the default PostScript file or command.
#psFile "|lpr -Pmyprinter"
# Set the default PostScript paper size -- this can be letter, legal,
# A4, or A3. You can also specify a paper size as width and height
# (in points).
#psPaperSize letter
#----- text output control
# Choose a text encoding for copy-and-paste and for pdftotext output.
# The Latin1, ASCII7, and UTF-8 encodings are built into Xpdf. Other
# encodings are available in the language support packages.
textEncoding UTF-8
# Choose the end-of-line convention for multi-line copy-and-past and
# for pdftotext output. The available options are unix, mac, and dos.
#textEOL unix
#----- misc settings
# Enable FreeType, and anti-aliased text.
#enableFreeType yes
#antialias yes
# Set the command used to run a web browser when a URL hyperlink is
# clicked.
#launchCommand viewer-script
#urlCommand "netscape -remote 'openURL(%s)'"
Попробовал запустить pdftohtml, получил ошибки
C:\Program Files\Xpdf\bin64>pdftohtml demo1.pdf C:\Users\kiss\Documents\Xpdf
Config Error: No display font for 'Symbol'
Config Error: No display font for 'ZapfDingbats'
I/O Error: Couldn't create HTML output directory 'C:\Users\kiss\Documents\Xpdf'
#Попытался добавить строчку со шрифтом в symbol.txt
# Заодно и создать xpdfrc.txt из xpdfrc
fontFile Symbol C:\Windows\Fonts\symbol.ttf
%load "C:\\Program Files\\Xpdf\\bin64\\xpdfrc.txt"
#========================================================================
#
# Sample xpdfrc file
#
# The Xpdf tools look for a config file in two places:
# 1. ~/.xpdfrc
# 2. in a system-wide directory, typically /usr/local/etc/xpdfrc
#
# This sample config file demonstrates some of the more common
# configuration options. Everything here is commented out. You
# should edit things (especially the file/directory paths, since
# they'll likely be different on your system), and uncomment whichever
# options you want to use. For complete details on config file syntax
# and available options, please see the xpdfrc(5) man page.
#
# Also, the Xpdf language support packages each include a set of
# options to be added to the xpdfrc file.
#
# http://www.foolabs.com/xpdf/
#
#========================================================================
#----- display fonts
# These map the Base-14 fonts to the Type 1 fonts that ship with
# ghostscript. You'll almost certainly want to use something like
# this, but you'll need to adjust this to point to wherever
# ghostscript is installed on your system. (But if the fonts are
# installed in a "standard" location, xpdf will find them
# automatically.)
#fontFile Times-Roman /usr/local/share/ghostscript/fonts/n021003l.pfb
#fontFile Times-Italic /usr/local/share/ghostscript/fonts/n021023l.pfb
#fontFile Times-Bold /usr/local/share/ghostscript/fonts/n021004l.pfb
#fontFile Times-BoldItalic /usr/local/share/ghostscript/fonts/n021024l.pfb
#fontFile Helvetica /usr/local/share/ghostscript/fonts/n019003l.pfb
#fontFile Helvetica-Oblique /usr/local/share/ghostscript/fonts/n019023l.pfb
#fontFile Helvetica-Bold /usr/local/share/ghostscript/fonts/n019004l.pfb
#fontFile Helvetica-BoldOblique /usr/local/share/ghostscript/fonts/n019024l.pfb
#fontFile Courier /usr/local/share/ghostscript/fonts/n022003l.pfb
#fontFile Courier-Oblique /usr/local/share/ghostscript/fonts/n022023l.pfb
#fontFile Courier-Bold /usr/local/share/ghostscript/fonts/n022004l.pfb
#fontFile Courier-BoldOblique /usr/local/share/ghostscript/fonts/n022024l.pfb
fontFile Symbol C:\Windows\Fonts\symbol.ttf
#fontFile ZapfDingbats /usr/local/share/ghostscript/fonts/d050000l.pfb
# If you need to display PDF files that refer to non-embedded fonts,
# you should add one or more fontDir options to point to the
# directories containing the font files. Xpdf will only look at .pfa,
# .pfb, .ttf, and .ttc files in those directories (other files will
# simply be ignored).
fontDir C:\Windows\Fonts
#----- PostScript output control
# Set the default PostScript file or command.
#psFile "|lpr -Pmyprinter"
# Set the default PostScript paper size -- this can be letter, legal,
# A4, or A3. You can also specify a paper size as width and height
# (in points).
#psPaperSize letter
#----- text output control
# Choose a text encoding for copy-and-paste and for pdftotext output.
# The Latin1, ASCII7, and UTF-8 encodings are built into Xpdf. Other
# encodings are available in the language support packages.
textEncoding UTF-8
# Choose the end-of-line convention for multi-line copy-and-past and
# for pdftotext output. The available options are unix, mac, and dos.
#textEOL unix
#----- misc settings
# Enable FreeType, and anti-aliased text.
#enableFreeType yes
#antialias yes
# Set the command used to run a web browser when a URL hyperlink is
# clicked.
#launchCommand viewer-script
#urlCommand "netscape -remote 'openURL(%s)'"
Не помогло. Решил попробовать вариант pdftotext
C:\Program Files\Xpdf\bin64>pdftotext demo1.pdf C:\Users\kiss\Documents\Xpdf
I/O Error: Couldn't open text file 'C:\Users\kiss\Documents\Xpdf'
C:\Program Files\Xpdf\bin64>pdftotext demo1.pdf C:\Users\kiss\Documents\Xpdf\text1.txt
И нашел в заданном файле text1.txt вполне приличный текстовый файл. Но захотелось большего. Может быть другие шрифты найдутся?
C:\Program Files\Xpdf\bin64>pdftohtml demo1.pdf C:\Users\kiss\Documents\Xpdf\text1.html
Config Error: No display font for 'Symbol'
Config Error: No display font for 'ZapfDingbats'
Syntax Warning: Substituting font 'Helvetica' for 'HelveticaNeue-Roman'
Да, действительно, произошла замене шрифта, и была создана "странная" папка text1.html
C:\Users\kiss\SkyDrive\Docs\mailru\cars_mail_1\carmailPrice>tree C:\Users\kiss\Documents\Xpdf /F
Структура папок
Серийный номер тома: 00000023 6017:2A0B
C:\USERS\KISS\DOCUMENTS\XPDF
│ text1.txt
│
└───text1.html
index.html
page1.html
page1.png
page2.html
page2.png
page3.html
page3.png
А в этой странной папке еще более странный html файл.¶
Казалось бы, такие стили с абсолютным позициированием - это безвредный изыск. Но нет, при последующем экспериментировании с другим файлом, содержащим таблицы, оказалось, что информация считана по столбцам, а позициируется по строкам.
%load C:/Users/kiss/Documents/Xpdf/text1.html/page3.html
<html>
<head>
<meta http-equiv="Content-Type" content="text/html; charset=UTF-8">
<style type="text/css">
.txt { white-space:nowrap; }
#f0 { font-family:sans-serif; font-weight:normal; font-style:normal; }
#f1 { font-family:serif; font-weight:normal; font-style:normal; }
#f2 { font-family:serif; font-weight:normal; font-style:italic; }
#f3 { font-family:serif; font-weight:bold; font-style:normal; }
#f4 { font-family:sans-serif; font-weight:normal; font-style:normal; }
#f5 { font-family:sans-serif; font-weight:bold; font-style:normal; }
#f6 { font-family:sans-serif; font-weight:normal; font-style:italic; }
#f7 { font-family:sans-serif; font-weight:normal; font-style:normal; }
#f8 { font-family:sans-serif; font-weight:normal; font-style:normal; }
#f9 { font-family:sans-serif; font-weight:normal; font-style:normal; }
#f10 { font-family:serif; font-weight:bold; font-style:italic; }
#f11 { font-family:sans-serif; font-weight:bold; font-style:normal; }
#f12 { font-family:sans-serif; font-weight:normal; font-style:normal; }
#f13 { font-family:sans-serif; font-weight:normal; font-style:normal; }
</style>
</head>
<body onload="start()">
<img id="background" style="position:absolute; left:0px; top:0px;" width="566" height="726" src="page3.png">
<div class="txt" style="position:absolute; left:47px; top:46px;"><span id="f5" style="font-size:28px;vertical-align:baseline;color:#094270;">„Was zum Teufel hat ihn getrieben“</span></div>
<div class="txt" style="position:absolute; left:138px; top:82px;"><span id="f1" style="font-size:11px;vertical-align:baseline;color:#000000;">Reuter-Erinnerungen: Die negativen Kommentare überwiegen</span></div>
<div class="txt" style="position:absolute; left:40px; top:102px;"><span id="f6" style="font-size:25px;vertical-align:baseline;color:#094270;">S</span><span id="f2" style="font-size:10px;vertical-align:super;color:#000000;">chein und Wirklichkeit“ – die</span></div>
<div class="txt" style="position:absolute; left:57px; top:113px;"><span id="f2" style="font-size:10px;vertical-align:baseline;color:#000000;">Memoiren von Edzard Reuter,</span></div>
<div class="txt" style="position:absolute; left:40px; top:124px;"><span id="f2" style="font-size:10px;vertical-align:baseline;color:#000000;">dem ehemaligen Vorstandsvorsitzen-</span></div>
<div class="txt" style="position:absolute; left:40px; top:134px;"><span id="f2" style="font-size:10px;vertical-align:baseline;color:#000000;">den der Daimler-Benz AG, sorgten</span></div>
<div class="txt" style="position:absolute; left:40px; top:145px;"><span id="f2" style="font-size:10px;vertical-align:baseline;color:#000000;">für Aufsehen, noch bevor sie erschie-</span></div>
<div class="txt" style="position:absolute; left:40px; top:156px;"><span id="f2" style="font-size:10px;vertical-align:baseline;color:#000000;">nen waren. Was Reuters grimmiger</span></div>
<div class="txt" style="position:absolute; left:40px; top:166px;"><span id="f2" style="font-size:10px;vertical-align:baseline;color:#000000;">Rückblick „schwarz auf weiß bietet,</span></div>
<div class="txt" style="position:absolute; left:40px; top:177px;"><span id="f2" style="font-size:10px;vertical-align:baseline;color:#000000;">ist kein schöner Anblick. Aber lehr-</span></div>
<div class="txt" style="position:absolute; left:40px; top:187px;"><span id="f2" style="font-size:10px;vertical-align:baseline;color:#000000;">reich“. So urteilte die „Frankfurter</span></div>
<div class="txt" style="position:absolute; left:40px; top:198px;"><span id="f2" style="font-size:10px;vertical-align:baseline;color:#000000;">Rundschau“, nachdem manager</span></div>
<div class="txt" style="position:absolute; left:40px; top:209px;"><span id="f2" style="font-size:10px;vertical-align:baseline;color:#000000;">magazin in seiner Februar-Ausgabe</span></div>
<div class="txt" style="position:absolute; left:40px; top:219px;"><span id="f2" style="font-size:10px;vertical-align:baseline;color:#000000;">vorab über das Buch berichtet hatte.</span></div>
<div class="txt" style="position:absolute; left:40px; top:230px;"><span id="f2" style="font-size:10px;vertical-align:baseline;color:#000000;">Kollegen und Geschäftspartner,</span></div>
<div class="txt" style="position:absolute; left:40px; top:240px;"><span id="f2" style="font-size:10px;vertical-align:baseline;color:#000000;">deutsche und internationale Medien</span></div>
<div class="txt" style="position:absolute; left:40px; top:251px;"><span id="f2" style="font-size:10px;vertical-align:baseline;color:#000000;">kommentierten Reuters Erinnerun-</span></div>
<div class="txt" style="position:absolute; left:40px; top:262px;"><span id="f2" style="font-size:10px;vertical-align:baseline;color:#000000;">gen. Auszüge:</span></div>
<div class="txt" style="position:absolute; left:40px; top:283px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">„Dieser sich selbst überschätzende</span></div>
<div class="txt" style="position:absolute; left:40px; top:293px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">Einfaltspinsel.“</span></div>
<div class="txt" style="position:absolute; left:90px; top:304px;"><span id="f2" style="font-size:8px;vertical-align:baseline;color:#000000;">Ex-Daimler-Chef Joachim Zahn</span></div>
<div class="txt" style="position:absolute; left:106px; top:312px;"><span id="f2" style="font-size:8px;vertical-align:baseline;color:#000000;">im Süddeutschen Rundfunk</span></div>
<div class="txt" style="position:absolute; left:40px; top:334px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">„Einfalt meint, der versteht vom Ge-</span></div>
<div class="txt" style="position:absolute; left:40px; top:344px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">schäft nix, und Pinsel bedeutet, der</span></div>
<div class="txt" style="position:absolute; left:40px; top:355px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">schwätzt trotzdem scheinbar ge-</span></div>
<div class="txt" style="position:absolute; left:40px; top:366px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">scheit daher.“</span></div>
<div class="txt" style="position:absolute; left:92px; top:376px;"><span id="f2" style="font-size:8px;vertical-align:baseline;color:#000000;">Joachim Zahn in „Der Spiegel“</span></div>
<div class="txt" style="position:absolute; left:40px; top:395px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">„Was zum Teufel hat Reuter getrie-</span></div>
<div class="txt" style="position:absolute; left:40px; top:406px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">ben, mir angesichts meiner angeb-</span></div>
<div class="txt" style="position:absolute; left:40px; top:417px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">lich ekelhaften Charaktereigen-</span></div>
<div class="txt" style="position:absolute; left:40px; top:427px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">schaften auch noch eine Stabsstelle</span></div>
<div class="txt" style="position:absolute; left:40px; top:438px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">anzubieten, mit dem Ziel, in den</span></div>
<div class="txt" style="position:absolute; left:40px; top:448px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">Daimler-Vorstand zu gehen?“</span></div>
<div class="txt" style="position:absolute; left:98px; top:459px;"><span id="f2" style="font-size:8px;vertical-align:baseline;color:#000000;">Martine Dornier-Tiefenthaler</span></div>
<div class="txt" style="position:absolute; left:102px; top:467px;"><span id="f2" style="font-size:8px;vertical-align:baseline;color:#000000;">in der „Stuttgarter Zeitung“</span></div>
<div class="txt" style="position:absolute; left:40px; top:491px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">„Reuter (hat sich) nie zugehörig ge-</span></div>
<div class="txt" style="position:absolute; left:40px; top:501px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">fühlt zu dieser Kaste reaktionärer In-</span></div>
<div class="txt" style="position:absolute; left:40px; top:512px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">dustrieller ... Reuter rechtfertigt sei-</span></div>
<div class="txt" style="position:absolute; left:205px; top:103px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">ne Unternehmenspolitik, und es ge-</span></div>
<div class="txt" style="position:absolute; left:205px; top:113px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">lingt ihm auch zum großen Teil.“</span></div>
<div class="txt" style="position:absolute; left:240px; top:123px;"><span id="f2" style="font-size:8px;vertical-align:baseline;color:#000000;">Ex-Daimler-Sprecher Winfried Münster</span></div>
<div class="txt" style="position:absolute; left:314px; top:132px;"><span id="f2" style="font-size:8px;vertical-align:baseline;color:#000000;">in „Die Woche“</span></div>
<div class="txt" style="position:absolute; left:205px; top:160px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">„Wir lehnen ein solches Vorgehen von</span></div>
<div class="txt" style="position:absolute; left:205px; top:170px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">jemandem, der eine herausragende</span></div>
<div class="txt" style="position:absolute; left:205px; top:181px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">Stellung in unserem Unternehmen in-</span></div>
<div class="txt" style="position:absolute; left:205px; top:192px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">nehatte, entschieden ab.“</span></div>
<div class="txt" style="position:absolute; left:242px; top:202px;"><span id="f2" style="font-size:8px;vertical-align:baseline;color:#000000;">Schreiben des Daimler-Benz-Vorstands</span></div>
<div class="txt" style="position:absolute; left:288px; top:210px;"><span id="f2" style="font-size:8px;vertical-align:baseline;color:#000000;">an seine Führungskräfte</span></div>
<div class="txt" style="position:absolute; left:205px; top:238px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">„So offen er Schrempp und den AEG-</span></div>
<div class="txt" style="position:absolute; left:205px; top:248px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">Managern Fehler anlastet, so vage</span></div>
<div class="txt" style="position:absolute; left:205px; top:259px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">bleibt seine Aussage über eigene Irrtü-</span></div>
<div class="txt" style="position:absolute; left:205px; top:270px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">mer und Versagen.“</span></div>
<div class="txt" style="position:absolute; left:257px; top:280px;"><span id="f2" style="font-size:8px;vertical-align:baseline;color:#000000;">Reuter-Biograph Hans Otto Eglau</span></div>
<div class="txt" style="position:absolute; left:322px; top:288px;"><span id="f2" style="font-size:8px;vertical-align:baseline;color:#000000;">in „Die Zeit“</span></div>
<div class="txt" style="position:absolute; left:205px; top:310px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">„Woher das Magazin das Manuskript</span></div>
<div class="txt" style="position:absolute; left:205px; top:320px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">wohl hat?“</span></div>
<div class="txt" style="position:absolute; left:257px; top:331px;"><span id="f2" style="font-size:8px;vertical-align:baseline;color:#000000;">Hannoversche Allgemeine Zeitung</span></div>
<div class="txt" style="position:absolute; left:205px; top:358px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">„Man mag das Resultat seiner un-</span></div>
<div class="txt" style="position:absolute; left:205px; top:368px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">ternehmerischen Bemühungen als</span></div>
<div class="txt" style="position:absolute; left:205px; top:379px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">ziemlich erfolglos betrachten: Hier</span></div>
<div class="txt" style="position:absolute; left:205px; top:390px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">ist einer, der sich seinen Kritikern</span></div>
<div class="txt" style="position:absolute; left:205px; top:400px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">stellt und auch im nachhinein beim</span></div>
<div class="txt" style="position:absolute; left:205px; top:411px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">Austeilen noch keineswegs erlahmt</span></div>
<div class="txt" style="position:absolute; left:205px; top:421px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">ist.“</span></div>
<div class="txt" style="position:absolute; left:315px; top:432px;"><span id="f2" style="font-size:8px;vertical-align:baseline;color:#000000;">Börsen-Zeitung</span></div>
<div class="txt" style="position:absolute; left:378px; top:103px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">„Daß Reuter nur hadert, anstatt sei-</span></div>
<div class="txt" style="position:absolute; left:378px; top:113px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">nen Verstand einzusetzen, daß er alte</span></div>
<div class="txt" style="position:absolute; left:378px; top:124px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">Rechnungen begleicht, anstatt sich</span></div>
<div class="txt" style="position:absolute; left:378px; top:134px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">um ein abgewogenes Urteil zu</span></div>
<div class="txt" style="position:absolute; left:378px; top:145px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">bemühen, das verübeln Reuter selbst</span></div>
<div class="txt" style="position:absolute; left:378px; top:156px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">wohlgesonnene Leute.“</span></div>
<div class="txt" style="position:absolute; left:454px; top:166px;"><span id="f2" style="font-size:8px;vertical-align:baseline;color:#000000;">Stuttgarter Nachrichten</span></div>
<div class="txt" style="position:absolute; left:378px; top:190px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">„Sein Buch wird sicherlich nicht</span></div>
<div class="txt" style="position:absolute; left:378px; top:200px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">dazu beitragen, Reputation und An-</span></div>
<div class="txt" style="position:absolute; left:378px; top:211px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">sehen zurückzuerlangen.“</span></div>
<div class="txt" style="position:absolute; left:488px; top:221px;"><span id="f2" style="font-size:8px;vertical-align:baseline;color:#000000;">Handelsblatt</span></div>
<div class="txt" style="position:absolute; left:378px; top:239px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">„Es irritiert, wenn Reuter so rechtet</span></div>
<div class="txt" style="position:absolute; left:378px; top:249px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">und sich dabei selbst zum auf kläreri-</span></div>
<div class="txt" style="position:absolute; left:378px; top:260px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">schen Weltgeist hoch zu Pferd stili-</span></div>
<div class="txt" style="position:absolute; left:378px; top:270px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">siert. Haben am Ende doch diejeni-</span></div>
<div class="txt" style="position:absolute; left:378px; top:281px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">gen recht gehabt, die zwar seine bril-</span></div>
<div class="txt" style="position:absolute; left:378px; top:292px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">lante Intelligenz erkannten, ihn aber</span></div>
<div class="txt" style="position:absolute; left:378px; top:302px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">nicht an der Spitze des Konzerns ha-</span></div>
<div class="txt" style="position:absolute; left:378px; top:313px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">ben wollten?“</span></div>
<div class="txt" style="position:absolute; left:430px; top:323px;"><span id="f2" style="font-size:8px;vertical-align:baseline;color:#000000;">Frankfurter Allgemeine Zeitung</span></div>
<div class="txt" style="position:absolute; left:378px; top:347px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">„Wie zu seiner Zeit als Vorsitzender</span></div>
<div class="txt" style="position:absolute; left:378px; top:357px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">weist Reuter heute in seinem Buch</span></div>
<div class="txt" style="position:absolute; left:378px; top:368px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">das Argument zurück, daß seine Ex-</span></div>
<div class="txt" style="position:absolute; left:378px; top:379px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">pansionsstrategie und seine Vision</span></div>
<div class="txt" style="position:absolute; left:378px; top:389px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">vom integrierten Technologiekon-</span></div>
<div class="txt" style="position:absolute; left:378px; top:400px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">zern das Unternehmen ins finanzi-</span></div>
<div class="txt" style="position:absolute; left:378px; top:410px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">elle Desaster führten.“</span></div>
<div class="txt" style="position:absolute; left:437px; top:421px;"><span id="f2" style="font-size:8px;vertical-align:baseline;color:#000000;">International Herald Tribune</span></div>
<div class="txt" style="position:absolute; left:378px; top:440px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">„Selbst die Hölle kennt keinen ge-</span></div>
<div class="txt" style="position:absolute; left:378px; top:451px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">waltigeren Zorn als den von abge-</span></div>
<div class="txt" style="position:absolute; left:378px; top:461px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">halfterten Executives. Doch Edzard</span></div>
<div class="txt" style="position:absolute; left:378px; top:472px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">Reuter ... wählte den falschen Mo-</span></div>
<div class="txt" style="position:absolute; left:378px; top:482px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">ment, um zurückzuschlagen. Seine</span></div>
<div class="txt" style="position:absolute; left:378px; top:493px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">Attacken dürften das Management</span></div>
<div class="txt" style="position:absolute; left:378px; top:504px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">ziemlich kaltlassen.“</span></div>
<div class="txt" style="position:absolute; left:478px; top:514px;"><span id="f2" style="font-size:8px;vertical-align:baseline;color:#000000;">Financial Times</span></div>
<div class="txt" style="position:absolute; left:31px; top:551px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#231f1f;">die Führungsmannschaft um Ent-</span></div>
<div class="txt" style="position:absolute; left:31px; top:563px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#231f1f;">wicklungschef </span><span id="f3" style="font-size:10px;vertical-align:baseline;color:#231f1f;">Johann Tomforde</span></div>
<div class="txt" style="position:absolute; left:31px; top:575px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#231f1f;">und Finanzchef </span><span id="f3" style="font-size:10px;vertical-align:baseline;color:#231f1f;">Christoph Baubin</span></div>
<div class="txt" style="position:absolute; left:31px; top:587px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#231f1f;">feuern.</span></div>
<div class="txt" style="position:absolute; left:51px; top:599px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#231f1f;">Beim Führungskräfte-Forum leg-</span></div>
<div class="txt" style="position:absolute; left:31px; top:611px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#231f1f;">te der Vorsitzende nach: „Es war offen-</span></div>
<div class="txt" style="position:absolute; left:31px; top:623px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#231f1f;">sichtlich, daß die Probleme bekannt</span></div>
<div class="txt" style="position:absolute; left:31px; top:635px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#231f1f;">waren, aber verschwiegen wurden.</span></div>
<div class="txt" style="position:absolute; left:31px; top:647px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#231f1f;">Wir haben daher umgehend personelle</span></div>
<div class="txt" style="position:absolute; left:31px; top:659px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#231f1f;">Konsequenzen gezogen.“</span></div>
<div class="txt" style="position:absolute; left:51px; top:671px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#231f1f;">Das war eine Warnung, die</span></div>
<div class="txt" style="position:absolute; left:31px; top:683px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#231f1f;">Schrempp vor allem an die Fahrzeug-</span></div>
<div class="txt" style="position:absolute; left:31px; top:695px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#231f1f;">bauer in Untertürkheim richtete. Die</span></div>
<div class="txt" style="position:absolute; left:210px; top:671px;"><span id="f10" style="font-size:10px;vertical-align:baseline;color:#000000;">Buchautor Edzard Reuter:</span></div>
<div class="txt" style="position:absolute; left:210px; top:683px;"><span id="f10" style="font-size:10px;vertical-align:baseline;color:#000000;">Weltgeist hoch zu Pferd</span></div>
<div class="txt" style="position:absolute; left:378px; top:551px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">blieben, anders als die Kollegen des el-</span></div>
<div class="txt" style="position:absolute; left:378px; top:563px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">sässischen Ablegers Smart, bisher von</span></div>
<div class="txt" style="position:absolute; left:378px; top:575px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">Sanktionen verschont.</span></div>
<div class="txt" style="position:absolute; left:398px; top:587px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">„Noch sind wir nicht überall da,</span></div>
<div class="txt" style="position:absolute; left:378px; top:599px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">wo wir hin wollen“, trieb Schrempp</span></div>
<div class="txt" style="position:absolute; left:378px; top:611px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">seine OFKler an, „wir müssen Großes</span></div>
<div class="txt" style="position:absolute; left:378px; top:623px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">vollbringen wollen und es einfach</span></div>
<div class="txt" style="position:absolute; left:378px; top:635px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">tun.“</span></div>
<div class="txt" style="position:absolute; left:398px; top:647px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">Auch unter neuer Führung, so</span></div>
<div class="txt" style="position:absolute; left:378px; top:659px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">scheint es, klafft bei Daimler-Benz</span></div>
<div class="txt" style="position:absolute; left:378px; top:671px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">noch eine erhebliche Lücke zwischen</span></div>
<div class="txt" style="position:absolute; left:378px; top:683px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">Anspruch und Alltag. Zwischen</span></div>
<div class="txt" style="position:absolute; left:378px; top:695px;"><span id="f1" style="font-size:10px;vertical-align:baseline;color:#000000;">„Schein und Wirklichkeit“.</span></div>
<div class="txt" style="position:absolute; left:526px; top:695px;"><span id="f2" style="font-size:10px;vertical-align:baseline;color:#231f1f;">fal</span></div>
</body>
</html>
Этот код показывается в браузере почти без ошибок (скринкаст в начале поста)
Посты чуть ниже также могут вас заинтересовать
Комментариев нет:
Отправить комментарий