Scheduled Maintenance: We are aware of an issue with Google, AOL, and Yahoo services as email providers which are blocking new registrations. We are trying to fix the issue and we have several internal and external support tickets in process to resolve the issue. Please see: viewtopic.php?t=158230

 

 

 

Where can I find a pdf to html for pdf with 2 columns?

New to Debian (Or Linux in general)? Ask your questions here!
Post Reply
Message
Author
Mariane
Posts: 75
Joined: 2009-11-19 21:29

Where can I find a pdf to html for pdf with 2 columns?

#1 Post by Mariane »

There is a package which converts pdf files to html files:
http://packages.debian.org/search?keywords=pdftohtml
So I did apt-get install poppler-utils

But my pdf file has 2 columns on each page and it comes out as one column, alternatively taking a line of text from one column then a line of text from the other.
I guess there is a script to put the lines of text back in their proper order, because this does not sound very difficult to write, or maybe there is a program which properly converts 2 columns pdf to html?

Mariane

User avatar
mzilikazi
Forum Account
Forum Account
Posts: 3282
Joined: 2004-09-16 02:14
Location: Colorado Springs, CO

Re: Where can I find a pdf to html for pdf with 2 columns?

#2 Post by mzilikazi »

Mariane wrote: But my pdf file has 2 columns on each page
Try this yet?

Code: Select all

pdftohtml -c file.pdf file.html
If that fails do you have a pdf w/ 2 columns to share?
Debian Sid Laptops:
AMD Athlon(tm) 64 X2 Dual-Core Processor TK-55 / 1.5G
Intel(R) Pentium(R) Dual CPU T2390 @ 1.86GHz / 3G

Post Reply