MEPIS Community Forum

A Linux operating system based on Debian Stable
View unanswered posts | View unsolved topics | View active topics |



Reply to topic  [ 11 posts ]  Go to page 1, 2  Next
pandoc - universal document converter 
Author Message
Forum Guide
Forum Guide
User avatar

Joined: Mon Jun 30, 2008 8:06 pm
Posts: 2952
Has thanked: 52 times
Have thanks: 376 times
Post # 300638
Post pandoc - universal document converter
from time, to time, there have been questions posited about the best methods to batch convert documents.

happened across a solution, called pandoc. haven't tried it, yet, but it looks promising.

http://johnmacfarlane.net/pandoc/


Fri Jun 01, 2012 11:40 am

GoManutd thanked by: Jerry3904
Profile
Forum Veteran
Forum Veteran
User avatar

Joined: Wed Jul 19, 2006 6:13 am
Posts: 10069
Location: Geneva NY
Has thanked: 2876 times
Have thanks: 1863 times
Post # 300639
Post Re: pandoc - universal document converter
Certainly looks interesting...thanks.

_________________
Production: 32 bit, 3.14-0.bpo.1-686-pae. MX-14.2, AMD FX-4130 Quad-Core, GeForce GT 630/PCIe/SSE2, 8 GB DDR3, / and /home on Kingston SSD 120 GB, data on WesternDigital 1TB, Dell SP2309W monitor.


Fri Jun 01, 2012 11:43 am
Profile
Forum Regular
Forum Regular

Joined: Fri Jul 28, 2006 8:19 pm
Posts: 479
Has thanked: 29 times
Have thanks: 35 times
Post # 300654
Post Re: pandoc - universal document converter
Interesting but i don't see any thing about PDF files in the mix. I could use a simple PDF to text converter. If you know of anything that doesn't require a degree to use, would be happy to try it. Played with poppler i think its called but it complained about the files being linear or something similar. Not up on things like that but would like to break out a paragraph now and then instead of putzing for ever.


Fri Jun 01, 2012 4:07 pm
Profile
Forum Regular
Forum Regular

Joined: Wed Jul 29, 2009 6:05 pm
Posts: 351
Location: New Zealand
Has thanked: 25 times
Have thanks: 76 times
Post # 300656
Post Re: pandoc - universal document converter
I have used this site for file conversion, especially when MS changed there formats.

http://www.zamzar.com/

_________________
Mepis 11 64bit Linux 3.2.0-0.bpo.4-amd64, KDE 4.5.1
Intel i5 2400, Asus P8H67-M-EVO, G-Skill Ripjaws 2x4GB DDR3-1333, nVidia GeForce GT430, Seagate 500GB sata3 HDD


Fri Jun 01, 2012 4:21 pm
Profile
Forum Veteran
Forum Veteran
User avatar

Joined: Wed Jul 12, 2006 5:54 am
Posts: 10896
Location: Tulsa, Oklahoma U.S.A.
Has thanked: 3650 times
Have thanks: 885 times
Post # 300658
Post Re: pandoc - universal document converter
fatjak wrote:
would like to break out a paragraph now and then instead of putzing for ever.


Wouldn't highlighting the text and Copy/Paste into an empty KWrite document do for a paragraph or two?

_________________
Yes, even I am dishonest. Not in many ways, but in some. Forty-one, I think it is.
--Mark Twain


Fri Jun 01, 2012 5:36 pm
Profile
Forum Guide
Forum Guide
User avatar

Joined: Mon Jun 30, 2008 8:06 pm
Posts: 2952
Has thanked: 52 times
Have thanks: 376 times
Post # 300659
Post Re: pandoc - universal document converter
it does PDF, but you need to have latex installed, too.

this tool isn't for a few docs, rather batch conversions of many docs.


Fri Jun 01, 2012 5:42 pm
Profile
Forum Regular
Forum Regular

Joined: Fri Jul 28, 2006 8:19 pm
Posts: 479
Has thanked: 29 times
Have thanks: 35 times
Post # 300687
Post Re: pandoc - universal document converter
lucky9 wrote:
fatjak wrote:
would like to break out a paragraph now and then instead of putzing for ever.


Wouldn't highlighting the text and Copy/Paste into an empty KWrite document do for a paragraph or two?


You would think so but haven't managed to find the key to doing so. All i get is a hand sliding up & down the page, can't figure how to highlight the text & no right click options...?? Something unusual about these PDF files i guess. Document handling not my long suit, wrench in hand & im at home.


Sat Jun 02, 2012 12:26 pm
Profile
Forum Veteran
Forum Veteran
User avatar

Joined: Wed Jul 12, 2006 5:54 am
Posts: 10896
Location: Tulsa, Oklahoma U.S.A.
Has thanked: 3650 times
Have thanks: 885 times
Post # 300693
Post Re: pandoc - universal document converter
Might be a function of the PDF reader. I remember using Word for the first time and losing it when I couldn't use the _ to underline a few words. Computer nerds are just arcane.

_________________
Yes, even I am dishonest. Not in many ways, but in some. Forty-one, I think it is.
--Mark Twain


Sat Jun 02, 2012 12:54 pm
Profile
Forum Guide
Forum Guide
User avatar

Joined: Sat Jun 06, 2009 6:06 am
Posts: 1471
Location: Bristol, UK
Has thanked: 423 times
Have thanks: 309 times
Post # 300695
Post Re: pandoc - universal document converter
fatjak wrote:
lucky9 wrote:
fatjak wrote:
would like to break out a paragraph now and then instead of putzing for ever.


Wouldn't highlighting the text and Copy/Paste into an empty KWrite document do for a paragraph or two?


You would think so but haven't managed to find the key to doing so. All i get is a hand sliding up & down the page, can't figure how to highlight the text & no right click options...?? Something unusual about these PDF files i guess. Document handling not my long suit, wrench in hand & im at home.

There are 2 sorts of pdf files - those created using font details, and those created from an image. You can extract text from the former, but not the latter. Image pdfs tend to be much larger than font-based pdfs. It's also possible to create a hybrid pdf, where some elements are font-based and others are images. This is an example:
Attachment:
2012cert12-133.pdf

Open it in Okular, click the last icon on the toolbar (a sort of box with a pencil across it) which is the selection tool. You will find that you can extract text from the bold black areas (which are text-based and specific to the event) but can only extract an image from the other text areas (these are created as an image template).

Other pdf readers work similarly. If the text looks a bit fuzzy, you can be fairly certain it's an image-based pdf.

Phil


You do not have the required permissions to view the files attached to this post.

_________________

AsRock FM2A88X-ITX+, A8-6500, 8GB, 120GB Samsung SSD, 1TB HDD, MX-14.
Asus M4A88TD-V EVO/USB3, Phenom II X2 550 BE (now X3), 8GB.
120GB Corsair SSD, 500+250GB HDD, MX-14, Mepis 11.9.92-64.
Acer Aspire One 150, Atom N270, 120GB HDD, MX-14 / WinXP


Sat Jun 02, 2012 12:58 pm

Gaer Boy thanked by: lucky9
Profile
Forum Regular
Forum Regular

Joined: Fri Jul 28, 2006 8:19 pm
Posts: 479
Has thanked: 29 times
Have thanks: 35 times
Post # 300702
Post Re: pandoc - universal document converter
Well i found the key right under my nose as usual. In ocular in tools , selection tool lets you draw a box round the item then right mouse context menu works to copy to clipboard etc. Couldn't see the forest for tha trees. Hard to teach an old dog new tricks at times.

Thanks for input guys, made the wheels go round finally.


Sat Jun 02, 2012 1:44 pm

fatjak thanked by: lucky9
Profile
Display posts from previous:  Sort by  
Reply to topic   [ 11 posts ]  Go to page 1, 2  Next

Who is online

Users browsing this forum: No registered users and 1 guest


You cannot post new topics in this forum
You cannot reply to topics in this forum
You cannot edit your posts in this forum
You cannot delete your posts in this forum
You cannot post attachments in this forum

Search for:
Jump to:  

Protected by Anti-Spam ACP Powered by phpBB © 2000, 2002, 2005, 2007 phpBB Group.
Designed by STSoftware for PTF.