@[email protected] to Programmer [email protected] • 3 months agoDOGE employeelemmy.worldimagemessage-square95fedilinkarrow-up1579arrow-down112cross-posted to: [email protected]
arrow-up1567arrow-down1imageDOGE employeelemmy.world@[email protected] to Programmer [email protected] • 3 months agomessage-square95fedilinkcross-posted to: [email protected]
minus-squarelime!linkfedilinkEnglish14•edit-23 months ago$ pandoc doc.pdf -o doc.txt Edit: welp, pandoc can’t do that. pdftotext it is.
minus-square@[email protected]linkfedilinkEnglish2•edit-23 months agomagick file.jpg file.html Imagemagick be converting anything into anything (Actually in this case, it make an html file and a png file which is referenced in html file and html page displays it)
minus-squarelime!linkfedilinkEnglish2•3 months agonot really a good way to get the text out of a pdf though. then again, turns out neither is pandoc.
minus-square@[email protected]linkfedilink1•3 months agoI thought pandoc didn’t support from PDF, only to?!
minus-squarelime!linkfedilinkEnglish2•3 months agodamn it, you’re right. should probably have checked that…
minus-square@[email protected]linkfedilink1•3 months agoDon’t worry, I didn’t know either and had to check to check too :P
$ pandoc doc.pdf -o doc.txt
Edit: welp, pandoc can’t do that.
pdftotext
it is.magick file.jpg file.html
Imagemagick be converting anything into anything (Actually in this case, it make an html file and a png file which is referenced in html file and html page displays it)
not really a good way to get the text out of a pdf though. then again, turns out neither is pandoc.
I thought pandoc didn’t support from PDF, only to?!
damn it, you’re right. should probably have checked that…
Don’t worry, I didn’t know either and had to check to check too :P