Re: extracting text from docx files
- From: Christian Barthel <test@xxxxxxxxxxxxxxxxx>
- Date: Tue, 9 Aug 2011 21:16:11 +0200
On Tue, Aug 09, 2011 at 02:36:32PM +0100, Anton Shterenlikht wrote:
I often receive information in *.docx format
from my MS using colleagues. Sometimes I can
ask for a pdf (or similar) instead, but not always.
You have a lot of nice options:
- Force them to use BSD/Linux ;)
- explain them, why docx is shit!
- don't read it
Usually I unzip a docx and then search
through all *xml files to find the
useful data. However, I can't find any
xml styles to use, so I have to convert
the relevant xml file(s) to plain text
by hand. I wonder if anybody can suggest
a better way. Perhaps there's something
in ports that can help.
But if you really, really need to read docx, you can try the web
application from Microsoft. A few months ago, I got also a lot of docx
and I opend it with the microsoft web app; this worked for me to extract
the information...
More information:
http://office.microsoft.com/en-us/web-apps/
The downside: you have to sign up on a microsoft service :(
cheers
--
Christian Barthel
Public-Key: http://bc.user-mode.org/bc.asc
Mail: bc@xxxxxxxxxxxxxxxxx
Web: http://bc.user-mode.org
_______________________________________________
freebsd-questions@xxxxxxxxxxx mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-questions
To unsubscribe, send any mail to "freebsd-questions-unsubscribe@xxxxxxxxxxx"
- Follow-Ups:
- Re: extracting text from docx files
- From: Polytropon
- Re: extracting text from docx files
- From: Antonio Olivares
- Re: extracting text from docx files
- References:
- extracting text from docx files
- From: Anton Shterenlikht
- extracting text from docx files
- Prev by Date: Re: How to use gui
- Next by Date: Re: extracting text from docx files
- Previous by thread: Re: extracting text from docx files
- Next by thread: Re: extracting text from docx files
- Index(es):
Relevant Pages
|