docxconv

Utility script to convert MS Word doc(x) files to clean HTML/Markdown.

docxconv

Utility script to convert MS Word doc(x) files to clean HTML, using DOM cleanup and HTML Tidy.

docxconv [-fq] -o <path> [--watch] <file>|<path> ...
 
Options:
  -f, --format   conversion format                   [default: "html"]
  -o, --output   output destination <path>           [required]
  -q, --workers  queue worker concurrency <int>      [default: 4]
  --watch        watch for new documents

See unoconv requirements.

Unoconv does not seem to currently work with LibreOffice version 4 and above. Haven’t tried with OpenOffice.

Tested and working with LibreOffice v3.6.7.