Basically script the process of LaTeX -> HTML -> Word/LibreOffice. IIRC Libreoffice can be run in batch mode from a command line. Trying to extract text in any useful form from a PDF that TeX has generated is hopeless -- a lot of TeX's adjustments for kerning etc. get mixed in with the text stream.
(no subject)
Date: 2019-01-10 07:39 pm (UTC)latex2html -split 0 -no_navigation -info 0 whatever.tex