
























I took over a project that was built by an overseas team. They set up a data ingestion process. They have a step in the ingestion where they use Libre Office (in headless mode) to convert Microsoft Word documents to PDFs. Later we convert all PDFs to Markdown. They felt that it was best to convert everything to a PDF, and then convert all of the PDFs to Markdown.
What I notice is that LibreOffice can create very complex PDFs when the Microsoft Word document has:
1. tables
2. multiple columns
3. strikethrough text
I am thinking we should go straight from Microsoft Word to Markdown.
What is the right software for that?
此内容由惯性聚合(RSS阅读器)自动聚合整理,仅供阅读参考。 原文来自 — 版权归原作者所有。