Convert Word (.docx) manuscripts into MyST projects suitable for JDH publishing — structured myst.yml, article.md, extracted figures, citations, and author metadata.
The CLI runs a fixed pipeline (Pandoc → cleanup → figures/tables/crossrefs → citations → ROR enrichment). Example inputs for this monorepo live under ../docx-examples/; conversion quality is tracked in ../gap-analysis/.
You need Bun and Pandoc on your PATH.
cd doc-convert
bun install
bun run buildOptional: link the CLI globally (bun link after build).
Convert one manuscript (writes to _improved/ under the project root):
bun src/index.ts path/to/manuscript.docx --project-root path/to/project --workdir _improvedFrom the monorepo root, run all bundled examples:
../scripts/convert-docx-examples.shList all 13 pipeline steps with planned [run] / [skip] / [warn-skip] dispositions (no workdir, no Pandoc):
bun src/index.ts path/to/manuscript.docx --list-stepsOptional per-directory pipeline config — run from the project directory where you convert:
cd path/to/project
bun ../../doc-convert/src/index.ts configure
bun ../../doc-convert/src/index.ts manuscript.docx --list-steps # preview after editsFull author guide, CLI reference, and pipeline details: docs/ (guide, reference).
MIT — see LICENSE.
Made with love by
Continuous Science Foundation