Replies: 2 comments 4 replies
-
|
Beta Was this translation helpful? Give feedback.
0 replies
-
I'm not sure what the best improvement would be here. We could change the HTML reader so that (perhaps depending on the extensions enabled) it parses |
Beta Was this translation helpful? Give feedback.
4 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
I am trying to convert fairly complex
latex
documents to Microsoft Worddocx
format.Pandoc
has some limitations for this and I am thus looking into first producinghtml
usinglwarp
and then converting thehtml
todocx
using pandoc. Unfortunately, I can't seem to make pandoc understand themathjax
equations produced by lwarp.For the record, here is a simple latex file I am using for testing purposes:
The resulting mathjax html is as such:
Expand
This html renders well in a web browser:
However, it is not being parsed properly by pandoc. Here is a screenshot of the resulting docx conversion:
I tried converting the mathjax html document into a mathml html one before feeding this to pandoc. For this, I used the
tex2chtml-page
command line tool from MathJax-demos-node which seemed to do mostly what I wanted. However the equation numbers disappear in the output html fromtex2chtml-page
. And when I use this as input topandoc
, the generated docx file containsUndefined control sequence \seteqnumber
lines.I am not quite sure whether I am missing something in how I should use pandoc here, if this is an issue with
pandoc
or an issue withlwarp
. As such, this discussion is somewhat mirrored in bdtc/lwarp#25Any suggestion would be greatly appreciated.
Beta Was this translation helpful? Give feedback.
All reactions