tortistic_turtle@alien.topB to

LocalLLaMA@poweruser.forumEnglish · 2 年前

Colud LLaVa be finetuned to perform image to markdown or even image to html conversion?

3

1

Colud LLaVa be finetuned to perform image to markdown or even image to html conversion?

tortistic_turtle@alien.topB to

LocalLLaMA@poweruser.forumEnglish · 2 年前

3

Hello! I am wondering, since this would be a very interesting use case and there is more than enough training material out there (pretty much every MD file could be rendered, then the image and markdown code could be used for training/finetuning)

however I have pretty much no idea about llava. Do you think this would be feasible to do?

Chat

Byt3G33k@alien.topB
link
fedilink
English
arrow-up
1·
2 年前
Facebook Nougat OCR model does PDF to Markdown. Also fine tuned versions of it doing PDF to LaTeX. I plan on making a fine tuned version to do PDF to XML later this winter break too!