• 0 Posts
  • 10 Comments
Joined 11 months ago
cake
Cake day: October 17th, 2023

help-circle
  • You can try something like Claude.ai which has long context and is free to use.

    You can use a python script to load the model, split the text into chunks, and ask the model to translate per chunk, then you don’t need a model with 64K context window (which will take up a lot of memory and are not that common).

    It also depends on the language you are trying to translate, it would be best to find models that has been trained in the original language, most models have a large english corpus, with many finetuned with chinese data, but there are specialty models for German/arabic/japanese, try google search or find on hugging face.