I am beginner in ml,how do I get myself updated with current state of OCR. If I want to get better results than Tesseract or EasyOCR ,what path should I follow.i basically want near 100% accuracy in identifying typed/digital characters and their location in image. Is this solved ?? Any guidance would be helpful 🙏🙏
Look at Meta’s Nougat OCR, use the API, run a flask server and play with it
“Guys I’m new to medicine, I’m gonna solve cancer, any guidance would be helpful.”
😂😂
If you’re interested in a new task, it’s good for you to start by exploring Papers with Code leaderboard. You can find recent research papers related to it.
As of about a year ago, I haven’t seen anything that really outperforms Tesseract across multiple benchmarks. You can get near 100% accuracy if the image is clean and the font isn’t anything weird. But if you have image noise, you need to lower your expectations.
have you checked out the latest research papers on OCR? following top conferences like CVPR, ICCV, and NeurIPS can help you stay updated. consider exploring deep learning models like Transformers for improved accuracy. good luck! 🚀
Azure Form Recognizer OCR is very good.
hey there, as a beginner in ml, staying updated on OCR is key. to surpass Tesseract or EasyOCR, focus on deep learning models, like CNNs or transformers. achieving near 100% accuracy is tough, but pre-processing, data augmentation, and model fine-tuning can help get you there. it’s not fully solved, but keep experimenting and learning. good luck! 🙏