As a software engineer transitioning into data science, I’m undertaking the challenging project of incorporating document detection for KYC (Know Your Customer) verification. The aim is to prevent users from submitting non-document images via their phone cameras, especially after disabling the gallery upload option.
I have a pool of 3 to 4 million KYC verified user IDs. Is it feasible to utilize these IDs to enhance the document detection model? My dilemma lies in deciding whether to employ OCR (Optical Character Recognition) in part or in whole. Is it necessary to use OCR just to determine if an image contains a government ID, or CNN to verify they are of certain Id type or are there alternative approaches that could be clearer?
Additionally, I lack practical experience in deploying machine learning models and haven’t ventured beyond working with AI/ML concepts solely within notebooks.
This post is an automated archive from a submission made on /r/MachineLearning, powered by Fediverser software running on alien.top. Responses to this submission will not be seen by the original author until they claim ownership of their alien.top account. Please consider reaching out to them let them know about this post and help them migrate to Lemmy.
Lemmy users: you are still very much encouraged to participate in the discussion. There are still many other subscribers on !machinelearning@academy.garden that can benefit from your contribution and join in the conversation.
Reddit users: you can also join the fediverse right away by getting by visiting https://portal.alien.top. If you are looking for a Reddit alternative made for and by an independent community, check out Fediverser.