We DELIVER amazing creative experiences


Driver licence data extraction

OCR to Recognize and Extract Data from Driver’s Licences, ID Books, and Smart Cards

 Overview


Challenge

Identification and deriving sensitive data from image scans and facilitating document-based processes


Solution

 OCR-based solution to capture and retrieve data from document scans automatically


Tools Utilised

Python PyTorch Tesseract Opencv

capture and derive data from document scans and facilitate document workflow

 Manual data search and its recognition from ID books, smart cards, driver’s licences or any other documents are time-consuming and tedious.

That’s why the out team was looking to develop an automated data capture and extraction solution.


The challenge was integrating OCR into their ERP system to handle critical business functions. But the primary objective was to leverage data from document scans and reduce its manual processing and entry because it was taking much time.


The major challenge for our team was to capture and retrieve data from photo scans that often vary in orientations, positions, colours, backgrounds, and lighting conditions. We delivered a service that helps recognize objects in the image scans and determine what data needs to be derived.

OCR-based solution to automate data capture and extraction from document scans

Having broad experience in OCR implementation, the Alfa Xprienz team provided the client with a custom-made solution that helps automate data-based business processes in the client’s workplace and improve the overall work performance.


Our team investigated the feasibility of extracting and recognizing text fields such as name, surname, ID, valid from/to from driver’s licenses from DMV databases, business cards, ID books, and smart cards using OCR. When it comes to leveraging data, these types of documents are uneasy to deal with. In this case, facial recognition comes to aid and helps derive data from damaged driver’s licenses or blurred ID books.


The next step was finding out how to use facial recognition to identify an individuals face in their driver’s license photo or any other document scan. We started developing image pre-processing functionality that significantly improved the efficiency of a commercial built-in OCR engine by automated cropping, rotation and de-skewing. We also advanced the existing document classifier to distinguish 3 types of documents: a smart card, ID book, and a driver’s license. We split this process into two phases: image normalization and field recognition.


1. Image normalization

The main challenge for the OCR engines is to recognize rotated, skewed, darkened and blurred scans as well as photo scans with a complex background that usually leads to misunderstanding and errors. To mitigate that, we trained a model to capture a document, crop it, and de-skew an image. This phase significantly improved default pipeline recognition. As a result, we got a document without any background positioned correctly.


2. Field recognition

The next task was to capture and identify the required fields in the normalized image. To complete this, our data scientists trained another ML model that predicts where exactly the particular field is located. Having the precise location of the required fields they could crop them from the normalized image and identify them independently. That gave our team more freedom in pre-processing and optimizing the performance of the OCR engine.


OCR data capture and extraction solution to reduce manual effort

Having extensive technical expertise in developing image recognition solutions, our engineers have delivered a robust solution that automatically captures and derives sensitive data from driver’s licences, ID books, and smart cards and increases the overall operational efficiency in the client’s workplace.


Our solution enables the processing of a document scan in 2 seconds. 4 document scans can be processed simultaneously. The recognition accuracy is about 85-90%.


Collaboration with us has empowered the client in the following aspects:-

  • Automated text and image recognition
  • Easy and smart data extraction from scanned papers
  • Overall speedup when processing scanned text and images
  • OCR integration into the client’s ERP system
  • Reducing the manual effort in the client’s workplace

TALK TO US

Talk to one of our experts to discover how Alfa Xperienz Solutions can help you achieve your AI and data-driven aspirations.

Book Consultation Now
Share by: