Contact Us

Keller Williams, a real estate agent prepares documents with home buyers

Paper to Digital

CHALLENGE

Much of the world's information is locked away in paper and PDF documents. Keller Williams, the world’s largest real estate company, processes hundreds of thousands of contracts each year. The data within these contracts, however, remains largely locked away in paper and PDF documents.

SOLUTION

Extracting the data is challenging as document layouts and data field locations vary widely from document to document. For certain documents, the pages are often scanned, out of order, and the data itself may be typed, handwritten, or a combination of both. We created a system that uses content classification and computer vision algorithms (CNN and OCR) to extract data from their offer contracts.

OUTCOME

We produced a highly accurate text field detection model with a mAP (mean average precision) of over 0.9, significantly improved over the off-the-shelf open source model (with 0.3 mAP) of an SSD-based deep convolutional neural network. The system gives Keller Williams an enormous new source of data for predictive analytics by unlocking data previously trapped in hundreds of thousands of documents.

Computer Vision
OCR

Paper to Digital

CHALLENGE

Much of the world's information is locked away in paper and PDF documents. Keller Williams, the world’s largest real estate company, processes hundreds of thousands of contracts each year. The data within these contracts, however, remains largely locked away in paper and PDF documents.

SOLUTION

Extracting the data is challenging as document layouts and data field locations vary widely from document to document. For certain documents, the pages are often scanned, out of order, and the data itself may be typed, handwritten, or a combination of both. We created a system that uses content classification and computer vision algorithms (CNN and OCR) to extract data from their offer contracts.

OUTCOME

We produced a highly accurate text field detection model with a mAP (mean average precision) of over 0.9, significantly improved over the off-the-shelf open source model (with 0.3 mAP) of an SSD-based deep convolutional neural network. The system gives Keller Williams an enormous new source of data for predictive analytics by unlocking data previously trapped in hundreds of thousands of documents.

Computer Vision
OCR

Download the Case Study

More case studies
The Waste Management Early Alert Rear Safety Device
Object Detection

Waste Management Early Alert Rear Safety Device

Realty Austin Case Study cover
Recommendation Modeling

Case Study: Realty Austin

Realty Austin needed to improve the available housing recommendations sent to its customers via email. Find out how KUNGFU.AI created the system that generated significantly improved email click-through rates.
Case Study: Natural Language Processing for Audio Search, a broadcaster records a podcast that will then be indexed and archived with NLP
Natural Language Processing (NLP)

NLP for Audio Search