AI comparator by Eden AI

Benchmark OCR

Choose a dataset

The Street View Text (SVT) dataset was harvested from Google Street View. Image text in this data exhibits high variability and often has low resolution. In dealing with outdoor street level imagery, we note two characteristics. Image text often comes from business signage and business names are easily available through geographic business searches. These factors make the SVT set uniquely suited for word spotting in the wild: given a street view image, the goal is to identify words from nearby businesses.

Overview

Files	Tag(s)
/svt/img/00_00.jpg	DOLL, HUT
/svt/img/00_01.jpg	ASTORIA, BEST, INN, SUITES, VALUE
/svt/img/00_08.jpg	MARBLE, YARD, ORION, TILE
/svt/img/00_12.jpg	SUBWAY
/svt/img/00_14.jpg	HOUSE, ORIGINAL, PANCAKE
/svt/img/01_02.jpg	SOL
/svt/img/01_09.jpg	NICK
/svt/img/01_10.jpg	PAYLESS, SHOE, SOURCE

AWS

Microsoft

Google Cloud

Tesseract

Results

	Google Cloud	AWS	Microsoft	Tesseract
Percentage of words detected	89.88326848	69.64980545	38.13229572	6.61478599
Average execution time (second)	1.72686089105058	1.32396451361868	0.194967273743017	0.37840094
Pricing ($) (per image)	0.0015	0.0015	0.001	Free