|
Scanning and OCR
more...
Home
Cables, Connectors
Filers, Load Balancers
Home Networking, Cable & DSL
Hubs
KVM Switch Boxes, Cables
Mainframe, DEC, VAX, AS/400
Network Interface Cards,...
Networking, Telecom Tools
Other Networking Equipment
Print Servers, Wired
Racks, Mounts & Patch Panels
Router Components, Memory
Routers, Wired
Security, Firewall, VPN
Server Components, Memory
Servers
Software
Antivirus, Security,...
Apple, Macintosh Software
Business & Productivity
Accounting, Finance
Microsoft Money
Other
Peachtree
QuickBooks
QuickBooks Basic
Quickbooks Other
QuickBooks Premier
QuickBooks Professional
Quicken
Quicken Deluxe
Quicken Other
Quicken Premier
Business Card
Business Planning
Business Suites
Corel
Lotus
Microsoft
Office 2000
Office 2000 Small Business
Office 2003 Other
Office 2003 Professional
Office 2003 SBE
Office 2003 Standard
Office 95
Office 97
Office XP Professional
Office XP Small Business
Office XP Standard
Other
Contact Management, PIM
Document Management
Groupware
Other Business &...
Point of Sale Systems (POS)
Presentation
Project Management, Flow...
Resume & Career Management
Scanning and OCR
Spreadsheets
Tax Preparation
H&R Block
Other Tax Software
Turbo Tax
Voice Recognition
Word Processing
Other Word Processing
Word
WordPerfect
Works
Database & Development Tools
Digital Music & Video...
Downloadable Software
Education & Reference
Games & Entertainment
Graphics, Photo & Publishing
Handheld Software
Internet Related Utilities
Kids' Software
Networking
Operating Systems
Other Software
Wholesale Lots
Software, Operating Systems
Storage Equipment, NAS, SAN
Switch Components, Memory
Switches
Telephone Systems, Telecom
UPS, Power Protection, APC
Wholesale Lots
Wireless Networking, WiFi
Workstation Components,...
Workstations, Terminals
Optical character recognition, usually abbreviated to OCR, is computer software designed to translate images of handwritten or typewritten text (usually captured by a scanner) into machine-editable text, or to translate pictures of characters into a standard encoding scheme representing them (e.g. ASCII or Unicode). OCR began as a field of research in pattern recognition, artificial intelligence and machine vision. Though academic research in the field continues, the focus on OCR has shifted to implementation of proven techniques.
Optical character recognition (using optical techniques such as mirrors and lenses) and digital character recognition (using scanners and computer algorithms) were originally considered separate fields. Because very few applications survive that use true optical techniques, the optical character recognition term has now been broadened to cover digital character recognition as well.
Early systems required "training" (essentially, the provision of known samples of each character) to read a specific font. Currently, though, "intelligent" systems that can recognize most fonts with a high degree of accuracy are now common. Some systems are even capable of reproducing formatted output that closely approximates the original scanned page including images, columns and other non-textual components.
History
In 1929, G. Tauschek obtained a patent on OCR in Germany, followed by Handel who obtained a US patent on OCR in USA in 1933 (U.S. Patent 1,915,993). Tauschek was in 1935 also granted a US patent on his method (U.S. Patent 2,026,329).
Tauschek's machine was a mechanical device that used templates. A photodetector was placed so that when the template and the character to be recognised was lined up for an exact match, and a light was directed towards it, no light would reach the photodetector.
In 1950, David Shepard, a cryptanalyst at the Armed Forces Security Agency in the United States, was asked by Frank Rowlett, who had broken the Japanese PURPLE diplomatic code, to work with Dr. Louis Tordella to recommend data automation procedures for the Agency. This included the problem of converting printed messages into machine language for computer processing. Shepard decided it must be possible to build a machine to do this, and, with the help of Harvey Cook, a friend, built "Gismo" in his attic during evenings and weekends. This was reported in the Washington Daily News on April 27, 1951 and in the New York Times on December 26, 1953 after his U.S. Patent Number 2,663,758 was issued. Shepard then founded Intelligent Machines Research Corporation (IMR), which went on to deliver the world's first several OCR systems used in commercial operation. While both Gismo and the later IMR systems used image analysis, as opposed to character matching, and could accept some font variation, Gismo was limited to reasonably close vertical registration, whereas the following commercial IMR scanners analyzed characters anywhere in the scanned field, a practical necessity on real world documents.
Read more at Wikipedia.org
|
|