Python图片文字识别——Windows下Tesseract-OCR的安装与使用

作者：System 时间：2024年08月12日分类：所有,python 字数：693

这篇文章距离上次修改已过509天，其中的内容可能已经有所变动。




# 安装Tesseract-OCR
brew install tesseract
 
# 验证安装是否成功
tesseract --version
 
# 安装Python包识别图片中的文字
pip install pytesseract




# Python代码使用Tesseract-OCR识别图片中的文字
from PIL import Image
import pytesseract
 
# 指定tesseract.exe的安装路径
pytesseract.pytesseract.tesseract_cmd = r'C:\Program Files\Tesseract-OCR\tesseract.exe'
 
# 要识别的图片文件
image_path = 'example.png'
image = Image.open(image_path)
 
# 使用Tesseract-OCR识别图片中的文字
text = pytesseract.image_to_string(image)
print(text)

以上代码首先使用Homebrew（macOS的包管理器）安装Tesseract-OCR，然后验证安装是否成功。接着，使用pip安装Python的Tesseract-OCR包，并在Python脚本中展示如何使用它来识别图片中的文字。在使用前需要指定Tesseract-OCR的安装路径。这是在Windows系统中安装和使用Tesseract-OCR的一个基本示例。

Python图片文字识别——Windows下Tesseract-OCR的安装与使用

评论已关闭

推荐阅读